Cloud Hadoop: Scaling Apache Spark

Apache Hadoop and Spark make it possible to generate genuine business insights from big data. The Amazon cloud is natural home for this powerful toolset, providing a variety of services for running large-scale data-processing workflows. Learn to implement your own Apache Hadoop and Spark workflows on AWS in this course with big data architect Lynn Langit. Explore deployment options for production-scaled jobs using virtual machines with EC2, managed Spark clusters with EMR, or containers with EKS. Learn how to configure and manage Hadoop clusters and Spark jobs with Databricks, and use Python or the programming language of your choice to import data and execute jobs. Plus, learn how to use Spark libraries for machine learning, genomics, and streaming. Each lesson helps you understand which deployment option is best for your workload.

Learn More

Cloud Hadoop: Scaling Apache Spark

Infographics: Planning and Wireframing

Excel: Power Query (Get & Transform)

Windows Server 2012 Active Directory: Management and Implementation

Java: Database Integration with JDBC

Learning Software Version Control (2012)

Processing: Interactive Data Visualization

Dreamweaver CS6 Essential Training

Word: Building Blocks and Macros

CSS: Float-Based Page Layouts (2012)

Infographics: Visualizing Relationships

View Source

JavaScript: Functions (2013)

HTML5: Document Editing

Creating an HTML Email Newsletter

HTML5: Geolocation

Narrative Portraiture: Foundations of Portraiture

Excel 2007: Business Statistics

Excel 2010: Advanced Formulas and Functions

Excel 2007: Introduction to Formulas and Functions

Excel 2007: Pivot Tables for Data Analysis