Big Data Analytics with Hadoop and Apache Spark

Apache Hadoop was a pioneer in the world of big data technologies, and it continues to be a leader in enterprise big data storage. Apache Spark is the top big data processing engine and provides an impressive array of features and capabilities. When used together, the Hadoop Distributed File System (HDFS) and Spark can provide a truly scalable big data analytics setup. In this course, learn how to leverage these two technologies to build scalable and optimized data analytics pipelines. Instructor Kumaran Ponnambalam explores ways to optimize data modeling and storage on HDFS; discusses scalable data ingestion and extraction using Spark; and provides tips for optimizing data processing in Spark. Plus, he provides a use case project that allows you to practice your new techniques.

Learn More

Big Data Analytics with Hadoop and Apache Spark

Infographics: Planning and Wireframing

Causal Inference with Survey Data

Windows Server 2012 Active Directory: Management and Implementation

Java: Database Integration with JDBC

Learning Software Version Control (2012)

Processing: Interactive Data Visualization

Dreamweaver CS6 Essential Training

Word: Building Blocks and Macros

CSS: Float-Based Page Layouts (2012)

Infographics: Visualizing Relationships

View Source

JavaScript: Functions (2013)

HTML5: Document Editing

Creating an HTML Email Newsletter

HTML5: Geolocation

Narrative Portraiture: Foundations of Portraiture

Excel 2007: Business Statistics

Excel 2010: Advanced Formulas and Functions

Excel 2007: Introduction to Formulas and Functions

Excel 2007: Pivot Tables for Data Analysis