Apache Spark is one of the most widely used and supported open-source tools for machine learning and big data. In this course, discover how to work with this powerful platform for machine learning. Instructor Dan Sullivan discusses MLlib—the Spark machine learning library—which provides tools for data scientists and analysts who would rather find solutions to business problems than code, test, and maintain their own machine learning libraries. He shows how to use DataFrames to organize data structure, and he covers data preparation and the most commonly used types of machine learning algorithms: clustering, classification, regression, and recommendations. By the end of the course, you will have experience loading data into Spark, preprocessing data as needed to apply MLlib algorithms, and applying those algorithms to a variety of machine learning problems.
Learn More- Career Communities
- Identity Resources
- Career Planning
- Access Career Tools
- Build Skills at Denison Edge
- Build Your Resume and Cover Letter
- Search for an Internship or Job
- Network with Confidence
- Leverage Winter Break
- Participate in the Denison Internship Program
- Prepare for an Interview
- Plan for Graduate School
- Research Industries and Companies
- Utilize Financial Resources
- Alumni
- Academic Partnerships
- Meet The Team
- Student Employment