Professional Documents
Culture Documents
B3. Machine Learning With Apache Spark - Coursera
B3. Machine Learning With Apache Spark - Coursera
Enroll
Starts Dec 22
Course
Gain insight into a topic and learn the fundamentals
Intermediate level
Recommended experience
14 hours (approximately)
Flexible schedule
Learn at your own pace
Describe ML, explain its role in data engineering, summarize generative AI, Evaluate ML models, distinguish between regression, classification, and
discuss Spark's uses, and analyze ML pipelines and model persistence. clustering models, and compare data engineering pipelines with ML
pipelines.
Construct the data analysis processes using Spark SQL, and perform Demonstrate connecting to Spark clusters, build ML pipelines, perform
regression, classification, and clustering using SparkML. feature extraction and transformation, and model persistence.
Machine Learning Machine Learning Pipelines Data Engineer SparkML Apache Spark
Details to know
Enroll
Starts Dec 22
Start by learning ML fundamentals before unlocking the power of Apache Spark to build and deploy ML models for data engineering applications.
Dive into supervised and unsupervised learning techniques and discover the revolutionary possibilities of Generative AI through instructional
readings and videos.
Gain hands-on experience with Spark structured streaming, develop an understanding of data engineering and ML pipelines, and become proficient
in evaluating ML models using SparkML.
In practical labs, you'll utilize SparkML for regression, classification, and clustering, enabling you to construct prediction and classification models.
Connect to Spark clusters, analyze SparkSQL datasets, perform ETL activities, and create ML models using Spark ML and sci-kit learn. Finally,
demonstrate your acquired skills through a final assignment.
This intermediate course is suitable for aspiring and experienced data engineers, as well as working professionals in data analysis and machine
learning. Prior knowledge in Big Data, Hadoop, Spark, Python, and ETL is highly recommended for this course.
Read less
In this module, you will gain knowledge of machine learning techniques that enable computers to perform tasks without explicit programming. You will
explore the lifecycle of machine learning models and understand the crucial role of data engineering in machine learning projects. The module covers
supervised and unsupervised learning techniques, including classification, regression, and clustering. Furthermore, you will acquire valuable insights
into Generative AI and its potential to revolutionize multiple industries, enhance people's lives, and generate newer and previously unimaginable data
and experiences.
What's included
Regression • 6 minutes
Classification • 6 minutes
Clustering • 5 minutes
Hands-on Lab: Building and Training a Prediction Model using Linear Regression • 30 minutes
This module will introduce you to Spark and provide an overview of its key features and applications in the field of data engineering. You will discover
the process of connecting to a Spark cluster using SN labs and delve into various topics such as regression, mileage prediction, classification, diabetic
classification, clustering, and clustering load data using SparkML. Additionally, you will gain insights into how to construct these models using Spark
ML. Moreover, this module will cover GraphFrames on Apache Spark and guide you in hands-on labs.
What's included
This module begins with Apache Spark Structured Streaming and its role in processing streaming data with Spark SQL. You will acquire knowledge
about key terms associated with Structured Streaming. The module then covers the Extract-Transform-Load process and provides hands-on experience
in transferring data from one source to another destination with varying data formats or structures. Additionally, you will gain a practical understanding
of feature extraction and transformation using Spark extract and transform features. The module also delves into machine learning pipelines using
Spark, demonstrating the process and benefits involved. Lastly, you will grasp the concept of model persistence and its significant role in Machine
Learning.
What's included
Graded Quiz: Data Engineering for Machine Learning using Apache Spark • 30 minutes
Practice Quiz: Data Engineering for Machine Learning using Apache Spark • 10 minutes
Final Project
Module details
Module 4 • 3 hours to complete
In this module, you will apply the data engineering skills and techniques you have acquired throughout the course. The course concludes with a final
project and assignments that allow you to demonstrate your proficiency in these areas. You will step into the role of a data engineer working at a
renowned aeronautics consulting company recognized for its adeptness in handling large datasets. Your role as a data engineer is crucial as the data
scientists rely on your expertise to carry out ETL (Extract, Transform, Load) tasks and establish machine learning pipelines. While data scientists possess
expertise in machine learning, they depend on your specialized knowledge to handle various algorithms and data formats. Your contribution plays a
vital role in ensuring the smooth execution of their tasks.
Instructors
Instructor ratings 4.9 (9 ratings)
Offered by
IBM
Learn more
Recommended Degrees
IBM IBM
Course Course
Show 8 more
"To be able to take courses at my own pace and rhythm has been "I directly applied the concepts and skills I learned from my
an amazing experience. I can learn whenever it fits my schedule ● ○ courses to an exciting new project at work."
and mood."
4.7 29 reviews
5 stars 82.75%
4 stars 6.89%
3 stars 6.89%
2 stars 0%
1 star 3.44%
BS
5 · Reviewed on Dec 19, 2023
What Is a Machine Learning 7 Machine Learning Projects to How to Land a Machine Learning Machine Learning in Finance: 10
Engineer? (+ How to Get Started) Build Your Skills Internship: A 2024 Career Guide Applications and Use Cases
November 29, 2023 November 29, 2023 December 12, 2023 November 29, 2023
Article Article · 7 min read Article · 6 min read Article · 7 min read
Learn more
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Explore degrees
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Learn more
More questions
Visit the learner help center
Coursera Community
About Learners
What We Offer Partners
Leadership Beta Testers
Careers Translators
Catalog Blog
Coursera Plus The Coursera Podcast
Professional Certificates Tech Blog
MasterTrack® Certificates Teaching Center
Degrees
For Enterprise
For Government
For Campus
Become a Partner
Coronavirus Response
Social Impact
More
Press
Investors
Terms
Privacy
Help
Accessibility
Contact
Articles
Directory
Affiliates
Modern Slavery Statement
Manage Cookie Preferences
Learn Anywhere
Follow Us
© 2023 Coursera Inc. All rights reserved.