Professional Documents
Culture Documents
Edureka Data Science Masters Program Curriculum
Edureka Data Science Masters Program Curriculum
Discover Learning
Course Curriculum
masters
programme
About Edureka
Edureka's Masters Program provides an in-depth hands-on experience with tools &
systems used by Data Scientists. This program starts with Data Science training to
master data extraction, exploration techniques, and Machine Learning algorithms,
followed by Python Apache Spark and AI & Deep learning using Tensorflow. The
program is a combination of interactive online classroom and self-paced sessions
curated and led by industry experts. The exhaustive curriculum sets this program one
step ahead of short-term certifications and transforms you into an expert Data Scientist.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka!
Data Scientist
Index
1. Statistics Essentials for Analytics 01
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 01
The self-paced Statistics Essentials for Analytics Course is designed for the learners to
explained using dedicated examples. The use case is taken up at the end of each
module and insights are gathered, thus at the end of the course we have a Project
Learning Objectives
At the end of this module, you will be able to understand Skewness, Modality, Measures
of Center, Measures of Spread etc. You will also understand the relationship between
these terminologies. You will also be able to analyze airlines data set to gather insights.
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 02
Learning Objectives
At the end of this module, you will be able to understand the rules of probability, learn
about Disjoint and Independent events, understand the concept of probability, and
implement these concepts on a case-study. You will also learn and implement Bayes'
Topics
Learning Objectives
At the end of this module, you will be able to understand Normal distribution,
Topics
Normal Distribution
Data science is a "concept to unify statistics, data analysis and their related methods"
techniques and theories drawn from many fields within the broad areas of
databases, and visualization. The Data Science Certification Training Course enables
you to gain knowledge of the entire Life Cycle of Data Science, analyzing and visualizing
Learning Objectives
At the end of this Module, you should be able to: Define Data Science. Discuss the era of
Data Science. Describe the Role of a Data Scientist. Illustrate the Life cycle of Data
Science. List the Tools used in Data Science. State what role Big Data and Hadoop, R,
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 04
Topics
What does Data Science involve? Introduction to Big Data and Hadoop
Learning Objectives
After this module students will be able to: Define Statistical Inference. List the
Terminologies of Statistics. Illustrate the measures of Center and Spread. Explain the
Topics
Measures of Spread
Learning Objectives
After this module students will be able to: Discuss Data Acquisition techniques. List the
different types of Data. Evaluate Input Data. Explain the Data Wrangling techniques.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 05
Topics
Learning Objectives
After this module students will be able to: Define Machine Learning. Discuss Machine
Learning Use cases. List the categories of Machine Learning. Illustrate Supervised
Learning Algorithms.
Topics
Module 5 : Classification
Learning Objectives
After this module students will be able to: Define Classification. Explain different Types of
Classifiers such as Decision Tree, Random Forest, Naïve Bayes Classifier and Support
Vector Machine.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 06
Topics
What is Classification and its use cases? Algorithm for Decision Tree Induction
Learning Objectives
After this module students will be able to: Define Unsupervised Learning.
Discuss the Cluster Analysis like K - means Clustering, C - means Clustering and
Hierarchical Clustering
Topics
Learning Objectives
After this module students will be able to: Define Association Rules. Define
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 07
Topics
User-Based Recommendation
Learning Objectives
After this module students will be able to: Define Text Mining. Discuss Text Mining
Algorithms like Bag of Words Approach and Sentiment Analysis
Topics
Learning Objectives
After this module students will be able to: Describe Time Series data , Format your Time
Series data . List the different components of Time Series data. Discuss different kind of
Time Series scenarios. Choose the model according to the Time series scenario.
Implement the model for forecasting. Explain working and implementation of ARIMA
model. Illustrate the working and implementation of different ETS models. Forecast the
data using the respective model
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 08
Topics
Learning Objectives
After this module the will be able to: Define Reinforced Learning. Discuss Reinforced
Learning Use cases. Define Deep Learning. Understand Artificial Neural Network.
Discuss basic Building Blocks of Artificial Neural Network. List the important
Terminologies of ANN’s.
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 09
Project Work
Challenge : A company has been in industry since a long time. Their business had been
increasing quite well over past, however in recent years, there has been a slowdown in
terms of growth because their best and most experienced employees leaving
prematurely. The VP of the firm is not very happy with the company’s best and most
experienced employees leaving prematurely. The VP of the firm has employed you to find
out insights in the company employee data and find out an answer as to know why best
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 10
Learning Objectives
At the end of this Module, you should be able to understand Python – an Object oriented
Programming Language, List the Users of Python for Data Analytics, Define Identifiers
and Indentation, List Operations on Strings and Numbers, Run a Python Script.
Topics
Get an overview of Python Start Python
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 11
Learning Objectives
At the end of this module, you will be able to Define Reserved Keywords and Command
Line Arguments, Describe how to Get User Input from Keyboard, Describe Flow Control
and Sequences, Practice Working with Files, Define and Describe Dictionaries and Sets.
Topics
File attributes
Learning Objectives
At the end of this Module, you should be able to explain Functions and various forms of Function
Arguments, explain Standard Library, define modules, describe Zip Archives and Packaging.
Topics
Functions Sorting
Learning Objectives
At the end of this Module, you should be able to implement Regular Expression and its
Basic Functions; Use Classes, Objects, and Attributes, Develop applications based on
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 13
Learning Objectives
At the end of this Module, you should be able to debug python scripts using pdb, debug
python scripts using IDE, classify Errors, develop Unit Tests, create project Skeletons,
implement Database using SQLite and perform CRUD operations on SQLite database.
Topics
Learning Objectives
At the end of this Module, you should be able to Statistics - data terminology,
Variance, Std. Deviation using Numpy. Distributions, Probability and Bayes’ Theorem
using Scipy. Numpy - arrays, matrices, related operations. Scipy - overview, areas of
application.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 14
Topics
Learning Objectives
At the end of this Module, you should be able to Define Machine Learning and
regression analysis. Apply Unsupervised Learning process flow, clustering. Apply Linear
Regression, Multivariate Regression. Measure accuracy using Mean Squared Error, Cross
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 15
Learning Objectives
At the end of this Module, you should be able to: Feature engineer datasets using PCA,
Bias/Variance analysis. Apply classifications algorithms like KNN, Random Forests, SVM
etc. Apply clustering algorithms like K-Means, Hierarchical clustering etc. Compute
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 16
Learning Objectives
At the end of this Module, you should be able to discuss: Apache Spark - Concepts, RDD,
MLLib, Data frames. Transformations, Actions, Shuffling, Persistence and Data Removal.
Shared variables - accumulators and broadcast. Spark SQL and Data frames. Spark MLlib.
Topics
Learning Objectives
At the end of this Module, you should be able to discuss: Web scraping and its
advantages. Discuss Steps Involved in Web Scraping. Use BeautifulSoup package and its
functions. Scrape IMDB webpage. Fetch Streaming Tweets from Twitter. Perform
Sentiment Analysis on tweets Fetched from Twitter and determine which is more popular
Ferrari or Porsche.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 17
Topics
Project Work
quite well over past, however in recent times there has been a slowdown in terms of
employees have started leaving the Company. In order to prevent this proactively you
Data and find out an answer as to know why the best and most experienced
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 18
The Edureka Apache Spark & Scala course will enable learners to understand how
Spark enables in-memory data processing and runs much faster than Hadoop
MapReduce. Learners learn about RDDs, different APIs which Spark offers such as
Spark Streaming, MLlib, SparkSQL, GraphX. This Edureka course is an integral part of a
Learning Objectives
In this module, you will understand the basics of Scala that are required for program-
ming Spark applications. You can learn about the basic constructs of Scala such as vari-
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 19
Learning Objectives
In this module, you will learn about object oriented programming and functional
Topics
Learning Objectives
In this module, you will understand what is big data, challenges associated with it and
the different frameworks available. The module also includes a first-hand introduction
to Spark.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 20
Topics
How Hadoop Solves the Big Data Problem Hadoop Cluster and Its Architecture
Learning Objectives
In this module, you will understand different frameworks available for Big Data
Analytics and the module also includes a first-hand introduction to Spark, demo on
Topics
Big Data Analytics with Batch & Spark Components & It’s
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 21
Learning Objectives
In this module, you will learn one of the fundamental building blocks of Spark -
Actions and Functions performed on RDD). You will learn about Spark applications, how it
Topics
Challenges in Existing Computing Methods RDD Lineage
Learning Objectives
In this module, you will learn about Spark SQL which is used to process structured
data with SQL queries. You will learn about data-frames and datasets in Spark SQL and
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 22
Learning Objectives
In this module you will learn about what is the need for machine learning, types of
ML concepts, clustering and MLlib (i.e. Spark’s machine learning library), various
Topics
Learning Objectives
In this module, you will understand Kafka and Kafka Architecture. Afterwards you
will go through the details of Kafka Cluster and you will also learn how to configure
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 23
Topics
Learning Objectives
In this module you will get an introduction to Apache Flume and its basic
architecture and how it is integrated with Apache Kafka for event processing.
Topics
Flume Sinks
Learning Objectives
In this module you will get an opportunity to work on Spark streaming which is
used to build scalable fault-tolerant streaming applications. You will learn about
DStreams and various Transformations performed on it. You will get to know about main
streaming operators, Sliding Window Operators and Stateful Operators.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 24
Topics
Project Work
Project 1 : US Election
Industry: Government
Technologies Used:
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 25
over Bernie Sanders from Democrats and on the other hand, Donald Trump was
nominated from Republican Party to contest for the presidential position. As an analyst,
you have been tasked to understand different factors that led to the winning of Hillary
Technologies Used:
Industry: Transportation
Technologies Used:
Problem Statement: A US cab service start-up (i.e. Instant cabs) wants to meet the
demands in an optimum manner and maximize the profit. Thus, they hired you as a data
analyst to interpret the available Uber’s data set and find out the beehive customer
pick-up points & peak hours for meeting the demand in a profitable manner.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 26
Technologies Used:
Problem Statement: You will be given a CDR (Call Details Record) file, you need to find
out top 10 customers facing frequent call drops in Roaming. This is a very important
report which telecom companies use to prevent customer churn out, by calling them
back and at the same time contacting their roaming partners to improve the connectivity
issues in.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 27
Delve into neural networks, implement Deep Learning algorithms, and explore layers
of data abstraction with the help of this Deep Learning using TensorFlow Certification
Training. In this Training, you will be able to learn the basic concepts of TensorFlow, the
main functions, operations and the execution pipeline. Starting with a simple “Hello
Word” example, throughout the course you will be able to see how TensorFlow can be
used in curve fitting, regression, classification and minimization of error functions. This
concept is then explored in the Deep Learning world. You will evaluate the common,
and not so common, deep neural networks and see how these can be exploited in the
real world with complex raw data using TensorFlow. In addition, you will learn how to
apply TensorFlow for backpropagation to tune the weights and biases while the Neural
Networks are being trained. Finally, the course covers different types of Deep
Learning Objectives
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 28
Topics
• Likelihood
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 29
Learning Objectives
At the end of this Module, you should be able to: Illustrate How Deep Learning Works?
Topics
What is Tensorflow?
Learning Objectives
At the end of this Module, you should be able to: Understand limitations of A Single
Topics
Perceptron
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 30
Learning Objectives
At the end of this Module, you should be able to: Define TensorFlow. Illustrate how
Topics
Learning Objectives
At the end of this Module, you should be able to: Define CNNs. Discuss the
Applications of CNN. Explain the Architecture of a CNN. ist Convolution and Pooling
Layers in CNN. Illustrate CNN. Discuss Fine-tuning and Transfer Learning of CNNs.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 31
in a CNN
Learning Objectives
At the end of this Module, you should be able to: Define RNN. Discuss the Applications
Explain Recursive Neural Tensor Network Theory. Illustrate the working of Neural
Network Model.
Topics
Learning Objectives
At the end of this Module, you should be able to: Define RBM. Discuss the Applications
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 32
Topics
Module 8 : Keras
Learning Objectives
At the end of this Module, you should be able to: Define Keras. Understand Keras
Model Building Blocks. Illustrate Different Compositional Layers for a Keras Model.
Implement a Use-Case Step by Step. Understand few features available with Keras.
Topics
How to compose Models in Keras Saving and Loading a model with Keras
Module 9 : Tflearn
Learning Objectives
At the end of this Module, you should be able to: Define TFlearn. Understand TFlearn
Model Building Blocks. Illustrate Different Compositional Layers for a TFlearn Model.
Implement a Use-Case Step by Step. Understand few features available with TFlearn.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 33
Topics
Learning Objectives
At the end of this Module, you should be able to: Define RBM. Discuss the Applications
Project Work
1. To create an image classifier using CNN, to classify images in one of the predefined
100 classes.
2. To create a script generator using LSTM, for generating scripts for any popular
3. Capstone project, here you can choose a dataset of your own, explore the different
challenges faced on the dataset domain and try to solve any one of them with any
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 34
Course Curriculum
Tableau is one of the hottest trends in business intelligence. With its intuitive and
organizations, big and small. It is one of the many data-related tools that work
alongside R, Python or D3.js that can help you to create complex and beautiful data
visualizations. More than 35,000 companies worldwide have truly transformed the way
they uncover insights from data they possess. Learn the key concepts of Data
Visualization and how data can be transformed by cleaning, splitting, pivoting, and
connection with datasets, perform Joins on the data sets and explore the new Cross
Join feature. Discover new ways to analyze your data such as, quick highlighting,
reference lines, and the new clustering function. Create personalized, dynamic
visualizations by using parameters to take user input and drive the visualization.
Understand good design practices for dashboards and how to make them fully
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 35
Learning Objectives
In this module, you will learn to: Identify the prerequisites, goal, objectives, methodology,
material, and agenda for the course. Discuss the basic of Data Visualization. Get a brief
idea about Tableau, establish connection with the dataset, and perform Joins operation
Topics
Data Visualization. Joins and Union
Learning Objectives
In this module, you will learn to: Manage extracts and metadata (by creating hierarchy
and folders). Describe what is Visual Analytics, why to use it, and it’s various scopes.
Explain aggregating and disaggregating data and how to implement data granularity
using marks card on aggregated data. Describe what is highlighting, with the help of a
use-case. Illustrate basic graphs including bar graph, line graph, pie chart, dual axis
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 36
Learning Objectives
In this module, you will learn to: Perform sorting techniques including quicksort, using
measures, using header and legend, and sorting using pill with the help of a use case.
Master yourself into various filtering techniques such as Parametrized filtering, Quick
Filter, Context Filter. Learn about various filtering option available with the help of use
case and different scenarios. Illustrate grouping using data-window, visual grouping, and
Calculated Grouping (Static and Dynamic). Illustrate some more graphical visualization
including Heat Map, Circle Plot, Scatter Plot, and Tree Maps.
Topics
Sorting. Grouping
Learning Objectives
In this module, you will learn to: Explain the basic concepts of sets followed by creating
sets using Marks Card, computation sets and combined sets. Describe the concepts of
forecasting with the help of Forecasting problem as a use-case. Discuss the basic
concept of clustering in Tableau. Add Trend lines and reference line to your visualization.
Discuss about Parameter in depth using Sets and Filter.
Topics
Clustering. Parameters.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 37
Learning Objectives
In this module, you will learn to: Describe the basic concepts of Dashboard and its UI.
Build a dashboard by adding sheets and object into it. Modify the view and layout. Edit
dashboard using actions (filter, highlighting, URL). Create stories for your Visualization
and Dashboards.
Topics
Module 6: Mappings
Learning Objectives
In this module, you will learn to: Map the coordinates on the map, plot geographic data,
and use layered view to get the street view of the area. Edit the ambiguous and
unrecognized location plotted on the map. Customize territory in a polygon map.
Connect to the WMS Server, use a WMS background map and saving it. Add a
background image and generate its coordinate and plot the points.
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 38
Module 7: Calculations
Learning Objectives
In this module, you will learn to: Perform Calculations using various types of functions
such as Number, String, Date, Logical, and Aggregate. In addition, you will get to know
about Quick Table Calculation. Cover the following LOD expressions – Fixed, Included,
and Excluded.
Topics
Aggregate Functions.
Learning Objectives
In this module, you will learn to: Tackle complex scenarios by using LOD expressions.
Topics
Use Case II - Profit per Business Day. Use Case V - Finding the second order date.
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
edureka! 39
Module 9: Charts
Learning Objectives
In this module, you will learn to: Plot various types of Charts using Tableau 10 and have
Topics
Learning Objectives
In this module, you will learn to: You will know the basics of Big Data, Hadoop, and R.
You will discuss the integration between Hadoop and R and will integrate R with Tableau.
In addition, you will get to publish your workbook on Tableau Server.
Topics
www.edureka.co © 2017 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.