Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

Advanced Certification in Data Analytics

The Advanced Certification in Data Analytics course by the E&ICT Academy of IIT Kanpur in association with
Edvancer is one of the most comprehensive and state of the art programs in this domain. This 180 learning hours
program for working professionals & freshers is designed, curated and delivered by faculty from IIT’s and Industry.
This program is a blend of academic rigour and significant industry exposure equipping you with highly advanced
conceptual knowledge and futuristic skillsets unmatched by any other such programs. This program once
completed satisfactorily strives to prepare you for seamless accelerated transition to a high-end career in data
analytics On successful completion of the program you will receive a certificate from E&ICT Academy, IIT Kanpur,
a joint initiative of IIT Kanpur and Ministry of Electronics & Information Technology, Government of India.

Full Curriculum
Module 1: Business Analytics in R
What is this module about?: Business Analytics is the scientific process of deriving insights from raw
data to support decision making and is the core of data analytics. Through this module you will learn how
to use predictive analytics techniques and the R language to solve business problems. This is
a comprehensive module which will take you from the basics of statistical techniques, basics and advanced
aspects of R language right up to building predictive models.

Tools to be learnt: R, R Studio


Class Duration: 66 hours of live classes + 12 hours of pre-recorded videos

Topic What does it mean?


Introduction to business analytics
• What is analytics & why is it so important? In this section we shall provide you an overview
into the world of analytics. You will learn about the
• Applications of analytics
various applications of analytics, how companies
• Different kinds of analytics are using analytics to prosper and study the
• Various analytics tools analytics project methodology through a real-world
• Analytics project methodology case study
• Real world case study
R Training
Fundamentals of R
• Installation of R & R Studio
• Getting started with R
• Basic & advanced data types in R
• Variable operators in R This part is all about learning how to manage and
• Working with R data frames manipulate data and datasets, the very first step of
• Reading and writing data files to R analytics. We shall teach you how to use R to work
• R functions and loops with data using a case study.
• Special utility functions
• Merging and sorting data
• Case study on data management using R
• Practice assignment
Data visualization in R
• Need for data visualization Data visualization is extremely important to
• Components of data visualization understand what the data is saying and gain
insights in just one glance. Visualization of data is a
• Utility and limitations
strong point of the R software and you will learn
• Introduction to grammar of graphics the same in this module.
• Using the ggplot2 package in R to create visualizations
Data preparation and cleaning using R
• Needs & methods of data preparation Real world data is rarely going to be given to you
• Handling missing values perfect on a platter. It will always be dirty with
• Outlier treatment missing data points, incorrect data, variables
needing to be changed or created in order to
• Transforming variables
analyze etc. A typical analytics project will have
• Derived variables 60% of its time spent on preparing data for analysis.
• Binning data This is a crucial process as properly cleaned data
• Modifying data with Base R will result in more accurate and stable analysis. We
• Data processing with dplyr package shall teach you all the techniques required to be
• Using SQL in R successful in this aspect.
• Practice assignment
Setting the base of business analytics
Understanding the data using univariate statistics in R This is where you shall learn how to start
• Summarizing data, measures of central tendency understanding the story your data is narrating by
• Measures of variability, distributions summarizing the data, checking its variability and
• Using R to summarize data shape by visualizing it. We shall take you through
• Case study on univariate statistics using R various ways of doing this using R and also solve a
• Practice assignment case study
Hypothesis testing and ANOVA in R to guide decision
making With 95% confidence we can say that there is an
• Introducing statistical inference 85% chance, people visiting this site twice will
• Estimators and confidence intervals enroll for the course . Here, you learn how to
• Central Limit theorem create a hypothesis, test and validate it through
• Parametric and non-parametric statistical tests data within a statistical framework and present it
• Analysis of variance (ANOVA) with clear and formal numbers to support decision
making.
• Conducting statistical tests
• Practice assignment
Predictive modelling in R

1. Correlation and Linear regression


• Correlation A statistical model is the core of predictive analytics
and regression is one of the most powerful tools for
• Simple linear regression
making predictions by finding patterns in data. You
• Multiple linear regression shall learn the basic of regression modelling hands-
• Model diagnostics and validation on through real world cases
• Case study
Logistic regression is the work-horse of the
2. Logistic regression predictive analytics world. It is used to make
• Moving from linear to logistic predictions in cases where the outcomes are dual in
• Model assumptions and Odds ratio nature i.e. an X or Y scenario where we need to
• Model assessment and gains table predict if X will be the case or will Y, given some
• ROC curve and KS statistic data. This is a must-know technique and we shall
• Case Study make you comfortable with it through real world
problems.
3. Techniques of customer segmentation
• Need for segmentation Learn why and how to statistically divide a broad
• Criterion of segmentation customer market into various segments of
• Types of distances customers who are similar to each other so as to be
• Hierarchical clustering able to better target and meet their needs in a cost
• K-means clustering effective manner. This is one of the most essential
• Deciding number of clusters techniques in marketing analytics.
• Case study
4. Time series forecasting techniques The ability to forecast into the future is very
• Need for forecasting important for any business and it is necessary to
• What are time series? have as accurate a forecasting as possible for
• Smoothing techniques financial and strategic planning. In this module
• Time series models learn the techniques of time series analysis without
• ARIMA being misled by seasonal and cyclical impacts.

5. Decision trees & Random Forests


• What are decision trees Decision trees are predictive models which map
• Entropy and Gini impurity index observations about an item to conclusions about
• Decision tree algorithms the item's target value. Learn the technique of
• CART decision trees, one of the most popular predictive
• Random Forest analytics techniques
• Case Study
6. Boosting Machines
• Concept of weak learners Want to win a data science contest on Kaggle or
data hackathons or be known as a top data
• Introduction to boosting algorithms
scientist? Then learning boosting algorithms is a
• Adaptive Boosting must as they provide a very powerful way of
• Extreme Gradient Boosting (XGBoost) analysing data and solving hard to crack problems
• Case study
7. Cross Validation & Parameter Tuning
Learn how to make your model more accurate and
• Model performance measure with cross validation perform the best on real -world data
• Parameter tuning with grid & randomised grid search

Module 2: Data Analysis in SQL (Videos only)


What is this module about?: This Data Analyst using SQL video tutorial teaches you how to use the
popular SQL language to analyse data stored in databases. SQL is a requirement in almost all analytics roles
& this module will make you eligible to work as a data analyst. In this SQL tutorial you will learn how to
communicate with databases, extract data from them, manipulate the data, analyse it and create reports.

Tools to be learnt: MS SQL


Class Duration: 6 hours of pre-recorded videos

Topic What does it mean?


Introduction To SQL
• What is SQL? Structured Query Language (SQL) is a standard
language for storing, manipulating and retrieving
• Why SQL?
data in databases. It is a heavily used language and
• What are relational databases? a must know for every data scientist. Here we will
• SQL command group introduce you to SQL using MS SQL.
• MS SQL Server installation
SQL Data Types & Operators
• SQL Data Types
• Filtering Data Learn about various types of data and how to filter
• Arithmetic Operators and conduct basic operations on data in databases
• Comparison operators using SQL.
• Logical Operators
• Exercises
Useful Operations in SQL
• Distinct Operation
• Top N Operation
• Sorting results
Learn more advanced operations on data.
• Combine results using Union
• Null comparison
• Alias
• Exercises
Aggregating Data in SQL
• Aggregate functions
Aggregate data using various conditions and
• Group By clause
clauses in SQL to gain the answers you are looking
• Having clause for.
• Over clause
• Exercises
Writing Sub-Queries in SQL
A subquery is a SQL query within a query.
• What are sub-queries?
Subqueries are nested queries that provide data to
• Sub-query rules
the enclosing query. In this module you will learn
• Writing sub-queries how to write various sub-queries.
• Exercises
Common function in SQL
• Ranking functions
• Date & time functions
Learn some of the common functions available in
• Logical functions
SQL to transform the data into more meaningful
• String functions data.
• Conversion functions
• Mathematical functions
• Exercises
Analytic Functions in SQL
• What are analytic functions?
Here you will learn various analytics function in SQL
• Various analytic functions
to undertake data analysis in SQL.
• SQL syntax for analytic functions
• Exercises
Writing DML Statements
• What are DML Statements?
DML is abbreviation of Data Manipulation Language
• Insert statement
in SQL. It is used to retrieve, store, modify, delete,
• Update statement insert and update data in databases.
• Delete statement
• Exercises
Writing DDL Statements
• What are DDL Statements? DDL refers to "Data Definition Language", a subset
• Create statement of SQL statements that change the structure of the
database schema in some way, typically by
• Alter statement
creating, deleting, or modifying schema objects
• Drop statement such as databases, tables, and views.
• Exercises
Using Constraints in SQL Constraints provide a standard mechanism to
• What are constraints? maintain the accuracy and integrity of the data
• Not Null Constraint inside table. There are several different types of
• Unique constraint constraints in SQL which you will learn here.
• Primary key constraint
• Foreign key constraint
• Check constraint
• Default Constraint
• Exercises
SQL Joins
• What are joins?
• Cartesian Join A SQL Join statement is used to combine data or
• Inner Join rows from two or more tables. Learn the various
• Left & Right Join joins in SQL in this module.
• Full Join
• Self Join
Views in SQL A view is a virtual table that consists of columns
from one or more tables. Though it is similar to a
• What are views?
table, it is not stored in the database. It is a query
• Create View
stored as an object. Hence, a view is an object that
• Drop view derives its data from one or more tables. Learn how
• Update view to create these views in this module.

Module 3: Data Visualization in Tableau (Videos Only)


What is this module about?: This video based Tableau tutorial teaches you how to visualize data using
Tableau Desktop 10. Tableau is one of the world’s most popular data visualization and BI tools and through
this module you will learn how to create data dashboards and stories to present your analysis to top
management. You get to work on 17 case studies and 3 projects in this course.

Tools to be learnt: Tableau


Class Duration: 6 hours of pre-recorded videos

Topic What does it mean?


Introduction to Business Intelligence & Visualization
In this module, you will understand what exactly is
• What is Business Intelligence?
data visualization, how does it relate to business
• What is data visualization? intelligence & analytics and the need and uses of
• Need for Visualization visualization
• Uses of visualization
Introduction to Tableau
• What is Tableau
• Tableau vs. Excel We will introduce you to the Tableau software
• Installing Tableau Desktop which is one of the most popular tools globally for
• Overview of Tableau Desktop visualizing data. Here you will learn how to
• Various Applications of Tableau download and install Tableau on your system, why
• Components of Tableau Desktop Tableau is used instead of Excel and the various
• Benefits of Tableau and Opportunities components of the Tableau software along with
uses and benefits.
• Tableau Products & Certifications
• Tableau Architecture
• Saving and publishing your work in Tableau
Dive into Tableau
• Explore Tableau Interface
Here you will get to know Tableau more in details
• Understand various Tableau terminologies
and explore how to start the visual analysis of data.
• Create Different Views to Analyze Data
• Case Study
In this module you will learn how to connect
Connecting Data Sources
various sources of data to Tableau, join various
• Connection Options datasets to one another and write custom SQL
• Data Types queries within Tableau.
• Data Roles
• Joins & Over Joins
• Unions
• Custom SQL Query
• Data Blending
• Editing Connections
• Case Study 1
• Case Study 2
Tableau Generated Fields
• Use of Measure Names and Measure Values
• Compare Multiple Measures Understand how to use various fields in Tableau to
• Fetch Number of Records In Database build various views of your data
• Latitude & Longitude Fields
• Case Study
Data Manipulation in Tableau
• Creating Groups
• Creating Combined Fields
• Sorting
• Filtering Data This module will teach you how to manipulate the
• Sets data like sorting, combining fields, binning etc.
• Binning Data through multiple case studies
• Hierarchies
• Case Study 1
• Case Study 2
• Case Study 3
Working with Dates in Tableau
• Changing Date Levels
• Different Date Parts Learn how to visualize data which has dates or time
• Custom Dates in it
• Create Fiscal Dates
• Case Study
Data Customization with Calculations
• Calculated Fields
• Arithmetic Calculations
• Date Calculation
• String Calculation
Undertake calculations on your data in Tableau to
• Logical Calculation come up with new views of the data.
• Type Conversion Calculation
• Table Calculation
• Level of Detail Calculations
• String Calculation
• Case Study 1
Adding Dynamism to a View with Parameters
• Introduction to Parameters
• Create a Parameter
• Explore Parameters Parameters are dynamic values that can replace
• Use Parameters in Calculations constant values in calculations, filters, and
reference lines. Use parameters to make your
• Parameters in Reference Lines
visualization more interactive.
• Parameters in Filters
• Make Estimates using Parameters
• Case Study
Geographical Analysis & Maps Learn how to create map based views of your
• Where is a Geographical or Map View Useful? location related data.
• Creating a Map view
• Custom Geocoding
• WMS Maps
• Modify Locations
Creating Visualizations
• Bar in Bar Chart
• Scatter Plots
• Histogram
• Heat Maps
• Highlighting in Tables
• Motion Charts
• Pie Chart In this module you will learn how to create a
• Bullet Chart number of charts each with its own specific utility.
• Box & Whisker Plot Learn through multiple case studies
• KPI Chart
• Market Basket Analysis Chart
• Pareto Chart
• Waterfall Chart
• Best Practice for Selecting Chart Type
• Case Study 1
• Case Study 2
Adding Statistics to Data
• Reference Lines
• Reference Bands
• Distribution Bands Understand how to add statistics to the charts and
• Trend Lines tables
• Forecasting
• Clustering
• Summary Card
• Case Study 1
Formatting & Annotation
• Add Titles, Captions & Annotations Format your visualizations and make them more
• Formatting Options - Fonts, Shading, Borders etc. informative
• Formatting Axes, Mark Labels and Legends
Dashboards & Stories
• What are Dashboards?
• Why and How are Dashboards Useful?
• Creating an Interactive Dashboard In this module you will learn how to create the end
• Adding Actions to a Dashboard output of visualization in Tableau which is creating
• Best Practices for Dashboard Design entire dashboards & data based storyboards to
• What is a Story? present to clients and management.
• Creating a Story
• Adding a Background Image to a Story
• Case Study
Module 4: Python For Data Analytics (Videos Only)
What is this module about?: This video based Python tutorial teaches you how to use the fundamentals
of Python and how to use Python for data handling, cleaning & analysis.

Tools to be learnt: Python


Class Duration: 8 hours of pre-recorded videos

Topic What does it mean?


Introduction to Machine Learning in Python In this section we shall provide you an overview
• What is machine learning & why is it so important? into the world of machine learning (ML). You will
• Applications of machine learning across industries learn about the various applications of machine
learning, how companies from all sort of domains
• Machine Learning methodology
are solving their day to day to long term business
• Machine Learning Toolbox problems. We’ll learn about required skill sets of a
• Tool of choice- Python: what & why? machine learning expert which make them capable
• How to approach a machine learning problem using a case of filling up this vital role. Once the stage is set and
study we understand where we are heading we discuss
• Course Components why Python is the tool of choice in ML.
Python Training
Introduction to Python
• Installation of Python framework and packages: Anaconda
and pip Python is one of the most popular & powerful
• Writing/Running python programs using Spyder, Command languages for data science used by most top
Prompt companies like Facebook, Amazon, Google, Yahoo
• Working with Jupyter Notebooks etc. It is free and open source. This module is all
• Creating Python variables: Numeric, string and logical about learning how to start working with Python.
operations We shall teach you how to use the Python language
• Basic Data containers: Lists, Dictionaries, Tuples & sets to work with data.
• Importing & exporting data
• Practice assignment
Flow Control, Iterative Operations & Functions in Python This is where we move beyond simple data
• Writing for loops in Python containers and learn about amazing possibilities
• List & Dictionary Comprehension and functionalities hidden in various associated
operators. We get introduced to wonderful world
• While loops and conditional blocks
of loops, list and dictionary comprehensions. In
• List/Dictionary comprehensions with loops addition to already existing functions and classes
• Writing your own functions in Python we learn to write our own custom functions and
• Writing your own classes and functions as class objects classes. This module sets the stage for handling
• Practice assignment data and ML algorithm implementation in python.
Data Summary, Numerical and Visual EDA in Python
• Need & ways for data summary
• Introduction to pandas Data summary is extremely important to
• Summarising numeric data in Pandas understand what the data is saying and gain
• Summarising categorical data insights in just one glance. Visualization of data is a
• Group wise summary of mixed data strong point of the Python software using the latest
• Need for visual summary Seaborn package.
• Introduction to Seaborn
• Visual summary of different data combinations
• Practice Exercise
Data Handling in Python using NumPy & Pandas
• Introduction to NumPy arrays, functions &properties
• Dataframe functions and properties
• Creating, Modifying, Renaming dataframe columns as Python is a very versatile language and in this
Pandas series module we expand on its capabilities related to
• Sub-setting data on conditions, row and column data handling. Focusing on packages Numpy and
indices, data types Pandas we learn how to manipulate data &
• Creating flag features for categorical columns undertake feature engineering which will be
• Imputing missing values, modifying column values eventually useful in converting raw data suitable
with dictionary replacements for machine learning algorithms.
• Reading and writing external data
• Creating summary features
• Handling datetime data
• Manipulating data columns

Contact Us
For further details & career counselling please reach out at:
 +91 8080928948
 info@edvancer.in
: https://edvancer.in/course/data-analytics-iitk-course-main/

You might also like