Professional Documents
Culture Documents
ML and AI Program
ML and AI Program
Landmark: Off Porwal Road, Near Sharda Super Market, Nimbalkar Nagar, Dhanori, Pune.
Project Details: 2 Tower of G+11 Floors,Final stage & 2nd Slab completed….Possession December 2015 & March 201
Product Details: Vitrified Flooring,4 Elevators,Gym,Play Ground Eqpts,Landscaped Garden.
Pune-411001.
ession December 2019.
r Kitchen,STP,Swimming Pool,Gym,Play Garden Eqpt.
cember 2016.
tor,Rainwater Harvesting,STP,ACC Blocks,SS Railings,False ceiling.
Fundamentals of R
Univariate statistics in R
Data visualization in R
Predictive analytics in R
1. Correlation and Linear regression
Correlation
Simple linear regression
Multiple linear regression
Model diagnostics and validation
Case study
2. Logistic regression
Hierarchical clustering
K-means clustering
Deciding number of clusters
Case study
5. Decision Trees
What are decision trees?
Entropy
Gini impurity index
Decison trees algorithms
ID3
C4.5
CART
CHAID
Regression trees
Introduction to Python
Linear Regression
Regularisation of Generalised Linear Models
Ridge and Lasso Regression
Logistic Regression
Case Study
Tree Models using Python
Factor Analysis
Case study
In this section we shall provide you an overview into the world of analytics.
You will learn about the various applications of analytics, how companies
are using analytics to prosper and study the analytics cycle.
This is where you shall learn how to start understanding the story your data
is narrating by summarizing the data, checking its variability and shape. We
shall take you through various ways of doing this using the R language and
also solve a real-world case study
Real world data is rarely going to be given to you perfect on a platter. It will
always be dirty with missing data points, incorrect data, variables needing to
be changed or created in order to analyze etc. A typical analytics project will
have 60% of its time spent on preparing data for analysis. This is a crucial
process as properly cleaned data will result in more accurate and stable
analysis. We shall teach you all the techniques required to be successful in
this aspect.
Learn why and how to statistically divide a broad customer market into
various segments of customers who are similar to each other so as to be
able to better target and meet their needs in a cost effective manner. This is
one of the most essential techniques in marketing analytics.
The ability to forecast into the future is very important for any business and
it is necessary to have as accurate a forecasting as possible for corporate
planning for finance, sales, marketing, strategy etc. In this module learn the
techniques of forecasting without being mis-led by seasonal and cyclical
impacts.
Decision trees are one of the most popular classification and prediction
methods for helping in decision making. Learn the various decision tree
algorithms and learn how to create a decision tree model.
In this section we shall provide you an overview into the world of data
science & machine learning. You will learn about the various applications of
data science, how companies from all sort of domains are solving their day
to day to long term business problems. We’ll learn about required skill sets
of a data scientist which make them capable of filling up this vital role. Once
the stage is set and we understand where we are heading we discuss why
Python is the tool of choice in data science.
Python is one of the most popular & powerful languages for data science
used by most top companies like Facebook, Amazon, Google, Yahoo etc. It is
free and open source. This module is all about learning how to start working
with Python. We shall teach you how to use the Python language to work
with data.
This is where you shall learn the functionalities and powerful capabilities of
Python that will make it easy for you to work with data and set the stage for
using Python for machine learning & data science.
Case Studies:
Case Studies: In the class we continue with the case studies taken in
previous module of simple linear models and see how the tree based
models compare in terms of performance in comparison to the linear
models. In take home exercises we have two case studies:
Capture risks associated with micro loans: In the 1st exercise you will work
on micro loans. Its inherently risky to hand out micro loans because of lack
of checks in the natural process of micro loans. and in this case study we try
to capture risk associated with these micro loans.
How do the tech specifications of a vehicle impact its emissions? In the 2nd
case study we find out effect of technical design specification of a vehicle on
average emission and thus its environmental impact.
Case Studies:
Predicting annual income based on census data: In the take home exercise,
find out whether someone is going to have annual income higher than a
certain amount just by simple census data and thus identifying potential
fraud cases when it comes to filing their taxes.
We step in a powerful world of “observation based algorithms” which can
capture patterns in the data which otherwise go undetected. We start this
discussion with KNN which is fairly simple. After that we move to SVM which
is very powerful at capturing non-linear patterns in the data.
Case Study: Since KNN and SVM take a lot of processing time, we have kept
the class discussion case study simple. Same implementation steps can be
used to work on any complex business problem as well.
Many machine learning algos become difficult to work with when dealing
with many variables in the data. We will learn methods which help solve this
problem and also clustering techniques. Case Studies:
Car Survey Data: We take up car survey data which contains technical &
price detail of vehicles through 11 numeric variables. We’ll see if these 11
variables represent any hidden factors representing different properties of a
vehicle.
Customer spend data at a retail chain: For DBSCAN we see how DBSCAN can
be used for anomaly detection using expense data of customers from a
retail chain.
Text data forms a big chunk of data available in the world today. Analysing
text data can give a business very powerful insights to take advantage of.
Python provides very useful ways to scrape data from the web or extract
data from social media sites using APIs and then analyse the data. Case
Studies: