Professional Documents
Culture Documents
Introduction
Introduction
Classification Predict if a data point belongs to Decision Trees, Neural Assigning voters into known buckets by
one of predefined classes. The networks, Bayesian political parties eg: soccer moms.
prediction will be based on models, Induction rules, K Bucketing new customers into one of
learning from known data set. nearest neighbors known customer groups.
Regression Predict the numeric target label of Linear regression, Logistic Predicting unemployment rate for next
a data point. The prediction will regression year. Estimating insurance premium.
be based on learning from known
data set.
Anomaly detection Predict if a data point is an outlier Distance based, Density Fraud transaction detection in credit
compared to other data points in based, LOF cards. Network intrusion detection.
the data set.
Time series Predict if the value of the target Exponential smoothing, Sales forecasting, production
variable for future time frame ARIMA, regression forecasting, virtually any growth
based on history values. phenomenon that needs to be
extrapolated
Clustering Identify natural clusters within the K means, density based Finding customer segments in a
data set based on inherit clustering - DBSCAN company based on transaction, web
properties within the data set. and customer call data.
Association analysis Identify relationships within an FP Growth, Apriori Find cross selling opportunities for a
itemset based on transaction retailor based on transaction purchase
data. history.
Course Core Algorithms
outline Classification
Decision Trees
Rule Induction
k-Nearest Neighbors
Naïve Bayesian
Artificial Neural Networks Common Applications
Process Basics
Support Vector
Clustering
k-Means
DBSCAN
10 applications that build upon the concepts of Data Science, exploring various
domains such as the following: