Professional Documents
Culture Documents
Crisp DM Framework: Data Mining Tasks: Description Estimation Prediction Classification Clustering Association
Crisp DM Framework: Data Mining Tasks: Description Estimation Prediction Classification Clustering Association
1
TOPICS
Topics
Business Objectives and Data Mining Problem
Data Preprocessing
Exploratory Data Analysis
Simple Linear Regression
Multiple Linear Regression
Logistic Regression
Model Evaluation Techniques
Integration of Predictive and Prescriptive
Analytics
Decision Trees
Ensemble Methods: Bagging and Boosting
Model Development
Conduct Exploratory
Derive and Analyze Data Analysis
Explore the data
Descriptive Statistics
Define functional
Perform Estimate regression form of the
Diagnostic Tests parameters relationship
NO
Model satisfies
diagnostic test Validate
YES the STOP
model(s)
R
4
Graphical output plot(),hist(),barplot()
par(mfrow=c(2,2))
Mathematical Function sqrt(),log(),min(), max()
Statistical summary(), mean(),sd(),quantile(), cor()
Data generation
sample()
set.seed()
Missing na.strings = c(“”,””)
na.omit(df)
is.na()
complete.cases(df)
Data input read.csv(strip.white,stringsAs Factors,header)
readxl()
anyDuplicated(df)
duplicated(df)
5
Models Commands Problems/ Case Studies Concepts