Data Science Hindi

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 29

Data Science

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Data?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Data?

Data is just collection of facts

“This is
23, 8000
Sparta””

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Back Then

Small

Structured

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Now

Huge

Un-structured

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Now

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Use Case

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Use Case

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Life Cycle

Machine
Data Data Pre- Pattern Knowledge
Learning
Acquisition Processing Evaluation Representation
Algorithm

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Acquisition

Data from Multiple Sources Data Storage Target Data

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Pre-processing

Data Manipulation Data Visualization

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Machine Learning

Application of intelligent algorithms to find meaningful information from the data

Classification Regression Clustering

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Pattern Evaluation

Once Data Mining techniques have been applied, the results have to be evaluated

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Knowledge Representation

The identified patterns must be represented using simple, aesthetic graphs

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Tasks–(Anomaly Detection)

Identification of unusual patterns, outliers helps in understanding the variation in data

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Data Mining Tasks–(Association Rule
Mining)

This method is used for finding interesting association amongst different entities

Beer Diaper Syndrome

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
What is Machine Learning?

Fish

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Categories of Machine Learning

Supervised Learning Unsupervised Learning

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Supervised Learning

Input Variable X

Y = f(X)

Output Variable Y

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Categories of Supervised Learning

Supervised Learning

Regression Classification

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Classification

Classification is the process of predicting the class of a new variable

Smoke(Yes/No) Cancer(Yes/No)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Regression

This method is used to estimate the relationship between different entities

Dependent Variable Independent Variable

Y=f(x)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Unsupervised Learning (Clustering)

Input Data with no


class labels

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Unsupervised Learning (Clustering)

Proprietary content. © Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.

You might also like