Professional Documents
Culture Documents
Introduction To Data Sceince
Introduction To Data Sceince
to
Data Science
Work is Worship 1
LEARNING OBJECTIVES
Work is Worship 2
What is Data and Information
Work is Worship 5
ACTIVITY
“Group the given data into Structured and Unstructured Data”.
Work is Worship 7
How to Structure Data in Python?
Example:
List1 = [80, 85, 90, 95, 100, 105, 110, 115, 120, 125]
print(List1)
Work is Worship 8
Types of Data
Data comes in different types.
Ex: Text ,Image, Video, Numbers, Spreadsheets, Sound.
Work is Worship 9
Types of Data
Quantitative Data
Continuous
Discrete Data
Data is
is Countable
Measurable
Discrete Continuous
Data Data
Weather Data
Education
Stock Market Data
Internet Search
Health Records
Entertainment
Social Media Posts
Augmented Furniture Shopping
E-Commerce
Casino Gambling
GPS Data
Magic Shows
Census Data
Traffic Data
Energy Consumption Data
Banking
Work is Worship 11
DIKW Model
Work is Worship 12
Examples of DIKW
Data Information Knowledge Wisdom
Knowledge is If the water is
Boiling Point of
1000c gained of water’s being touched,
water
boiling point. hands may burn.
Work is Worship 13
What are Data Foot Prints
Work is Worship 14
Active Data Passive Data
Work is Worship 16
What is Data Collection and Variables
Data Collection:
The method of gathering data for calculating and analyzing is known
as data collection.
Variable: A variable is an attribute of an object of study that may
vary for different cases.
Numerical variable
They represent values that have numbers. For Example, age, weight, height.
Categorical variable
These variables represent values that have words, for example, name, nationality,
sport, etc.
Work is Worship 17
What are Data Sources
Data sources can be classified into two types:
5V’s 3
5
www.kaggle.com
Work is Worship 19
Questioning Your Data
Work is Worship 20
Introduction to Data Science
Work is Worship 21
Careers in Data Science
Data Data Scientists are data enthusiasts who gather and analyze large sets of structured and
Scientist unstructured data. They analyze, process, and model data and later interpret the results to
create actionable plans for companies and organizations.
Business Business Intelligence Analysts use data to assess the market and find the latest business
Intelligence trends in the industry. This helps to develop a clearer picture of how a company should
Analyst shape its strategy.
Data Data Engineer examines not only the data for their own business but also that of third
Engineer parties. In addition to mining data, a data engineer creates robust algorithms to help
analyze the data further.
Data Data Architects work closely with users, system designers, and developers to create a
Architect blueprint that data management systems use to centralize, integrate and maintain the data
sources.
Senior Senior Data Scientists anticipate the business's needs in the future. Although they might
Data not be involved in gathering data, they play a high-level role in analyzing it.
Scientist
Work is Worship 22
Where is Data Science needed
For route planning: To discover the best routes to ship.
To foresee delays for flight/ship/train etc. (through predictive analysis).
To create promotional offers.
To analyze health benefit of training.
To predict who will win elections.
Data Science can be applied in nearly every part of a business where data
is available.
Consumer goods
Stock markets
Industry
Politics
E-commerce
Work is Worship 23
How does a Data Scientist work?
A Data Scientist requires expertise in several backgrounds:
Statistics, Programming (Python or R), Mathematics, Databases.
Ask the right questions : To understand the business problem.
Explore and collect data : From database, web logs, customer feedback.
Extract the data : Transform the data to a standardized format.
Clean the data : Remove erroneous values from the data.
Find and replace missing
values : Check for missing values and replace them with a
suitable value
Analyze data, find patterns and make future predictions.
Represent the result : Present the result with useful ways that the
"company" can understand.
Work is Worship 24
Database Table and Database Table Structure
Duration Average_Pulse Max_Pulse Calorie_Burnage Hours_Work Hours_Sleep
30 80 120 240 10 7
Work is Worship 25
Data Science & Python
Work is Worship 26
Python Libraries
Pandas- This library is used for structured data operations, like import
SciPy- This library has linear algebra modules. Scipy stands for
Work is Worship 28
Python DataFrame
Work is Worship 29
Data Science Functions
Ex:
import numpy as np
Calorie_burnage = [240, 250, 260, 270, 280, 290, 300, 310, 320, 330]
Average_calorie_burnage = np.mean(Calorie_burnage)
print(Average_calorie_burnage)
Work is Worship 30
Data Science Functions
Average_pulse_min = min(80, 85, 90, 95, 100, 105, 110, 115, 120, 125)
print (Average_pulse_min)
Work is Worship 31
Data Preparation
import pandas as pd
health_data = pd.read_csv("data.csv", header=0, sep=",")
print(health_data)
Work is Worship 32
Data Cleaning Functions
Remove Blank Rows - dropna() function
health_data.dropna(axis=0,inplace=True)
print(health_data)
Data Types Function – info() function
Ex: print(health_data.info())
astype() function
health_data["Average_Pulse"] = health_data['Average_Pulse'].astype(float)
health_data["Max_Pulse"] = health_data["Max_Pulse"].astype(float)
print (health_data.info())
Analyze the Data - describe() function
print(health_data.describe())
Work is Worship 33
Data Visualization
Data Visualization is the representation of data or information in a graph,
chart or other visual formats.
Charts
Graphs
Tables
Maps
Histograms
Work is Worship 34
Data Visualization
plt.plot(xpoints, ypoints)
plt.show()
Work is Worship 35
Data Visualization
Work is Worship 36
Data Visualization
Work is Worship 37
Data is the new science.
Big Data holds the answers.
Artificial Intelligence Controls the World!!!.
Work is Worship 38
Work is Worship 39