Professional Documents
Culture Documents
Data Sceince Ppt (Copy 3)
Data Sceince Ppt (Copy 3)
Presented By
Kiran Shivanand Totager
4HG21CS021
TABLE OF CONTENTS
Introduction to data science
Python for data science
Introduction to CSV file & pandas
Descriptive statistics
Histograms
Introduction to probability
Machine learning
Conclusion
INTRODUCTION
Data Science is a multi-disciplinary field
that uses scientific methods, algorithms,
and systems to extract knowledge and
insights from structured and unstructured
data.
Applications
Search Engines
Digital Advertising
Recommendation Systems
Image Recognition
PYTHON FOR DATASCIENCE
Operators
Variable & variable naming
conventions
Data types in python
Conditional statements
Looping statements
Functions
Packages in Python
INTRODUCTION TO CSV FILE & PANDAS
CSV file:
Comma Separated Values file is way of storing
information in tabular format within text file.
Pandas:
Pandas is python library .it is used for
key concepts
Measures of Measures of Types of Data:
Central Tendency: Variability:
Mean Range Categorical
Median Variance Numerical
Mode Standard Deviation
HISTOGRAMS
histogram is a graphical representation of data using bars of
different heights. It summarizes a data set by dividing it into
bins and plotting the count of data points in each bin.
PURPOSE :
VISUAL SUMMARY : provides a visual summary of the distribution and
frequency of data values.
COMPARISON : It allows you to compare measurements to
specifications.
Bernoulli Trials:
A Bernoulli trial is a random experiment
with precisely two possible outcomes:
success and failure.
Predictive modeling :
combines AI and historical data to predict
future outcomes accurately.
predictive modeling steps :
Define Your Data
Data Cleaning and
Objective Collection
Preparation