Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

JSPM’s

Jayawantrao Sawant College Of Engineering


Sr. No. 58, Handewadi Road, Hadapsar, Pune, Maharashtra 411028
Department of Electronics and Telecommunication Engineering

Experiment No. 1

Title: Basics of Data Science and Data Analytics


Aim: To Study basics of Data science and Data Analytics
CO 1: Understand the basics of Data science and its significance
ELO-1: Student will be able to learn the installation and use of various libraries for data
analytics
Objectives: After performing this experiment, the learner will be able to
1. Differentiate between the Data Analysis and Data Analytics
2. Data and various levels of data
Pre-requisite:
Variable, Measurement and Data
•Variables – is a characteristic of any entity being studied that is capable of taking on different values
•Measurements – is when a standard process is used to assign numbers to particular attributes or
characteristic of a variable
•Data – data are recorded measurements
Theory:
Data Generation
•Data can be generated by
–Humans,
–Machines or
–Humans-machines combines
•It can be generated anywhere where any information is generated and stored in structured or
unstructured formats
Importance of Data
•Data helps in make better decisions
•Data helps in solve problems by finding the reason for underperformance
•Data helps one to evaluate the performance.
•Data helps one improve processes
•Data helps one understand consumers and the market
Data science is focused on making sense of complex datasets and in building predictive models from
those data. As such, it encompasses a wide array of different activities, from the upstream processes of
acquiring, cleaning and integrating data to downstream processes of analysis, modeling and prediction.
There are many facets of data science, including:
➢ Identifying the structure of data
➢ Cleaning, filtering, reorganizing, augmenting, and aggregating data
➢ Visualizing data
➢ Data analysis, statistics, and modeling
➢ Machine Learning
➢ Assembling data processing pipelines to link these steps
➢ Leveraging high-end computational resources for large-scale problems

SE [Electronics and Telecommunication] / 2022-23 Subject: Data Analytics Lab


JSPM’s
Jayawantrao Sawant College Of Engineering
Sr. No. 58, Handewadi Road, Hadapsar, Pune, Maharashtra 411028
Department of Electronics and Telecommunication Engineering

Data science encompasses preparing data for analysis, including cleansing,


aggregating, and manipulating the data to perform advanced data analysis.
https://www.oracle.com/in/data-science/what-is-data-science/

Define data analytics and its types


Analytics is defined as “the scientific process of transforming data into insights for making
better decisions”
•Analytics, is the use of data, information technology, statistical analysis, quantitative methods, and
mathematical or computer-based models to help managers gain improved insight about their business
operations and make better, fact-based decisions – James Evans
Why analytics is important?
Opportunity abounds for the use of analytics and big datasuch as:
1. Determining credit risk to deliver products and services

2. Developing new medicines

3. Finding more efficient ways


4. Preventing fraud

5. Uncovering cyber threats

6. Retaining the most valuable customers

Data analysis
•Data analysis is the process of examining, transforming, and arranging raw data in a specific way to
generate useful information from it
•Data analysis allows for the evaluation of data through analytical and logical reasoning to lead to some
sort of outcome or conclusion in some context
•Data analysis is a multi-faceted process that involves a number of steps, approaches, and diverse
techniques
Data analytics vs. Data analysis
Data Analysis

SE [Electronics and Telecommunication] / 2022-23 Subject: Data Analytics Lab


JSPM’s
Jayawantrao Sawant College Of Engineering
Sr. No. 58, Handewadi Road, Hadapsar, Pune, Maharashtra 411028
Department of Electronics and Telecommunication Engineering

Data Analytics & its elements

Classification of Data analytics

Based on the phase of workflow and the kind of analysis required, there are four major
types of data analytics.

SE [Electronics and Telecommunication] / 2022-23 Subject: Data Analytics Lab


JSPM’s
Jayawantrao Sawant College Of Engineering
Sr. No. 58, Handewadi Road, Hadapsar, Pune, Maharashtra 411028
Department of Electronics and Telecommunication Engineering

Difference between Data analyst and Data Scientist

Types of Variables

Levels of Data Measurement


•Nominal — Lowest level of measurement
•Ordinal
•Interval
•Ratio — Highest level of measurement
1) Nominal
A nominal scale classifies data into distinct categories in which no ranking is implied •Example :
Gender, Marital Status
2) Ordinal
An ordinal scale classifies data into distinct categories in which ranking is implied
Example:
1) Product satisfaction ➔ Satisfied, Neutral, Unsatisfied
2) Faculty rank ➔ Professor, Associate Professor, Assistant Professor
3) Student Grades ➔ A, B, C, D, F
3) Interval
An interval scale is an ordered scale in which the difference between measurements is a

SE [Electronics and Telecommunication] / 2022-23 Subject: Data Analytics Lab


JSPM’s
Jayawantrao Sawant College Of Engineering
Sr. No. 58, Handewadi Road, Hadapsar, Pune, Maharashtra 411028
Department of Electronics and Telecommunication Engineering

meaningful quantity but the measurements do not have a true zero point.
Example: Temperature in Fahrenheit and Celsius, Year

4) Ratio
A ratio scale is an ordered scale in which the difference between the measurements is a
meaningful quantity and the measurements have a true zero point.
•Example: - Weight, Age, Salary

Impact of choice of measurement scale


Statistical
Data Level Meaningful Operations
Methods
Nominal Classifying and Counting Nonparametric
Ordinal All of the above plus Ranking Nonparametric
Interval All of the above plus Addition, Subtraction Parametric
Ratio All of the above plus multiplication and division Parametric

Conclusion: At the End of experiment, we understood

1. __________________________________________________________________________
2. __________________________________________________________________________
3. __________________________________________________________________________
Questions:
1. What is the data analysis?
2. What is the Process of Data Analysis?
3. What are list and dictionary?
4. What is the difference between a list and a tuple?
5. Explain the differences between Python 2 and Python 3?

SE [Electronics and Telecommunication] / 2022-23 Subject: Data Analytics Lab

You might also like