Professional Documents
Culture Documents
Data Science Module1
Data Science Module1
Data Science Module1
By Gaurav Kumar
Data
Data are measurements or
observations that are collected as a
source of information
From computer science comes machine learning and high performance computing
technologies for dealing with scale.
From statistics come a long tradition of exploratory data analysis , significance testing
and visualisation.
From application domains in business and the sciences comes challenge worthy of
battle and evaluation standard to assess when they have been adequately
conquered.
Workflow
Obtain data that you hope will answer the question.
Big data refers to a huge volume of data that can not be stored,
processed by any traditional data storage or processing units
Big data refers to data sets whose size is beyond the ability of typical database
software tools to capture , store , manage and analyse.
Traits of Big Data
Web scraping
https://data-lessons.github.io/library-webscraping-DEPRECATED/01-introduction/
Reporting
Analysis
vs
place
and presenting
representations
.
it in visual
data
Analysis : Interpreting your
Data it context
and giving
task than
is more difficult
Data analysis knowl
because
it requires
data reporting models and
charts ,
Dashboard,
er to
gralp (eg .
Whereas
Kanakis
are
reports ,
not analysisreports)
data and makes action suggestions
.
understands the
into
In reporting ,
data is organised
inspecti
summaries whereas analysis involves
data before
and transforming
ng cleaning
,
,
models
.
creating
data into information ,
translates
Reporting turns information into
Whereas Analysis
insights
.
to ask 'What'
users
allows
Reporting 'What is
the data . Ex :
about
questions sales
our
performance of
the average
Analysis should
team ?' Whereas
and 'what
can
me do about it !
https://bidataintel.com/2021/05/data-analysis-and-reporting-differences/