Professional Documents
Culture Documents
LPR - Week 1 070253 Final Edit
LPR - Week 1 070253 Final Edit
REVIEW
Datalchemist
Classification - Internal
2
MEET US!
ADAM YAZLI PUTRA
AJENG YUNITA
Classification - Internal
OUTLINE
• Data Analytics
• Data Analytics Overview
• Data Analytics Workflow and CRISP-DM
Framework
• Level of Analytics
• Data Analytics Practitioner
Classification - Internal
OVERVIEW
Data
Conceived of symbol, signs
Usually sourcing from Data Engineering, Data Data Analytics (Statistical methods, Descriptive & Decision making, based on
Information Technology Warehousing help us Diagnostics Analysis, Predictive Model, etc) help us data, or even better: data-
convert Data to Information convert Information to Knowledge by visualizing and driven, give us Wisdom from
taking insights the Knowledge
Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
The data analytics workflow typically consists of several key stages, each
contributing to the overall process of deriving insights from data.
Common framework includes the following stages:
Data Collection:
• Determine the sources of data needed for analysis.
• Collect and gather the required data
Communicate Results
Present the findings to relevant stakeholders, providing
insights to support decision-making processes.
Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
Which stands for CROSS-INDUSTRY STANDARD PROCESS FOR DATA
MINING, is a widely used process model for data mining and analytics projects.
It provides a structured
approach to guide
practitioners through
the stages of a data
mining project, from
understanding the
Develop a business problem to
strategy for
deploying the deploying the final
model into the model.
business
environment.
Classification - Internal
LEVEL OF ANALYTICS
Example: Analyzing factors that led to a spike Example: Sales forecasting (statistical
or drop in sales (advances statistics) learning)
Classification - Internal
DATA PRACTITIONER
Machine Learning
Visualization and Analytics
Data Engineering
• Problem solving
• Business decision strategu
• Analyze data, finding root cause
• Handling large-scale data processing • Utilize statistical analysis and analysis, and recommend business
and ensuring data availability visualization techniques to present decision
• Implement ETL (Extract, Transform, findings in a meaningful way. • Define business strategy in the future.
Load) processes to clean, transform, • Design and create dashboards and • Descriptive, Prescriptive and Predictive
and integrate data. reports that visually represent key
• Set up and manage the infrastructure performance indicators (KPIs) and Tools : SQL, Excel, SAS, Pentaho, Spark,
for data storage and processing other relevant metrics. Hadoop, Domain Knowledge
• Work closely with data scientists, • Focusing in analytic descriptive and
analysts, and other stakeholders to report summary
understand data requirements and • Descriptive, diagnostic
provide the necessary infrastructure
and tools for analysis. Tools : SQL, Excel, R, Python, Tableau,
Power BI, Google Data Studio, Domain
Tools : SQL, Excel, SAS, Pentaho, Spark, Knowledge
Hadoop, Domain Knowledge
Classification - Internal
OVERVIEW
Text Cell formatting
formatting
Column
Understanding
Excel and Row
Cell
Function Worksheets
IF COUNTIF SUMIF
• Performs a logical test and • Counts the number of cells • Adds up all the numbers in a
returns one value if the test is within a range that meet the range that meet a specified
true and another if false. given condition. condition.
• It performs a logical test and returns one value if • The OR function returns TRUE if at least
the test is true and another value if the test is false. one of the conditions specified is true;
• Syntax: =IF(logical_test, value_if_true, value_if_false) otherwise, it returns FALSE.
• Syntax: =OR(condition1, condition2, ...)
Understanding
• Example: =IF(A1>10, "Yes", "No")
• Example: =OR(A1>10, B1<20)
NOT:
Classification - Internal
TEXT FUNCTIONS
CONCATENATE (or CONCAT): LOWER:
• Combines two or more text strings into one. • Converts all letters in a text string to lowercase.
• Syntax: =CONCATENATE(text1, [text2], ...) • Syntax: =LOWER(text)
• Example: =CONCATENATE(A1, " ", B1) • Example: =LOWER(A1)
Understanding
Excel and RIGHT: LEN:
LEFT: MID:
• Returns a specified number of characters from the • Returns a specific number of characters from a
beginning of a text string. text string, starting at a specified position – as per
• Syntax: =LEFT(text, num_chars) character decide.
• Example: =LEFT(A1, 5) • Syntax: =MID(text, start_num, num_chars)
• Example: =MID(A1, 3, 4)
SUBSTITUTE: TRIM:
• Replaces occurrences of a specified substring with • Removes leading and trailing spaces from a text
another substring in a text string. string and EXCEPT a single space between words.
• Syntax: =SUBSTITUTE(text, old_text, new_text, • Syntax: =TRIM(text)
[instance_num]) • Example: =TRIM(A1)
• Example: =SUBSTITUTE(A1, "apple", "orange")
Classification - Internal
LOGICAL FUNCTIONS
Sum SUMIF
• Adds up all the numbers in a range. • Adds up all the numbers in a range that meet a
Understanding • Example =SUM(A1:A10) specified condition.
• Example: =SUMIF(A1:A10, “>10
Excel and
Function Count COUNTIF
• Counts the number of cells that contain • Counts the number of cells within a range that
numbers in a range. meet the given condition.
• Example: =COUNT(C1:C8) • Example: =COUNTIF(C1:C10, “>50”)
IF Average
Classification - Internal
Analysis with Excel
LOOKUP FUNCTION
• LOOKUP FUNCTION is one of the basic functions in searching
and referencing in Microsoft Excel
Classification - Internal
Analysis with VLOOKUP & HLOOKUP
excel
VLOOKUP HLOOKUP
Classification - Internal
INDEX & MATCH
INDEX MATCH
Analysis with INDEX function is used to MATCH function is used to
return a value or the reference locate the position of a lookup
excel to a value from within a table or value in a row, column, or table.
range. There are two ways to The function searches for a
use the INDEX function: If you specified item in a range of
want to return the value of a cells, and then returns the
specified cell or array of cells, relative position of that item in
see array form. If you want to the range.
return a reference to specified
cells, see reference form
Classification - Internal
A chart or graph is a visual representation of
Chart And data that helps to convey information in a clear
and concise manner. There are many different
Visual types of charts and graphs, each with its own
strengths and weaknesses.
Classification - Internal
VISUALIZATION TYPE 17
SCATTER PLOT
MAP CHART TREEMAP
Classification - Internal