Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

SEHH1071

C O M P U TAT I O N A L T O O L S
F O R S TAT I S T I C S

I N T R O D U C T I O N T O D ATA - A L L A B O U T D ATA !
OUTLINE

• Basic and trends in Data

• Big Data Analysis vs Traditional Data Analysis

• Application of Data Analysis


RECAP

Data Information
Stored Presented
Raw Processed
Technical based Application based
Collected Analyzed
Not ready for understanding Easy to understand
Input Output
WHAT IS SO SPECIAL ABOUT BIG
DATA?
• Definition
– Big data includes datasets that have the size beyond the capability of traditional software and tools
to capture, organize and process data with a reasonable time frame

– Big data is with high volume, high velocity and high variety that need to make use of the data mining
and unstructured tools to obtain pattern and information for decision making.
WHAT IS BIG DATA? WHY?

Some big data facts about IG:


Volume
- >800 million Instagram users

- 500 millions are active every day

- >30% users look at their IG more


than once every day Velocity Variety
- >90 millions photos and videos are
shared everyday

Beyond the capacity of the conventional database system


BIG DATA VS TRADITIONAL DATABASE
• New source of data

• Data frequency

• Data/problem structure

• Not Only SQL


– Unstructured (may not have predefined schema as relational database)
• Relational database
– data are stored in inter-related tables that contain rows and columns
– use of foreign keys to reference the tables

• Expand horizontally when scale increases


APPLICATION OF DATA ANALYSIS
NOWADAYS
• Banking • Education
– Customer spending pattern – Analysis Learning through online planform
– Credit risk analysis
• Government and charity
• Communication and media – Operation efficiency
– Accurate and useful Information delivery
– Pattern of information demand • Transportation
– Routing
• Healthcare
– Effective and efficient utilization of medical resources • Business
– Clustering and segmentation
– Recommendation

https://www.youtube.com/watch?v=rl7ZBqjB6MI
APPLICATION OF DATA ANALYSIS NOWADAYS
• Case 1: IG Story for marketing

– Goals:
• Using KPI to analysis the effectiveness of
your advertisement in your IG

– Inputs:
• Views
• Tags (Geotag, hashtag)
• Taps (tap back, tap forward)

– Outputs (e.g. Instagram insight)


APPLICATION OF DATA ANALYSIS NOWADAYS
• Case 2: Research Study on
anthropologist with social media
photos

– Goals:
• Investigate the cultural phenomenon

– Inputs:
• Aanalyze over 100 million of photos
being uploaded to social media

Source: https://www.technologyreview.com/s/608116/data-mining-100-million-instagram-photos-reveals-global-
clothing-patterns/
APPLICATION OF DATA ANALYSIS
NOWADAYS
• Case 3: Public Transport System in Jakarta

– Goals: Improving the public transport system in the city (e.g. bus scheduling)

– Inputs:
• Real time GPS data from bus
• Passengers tap-in data

– Outputs:

• Real Time arrival time


• Congestion information

This Photo by Unknown Author is licensed under CC BY-SA


Source: Global Pulse, 2017. USING BIG DATA ANALYTICS FOR IMPROVED
PUBLIC TRANSPORT

You might also like