Professional Documents
Culture Documents
Big Data Intro PDF
Big Data Intro PDF
o Fundamentals
o Challenges
o Opportunities & Applications
Fundamentals:
Big Data
There are some things that are so
big that they have implications for
everyone, whether we want it or
not.
• High-velocity
– Speed rate in collecting or acquiring or generating or
processing of data
• High-variety
– Different data type such as audio, video, image data
The Currently
three V’s of big more
three data constitute a
V’s are also
comprehensive definition
considered
Variety Velocity
Volume
11
Big Data
Volume…
…refers to the vast amounts of data generated
every second. We are not talking Terabytes but
Ze;abytes or Brontobytes. If we take all the
data generated in the world between the
beginning of ?me and 2008, the same amount
of data will soon be generated every minute.
New big data tools use distributed systems so
that we can store and analyse data across
databases that are do;ed around anywhere in
the world.
Velocity…
• Data Volume
– 44x increase from 2009 2020
– From 0.8 zettabytes to 35zb
• Data volume is increasing exponentially
Exponen'al increase in
collected/generated data
4.6
30 billion RFID billion
tags today
12+ TBs (1.3B in 2005)
camera
of tweet data phones
every day world wide
100s of
millions
data every day
of GPS
? TBs of
enabled
devices sold
annually
25+ TBs of 2+
log data
every day billion
people on
the Web
76 million smart meters by end
in 2009… 2011
200M by 2014
Variety (Complexity)
• Streaming Data
– You can only scan the data once
Social Banking
Media Finance
Our
Gaming
Customer Known
History
Entertain Purchase
Velocity (Speed)
• Data is begin generated fast and need to be processed fast
• Online Data Analytics
• Late decisions è missing opportunities
• Examples
– E-Promotions: Based on your current location, your purchase
history, what you like è send promotions right now for store next to
you
– Healthcare monitoring: sensors monitoring your activities and
body è any abnormal measurements require immediate reaction
Real-time/Fast Data
Mobile devices
(tracking all objects all the ?me)
• The progress and innovation is no longer hindered by the ability to collect data
• But, by the ability to manage, analyze, summarize, visualize, and discover
knowledge from the collected data in a timely manner and in a scalable fashion
Real-Time Analy9cs/Decision Requirement
Product
Recommenda9ons Learning why Customers
Influence
that are Relevant Behavior Switch to compe9tors
& Compelling and their offers; in
9me to Counter
Friend Invita9ons
Improving the Customer to join a
Marke9ng Game or Ac9vity
Effec9veness of a that expands
Promo9on while it business
is s9ll in Play
Preven9ng Fraud
as it is Occurring
& preven9ng more
proac9vely
Harnessing Big Data
Trends:
• SPIMES (Space & Time-Spatiotemporal data)
• Streaming Data
– You can only scan the data once
(Big) Data Features
Analysis
The ‘Datafication’ Volume
Analysing
of our World;
Big Data:
• Activities
• Text
• Conversations
Velocity
analytics
• Words
• Sentiment
• Voice
analysis
• Social Media
• Face Value
• Browser logs
recognition
Variety
• Photos
• Voice
• Videos
analytics
• Sensors
• Movement
• Etc.
analytics
Veracity
• Etc.
Big Data
Opportunities
Potentials…..
Risk Management
and Fraud Smart Ci?es
Detec?on
Customer Intelligence
• Big data analytics can provide organizations with the ability to profile
and segment customers based on different socioeconomic
characteristics, as well as increase levels of customer satisfaction
and retention.
• This can allow them to make more informed marketing decisions,
and market to different segments based on their preferences along
with the recognition of sales and marketing opportunities.
• By performing sentiment analysis on this data, firms can be alerted
beforehand when customers are turning against them or shifting to
different products, and accordingly take action.
Supply Chain
• Big data analytics can be used to forecast demand changes, and
accordingly match their supply. This can increasingly benefit the
manufacturing, retail, as well as transport and logistics industries.
• By analyzing stock utilization and geospatial data on deliveries,
organizations can automate replenishment decisions, which will reduce lead
times and minimize costs and delays, as well as process interruptions.
• Furthermore, alternate pricing scenarios can be run instantly, which can
enable a reduction in inventories and an increase in profit margins.
• Accordingly, big data can lead to the identification of the root causes of cost,
and provide for better planning and forecasting.
Quality Management
• Especially for the manufacturing, energy and utilities, and
telecommunications industries, big data can be used for quality
management, in order to increase profitability and reduce costs by
improving the quality of goods and services provided.
• In the manufacturing process, predictive analytics on big data can
be used to minimize the performance variability, as well as
prevent quality issues by providing early warning alerts.
• Real-time data analyses and monitoring of machine logs can
enable managers to make swifter decisions for quality management.
Risk Management & Fraud Detection
• Industries such as investment or retail banking, as well as insurance, can
benefit from big data analytics in the area of risk management. Since the
evaluation and bearing of risk is a critical aspect for the financial services
sector, big data analytics can help in selecting investments by analyzing the
likelihood of gains against the likelihood of losses.
• Internal and external big data can be analyzed for the full and dynamic
appraisal of risk exposures
• High-performance analytics can also be used to integrate the risk profiles
managed in isolation across separate departments, into enterprise wide risk
profiles.
• Banking, and insurance industries, big data analytics can be used to detect
and prevent fraud. Analytics are already commonly used in automated
fraud detection, but organizations and sectors are looking towards
harnessing the potentials of big data in order to improve their systems.
Smart Cities
Photo: Riox.io
The need for smarter cities
Ø Growing population
Ø Aging infrastructures
How can Big data help the
different sectors mentioned?
• Etc…..
More Challenges…..
Integra9on
<10% of Big Data is relational
Complex data integration
Security
Security of source data, processing, and of “knowledge”
Privacy
Cost of analy9cs
Interpretations of outcomes!
Availability of the data scien9st
HR’s!
Big Data
Infrastructure &
Example Technologies
Big Data: 3 worlds
Storage
Processing
• Hadoop / MapReduce
Analy9cs
• Teradata DBMS
• Aster Data
• Data Mining
• TWM
• Rapid Miner
• Dashboard
• Tableau
From Big Data 1.0 to Big Data 2.0
• 64One way to think about the state of big data technologies is to draw an analogy
with the business adoption of Internet technologies.
• In Web 1.0, businesses busied themselves with getting the basic internet
technologies in place, so that they could establish a web presence, build
electronic commerce capability, and improve the efficiency of their operations.
• Once firms had incorporated Web 1.0 technologies thoroughly (and in the
process had driven down prices of the underlying technology) they started to
look further. They began to ask what the Web could do for them, and how it
could improve things they’d always done—and we entered the era of Web 2.0,
where new systems and companies began taking advantage of the interactive
nature of the Web.
From Big Data 1.0 to Big Data 2.0
65
• We should expect a Big Data 2.0 phase to follow Big Data 1.0. Once
fashion, they should begin asking: “What can I now do that I couldn’t
Tableau, RapidMiner.
IoT
IoT
• The internet of things (IoT) is the internetworking of
physical devices, vehicles (also referred to as
"connected devices" and "smart devices"), buildings and
other items—embedded with electronics, software,
sensors, actuators, and network connectivity that enable
these objects to collect and exchange data.
IoT Applications.. Examples
• Electrical Power
• Mass Produc?on
2nd
• Electronics
• Programmable logic controller (PLC)
3rd • Internet & IT/IS Systems
• Industry 4.0
• IoT
4th • FoF
Industry 4.0
• Nowadays, machines are connected via collaborative community creating the
so-called Industry 4.0, whereby machines are connected as a collaborative
community.
• Achieving faultless interaction between surrounding machines and their
corresponding systems turns regular machines into self-aware or smart
machines, and subsequently improves its performance in the environment
where it operates.
• IoT has been an enabler to Industry 4.0. IoT refers to the networked
interconnections of objects that are fitted with ubiquitous intelligence. IoT is
opening huge opportunities of new applications, which promise better quality of
our lives. The connection of physical objects to the Internet makes it conceivable
to access distant sensor data and control the physical world from distance.
• A smart object is a term given to an embedded system, which is connected to
the Internet.
FoF
• The first time the term FoF was mentioned goes back to 1986 where Irwin
Welber gave a keynote speech to the 86 international symposium on robot
manipulators.
• Welber (1987) explained that the FoF would look like a large-scale intelligent
machine, which operates with a highly integrated as well as organized
knowledge base. He also highlighted the need for both suppliers and customers
to become integral part of the FoF environment.
• Becoming more and more adaptive is the primary motivation for manufacturing
enterprises to adopt the FoF initiative. This becomes necessary due to the new
challenges manufacturing enterprises are facing including the need to cut down
production costs, making customized products, as well as being able to respond
to market changes faster than ever before.
Greening
Sustainable business, or green business, is an enterprise that has minimal nega?ve
impact on the global or local environment, community, society, or economy.
Osen, sustainable businesses have progressive environmental and human rights
policies. In general, business is described as green if it matches the following four
criteria:
1. It incorporates principles of sustainability into each of its business decisions.
2. It supplies environmentally friendly products or services that replaces demand for
nongreen products and/or services.
3. It is greener than tradi?onal compe??on.
4. It has made an enduring commitment to environmental principles in its business
opera?ons.
e.g. Greening of Supply Chains
Green Data Centers- energy
Photo: NCC
Green Universities
Source: Duurzaambedrijfsleven
Augmented Reality
• Gaming;
• Product customization;
• Product info & comparisons;
• Marketing;
• Retail;
• Real-time & location-based apps;
• And others.
Augmented Reality- Real-9me Transla9on
Download Blippar
“What is the best measure for success?? Happiness”.
Richard Branson
91
Smarter Cities
http://www.ibm.com/smarterplanet/us/en/smarter_cities/overview/,
Frost & Sullivan (2014). “2014 Global Best-in-Class Smart City IntegratorProduct Leadership Award Visionary Innovation
Leadership Award”.
http://en.wikipedia.org/wiki/Masdar_City
https://en.wikipedia.org/wiki/Big_data
Social TV.
http://socialitv.web.ua.pt/index.php/author/jferraz/
Thank you for attention!
Questions?