Professional Documents
Culture Documents
1 Introduction
1 Introduction
Documents:
PhD. Pham Tien Lam
• Xá c điṇ h khá ch hà ng tiềmnăng
• Gợ i ý sả n phẩm cho khá ch hà ng?
• Tă ng cườ ng hiệ u quả củ a quả ng
cá o?
• Hệ thống mở cử a tự độ ng?
• Hệ thống tự độ ng trả lờ i khá ch
hà ng?
• Tối ưu hoá quá trı̀nh vậ n tả i hà ng
hoá ?
• Dự đoá n giá chứ ng khoá n?
• Tự độ ng dic̣ h vă n bả n?
• Data
?
https://ourworldindata.org
• Data as models of What you created while using your
reality phone
• Make a calls
Geographical
•• Capture photos
Text message
Transport Cultural
• Browse internet
• Reading books
Natura
l
Scientific
• Learning
• ….
Meteorological Financia
All of your data are
l collected
Statistical
https://financesonline.com/how-much-data-is-created-every-day/
• Big data is a term that describes the large volume of data — both structured and
unstructured.
• Big data can be analyzed for insights that lead to better decisions and strategic
business moves.
https://www.weforum.org/agenda/2019/04/how-much-data-is-generated-each-day-
cf4bddf29f/
Volume Variety
BIG
Velocity
DATA Veracity
https://xcelpros.com/erps-make-big-data-and-big-business-a-good-
match/
Data Formats in Big
Data
Sensor
data
Machine
Learnin
COMPUTER g MATH AND
SCIENCE STATISTICS
DATA
SCIENTIST
Traditional Data
software Analyst
BUSINESS/ DOMAIN
EXPERTISE
• Data analyst
• Machine learning
engineer
• Deep learning engineer
• Data engineer
• Data scientist
• Risk analyst
• Business analyst
• …
• How can we
make it happen?
• What
will PRESCRIPTIVE
• Why did happen? ANALYTICS
• What it PREDICTIVE
happened happen? ANALYTICS
DIAGNOSTIC
? ANALYTICS
DESCRIPTIVE
ANALYTICS
• is an interdisciplinary field that involves using mathematical and statistical
methods to extract insights and knowledge from data.
• Goals: extract valuable information from data and use it to make informed
decisions, whether that's in the context of a business, government, or any
other organization.
• Data Science is being used in various industries, such as healthcare, finance,
marketing, and more.
• Define your problems
• Data collection: web, databases, log, APIs, and others
• Data preparation (data model):
➡ Data cleaning: missing data, inconsistency,
duplications
➡ Data transformations
• Exploratory analysis: in sight of data
• Modeling: statistics, machine learning
• Report
• Deploy and maintenance
KNOWLEDGE
https://www.visualcapitalist.com/how-big-tech-makes-their-billions-
2022
• Business with
data
https://www.visualcapitalist.com/how-big-tech-makes-their-billions-
2022
• Business with
data
• Phenikaa
Dữ liệu lớn và nhu cầu của xã hội
• A machine which can do like
human?
Siri bot Cortana bot
sophia robot
Boston dynamic robots
• is the field of computer science focused on creating machines that can
perform tasks that typically require human intelligence, such as learning,
problem-solving, and pattern recognition.
• Goals: is to create machines that can perform tasks that were previously
performed by humans, and do so more efficiently and accurately.
• AI applications in various industries, such as autonomous vehicles, speech
recognition, recommendation systems, and more.
Artificial
Intelligence
Machine
Learnin
g
Statistics
Deep
Learnin
g
Data
Minin
g
Traditional Computer y = F(x) Artificial
Intelligent
Work
flow
• •
• Functiona •
l
Functional
1. Problem Settings
2.Data Collection
D = [(xi, yi), i =
1,2,...,m]
4.Model selection
5.Deploy suitable model (Using the best model to make
prediction)
" A computer program is said to learn from experience E with respect to some class of
tasks T and performance measure P, if its performance at tasks in T, as measured by P,
improves with experience E."
Tom Mitchell
Training a model on labeled data and making predictions based on that training data.
Unsupervised Learning
Training a model on unlabeled data and finding patterns or relationships within the data
Semi-supervised Learning
Trainning
Label Learning
s Dataset Final New
Algorith
model Data
Raw data m
Test Dataset
Label
s
Trevor
Hastie
Robert
Tibshirani
The Elements of
Jerome
Friedman
Statistical
Learning
Data Mining, Inference,
and Prediction
Second Edition
Visualization ML
DL
ID
Scientific computing E
Data processing/
analysis Natural
processing
Language