Professional Documents
Culture Documents
Lec 01
Lec 01
Stream Processing
Class Rules
1
2/23/2024
Course Assessment
A few suggestions….
2
2/23/2024
Data Deluge
3
2/23/2024
7
C o p y r i ght © S A S I n s titu te In c . A ll r igh ts r e s e r v e d .
8
C o p y r i ght © S A S I n s titu te In c . A ll r igh ts r e s e r v e d .
4
2/23/2024
Data Volume
10
5
2/23/2024
Data Velocity
11
Data Variety
• structured data
• unstructured data
• business applications
• unstructured text documents
(articles, blogs, and so on)
• emails
• digital images
• video and audio clips
• streaming data
• stock ticker data
• RFID tag data
• sensor data
12
6
2/23/2024
Data Variability
13
Data Complexity
14
7
2/23/2024
15
16
8
2/23/2024
Data Streaming
17
Communications Surveillance
T
Transportation
OF
hings Agriculture
Energy
Manufacturing
Finance /
Insurance
Retail
Health Care 18
C o p y r i ght © S A S I n s titu te In c . A ll r igh ts r e s e r v e d .
18
9
2/23/2024
19
20
10
2/23/2024
Streaming Analytics
Stream – Understand – Act
Deploy
Enrich
21
Streaming Data
22
11
2/23/2024
23
24
12
2/23/2024
Unbounded data
• Conceptually infinite, ever-growing set of data items/events
• Practically continuous stream of data, which needs to be
processed/analyzed
Push model
• The source controls data production and procession
• Publish/subscribe model
Concept of time
• Often need to reason about when data is produced and when processed
data should be output
• Processing time, ingestion time, event time
25
The value of data changes from the individual item to the aggregate over this time line.
26
13
2/23/2024
27
Data Streaming
28
14
2/23/2024
29
30
15
2/23/2024
Static vs Streaming
31
32
16
2/23/2024
Challenges of Streaming
33
Challenges of DSP
34
17