Professional Documents
Culture Documents
Rise of Data Science in Age of Big Data
Rise of Data Science in Age of Big Data
Revolution Confidential
Revolution Confidential
Revolution Confidential
Revolution Confidential
Revolution Confidential
Ed Chen http://blog.echen.me/hurricane-sandy-outages/
Revolution Confidential
New York Times, June 25 2009 (3 hours after Michael Jacksons death) http://www.nytimes.com/interactive/2009/06/25/arts/0625-jackson-graphic.html
Revolution Confidential
Revolution Confidential
Revolution Confidential
ML
New Data
scoring rules
scoring rules
Accuracy
Revolution Confidential
11
P roblem: L ac k of c redibility
Revolution Confidential
12
P roblem: C omplexity
Revolution Confidential
13
Revolution Confidential
14
Revolution Confidential
15
Revolution Confidential
Companies that have massive amounts of data without massive amounts of clue are going to be displaced by startups that have less data but more clue. -- Tim OReilly
Google Research, The Unreasonable Effectiveness of Data: http://googleresearch.blogspot.com/2009/03/unreasonable-effectiveness-of-data.html Tim OReilly on Google+: https://plus.google.com/107033731246200681024/posts/4Xa76AtxYwd TechnoCalifornia: http://technocalifornia.blogspot.com/2012/07/more-data-or-better-models.html
16
Revolution Confidential
S&P 500
17
Revolution Confidential
18
Revolution Confidential
19
Revolution Confidential
R is Hot
bit.ly/r-is-hot
20
Custom graphics
21
Revolution Confidential
Data
Model Estimation
Predictions
Model Refinement
22
Revolution Confidential
Data
Disk
Core 0
(Thread 0)
Core 1
(Thread 1)
Core 2
(Thread 2)
Core n
(Thread n)
23
Revolution Confidential
BIG DATA
Data Partition Data Partition
Data Partition
Compute Node
Compute Node
Master Node
Compute Node
Revolution Confidential
25
Revolution Confidential
Map-Reduce
RHadoop: http://bit.ly/RHadoop
26
B ig Data A pplianc es
Revolution Confidential
Revolution Confidential
28
Revolution Confidential
Revolution R Enterprise
29
Revolution Confidential
Revolution R Enterprise
www.revolutionanalytics.com/products
30
Revolution Confidential
Image www.tinyplanetphotography.com
31
A nd the future?
Even more data Cloud computing Demand for Data Scientists
Revolution Confidential
Revolution Confidential
Files Clusters
Data Appliances
Hadoop NoSQL
Exploration Modeling
Storage Preprocessing
33
Revolution Confidential
34
Revolution Confidential
DJ Patil in OReilly Radar: http://oreil.ly/I3H5fI Statistics and Data Science graduates Kaggle and Chorus Revolution Analytics R Training:
http://www.revolutionanalytics.com/services/training/
35
Revolution Confidential
Data Scientists need a technology platform to think about, explore, and model data Revolution R Enterprise is R for Big Data
36
R es ourc es
www.revolutionanalytics.com/products
Revolution Confidential
T hank you.
Revolution Confidential
The leading commercial provider of software and support for the popular open source R statistics language.
www.revolutionanalytics.com
650.646.9545
Twitter: @RevolutionR
38