Professional Documents
Culture Documents
Agile Data Warehouse PDF
Agile Data Warehouse PDF
Agile Data Warehouse PDF
Oliver Ratzesberger
Officer
Teradata Corporation
Senior Director
eBay
TERADATA
Raising Intelligence
Adastra Information
Management
Conference
2009
Oliver.Ratzesberger
Senior Director
eBay
oratzesberger@ebay
.com
TERADATA
Raising Intelligence
Agenda
Options
fF..RADATA
.....
_ eb_-'_
The
Busi ness
Need
-.
TfAADATA
.....
"'-
TERADATA
...
Initial
Deployment
Repeated
Use
Value
Time
--.
TERADATA
--
TFRADATA
--
TE RADATA
Use Case
eb._
---
TERADATA
C(
Strategy
Research
Opportunity
Assessment
Enterprise
Assessment
==
--
~rojecf
Manc(gemerfi
Design
Equip
Build
Ana yze
Application
Requirement
Logical
Model
--.a... ~,
-~--
( 0
Intearate
Manage
System
Architecture
II
Package
Adaptation
Data
Mapping
nfrastructur,
Education
&
User
Curriculum
--
TERADATA
10
eb'
--
TERADATA
11
_____ eb~
Value
Value
Value
Value
--
TEAADATA
12
----
TERADATA
13
--
TfRADATA
---
TERADATA
15
Implementation
Options
Implementation
Options
---
TEiYtDATA
16
Option 1:
Separate Developmel!.t S_ystem
_ eb_
--
TERADATA
17
Option 1:
Separate Development
System
--
TfRADATA
11
Option 2:
Downsized Development
------------System
-"-
TERADATA
"
Option 2:
Downsized Development System
--
TERADATA
2.
10
Option 3:
------
---
--
---
--
TERADATA
21
Option 3:
Federated Development System
--
TERADATA
22
11
Option 4:
Production Sandbox
----
TERAOAIA
"
Option 4:
Production Sandbox
"
--
TERAOA1A
12
Option 4:
Production Sandbox
TERADATA
25
1It1
Governance
Governance
-"-
TERADATA
2.
13
Controls
No "production" reporting
from the sandbox
environment.
Automated resource governors
to prevent "runaway" queries
in the sandbox area.
Data residency no more tha n
XX days.
--
TERADATA
27
eb"' r
"
--
TfRADATA
14
Case Study
The eBay
Experience
--
TERADATA
2.
>50 TB/day
>1 OOk
>50A10
new records/day
>50k
Active/Active
>5000
data elements
chains of logic
turning over a
TB
24X7X365
every
5 seconds
Millions
Always online
99.9+%
of queries/day
Availability
Near-Real-time
30
--
TERADATA
15
Unica
MicroStrategy
2.SPB
L.o~llnt.rconn.ct
T.d.ta
ationalDlta
SAS
Crystal
SOL
r:
2.2PB
LinuxMPP
Secondary
Primary
Relatlona' Data
LoQIIl'lterconllt
Wid.""".
!nterconne<:r:
1000 mllu
Sun Fir. <txxx
Solarl5
Sot.ns
2.2PB
XML, nam_tvalue,
6.6PB
MPP/HPC/Grid
raw
Phoenix, A2
Dlta JnteOr8tion
Informatica
--
TfRAOATA
"
---
Unknown
Exploration
II
~; The metrics
I
j
&
NEW
'
in potential ROI
or dimensions
Desig~
can.'t be static
L.r~
------
II
~~1
~
--
l'
16
..,
WlltKh'Ui
Respthleols
0.1.11 iIIII!'Mu$e
~M-nC
Mfh
tiN mh
(i,t
"~ub
~f'1
90
10
JI
CtRtrll~a'll"""-'JSttnJy
and
~ftl
y.tu(tIr1aI warttlouw(l.t
IMIfl,cL1ladJINmallylrom
SQIfLt ~~-\It".I1\whffi 1W'f'dM)
!J
(1'*It.M\y,f'ftd1
17
..
_"'*~
.' ,,~
'.,
"
;~ ""':<i~~"~.:_'
~.
. ',,(,.;..:::
,
,;;,:;iii?;;
:"
'
:, -
;.-",.t-~~~
"'J!
;;'~"
'
~.~
.'-
,Iio-.. ,
i~"
~-?~
redundancy
Increased complexity
Loss of lineage over time
Analytics as a Service
Massive scale
Analytical ,=,tili~yComputing
jJ,
Combine ~~~a:dd-
dlCI
~v~36
,..
"
to fully ALL
p'rivate
Utilitydata
access
c:ode~Wltti
existing
~
;
IC:I.
I
ce.
--
TERADATA
18
dJ_
Analytics as a Service
o
-I
ma'-!
~.
........ ,-..
---
s...,.t
c-... ..
cc:::x
Dt __
-..-...,-
..
.-11II.-'" ,
III::!C:II!
::~::
.-....
CCII!
w~ __
'."':.I&.:.A.-
"'"
---s..-._.,_"
-'1. ,_
"",c::::r:
.' ---..
-~ .1
" __
...
---
-.
,. __
--
.
a._ ..-..,
L_"
,_ ..
'00-
,-,."....
----- ..--"
_n_
--
--'--
a:::I
... ....
---.....",
.. ,
t-.
....
...
37
Analytics as a Service
From a simple web based table upload:
.......
-----
'''''-''',r
-n._
-- .........
"',.........."'f~ ~
ctMfe~LW_~top~_\JI
...,:Cll'lUJ'OU
..MId.~
38
.,....-"'dD.'..
h ....._
you"'to.
elf''''''
"-*"-.P,.,.,
. n..--. SCl..hf"'~"'"
n.slllhellec:or'teMg
. lobt~
ITtIot""'o::MAT[TAfU
CM:An:
1MU:
I o.a.Tt
-I & I
t"'1I'-v~
__
~r"""tdIon
o:;oMrM(J.-.d_~_-..db'f'''\1".N.U
b::IIIOIfIOI'''tonn.oru'\''DUIfI!II~
OIeOe,fotoIIIfetI1ItIrlWy
12~)OO7-Oi
~;hJNIbc*~e
~$,20Q8.t2.ot
.987$112.'
:WS81WJ,2IXJB.t2-01
~"
.,.
~W\twllU)"",,,_,
toAIon
~..,.._t-~
Bv*
~ ~__
p.on:;"~,,prv0U;40-"''*''''
twll4Jk:>8dp'w
I~")
~___
OII:h
8kId
'kI~.,..,........
~_
I..MU:PfIIM.t.IItYt()f)I(
~2:ltwtoloowlJ_noI~~.
)w~J_)f~_'_JU'QIt<I
1o
CSV~(~
__
...DI_by
nm
-,
SOL
--
TfRADATA
19
More than
small
75 active
right now
(100GB-5TB)
free
EDW
3.
agile prototyping.
II
20
Governance Rule 1
----~-
DW
data:
-"-
TERADATA
"
Governance Rule 2
g-
.,
TERADATA
21
Governance Rule 3
--
TERADATA
Key Process 1
..
users.
--
TERAOATA
22
Key Process 2
Help Desk support is critical.
Direct access for PETpersonnel to the most senior
architectural and technical personnel.
"Bidirectional" mentoring:
Best and brightest technical resources get closer to the business ...
Business gets closer to fast and effective implementations.
It does not take long for PET personnel to become self-sufficient.
----
TERADAfA
45
Key Learning
--~~~
benefits.
Failure = Learning
Do so with great effectiveness ...
Fail fast, fail early.
--
Tf RAD,.\TA
4.
23
Questions?
--
TERADA1A
'7
References
--
rFR,\Df\Tt\
24