Tulley

You might also like

Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 35

Teradata

Leaders in Enterprise Data Warehousing

John Tulley
Vice President, Teradata Canada

Email: John.tulley@ncr.com
Office: 905-478-8997
NCR Corporate Overview

2004 Revenue
• Fortune 500 company by Business Unit
• Global operations in more than 100
countries & territories
Teradata
• 28,500 employees Financial
Retail
Systemedia
Customer Service
• 2004 Revenue $5.984B Payment & Imaging
Other
• 1999-2004 >51% revenue growth

Worldwide
Retail Teradata Financial
Systemedia
Solutions Data Warehouse Solutions
Customer
Services

2
Top Industry Leaders Rely on Teradata

Teradata Top 10 • Leading industries


> Banking
80% of Top 10 > Government
Global Telco Firms > Insurance & Healthcare
> Manufacturing
60% of Top 10 > Retail
Most Admired > Telecommunications
> Transportation Logistics
Global Companies
> Travel

60% of Top 10 • World class customer list


Global Airlines > More than 800 customers
> Over 1200 installations

50% of Top 10 • Global presence


Global Retailers > Over 100 countries

50% of the Top 10 • 4,000 world-wide professionals


Transportation dedicated to data warehousing
Logistic Firms
FORTUNE Global Rankings, July 2005
3
The Teradata Difference

What We Do….
• Enterprise data warehouse
• Windows 2003/Unix/Linux scales from Intel laptop to MPP
• Analytic capabilities transform data into information.
• Extreme high availability
• Industry leader in analytical applications
• Integration with SAP, Siebel, Hyperion
• Partnerships include Accenture, Bearingpoint,
CAPGemini, Deloitte, EDS, Lockheed Martin
• Strong customer references

All we do is Data Warehousing!

4
Teradata - the recognized leader in data warehousing
and high-performance decision analytics.
….Gartner ASEM

IBM Sun HP IBM SP Compaq Generic Unisys


S/390 Enterprise HP9000 RS/6000 Alpha Intel IA-32 ES7000
AIX Teradata
OS/390 Solaris HP-UX Tru64 Win2000 Win2000
DB2 EEE Oracle Oracle DB2 EEE Oracle SQL Server SQL Server

Data Mgmt.

Data Admin.

Scalability
and Suitability

Concurrent
Query Mgmt.

DW Track
Record

Query Perform.

Source: Gartner ASEM Ratings 2004


Worst Best

5
Industry Leadership Recognition

• Gartner - “Dominant Lead” – 5th Consecutive Year


> “DBMS is surely the place where NCR Teradata sets the gold standard. As
in previous years, the Teradata score was 98%, leaving little scope (and need) for
improvement.”
– Gartner's [Application Server Evaluation Model] ASEM Data Warehouse Server Update, A. Butler, K. Strange, J.
Enck, M. Chuba, November 2004

> Teradata[database management system] DBMS capabilities remain


unchallenged by its competitors in the market.”
– Gartner’s Magic Quadrant for Data Warehouse DBMSs, 2004, Kevin H. Strange, June 2004

> “Teradata continues to drive a strong vision.”


– Gartner Research, MarketScope: Customer Relationship Marketing, 1Q04, G. Herschel, J. Radcliffe, Feb 2004

> Gartner Dataquest recognized Teradata as the growth leader in the RDBMS
market, with above market growth of 17.4%. 2005

> Teradata is rated “Positive” in Gartner’s MarketScope for Campaign Management,


the highest rating awarded 2005

• META Group
> “Teradata has displayed unmatched (but often copied) strength of vision
and focus in the [enterprise data warehouse] EDW market.”
– METAspectrum Market Summary, Enterprise Data Warehouse METAspectrumSM Evaluation, 2004

6
Industry Awards and Recognition - 2005

BI Excellence Award 1to1 Impact Award


Sponsor: Gartner Group Sponsor: Peppers & Rogers
•Continental Airlines - winner Continental Airlines recognized
•Cardinal Health - finalist as Technology Optimization winner

Editors’ Choice Awards


Technology Leadership Sponsor: Intelligent Enterprise
Award •Teradata selected for the
Sponsor: Frost & Sullivan “Dozen” Most Influential
•Teradata selected for BI Companies
Leadership Award – CRM •Winner, Customer Analytics category
Analytics
NEXUS Awards
Sponsor: New Zealand
NEXUS
TDWI Best Practices Direct Marketing Association Awards
Award •Bank of New Zealand,
•sunrise TDC Switzerland AG silver award - data mining & analytics;
– winner - Customer bronze award - data management
Relationship Management

7
Government Agencies with Teradata Presence

• US Air Force • Dept. of Justice


• US Navy • Dept. of Housing and
• US Transportation Urban Development
Command • Dept. of Agriculture
• Defense Commissary • Arizona, Iowa, Florida,
Agency Texas, Illinois, New
• Army, Air Force York, Utah, Michigan
Exchange
• RAMQ – Quebec
• Intelligence
• Australian Tax Office
Community
• US Postal Service • South African Tax
Office
• Italian Post Office

8
Teradata Solutions Methodology

Project Management
Strategy Research Analyze Design Equip Build Integrate Manage
Opportunity
Opportunity Business
Business Application
Application System
System Hardware
Hardware Physical
Physical Components
Components Help
Value
Value Requirement
Requirement Architecture
Architecture Platform
Platform Database
Database for
forTesting
Testing HelpDesk
Desk
Assessment
Assessment

Enterprise
Enterprise EDW
EDW Logical Package
Package Software
Software ECTL
ECTL Capacity
Capacity
Roadmap
Roadmap LogicalModel
Model Adaptation
Adaptation Platform
Platform Application
Application
System
SystemTest
Test Planning
Planning
Assessment
Assessment

Information
Information Data Custom
Custom Support
Support Information
Information Production
Production System
System
Sourcing DataMapping
Mapping Component
Component Management
Management Exploitation
Exploitation Install
Install Performance
Performance
Sourcing

Infrastructure
Infrastructure Test Operational
Operational Operational
Operational Business
Business
&&Education
Education TestPlan
Plan Mentoring
Mentoring Applications
Applications
Initial
InitialData
Data Continuity
Continuity

Education
Education Technical
Technical Backup
Backup& & Acceptance
Acceptance Data
Data
Plan
Plan Education
Education Recovery
Recovery Testing
Testing Migration
Migration
Technology
TechnologyNeutral
NeutralServices
Services User
User User
User HW/SW
HW/SW
Curriculum
Curriculum Training
Training Upgrade
Upgrade

Value
Value Availability
Availability
Assessment
Assessment SLA
SLA

System
SystemDBA
DBA

Solution
Solution
Architect
Architect
Teradata’s success is the combination of hardware, software and
methodology
9
Data Warehouse Needs Will Evolve

ACTIVATING
• Query complexity grows MAKE it happen!
• Workload mixture grows
• Data volume grows
• Schema complexity grows
• Depth of history grows OPERATIONALIZING
• Number of users grows WHAT IS happening?

• Expectations grow
Workload Complexity

PREDICTING Event-Based
WHAT WILL Triggering
happen?
Takes Hold

ANALYZING
WHY
did it happen?

REPORTING Batch
WHAT Analytical
happened? Ad Hoc
Modeling
Grows
Analytics
Increase in
Continuous Update/Short Queries
Ad Hoc Analysis
Event-Based Triggering
Primarily Batch &
Some Ad Hoc Reports

Data Sophistication
10
Enterprise Analytical Topologies

Data Mart Virtual, Hub-and- Enterprise


Centric Distributed, Spoke Data Data
Federated Warehouse Warehouse
Sources Sources Sources Sources
ODS
Marts Middleware DW DW

Users Users Marts Users

Users
Independent Data Leave Data Where it Dependent Data Centralized
Marts Lies Marts Integrated Data
With Direct Access
P • Easy to Build • No need for ETL • Allows easier • Enterprise view
r Organizationally • No need for separate customization of user • Design consistency &
interfaces & reports
o • Easy to Build platform data quality
Technically • Data reusability
s
C • Business Enterprise • No ETL • Business Enterprise • Requires vision
o view unavailable • Meta data issues view challenging • Requires Data Owners
n • Redundant data costs • Network bandwidth and • Redundant data costs to willingly participate
s • High ETL costs join complexity issues • High DBA and
• High App costs • Only viable for low operational costs
• High DBA and volume • Data latency
operational costs • ODS duplication
11
Typical Data Warehouse Architecture

What’s wrong with 3. The solution is too


this picture? complex. Every line
Transaction Systems
on the chart
represents an ETL
1. There are too many process that
copies of the data. requires $$ for Life
Will they all be the Operational Data Stores Cycle Maintenance
same?

Central store, Hub, Clearing house 4. The solution is too


expensive. There
2. There is too much are numerous
latency - too long to components that
get the data to the lead to increased
Data Marts
people who need it. costs. Costs often
Everyone sees hidden in
different inconsistent distributed
points in time organization.

12
Teradata’s Enterprise Data Warehouse
An Integrated, Centralized Data Warehouse Solution

Transactional Users

Data Base Design


Transactional Data

Enterprise, System, & Database Management


Physical

Business & Technology – Consultation


Data Transformation Optional

Middleware/Enterprise Message Bus


ETL Hub

Support & Education Services


Operational Optional
Optional ELT
Data Store (ODS) Single version of data

Logical Data Model


ORDER
ORDER NUMBER
ORDER DATE

“Enterprise”
STATUS
ORDER ITEM BACKORDERED
QUANTITY
CUSTOMER
CUSTOMER NUMBER

Data Warehouse
CUSTOMER NAME
ORDER ITEM S HIPPED
CUSTOMER CITY
CUSTOMER POS T QUA NTITY
CUSTOMER ST SHIP DATE
CUSTOMER ADDR
CUSTOMER PHONE ITEM
CUSTOMER FA X ITEM NUMB ER
QUANTITY
DES CRIPTION
PRODUCT
PERIOD PRODUCT KEY

Data Replication
PERIOD KEY PRODUCT NAME

Logical
DATE SALES DISTRIBUTOR
DAY PRODUCT DESCRIPTION
PERIOD KEY
MONTH PRODUCT HEIGHT
PRODUCT KEY
YEAR CUSTOMER KEY PRODUCT WIDTH

(Views)
QUARTER PRODUCT DEPTH

Application
MARKET KEY
TRIMESTER DOLLARS PRODUCT WEIGHT
UNITS

CUSTOMER MARKET
CUSTOMER KEY MARKET KEY
CUSTOMER NAME CITY
STATE

Co-Located
CUSTOMER CITY
CUSTOMER POST ZIP

Dimensional
CUSTOMER ST ZIP4
CUSTOMER ADDR DISTRICT

Dependent DM
Data Marts
CUSTOMER PHONE REGION
CUSTOMER FAX COUNTRY

Optional

Metadata
Virtual Views

Decision Users

Strategic Tactical Reporting Data Event-driven/


Users Users OLAP Users Miners Closed Loop

13
TERADATA is an Open System
Virtually
any application
or middleware
Messages
framework can be
integrated with
WEB
WEB
TERADATA !!!

JMS JMS JSP IIOP ASP

JAVA EJB TAP Appl CORBA .NET

JDBC JDBC JDBC ODBC OLE-DB

Publish & Subscribe

Message Bus
TERADATA
Adapter(s)
Utilities

TERADATA
TERADATA
Adapter(s)
Utilities

Queues

14
Teradata Active Data Warehouse in action
Front Base Secure Wireless DOD Supplier Warfighter
Supply Support Strategic
Line
5.Warfighter receives alert via & Tactical
Secure Blackberry, adjusts Battle Queries
Plans to align with rush replenishment

1.Continuous Transaction 4. and or DOD


Enterprise Application Vendor notified
feeds on supplies usage
Integration and reorders
Web Services
Web- Tibco
.NET
Sphere (EAI) Secure
Secure
DOD
DOD
Network
Network Business Services
OLAP Intel Rules Event
Queries Agents Engine Engine
2. Conditioning & 3.Stored
Ascential
Loading of trans Informatica Procedures
Information Exchange
data trigger based
MQ Adapter
event
T-Pump, MQ Adapter detection
Fast Export TERADATA sends alert
Stored Procedures to
Direct Data Access Q Tables Warfighter,
Legacy
UDF, Triggers Warfighter
Systems
Support, &
Data Acquisition DOD Supplier
T-Pump, MQ Adapter via MSTR
Narrowcaster
Fast Load, Multi Load
Transactional Environment Decision Making Environment
16
So what is Teradata ?
What is Teradata?

• RDBMS designed to run the world’s


largest databases
• Latest Intel technology nodes
• UNIX-MP-RAS, Windows 2003
• Linux in Fall 2005
• Scales linearly from Laptop to MPP
• Has a parallel aware optimizer that
allows multiple complex queries to run
concurrently
• Standard access language (SQL)
• Uses a “Shared-Nothing” architecture
• Unlimited, unconditional parallelism
• Linear Scalability allows for increased
workload without decreased throughput.

18
Teradata Hardware Architecture

• SMP Nodes BYNET Interconnect


> Latest Intel SMP CPUs
> Configured in 2 to 8 node
cliques SMP Node1 SMP Node2 SMP Node3 SMP Node4

> Windows, Unix or Linux


• BYNET Interconnect
PE PE AMP PE PE AMP PE PE AMP PE PE AMP

> Fully scalable bandwidth AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP

> 1 to 1024 nodes


• Connectivity
> Fully scalable
> Channel - ESCON
> LAN, WAN
• Storage
> Independent I/O
> Scales per node
• Server Management
> One console to view
the entire system Server Management

19
Teradata Shared Nothing Architecture

P P P P
FSB FSB

Memory I/O I/O Memory

P P P P
FSB FSB

Memory I/O I/O Memory

• Similar to Large SMP, except Interconnect runs at I/O Rates and not Memory Rates
• Longer Lifetime: I/O Interfaces have a 3-5 Year Lifetime
• Scaling Is By Increasing Link Data Rates and Parallel Links

20
SMP vs. MPP: The Teradata Advantage

• 2-Way SMP • 2 2-Way Teradata Nodes


> 1.8 Relative CPU’s > 3.6 Relative CPU’s
> 4 GB Memory > 8 GB Memory
> 3.2 GB/Sec BUS > 6.4 GB/Sec BUS
> 3.2 GB/Sec Memory > 6.4 GB/Sec Memory
> 1.5 GB/Sec I/O > 3 GB/Sec I/O
• 4-Way SMP
> 3.1 Relative CPU’s • 32 2-Way Teradata Nodes
> 4 GB Memory > 57.6 Relative CPU’s
> 3.2 GB/SEC BUS > 128 GB Memory
> 3.2 GB/Sec Memory > 102.0 GB/Sec BUS
> 1.5 GB/Sec I/O > 102.0 GB/Sec Memory
> 48 GB/Sec I/O

21
Teradata Data Distribution
Dividing the Work

• Rows are distributed evenly by hash partitioning


> Done in real-time as data are loaded, appended, or changed.
> No reorgs, repartitioning, space management
• Shared nothing software:
> Each VAMP owns an equal slice of the data.
Table A Table B Table C
> Each VAMP works exclusively & independently on its rows
> Nothing centralized: No single point of control for any operation (I/O,
Buffers, Locking, Logging, Dictionary)

Prime Index

Teradata Parallel Hash Function RowHash (Hash Bucket) Data Fields

VAMP1 VAMP2 VAMP3 VAMP4 ………………………………………………………VAMPn


P P P P P P P P P

M D M D M D M D M D M D M D M D M D

22
File System

• File system architecture is fundamentally different


> Broke all the rules
> No Pages, BufferPools, TableSpaces, Extents,...
> Data location and management are entirely automatic
> Space allocation is entirely dynamic
• Absolutely minimal labor required
> No reorgs
– Don’t even have a reorg utility
> No index rebuilds
> No re-partitioning
> No detailed space management
> Easy database and table definition
> Minimum ongoing maintenance
– All performed automatically

23
Self Managing Architecture

• Teradata’s self-managing philosophy provides the lowest


total cost of ownership of any RDBMS
> Automatic, random and even data distribution
> Parallel-aware optimizer eliminates query tuning
> Parallel utilities with low setup and checkpoint restart
> Single operational view of entire MPP complex (AWS)
> Single point of control for the DBA (Teradata Manager)
> SQL-ready database management information (log files)
Teradata DBAs Don’t Worry About!

1. Install the Database


2. Understand, monitor and tune extensive operating system
parameters
3. Understand, monitor and tune extensive database parameters
4. Determine the size and physical location and/or space allocations
of tables and index partitions
5. Perform periodic table and index re-orgs
6. Manually restart multi-step load process when failure occurs
7. Ability to run queries and data maintenance 24x7
8. Sort data before loading
9. Calculate and configure fail-over plans in a clustered
multiprocessing environment
10. Spend a lot of time planning and expanding the system
11. Query tuning for decision support

25
Teradata High Availability

• Teradata software
BYNET Interconnect
provides high availability
beyond other databases
SMP Node1 SMP Node2 SMP Node3 SMP Node4
> Compensates for
hardware failures: PE PE AMP PE PE AMP PE PE AMP PE PE AMP

– Automatic failover for


AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP AMP
dynamic
workload rebalancing
(migrating VPROCS)
– Online, continuous
backup
(Fallback)
> Recycles before
the operating system
completes its reboot
(multi-node system)

26
Teradata’s Multidimensional Scalability
(It’s more than just big data)

Amount of Detailed Data Concurrent Users

Multiple Subject Areas Sophisticated Queries

ORDER
• Simple Direct at the start
ORDER NUMBER

• Moderate Multi-table Join


ORDER DATE
STATUS
ORDER ITEM BACKORDERED
QUANTITY
CUSTOMER
CUSTOMER NUMBER
• Regression analysis
CUSTOMER NAME
ORDER ITEM SHIPPED

• Query tool support


CUSTOMER CITY
CUSTOMER POST QUANTITY
CUSTOMER ST SHIP DATE
CUSTOMER ADDR
CUSTOMER PHONE ITEM
CUSTOMER FAX ITEM NUMBER
QUANTITY
DESCRIPTION

28
EDW Requires Multi-dimensional
Scalability
Data Volume
(Raw, User Data)

Mixed
Query
Workload
Concurrency

Data Query
Freshness Complexity

Query Schema
Freedom Sophistication
Query Data Volume
29
The Teradata Difference
“Multi-dimensional Scalability”
Data Volume
(Raw, User Data)

Mixed Query
Workload Concurrency
Teradata can Scale Competition Scales
Simultaneously Across One Dimension at the
Multiple Dimensions Expense of Others
Driven by Business!
Limited by Technology!

Data Query
Freshness Complexity

Query Schema
Freedom Sophistication
Query Data Volume
30
The Teradata Difference
“Multi-dimensional Scalability”
Data Volume
(Raw, User Data)

Mixed Query
Workload Concurrency
Teradata can Scale Competition Scales
Simultaneously Across One Dimension at the
Multiple Dimensions Expense of Others
Driven by Business!
The Limited by Technology!

Data
Freshness Teradata Query
Complexity

Difference!
Query Schema
Freedom Sophistication
Query Data Volume
31
The Teradata Difference
“Multi-dimensional Scalability”
Data Storage
Teradata
(raw, user data)
20 TB Others
100’s TBs +

Multiple, Integrated
Stars and Normalized 15 TB
1,000’s

Schema Normalized
# of
Sophistication 10 TB Concurrent
Multiple,
Queries
Integrated
Stars
5 TB
Simple
Star Batch Reporting,
Repetitive Queries
3-5 Way
Joins “Iterative”, Ad Hoc Queries
Data Analysis/Mining
5-10 Way MBs
15+ way Joins + Joins Near Real Time Data Feeds
OLAP operations +
Aggregation + Active Data Warehousing
Complex “Where” GBs
constraints +
Views
Query Data Workload
Parallelism Query TBs Mix
Complexity Volumes

32
State of Michigan, Department
of Community Health (DCH)
Teradata Customer Since 1991
Customer Profile
As the largest department in the State of Michigan, DCH is responsible for managing delivery of health care
services to more than 1.2 million clients and overseeing an annual budget of $9.5 billion. DCH administers many of
the state’s most critical programs, including Medicaid, WIC, and child immunizations.

Business Solutions Implementation Summary


• Data warehouse integrates • Integrated data from nine separate health-related agencies
claims/encounters; beneficiary • Managed and used by agency subject matter/programmatic
eligibility data; provider data; birth experts, not by the IT department
records; death records; long-term
care assessments; WIC data; • Over 200 users in Medicaid and 8,000 state-wide
immunizations; lead screening;
newborn screening; & notifiable
diseases. Realizations and ROI
• Fraud & abuse • Estimated annual savings of $75 million–$100 million due to
• Contract management with health advanced health care analysis
plans • Medicaid administrative costs have been reduced by 25 percent
• Healthcare cost & quality • Recoveries for Medicaid Fraud has doubled
assessment
• Maximized Medicaid program savings while sustaining quality
• Overpayment & COB analysis care
• Program effectiveness • Warehouse helped Michigan go from “last to first” in child
• Predict State’s healthcare needs immunization rates
• Prioritize health initiatives • Track and substantiate savings in Medicaid pharmacy costs
for future • 2004 TDWI Best Practice Award Winner – Government and Non-
Profit Category

33
The New York State
Department of Health (DoH)
Teradata Customer Since 1999
Customer Profile
New York’s Medicaid program provides critical health care services to more than 3.7 million participants – 2.4
million in New York City alone. To serve this constituency, the state processes and analyzes more than 300 million
claims totaling more than $38 billion annually. It is the largest Medicaid program in the US.
Business Solutions
New York is making more rapid, informed Implementation Summary
decisions about programs, policies, and
people across its vast Medicaid system. • More than five years of History
• 1.3 Billion Claims
• Fraud & abuse • 650 users from 17 counties that is expected to grow to
• Tracking bio-terrorism indicators daily by thousands
pharmaceutical purchases with acute
illness data from hospital emergency
rooms
Realizations and ROI
• Determining disease patterns and trends
and the best possible treatment • First year in operation paid for entire implementation of
the DW!
• Tracking drug pattern usage to prevent
abuse • Better analysis of integrated data resulted in recoveries in
the millions!
• Program effectiveness
• Service delivery effectiveness • $16m - Coordination of Benefits, $5m - duplicate
payments, $1 million - overpayments
• Enhanced audit control
• $187 million saved due to better policy decisions based
• Forecasting the cost and utilization of
on medical and pharmaceutical analysis
expensive prescription drugs
• Identification of overpayments • Millions saved due to efficiency of analysis such as Audit
process reduced to 2 hours from 8 weeks
• Responding quickly to legislative
inquiries • 2004 NASCIO Award – Best Information Architecture
Category

34
Iowa Department of Revenue

Tax Compliance
• Have more accurate leads because of better information
• Experienced substantial savings; staff can --
> Analyze greater volumes of data
> Manage a greater number of cases
> Exercise a higher level of control over taxpaying behavior
> Before the EDW, this additional work would have caused
for a 20-25% increase of the audit staff
• Generated $69.7M in incremental collections and refund
reductions in 2003
> $30.6M through office examinations
> $17.4M in refund reductions
> $ 9.1M from tax gap revenues
> $ 7.5M in out-of-state audits of multi-state businesses
> $ 5.1M from in-state field audits Business Benefits

35
The Teradata Mission

Teradata Active Data Warehousing

strategic
tactical
Sources
event-driven tactical

decision making in a single strategic

centralized Active Data Warehouse


mission-critical
up-to-date
version of the enterprise data Users

“Any Question, By Any User, At Any Time”


All Decision Making…from One Copy of the Data.
36
The Industry Leader in Data Warehousing

john.tulley@ncr.com

37

You might also like