Professional Documents
Culture Documents
Tara Data
Tara Data
John Tulley
Vice President, Teradata Canada Email: John.tulley@ncr.com Office: 905-478-8997
Retail Solutions
Financial Solutions
Systemedia
Leading industries > Banking > Government > Insurance & Healthcare > Manufacturing > Retail > Telecommunications > Transportation Logistics > Travel World class customer list > More than 800 customers > Over 1200 installations Global presence > Over 100 countries 4,000 world-wide professionals dedicated to data warehousing
Teradata - the recognized leader in data warehousing and high-performance decision analytics. .Gartner ASEM
IBM S/390 OS/390 DB2 EEE Sun Enterprise Solaris Oracle HP HP9000 HP-UX Oracle IBM SP RS/6000 AIX DB2 EEE Compaq Alpha Tru64 Oracle Generic Unisys Intel IA-32 ES7000 Win2000 Win2000 SQL Server SQL Server
Teradata
Data Mgmt.
Data Admin.
Scalability and Suitability Concurrent Query Mgmt. DW Track Record Query Perform.
Worst
Best
> DBMS is surely the place where NCR Teradata sets the gold standard. As in previous years, the Teradata score was 98%, leaving little scope (and need) for improvement.
Gartner's [Application Server Evaluation Model] ASEM Data Warehouse Server Update, A. Butler, K. Strange, J. Enck, M. Chuba, November 2004
> Teradata[database management system] DBMS capabilities remain unchallenged by its competitors in the market.
Gartners Magic Quadrant for Data Warehouse DBMSs, 2004, Kevin H. Strange, June 2004
> Gartner Dataquest recognized Teradata as the growth leader in the RDBMS market, with above market growth of 17.4%. 2005
META Group
> Teradata has displayed unmatched (but often copied) strength of vision and focus in the [enterprise data warehouse] EDW market.
METAspectrum Market Summary, Enterprise Data Warehouse METAspectrumSM Evaluation, 2004
Research
Business Value
Analyze
Application Requirement Logical Model Data Mapping Infrastructure & Education
Design
System Architecture
Equip
Hardware Platform
Build
Physical Database
Integrate
Components for Testing System Test
Manage
Help Desk
Enterprise Assessment
EDW Roadmap
Information Sourcing
Production Install
Initial Data Acceptance Testing User Training Value Assessment
Workload Complexity
Data Sophistication
10
DW Marts Users
Independent Data Marts P Easy to Build Organizationally r o Easy to Build Technically s C Business Enterprise view unavailable o n Redundant data costs s High ETL costs
High App costs High DBA and operational costs
No ETL Meta data issues Network bandwidth and join complexity issues Only viable for low volume
Business Enterprise view challenging Redundant data costs High DBA and operational costs Data latency ODS duplication
11
Transaction Systems
Central store, Hub, Clearing house 2. There is too much latency - too long to get the data to the people who need it. Everyone sees different inconsistent points in time
Data Marts
4. The solution is too expensive. There are numerous components that lead to increased costs. Costs often hidden in distributed organization.
12
Optional ELT
ORDER ITEM SHIPPED QUANTITY SHIP DATE ITEM ITEM NUM BER QUANTITY DESCRIPTION
PRODUCT PRODUCT KEY PRODUCT NAME DISTRIBUTOR PRODUCT DESCRIPTION PRODUCT HEIGHT PRODUCT WIDTH PRODUCT DEPTH PRODUCT WEIGHT
SALES PERIOD KEY PRODUCT KEY CUSTOMER KEY MARKET KEY DOLLARS UNITS
Logical (Views)
Application Co-Located
CUSTOMER CUSTOMER KEY CUSTOMER NAME CUSTOMER CITY CUSTOMER POST CUSTOMER ST CUSTOMER ADDR CUSTOMER PHONE CUSTOMER FAX
MARKET MARKET KEY CITY STATE ZIP ZIP4 DISTRICT REGION COUNTRY
Optional
Virtual Views
Decision Users
Strategic Users Tactical Users Reporting OLAP Users Data Miners Event-driven/ Closed Loop
13
Metadata
Dimensional
Dependent DM
Optional
Data Transformation
TERADATA Utilities
Adapter(s)
TERADATA
TERADATA Utilities Adapter(s)
Queues
14
Message Bus
Secure Wireless
DOD Supplier
Warfighter Support
5.Warfighter receives alert via Secure Blackberry, adjusts Battle Plans to align with rush replenishment Enterprise Application Integration Web Services WebTibco .NET Sphere (EAI) Business Services OLAP Rules Event Intel Queries Agents Engine Engine
Information Exchange
MQ Adapter T-Pump, MQ Adapter
Fast Export
Legacy Systems
3.Stored Procedures trigger based event detection TERADATA sends alert Stored Procedures to Q Tables Warfighter, UDF, Triggers Warfighter Support, & DOD Supplier via MSTR Narrowcaster Decision Making Environment
16
Transactional Environment
So what is Teradata ?
What is Teradata?
RDBMS designed to run the worlds largest databases Latest Intel technology nodes UNIX-MP-RAS, Windows 2003 Linux in Fall 2005 Scales linearly from Laptop to MPP Has a parallel aware optimizer that allows multiple complex queries to run concurrently Standard access language (SQL) Uses a Shared-Nothing architecture Unlimited, unconditional parallelism Linear Scalability allows for increased workload without decreased throughput.
18
BYNET Interconnect
> Latest Intel SMP CPUs > Configured in 2 to 8 node cliques > Windows, Unix or Linux > Fully scalable bandwidth > 1 to 1024 nodes > Fully scalable > Channel - ESCON > LAN, WAN > Independent I/O > Scales per node
Connectivity
Storage
Server Management
> One console to view the entire system
Server Management
19
P
FSB Memory
P
FSB
I/O
I/O
Memory
P
FSB Memory
P
FSB
I/O
I/O
Memory
Similar to Large SMP, except Interconnect runs at I/O Rates and not Memory Rates Longer Lifetime: I/O Interfaces have a 3-5 Year Lifetime Scaling Is By Increasing Link Data Rates and Parallel Links
20
4-Way SMP
VAMP1
P
VAMP2
P
VAMP3
P
VAMP4 VAMPn
P P P P P P
22
File System
File system architecture is fundamentally different
> > > > Broke all the rules No Pages, BufferPools, TableSpaces, Extents,... Data location and management are entirely automatic Space allocation is entirely dynamic
No index rebuilds No re-partitioning No detailed space management Easy database and table definition Minimum ongoing maintenance
All performed automatically
23
Teradatas self-managing philosophy provides the lowest total cost of ownership of any RDBMS
> > > > > > Automatic, random and even data distribution Parallel-aware optimizer eliminates query tuning Parallel utilities with low setup and checkpoint restart Single operational view of entire MPP complex (AWS) Single point of control for the DBA (Teradata Manager) SQL-ready database management information (log files)
10. Spend a lot of time planning and expanding the system 11. Query tuning for decision support
25
BYNET Interconnect
SMP Node2 PE PE AMP SMP Node3 PE PE AMP SMP Node4 PE PE AMP
> Recycles before the operating system completes its reboot (multi-node system)
26
Concurrent Users
Sophisticated Queries
Simple Direct at the start Moderate Multi-table Join Regression analysis Query tool support
ORDER ITEM SHIPPED QUANTITY SHIP DATE ITEM ITEM NUMBER QUANTITY DESCRIPTION
28
Mixed Workload
Query Concurrency
Data Freshness
Query Complexity
Query Freedom
Schema Sophistication
29
Mixed Workload
Teradata can Scale Simultaneously Across Multiple Dimensions Driven by Business!
Query Concurrency
Competition Scales One Dimension at the Expense of Others Limited by Technology!
Data Freshness
Query Complexity
Query Freedom
Schema Sophistication
30
Mixed Workload
Teradata can Scale Simultaneously Across Multiple Dimensions Driven by Business!
Query Concurrency
Competition Scales One Dimension at the Expense of Others Limited by Technology!
Data Freshness
Difference!
Query Freedom Query Data Volume Schema Sophistication
31
Teradata
The
Query Complexity
15 TB 1,000s
Schema Sophistication
10 TB
# of Concurrent Queries
5 TB Simple Star Batch Reporting, Repetitive Queries Iterative, Ad Hoc Queries Data Analysis/Mining Near Real Time Data Feeds
15+ way Joins + OLAP operations + Aggregation + Complex Where constraints + Views Parallelism
GBs
Query Complexity
TBs
Workload Mix
32
Business Solutions
Data warehouse integrates claims/encounters; beneficiary eligibility data; provider data; birth records; death records; long-term care assessments; WIC data; immunizations; lead screening; newborn screening; & notifiable diseases.
Implementation Summary
Integrated data from nine separate health-related agencies Managed and used by agency subject matter/programmatic
experts, not by the IT department
Overpayment & COB analysis Program effectiveness Predict States healthcare needs Prioritize health initiatives
for future
33
New Yorks Medicaid program provides critical health care services to more than 3.7 million participants 2.4 million in New York City alone. To serve this constituency, the state processes and analyzes more than 300 million claims totaling more than $38 billion annually. It is the largest Medicaid program in the US.
Implementation Summary
More than five years of History 1.3 Billion Claims 650 users from 17 counties that is expected to grow to
thousands
Fraud & abuse Tracking bio-terrorism indicators daily by pharmaceutical purchases with acute illness data from hospital emergency rooms Determining disease patterns and trends and the best possible treatment Tracking drug pattern usage to prevent abuse Program effectiveness Service delivery effectiveness Enhanced audit control Forecasting the cost and utilization of expensive prescription drugs Identification of overpayments Responding quickly to legislative inquiries
34
Sources
tactical
strategic
Users
Any Question, By Any User, At Any Time All Decision Makingfrom One Copy of the Data.
36
john.tulley@ncr.com
37