Data AI Modernization - CP4D Overview

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 22

Data & AI Modernization

IBM Cloud Pak for Data


The AI Ladder
A prescriptive approach to the journey to AI

INFUSE - Operationalize AI throughout the business

ANALYZE - Build and scale AI with trust and explainability

MODERNIZE
ORGANIZE - Create a business-ready analytics foundation Make your data ready for an
AI and hybrid multicloud
world
COLLECT - Make data simple and accessible

One Platform,
Talent &
Any Cloud
Skills
IBM Uniquely Delivers the IA Foundation

System Admin Data Engineer Data Scientist Business Analyst


Unified: APIs Integrated User Experience

Extensible: Accelerators and Solutions


Modular: - provision services & scale out when needed

Collect & Connect Services Organize and Integrate Services Analyze and Infuse Services
The AI ladder –
– Data virtualization & Connectors Data Discovery and search – Data Science and Visualizations
– Provision SQL and – Data transformation – Dashboards and BI Reporting
NOSQL Databases & – Data Curation – AUTO-AI, ML deployments
Warehouses
– Data cataloging and and operations
– Event Ingestion and Classification – AI Trust and Transparency -
Streaming Analytics
– Business glossary Explainability and Bias detection
– Distributed compute with
– Policies and rules – AI services – NLU, Sentiment &
Apache Spark
Text analytics, Speech-to-text ,
– Data Profiling & Quality Text-to-Speech, Chat interfaces

[Bedrock] – User Access Management – Manage:- Monitor & Meter – Operator: Install, Patch & Upgrade
Foundational – Security Contexts & RBAC – Scale
– Service Provisioning
services
– Volume Management – Diagnostics – Backup & Migrate
The IBM Data and AI Portfolio
Everything you need for enterprise AI, on any cloud

Pre-built Use Cases


Watson Applications

Prepare Build Run Manage

Watson Watson
Watson Watson
Knowledge Machine
Studio OpenScale
Catalog Learning

Hybrid Data Management Data Ops & Governance


Business and NPS & Db2 Family InfoSphere Family
technical services

Unified Hybrid Data and AI Platform


Cloud Pak for Data

Hyperconverged
System

4
Deployment flexibility
to run anywhere

A true
hybrid multicloud
strategy

Managed by Client

Managed by IBM/Vendor
Cloud Pak for Data v4.0 Packaging

Cloud Pak for Data Base Platform Services Cloud Pak for Data Cartridges

Db2 Warehouse Netezza Performance Server Db2 AESE Master Data Management
Data Virtualization Db2 Big SQL Informix Virtual Data Pipeline (Actifio)
IBM Streams Guardium Integration DataStage OpenPages

Watson Knowledge Catalog (including IGC) Data Management Console Information Server Open Data for Industries

Information Analyzer (included in WKC) Watson Machine Learning– Accelerator Cognos Analytics Knowledge Accelerators

Watson Studio (includes Data Refinery) Data Privacy (Beta)


Planning Analytics Product Master NEW!
Watson Assistant
Watson Machine learning (includes AutoAI ) IBM Match360 with Watson
Watson Discovery
Watson OpenScale SPSS Modeler

Cognos Dashboards Embedded Decision Optimization


NEW! Watson Speech Services
Financial Crimes Insights
Analytics Engine for Apache Spark Hadoop Execution Engine
Financial Services Workbench
Collect Make data simple and
accessible

Cloud Pak for Data:


• Data Virtualization
• Db2 Warehouse
• Performance Server
• Streams
Netezza Performance Server Highlights Business Value
Simplicity Build once, Run anywhere
Minimal administration and tuning Flexible deployment options, no
vendor lock-in, 100% compatibility,
Scalable Hybrid Analytics risk free frictionless migration
petabyte scaling, independently scale
compute and storage in cloud Faster Actionable insights
Blazing speeds, up to 3X
Seamless Data Integration performance and 2X concurrency
Built-in Data Virtualization to in-place improvement over legacy Netezza
connect, manage and query data as one systems

Resiliency for Business Continuity Data Science & ML at Scale


Infrastructure resiliency, backup to object Operationalize AI with built-in
storage, replicated to multiple availability Watson DS/ML and 200+ in-
zones database algorithms

Use cases
On-premise, Cloud or Hybrid environments: Flexibility in choosing deployment and
On Prem IBM Cloud consumption model with License Flexibility that best suit the business needs
AWS Azure
Hybrid scenarios:
Seamless on-ramp to
Managed Cloud ▪ Dev/test environments on Cloud: Easily spin up/down a cloud instance, seamlessly move
data from on-premise to cloud instance with a single command

▪ Disaster recovery on Cloud: Backup to cloud object storage and restore to your Cloud data
warehouse, switch over if disaster strikes

▪ Make the move to Cloud on your own terms: If Cloud is your strategic direction, start
8
small, scale storage and compute independently, when you’re ready
Organize Create a business-ready
analytics foundation

Cloud Pak for Data :


• Watson Knowledge Catalog
(including Information
Analyzer, IGC)
• DataStage
Watson Knowledge Catalog Highlights Business Value
• Enhanced usability and • Speeds up metadata
improved robustness and tuning classification time for
for Data Quality Projects regulations by 90%.

• Infrastructure needs reduced


• Define and manage additional 50% with end-to-end DataOps
custom attributes for custom services on Cloud Pak for Data
and OOTB asset types and up to 158% ROI

• Add user groups as • Productivity increased by 95%


collaborators in catalogs, when Watson Knowledge
categories, workflows and data Catalog and other CP4D
protection rules services are deployed.

Use Cases
• End-to-end data governance - Single integrated solution to serve customers’
data needs from data ingestion, governance, quality and consumption

• Self-service access to trusted data for analytics -Enable data consumers to


use a self-service, integrated experience to search through catalogs,
collaborate with other users and visualize, shape & analyze data

• Support regulatory compliance - Quickly discover and inventory assets into


the catalog, automatically classify and tag them with business terms to
detect sensitive data

10
Link: ibm.biz/wkc-sales-kit
DataStage and Information Highlights Value

Server for Cloud Pak for Data Dynamic configuration for


DataStage and QualityStage jobs
• Up to 30% performance
improvement when executing
flows due to dynamic resource
Modern Flow Designer design allocation
interface with improved
performance • Design environment
performance improvements that
Include unstructured data as part boost user productivity
of data integration flow design
• Leverage existing job designs
Enhancement to Classic Designer with benefits of containerized
(Windows Client) support deployment

What’s new:
Improved performance in a modern interface on key aspects

• Slowly Changing Dimension (SCD) stage – very critical data movement task to
track history of dimension records or structured data – save operation
performance improved by 52% in new interface

• Leading and common targets / sources (Db2, Snowflake, Salesforce) save


operation performance improved between 31 to 48%.

• Add Azure Data Lake Store and Redshift Connector to Flow Designer in Cloud Pak
for Data

• SAP Packs supported via classic designer (windows client)

• Removal of requirement of NodePort entries and addition of end to end encryption


More information here from Windows Client to CP4D cluster 11
Master Data Management Highlights Business Value
Utilize native REST APIs or • Majority of MDM workload are
IBM App Connect connectors read/inquiry type (typical 80-
to accelerate application 95%)
integration
• Accelerated processing time:
Deploy one or more cache 3,000 read side TPS rate
compared to 1,000 TPS rate
instances to service
Update (~ 200% increase)
Custom
er Store
consumers
New Contrac
Lead t • Easy to deploy, offers agility,
Search across entities and and provides more scalability
traverse relationships for new than on-premises software at
Update
Commissio
Cognitive
Enrichme
insights or push data to your an affordable cost
n nt
data warehouse
Update
New New
d

Use cases/Capabilities
Cart Ticket
Invoice

• Provide application developers with an instance of master data for faster time
to market
• Support mobile and online applications requiring extremely low latency &
A modernized MDM isn’t just an investment in MDM’s premium capabilities but an high availability
investment in the Next Generation of MDM, this means operating on a platform • Utilize master data for downstream analytics
with:
• New AI/ML driven entity match engine • Support applications needing more local access to global Master Data
• Auto configuration • Set up data filters and enforce publishing policies to users and geographies as
• Data driven data model definition required
• Completely new UX
• Tight integration with Watson Knowledge Catalog* Link: https://ibm.seismic.com/Link/Content/DCaD39KjB-6USDZCh0Eg1oZw 12
Analyze Build and scale AI with trust
and explainability

Cloud Pak for Data:


• Watson Studio & Watson ML
• AutoAI
• Watson OpenScale
• Decision Optimization
Custom Runtimes Auto AI
Watson Studio and Watson
Machine Learning
Users
Users can
can bring
bring in
in libraries
libraries of
of their
their Use SDK to run AutoAI experiment
choice via custom images to through programming without UI
choice via custom images to
analyze
analyze data,
data, build
build models
models in
in
notebooks Tech Preview Feature: AutoAI
notebooks oror scripts
scripts and
and deploy
deploy in
in support multiple input datasets with
WML.
WML. configurable join relationships
Business Value: provides the
Business Value: Business Value: New features make
extensibility andprovides
flexibilitythe
extensibility and flexibilityteams to AutoAI suitable for automated and
required by data science customized workflow through
required by data science teams to
create AI solutions effectively. programming, saving users time
create AI solutions effectively.
for data preparation when dealing
with multiple datasets

Watson Studio and Watson Machine Learning


• Use Python 3.7.9 version with Notebooks and Scripts to build model and
deploy in production with Watson Machine learning.

• Bitbucket server and self-signed certificates support for Git integration

• Introducing Multi-Cloud Machine Learning to Cloud Pak for Data (Tech Preview)

• Business Value: Keep up to date with the latest innovations in open source for
your AI lifecycle. Use enhanced git integration for collaboration on your data
science projects. Train your machine learning models by utilizing the data
distributed across multiple parties or locations.
AutoAI

weeks

200%

53%

Provided by users Automated by AutoAI Provided to users

Deployable pipeline

Model Hyper Parameter Feature


Raw Labeled Data Prep Model Building
selection Optimization Engineering
Data Set

Finds best Finds top models Optimize on Finds best data Optimize on models Python Notebook
preprocessing selected models transformation after Feature
imputation / encoding sequence Engineering
and scaling strategies

15
Business Value
Watson OpenScale Highlights
• Share Watson OpenScale across
Role based user access
multiple teams of business users
and data scientists with
Explainability enhancements appropriate content and function
including “what if” interactivity visible to each
and improved understandability
• Enable end users of AI
Monitor models for indirect bias applications to better understand
model decisions

• Detects potential fairness issues


due to unseen correlations in
input data

Use Cases
• Monitor production models to ensure accuracy, fairness and
explainability

• Ensure models continue to preform as expected over time by


detecting and evaluating impact of inputs drifting from data used to
build models

• Enable model validators and risk managers to run tests, compare


candidates, document results and determine when AI/ML models are
ready for production.

16
IBM Streams Highlights Business Value
Custom application resource Improved resource utilization
templates by specifying vCPU and memory
requirements for each
Edge Analytics (beta) to analyze application
and act on data where its
created Millisecond latency can be
achieved to act in the moment
Auto-creation of Cloud Pak for
Data service(s) from any point Immediate creation of
in a Streams application OpenShift services for discovery
and access to analytic
applications over standard
REST interfaces

Use Cases and Capabilities


• Intelligently Collect data to analyze, filter and summarize real time
data before landing it in persistent stores

• Agent Assist to convert speech to text and perform natural language


processing to provide recommendations to call center agents

• Real-time situational awareness infused with AI for hospital patients,


manufacturing devices and automobiles to improve operations

• Geospatial analytics to create alerts when people or things enter an


area of interest for marketing or safety concerns
Link: Streams Seismic Sales Kit 17
Infuse Operationalize AI
throughout the business

Cloud Pak for Data:


• Cognos Analytics
• Watson Apps
• Financial Crime Insight
Cognos Analytics Highlights Business Value
Save time with automated data Empowers users with AI-
preparation, data discovery and infused self-service capabilities
dynamic visualizations and find
deeper insights faster Easily visualize data and share
insights across your team to
Execute complex queries faster and
drive confident decisions
analyze data where it resides using
data virtualization
Reduce the complexity of
Create an analytics foundation by deploying and managing a BI
integrating business intelligence environment to meet your users
predictive, prescriptive and business needs
planning analytics

Use cases/Capabilities
• Get the self-service you expect, the data governance you require, and
the reporting you trust, with a secure business intelligence platform

• Deploy one AI-infused business intelligence platform for all analytics


use cases, from marketing campaign performance to human resources
analysis, customer sentiment analysis to sales pipeline analysis

• Ensure Managed Reporting Production workload SLAs are met with


confidence using modern container architecture

• Enable the most secure and compliant strategy with data governance
Link: < to the detailed
for managed reportingdeep diveexploration
and data deck or recording>
19
Planning Analytics Differentiators Business Value
Adjust financial plans in real • 63% time saved in
time across departments completing annual budgeting
cycles
Protect your investment in
Microsoft Excel while • 80% faster planning system
transcending limitations of processing
spreadsheets
• 20% time saved in
Uncover deep insights through completing forecasts
AI-infused planning, without the
need for help from a data
scientist

Plan for anything, be ready for everything


• Steer business performance by bridging operations and finance for
any department allowing you to adapt to changing business
conditions

• See impact before executing – explore what-if scenarios and assess


impact to determine the best course of action

• Make changes in real-time – pivot plans, budgets, and forecasts


quickly to meet changing demands and priorities

20
Watson pre-
built Apps on Watson Assistant Watson Assistant Voice Interaction
Cloud Pak for
Data

Watson Discovery
Includes:
• Content Mining
• Content Intelligence
• Watson Knowledge Studio

Watson API Kit


Speech to Text, Text to Speech, NLU, WKS, Language Translator
Thank You!

IBM Cloud / April 2019 / © 2019 IBM Corporation

You might also like