Professional Documents
Culture Documents
Scaling Generative AI with watsonx - The power of choice and flexibility Slides PDF
Scaling Generative AI with watsonx - The power of choice and flexibility Slides PDF
• Granite models
• Use Cases
• Win Stories
• A look head
3
watsonx.ai Model PoV
4
Surpassed Driver of growth Governance, Risk
2024, the experimental and competitive and Compliance
breakout-year phase differentiation are key concerns
for enterprise
Generative AI 55% 75% 58%
adoption of organizations are
already in piloting or production
of CEOs believe generative AI is
a source of competitive
of business executives think
major ethical risks abound with
mode with generative AI, advantage and 50%2 are now generative AI3
reveals recent Gartner poll1 integrating the tech into their
products and services.
The Challenge
CEOs say they feel over 6x more pressure from their 79% say of business executive say AI ethics is important to
boards and investors to accelerate generative AI their enterprise-wide AI approach.
adoption rather than to slow it down.
Client Needs Enterprise-grade Trusted platform to scale Reliable partner with
foundation models AI with confidence deep AI expertise
As clients move from
‘Trust’ in AI models Model customization Enterprise AI leader
exploration to investigation and
Hinges on transparency in data Tailor models with proprietary data and A partner who has a successful
production with generative AI, management, methodical training expertise to unique use cases, company track-record of bringing AI
they are looking for the right procedures and rigorous evaluation and industry domain, with easy technologies and solutions to
model choices, robust platform standards. integrations to ready-to-use AI apps. Fortune 500 clients and partners.
to infuse AI into applications,
and a reliable partner who can ‘Performance’ Robust governance Champion of Responsible AI
Deliver optimal performance Make AI safe and secure at scale with Thoughtfully applies AI Ethics,
help scale and operationalize AI
measures for accuracy and latency AI guardrails, continuous risk privacy and regulatory preparedness
with minimal risks. with targeted enterprise business monitoring and integrated governance. across the generative AI lifecycle.
domains and use cases.
Flexible deployment
'Cost-Effective' Work with the infrastructure of choice
Achieve lower inferencing costs and with hybrid multi-cloud and on-prem
total cost of ownership while options to avoid vendor lock-in and
meeting performance requirements. reduce total cost of ownership.
IBM's differentiated approach to delivering enterprise-grade foundation models
Feedback from clients and ENTERPRISE Scale generative AI for targeted business domains
AI communities rolls up PLATFORM
and use cases on watsonx with trust and confidence.
9
IBM watsonx.ai Foundation Models Library – available today
IBM Granite Llama 3 models LAB Aligned models
Model in deprecation
What is IBM Granite ? Trusted, Performant, Cost-effective AI foundation models purpose built for enterprises.
(v1 breakdown)
11
Trusted
developed LLM
alignment technique
→ Large-scale Alignment of
chatBots, it is a new training
paradigm for LLMs where IBM
does very large-scale targeted
alignment on granite LLMs
Impact:
→ dramatic improvement in
Granite RAG performance
0.3
Accuracy
0.2
0.1
16
IBM is actively HR, Finance,
Customer Service and Supply Chain IT Operations
engaging with
enterprise clients Customer service HR automation App modernization, migration Threat management
Reduce manual work and Generate code, tune code Reduce incident response
across a broad
Empower customers to find
solutions with easy, automate recruiting, sourcing generation response times from hours to
set of business compelling experiences and nurturing job candidates in real time minutes or seconds
Scale live viewing Process planning data Reduce application Faster and less
experiences cost effectively up to 80% faster support tickets by 70% expensive drug discovery
Find more Client Knowledge worker Regulatory compliance Data platform engineering Environmental intelligence
Stories & Use-Cases on Enable higher value work, Support compliance based on Redesign the approach for data Provide intelligence to proactively
improve decision making, requirements / risks, proactively integration using generative AI plan and manage impact of
Seismic! and increase productivity respond to regulatory changes severe weather and climate
Source: IBM internal data Reduce 90% of text reading Reduce time spent Reduce data integration Increase manufacturing
and analysis work responding to issues time by 30%+ output by 25%
Use case: App Modernization
Challenge Solution
A fintech startup and IBM Business Partner The collaboration resulted in the creation of
headquartered in Sweden, Edger Finance aims three AI-assisted processes that are offered
to be the go-to solution that investors can use in Swedish and English and were explored
to navigate the stock market and make better during a four-week minimum viable product
investment decisions. (MVP) pilot:
In 2023, Edger joined the IBM® Fintechx – The first accelerates and simplifies
program and began collaborating with IBM the creation of a CEO summary from
Client Engineering and the IBM Innovation corporations’ quarterly reports.
Studio. The goal for the engagement was to – The second automates the extraction
strengthen the firm’s processes and platform of data points that are within each report. 90%
by piloting generative AI (gen AI). – The third allows investors to interact with
improvement in the turnaround time
the data in the report through a question-
answer chat flow. for quarterly report data extracts
Case study 18
Use case: App Modernization
Challenge Solution
Dun & Bradstreet, a leading global provider Dun & Bradstreet and IBM, have
of business decisioning data and analytics, announced a strategic collaboration that
seeks to build AI use cases, implement will bring together Dun & Bradstreet’s
watsonx, and develop applications that Data Cloud and IBM’s watsonx to help
help address employee productivity, organizations responsibly expand their
enhance customer experiences, mitigate use of generative AI. Dun & Bradstreet
business-to-business risks, automate also intends to leverage watsonx for its
workflows, and optimize efficiency. workflows and solutions, supported by
IBM Consulting.
Minutes
instead of days procurement
process using Ask Procurement
Press release
Challenge Solution
Each support representative at Sicredi is Sicredi chose to partner with IBM® Client
responsible for answering questions on a Engineering to augment its support
wide range of products. When a member representatives’ efforts using generative AI.
reaches out in person or over the phone, Sicredi spent three weeks co-creating the
the support rep is accountable for promptly new assistant with IBM, and then spent 20
and thoroughly resolving their query. Given days testing it. Because the new assistant
the wide range of products they support, is enabled by the IBM watsonx.ai , IBM
these representatives rely on a digital Watson® Discovery and IBM watsonx
assistant to compile information to answer Assistant solutions, Sicredi’s team can
each member query. Given the previous submit a wide range of questions (varying
configuration of the assistant, support in complexity) in natural language. Then 10%–12% Seconds
representatives often needed to escalate a the assistant will query Sicredi’s support improvement in query for new assistant to generate
query to a product specialist in order to get documentation and generate an answer
resolution without escalation an answer
it fully resolved. This contributed to longer within a matter of seconds.
wait times for members and a frustrating
experience for support reps. 8%
Case study
decrease in abandoned
support calls
Client testimonial
21
watsonx.ai
Train, validate, tune, and deploy AI models A next generation enterprise studio for AI builders
to train, validate, tune, and deploy generative AI,
foundation models, and machine learning capabilities.
Note: Available in software as of 4/28, targeted for SaaS in June bloom N/A Yes
codegen N/A No
falcon N/A Yes
Coming soon 3
• Additional API endpoints for similarity search and reranking
• LangChain and additional orchestration framework 4
integration support
• Multilingual Slate embeddings models (Q3), and other model
additions
• Fine-tuning and BYOM support for embedding models 1. IAM authorization token for IBM Cloud
2. String inputs within the request body
Model Origin Context length Dim Price per 1k tokens 3. Embeddings model ID specification
4. Watsonx.ai Project/Spaces ID for resource association
bge-large-en-v1.5 Open Source 512 1024 $0.0001
5. Response:
multilingual-e5-large Open Source 512 1024 $0.0001 a) Embeddings for each input string
b) Token count for consumption tracking
all-MiniLM-L12-v2 Open Source 512 384 $0.0001
watsonx.ai Chat Mode
25
watsonx.ai Taxonomy Explorer
• Provides a degree of
transparency to the model’s
training and potential behavior
26
Looking ahead…
• Tuning enhancements
• Enhanced AI-builder / developer experience (e.g. node.js support, additional SDK integrations)
• Exciting new LAB features from IBM Research – Announcements coming on May 6th and at THINK
https://www.ibm.com/events/think
https://www.ibm.com/community/ibm-
techxchange-conference/
28