AI Fundamentals

You might also like

You are on page 1of 881

What is Artificial

Intelligence?
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
What is AI?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is AI?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is AI?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is AI?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


First things first: Computer Science
The group of technical knowledge needed for the automatic processing of information by
computers: hardware, software, data, networks, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


First things first: Computer Science
The group of technical knowledge needed for the automatic processing of information by
computers: hardware, software, data, networks, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


First things first: Computer Science
The group of technical knowledge needed for the automatic processing of information by
computers: hardware, software, data, networks, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Artificial Intelligence (AI)

Machines that learn to mimic reasoning, decision-making, and in general exhibit some
degree of human-like intelligence to solve a problem.

EU Commission, 2019:

"Systems that, given a goal, perceive their environment, interpret the collected data, reason
to derive knowledge, and decide the best action(s) to achieve the goal".

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI vs Artificial General Intelligence

Artificial Intelligence (AI) Artificial General Intelligence (AGI)

Perceives, interprets and learns from data. Equals or exceeds average human
Reasons and makes decisions intelligence

Excels at solving specific tasks Solves a breadth of tasks intelligently

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI vs Artificial General Intelligence

Examples of AI "Halfway" examples towards AGI

Voice assistants Self-driving cars

Facial recognition AlphaGo

Personalized recommendations Generative AI: Language Models (e.g. GPT)

Autonomous industrial robots

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
What AI can -and
cannot- do?
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Things AI can do

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Predictions and inference
Machine Learning: Learn from data how to
make predictions or inferences.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Predictions and inference
Machine Learning: Learn from data how to
make predictions of inferences.

Predictions: forecasting what will happen in


the future, e.g. weather forecast.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Predictions and inference
Machine Learning: Learn from data how to
make predictions of inferences.

Predictions: forecasting what will happen in


the future, e.g. weather forecast.

Inference: determine output based on data


inputs (predictors), e.g. books you may like.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Pattern recognition
Identify patterns in the data to help make
decisions:

Predictions and inference

Clustering (segmentation)

Anomaly detection

Data generation (Generative AI)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Optimization
Find the best possible solution for a
problem at a minimum cost, under
constraints.

Logistics and delivery: smart routing

Energy: Power grid operation and control

Tourism: flights and hotel pricing

Marketing: maximum-revenue campaigns

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Automation
Automation: follow set of rules to perform
(usually repetitive) tasks.

Classifying documents, photos, etc.

Job application screening

Parcel management robots

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Limitations of AI
Social skills: emotional intelligence, empathy New, unseen situations, e.g. new items to
recommend

Bias: making unfair decisions to some groups Data ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Areas and related
disciplines of AI
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Subdomains of Artificial Intelligence

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems
Knowledge representation and reasoning:
reason, communicate with other AI systems

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems
Knowledge representation and reasoning:
reason, communicate with other AI systems

Robotics: act and manipulate physical


environment

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems
Knowledge representation and reasoning:
reason, communicate with other AI systems

Robotics: act and manipulate physical


environment

Computer Vision: visually perceiving


objects in the environment

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems
Knowledge representation and reasoning:
reason, communicate with other AI systems

Robotics: act and manipulate physical


environment

Computer Vision: visually perceiving


objects in the environment

Natural Language Processing: analyze,


understand, communicate human language

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Subdomains of Artificial Intelligence
Machine Learning: Learn from data;
predictions, inference
Deep Learning: neural networks; solve
most challenging AI problems
Knowledge representation and reasoning:
reason, communicate with other AI systems

Robotics: act and manipulate physical


environment

Computer Vision: visually perceiving


objects in the environment

Natural Language Processing: analyze,


understand, communicate human language

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Examples of AI applications
Personalized product recommendations Warehouse management

Machine Learning Robotics, Computer Vision, Reasoning

Medical diagnosis Smart voice assistants

Computer Vision, Deep Learning NLP, Deep Learning

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Related disciplines

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Related disciplines

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Related disciplines

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Related disciplines

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Video takeaways
AI is an umbrella discipline with several popular areas.

Present AI systems and applications combine principles from multiple areas.

Math, Data Science, and Statistics are closely related disciplines to AI.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Algorithms and AI
systems demystified
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
What is an algorithm?
Algorithm: a set of (computer) instructions
to solve a problem or perform an action.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an algorithm?
Algorithm: a set of (computer) instructions
to solve a problem or perform an action.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an algorithm?
Algorithm: a set of (computer) instructions
to solve a problem or perform an action.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an algorithm?
Algorithm: a set of (computer) instructions
to solve a problem or perform an action.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an algorithm?
Algorithm: a set of (computer) instructions
to solve a problem or perform an action.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Algorithms in Computer Science vs AI algorithms

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Algorithms in Computer Science vs AI algorithms

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Algorithms in Computer Science vs AI algorithms

AI algorithms: learn by themselves to produce better outputs or processes from input data

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an AI system?
AI system: infrastructure and components needed to implement and deploy AI algorithms in
the real world

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an AI system?
AI system: infrastructure and components needed to implement and deploy AI algorithms in
the real world

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What is an AI system?
AI system: infrastructure and components needed to implement and deploy AI algorithms in
the real world

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Acquiring data
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data acquisition: sensing the environment
Collect outside sensory information
through sensors: mimic human senses

Transform perceptions into data

Occurs in:

NLP and audio: capturing speech, sounds.

Computer Vision: satellite images,


fingerprint, etc.

Robotics and sensors: temperature, touch,


motion, gravity, etc.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data acquisition: datasets
Dataset collection of data: data samples or instances of a given type of data

Structured: tabular format, spreadsheets

Unstructured: images, audio, videos, text, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data acquisition: datasets
Dataset collection of data: data samples or instances of a given type of data

Structured: tabular format, spreadsheets

Unstructured: images, audio, videos, text, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data acquisition: datasets
Dataset collection of data: data samples or instances of a given type of data

Structured: tabular format, spreadsheets

Unstructured: images, audio, videos, text, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data acquisition: datasets
Dataset collection of data: data samples or instances of a given type of data

Structured: tabular format, spreadsheets

Unstructured: images, audio, videos, text, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Learning from data
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Enter Machine Learning (ML)
Machine Learning: learn from data and identify patterns to perform inference tasks:
predictions, classifications, clustering, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Enter Machine Learning (ML)
Machine Learning: learn from data and identify patterns to perform inference tasks:
predictions, classifications, clustering, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Enter Machine Learning (ML)
Machine Learning: learn from data and identify patterns to perform inference tasks:
predictions, classifications, clustering, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Enter Machine Learning (ML)
Machine Learning: learn from data and identify patterns to perform inference tasks:
predictions, classifications, clustering, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Enter Machine Learning (ML)
Machine Learning: learn from data and identify patterns to perform inference tasks:
predictions, classifications, clustering, ...

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Supervised Learning: classification
Classification: assign each data observation the category (class) it may belong to

Binary classification: two classes, e.g. positive/negative, male/female, etc.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Supervised Learning: classification
Classification: assign each data observation the category (class) it may belong to

Binary classification: two classes, e.g. positive/negative, male/female, etc.

Multi-class classification: several mutually exclusive classes, e.g. multiple species


Supervised learning: Data annotation (getting labelled observations with known class a
priori) needed to learn/train a model capable of making inference

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Machine Learning algorithm vs Model

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Machine Learning algorithm vs Model

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Supervised Learning: regression and forecasting
Regression: assign each data observation a numerical output or label based on its inputs

Time series forecasting: predict future values of variable, based on its past behavior

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Unsupervised and reinforcement learning
Clustering: find subgroups of data with similar Anomaly detection: detecting abnormal data
characteristics (e.g. k-means algorithm) observations e.g. unusual card transactions

Association rule discovery: find common co- Reinforcement learning: learn by experience
occurrences of items in transaction data (trial and error) to master a complex task

UNDERSTANDING ARTIFICIAL INTELLIGENCE


How about Deep Learning?
Highly sophisticated models based on deep neural networks: solve very challenging tasks
where classical ML models become limited.

Need a lot of data to learn: sometimes millions of observations.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


How about Deep Learning?
Highly sophisticated models based on deep neural networks: solve very challenging tasks
where classical ML models become limited.

Need a lot of data to learn: sometimes millions of observations.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Interacting with the
Environment
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI functions and areas involved

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Robotics
Sensing and perception: collecting data or perceiving signals
Mobility: moving in the environment guided by perceptions of surroundings

Manipulation: the robot modifies its environment

Human-robot interaction: e.g. conversational robots endowed with NLP

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Computer Vision
Image processing: intelligently enhance images and video

Object detection: identify subjects in images/video for surveillance, logistics, etc.

Motion analysis: extract motion information like speed and direction of objects

Image and video generation: create realistic visual data from human text

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Natural Language Processing (NLP)
Text-based
Text classification

Sentiment analysis: extract positive and


negative feelings in text, e.g. customer
reviews.

Question answering (chatbots)

Text summarization

Speech-based
Text-to-speech

Speech-to-text

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Chapter summary
Takeaways from this chapter:

Algorithms are the building blocks of AI systems, along with data, hardware and other
components

Acquiring data, learning and reasoning from data, and interacting with the environment,
are three key functions in AI systems

Data collection into datasets are the fuel of most AI systems, especially those guided by
Machine Learning and Deep Learning

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Establishing an AI
culture
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
The value of AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


The value of AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


The value of AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


The value of AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


The value of AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


The value of AI in organizations

AI for personalization example: identifying customers' shopping habits leads to more loyalty
and increased sales.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

1. Roadmap: obtain leadership support and a clear vision for AI adoption

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

1. Roadmap: obtain leadership support and a clear vision for AI adoption

2. Data strategy: plan to collect, use, and govern data for AI

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

1. Roadmap: obtain leadership support and a clear vision for AI adoption

2. Data strategy: plan to collect, use, and govern data for AI

3. Infrastructure resources: scalable computing infrastructure and AI tools

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

1. Roadmap: obtain leadership support and a clear vision for AI adoption

2. Data strategy: plan to collect, use, and govern data for AI

3. Infrastructure resources: scalable computing infrastructure and AI tools

4. Roles: talented AI, Machine Learning, and Data Science roles

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

5. Collaboration: cross-functional AI projects

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

5. Collaboration: cross-functional AI projects

6. Success: define and pursue success aims, e.g. customer-centric, impact on revenue, etc.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

5. Collaboration: cross-functional AI projects

6. Success: define and pursue success metrics, e.g. customer-centric, impact on revenue, etc.

7. AI & Data literacy: continuous AI and data evangelization for everyone

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization

5. Collaboration: cross-functional AI projects

6. Success: define and pursue success metrics, e.g. customer-centric, impact on revenue, etc.

7. AI & Data literacy: continuous AI and data evangelization for everyone

8. Responsible AI: ethical, secure, and accountable use of AI and data

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building an AI-driven organization
8 elements, 3 dimensions

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI-driven organization: roadmap

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI-driven organization: roadmap

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI-driven organization: roadmap

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI-driven organization: roadmap

Example: insurance company AI roadmap

1. Objective: efficient claim processing

2. Resources: data scientists; ML experts;


cloud infrastructure; customer, policy and
claim data
3. Implementation: ML model for automated
fraud detection and claim classification,
extendable to customer service

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Data strategy,
resources, and
people
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Data strategy and governance
Data strategy: design and development of
data-centric approaches for information
extraction and business decision-making

Data strategy steps:

1. Setting data-oriented objectives

2. Find out necessary data

3. Determine data sources and types

4. Predictive and prescriptive analysis

5. Operationalize data-driven processes

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Resources: AI infrastructure
Cloud-based AI infrastructure On premises (self-hosted) AI infrastructure

Scalable computing resources, data Organizations own their hardware


storage, AI & ML development tools and software, data, and network resources to
pre-built models. Elastic, on-demand support AI operations

Pros: High scalability, Cost-effectiveness Pros: Enhanced data control, lower latency
Cons: Data location, Internet needed Cons: Upfront costs, limited scalability
1 Left image: Google Cloud Platform, Microsoft Azure, and Amazon Web Services logos

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Resources: MLOps methodology
Machine Learning Operations (MLOps): efficient and reliable management and operation
of ML (AI) systems in the enterprise

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Resources: MLOps methodology
Machine Learning Operations (MLOps): efficient and reliable management and operation
of ML (AI) systems in the enterprise

UNDERSTANDING ARTIFICIAL INTELLIGENCE


People: AI-related roles
AI Architect Data Scientist

Machine Learning and Data Engineer Others: AI Ethicist, Project Manager

1 Icon made by Freepik, juicy_fish, deemakdaksina from www.flaticon.com

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building your AI team
Leadership and management

AI manager / team lead

AI project manager(s)
Execution & MLOps

AI architects

Data scientists

ML & data engineers

Support

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building your AI team
End-to-end data scientists: responsible for whole MLOps lifecycle, over-ambitious skills

Dedicated teams: Dev + Ops teams, strong communication and collaboration needed

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Building your AI team
Leadership and management

AI manager / team lead

AI project manager(s)
Execution & MLOps

Data scientists

AI architects

ML & data engineers

Support

AI ethicist; domain experts.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Is your deployed AI
system successful?
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


AI course instructor, DataCamp
When to measure success?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


When to measure success?

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring performance offline - accuracy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring performance offline - accuracy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring performance offline - accuracy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring performance offline - accuracy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring performance offline - accuracy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Beyond accuracy - error and other metrics

Metrics for search and recommendation engines: ranking quality -relevance of ranking items
to the user-, diversity in search results or recommendations, etc.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Measuring success in production
AI/ML metrics: accuracy, error, relevance, diversity, ...
Model degradation: the measured metric value gets worse over the time

Business metrics: Key Performance Indicators (KPIs)


Indicator of performance and progress of organization objectives
Example KPIs: conversion rate, satisfaction (retail) ; turnaround time (healthcare)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Risks: what could possibly go wrong?
Possible risks include:

Data bias

Lack of transparency
Ethical concerns

Dubious system reliability

Vulnerability to cyber threats

Proof-of-Concept (PoC):

Pilot demonstrator to validate feasibility and potential value + early risk identification

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Challenges and
success stories
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Challenges

Challenges to build an AI-driven organization

Resources: people, infrastructure, budget

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Challenges

Challenges to build an AI-driven organization

Data: availability, quality, governance, privacy

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Challenges

Challenges to build an AI-driven organization

Culture: rigid mindset, siloed operations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Challenges

Challenges to build an AI-driven organization

Awareness: "Why AI is critical to the business?"

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Success stories: Google
Challenge:

Data quality and accessibility issues

Solution:

Data governance frameworks and data


integration strategies, to leverage large
volumes of data effectively

1 More info: https://www.youtube.com/watch?v=iCVJdFedSv4

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Success stories: Airbnb
Challenge:

Talent needed to become AI-driven

Solution:

Talent acquisition and talent development


through upskill training in AI and ML

1 More info: https://www.linkedin.com/pulse/what-made-airbnb-data-team-special-5-traits-i-look-when-claire-


lebarz/

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Success stories: IBM
Challenge:

Address ethical and regulatory AI issues

Solution:

AI Ethics Board for responsible AI,


guidelines to mitigate algorithmic bias,
engagement with policymakers

1 More info: https://www.ibm.com/downloads/cas/4DPJK92W

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Success stories: Netflix
Challenge:

Large-scale computing infrastructure


needed

Solution:

Cloud infrastructure investments, AI tools


for recommendation, data processing
workflows

1 More info: https://valohai.com/blog/building-machine-learning-infrastructure-at-netflix/

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Democratizing
Artificial Intelligence
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
AI democratization
AI is deeply impacting our lives

How to bring AI benefits to everyone and eliminate its potentially harmful side?

Access to:

Use of AI-based systems and solutions

Design of AI tools anyone can effortlessly use to supplement their tasks

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI literacy
AI literacy: individuals and organizations'
understanding of AI concepts,
technologies, and their implications in:

Organizations

Society, economy and the environment

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI literacy
AI literacy: individuals and organizations'
understanding of AI concepts,
technologies, and their implications in:

Organizations

Society, economy and the environment

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI literacy
AI literacy: individuals and organizations'
understanding of AI concepts,
technologies, and their implications in:

Organizations

Society, economy and the environment

UNDERSTANDING ARTIFICIAL INTELLIGENCE


How AI literacy contributes to AI democratization?
Empowered individuals: equipped with Ethics awareness: fairness, privacy,
knowledge and skills to engage with AI transparency, responsible AI

Inclusive participation: engage in AI-related Critical thinking: ability to evaluate AI


activities and participatory decisions systems and make informed judgments

1 Icons made by Paul J.& Freepik from www.flaticon.com

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data democratization
In organizations
Make information underlying data accessible to all roles
Competitive market advantage

Optimizing activities
Proactive strategic mindset

Data upskilling is crucial

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data democratization
In society
Make information accessible to individuals

Enable access, use and contribution to data-driven insights, through:


Open data policies and data-sharing
Data visualization and literacy

Empower communities with data

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Explainability and
interpretability
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Explainability and interpretability
Explainability: humans' ability to access and understand AI outputs, e.g. predictions, decisions

Interpretability: understand AI systems' internal processes: algorithm, model, data workflow

UNDERSTANDING ARTIFICIAL INTELLIGENCE


White-box vs black-box AI systems
White-box: transparent and easily interpretable models/systems

UNDERSTANDING ARTIFICIAL INTELLIGENCE


White-box vs black-box AI systems
White-box: transparent and easily interpretable models/systems

UNDERSTANDING ARTIFICIAL INTELLIGENCE


White-box vs black-box AI systems
White-box: transparent and easily interpretable models/systems

UNDERSTANDING ARTIFICIAL INTELLIGENCE


White-box vs black-box AI systems
Black-box: higher complexity, little or no degree of understandability

UNDERSTANDING ARTIFICIAL INTELLIGENCE


White-box vs black-box AI systems
Black-box: higher complexity, little or no degree of understandability

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Basic Explainable AI (XAI) tools
XAI: methods and tools to increase AI systems and models' transparency and explainability

Model introspection: examining internal model parameters to understand decisions

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Basic Explainable AI (XAI) tools
XAI: methods and tools to increase AI systems and models' transparency and explainability

Model introspection: examining internal model parameters to understand decisions


Model documentation: shareable architecture and design considerations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Basic Explainable AI (XAI) tools
XAI: methods and tools to increase AI systems and models' transparency and explainability

Model introspection: examining internal model parameters to understand decisions


Model documentation: shareable architecture and design considerations

Model visualization: human-friendly representation of data features and model outputs


1 Heatmap source: https://towardsdatascience.com/

UNDERSTANDING ARTIFICIAL INTELLIGENCE


XAI tools: feature importance
Feature importance: impact or contribution of
features (predictors) in model outputs

Understand how data-driven models


(ML/DL) make decisions

Detect and mitigate issues, e.g. biases

Impact on model performance if a feature


were removed

SHAP (SHapley Additive exPlanations)

Feature importance visualizations toolbox

UNDERSTANDING ARTIFICIAL INTELLIGENCE


XAI tools: feature importance
SHAP (SHapley Additive exPlanations)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


XAI tools: feature importance
SHAP (SHapley Additive exPlanations)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


XAI tools: feature importance
SHAP (SHapley Additive exPanations)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Practical implications of XAI
Algorithmic transparency: Ethical considerations:
How algorithms process data and make XAI to address ethical AI concerns:
decisions biases, discrimination, compliance, etc.

Local and global interpretability: Human-AI collaboration:


Understand system behavior for a Reliable collaboration based on trust and
specific prediction, vs feedback

Understand system overall behavior on a


dataset or problem

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Social challenges:
ethics, fairness and
privacy
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Responsible AI
Responsible AI: Ethical and accountable development and use of AI systems, with regard to
societal impact

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Ethics and fairness
AI ethics: adhere to ethical guidelines and
principles:

Fairness

Transparency

Privacy

Accountability

Liability for AI decisions

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Ethics and fairness
AI ethics: adhere to ethical guidelines and
principles.

Fairness

Transparency

Privacy

Accountability

Liability for AI decisions

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Ethics and fairness
AI ethics: adhere to ethical guidelines and
principles.

Fairness

Transparency

Privacy

Accountability

Liability for AI decisions

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Bias in AI systems: examples
Screening job resumes
Biased training data: mostly male hirings

Unfair treatment of female candidates

Solutions: active data collection, bias-


correction algorithms

E-commerce recommendations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Bias in AI systems: examples
Screening job resumes
Biased training data: mostly male

Unfair treatment of female candidates

Solutions: active data collection, bias-


correction algorithms

E-commerce recommendations
Popular products are overly promoted

New or different products are disregarded

Solutions: techniques and metrics for


diverse and fair recommendations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data privacy in AI systems
Data privacy: safeguarding sensitive or personal information from unauthorized access and
misuse

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data privacy in AI systems
Data privacy: safeguarding sensitive or personal information from unauthorized access and
misuse

1 GDPR: General Data Protection Regulation. CCPA: California Consumer Privacy Act.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Data privacy in AI systems
Data privacy: safeguarding sensitive or personal information from unauthorized access and
misuse

1 GDPR: General Data Protection Regulation. CCPA: California Consumer Privacy Act.

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
Social challenges:
the future of AI
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
How AI may shape our present (and future) society?
Healthcare: advanced diagnosis, personalized Governments and Law: Generative AI; new
treatments, surgical robots, etc. regulations about responsible use

Finance and cybersecurity: risk management, Sustainable Development Goals (SDGs)


fraud detection

1 Icon made by mynamepong, surang (flaticon.com). Image generated in https://stablediffusionweb.com/

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI and sustainability

1 Source: United Nations (https://sdgs.un.org/goals)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI and sustainability

1 Source: United Nations (https://sdgs.un.org/goals)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI and the future of workforce
Challenges:

Job displacement, rapidly evolving skillsets

Example: Large Language Models (e.g.


ChatGPT), prompt engineering skills
Opportunities:

New jobs, e.g. AI ethicist, AI educator


Industry transformation

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI and the future of education
Challenges:

Skills gap: educational institutions need


to keep up to date with relevant AI
training

Digital divide: ensure universal access to


AI-powered education

Opportunities:

Personalized learning

Automation of time-consuming
administrative tasks

UNDERSTANDING ARTIFICIAL INTELLIGENCE


AI and the future of the environment
Challenges:

Ecological degradation: energy


consumption, electronic waste, carbon
footprint

Opportunities:

Understand climate change

Optimize use of natural resources

Optimize renewable energy use

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Let's practice!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
One journey ends,
another begins
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E

Iván Palomares Carrascosa


Senior Data Science & AI Manager
Chapter 1: What is Artificial Intelligence

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Chapter 2: Tasks AI can solve

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Chapter 3: Harnessing AI in organizations

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Chapter 4: The human side of AI

UNDERSTANDING ARTIFICIAL INTELLIGENCE


What to learn next?
AI Essentials Skill Track
An exciting 6-course pathway to consolidate your AI literacy.

Implementing AI Solutions in Business


From use cases to proofs-of-concept, explore AI system deployment in business.

Artificial Intelligence (AI) Concepts in Python


A gentle and practical introduction to implementing AI and ML systems.

1 Image by upklyak (www.freepik.com)

UNDERSTANDING ARTIFICIAL INTELLIGENCE


Congratulations!
U N D E R S TA N D I N G A R T I F I C I A L I N T E L L I G E N C E
What is ChatGPT?
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
What is ChatGPT?

AI Chatbot application:

Answer questions

Perform tasks

User-inputted text

INTRODUCTION TO CHATGPT
What is ChatGPT?
Traditional chatbots

Predetermined responses

Limited questions

ChatGPT

More generalizable

Uses its understanding of language to


interpret the question and respond

Wide range of potential applications

INTRODUCTION TO CHATGPT
Generative AI

Subset of AI and Machine Learning

Generates new content


Uses patterns in information it has already
seen

INTRODUCTION TO CHATGPT
From prompt to response

Step 1: User writes a question or instruction: prompt

INTRODUCTION TO CHATGPT
From prompt to response

Step 2: ChatGPT interprets the prompt

Step 3: Generates new, relevant language in response

INTRODUCTION TO CHATGPT
From prompt to response

Step 4: Response is returned to the user

INTRODUCTION TO CHATGPT
Summarizing text

ChatGPT is great at summarizing text and


explaining concepts

Save time when summarizing reports


Interpret complex information more easily

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Creating marketing content

INTRODUCTION TO CHATGPT
Creating marketing content

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Why utilize ChatGPT?

ChatGPT can perform many tasks with


greater efficiency

New workflow: AI → Human

Save time and money

Greater personalization

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Limitations of
ChatGPT
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
ChatGPT under the hood

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Demystifying the LLM

INTRODUCTION TO CHATGPT
Limitation 1 - Knowledge cutoff

Trained on data from up to a certain date:


GPT 3.5: January 2022

GPT 4: April 2023

Isn't aware of events beyond this date

INTRODUCTION TO CHATGPT
Limitation 2 - Training data bias

ChatGPT was on a huge text dataset,


including:
Books

Articles

Websites

Model may learn the biases from the


training data

Could bias the responses

INTRODUCTION TO CHATGPT
Limitation 3 - Context tracking

INTRODUCTION TO CHATGPT
Limitation 3 - Context tracking

INTRODUCTION TO CHATGPT
Limitation 3 - Context tracking

Struggles to keep track of the context if the


focus shifts

Can lead to inaccurate or irrelevant results

Tip: Keep conversations to a single topic

INTRODUCTION TO CHATGPT
Limitation 4 - Hallucination

Model confidently provides inaccurate


information

Often occurs when trying to go beyond the


model's knowledge or abilities

1 https://en.wikipedia.org/wiki/Hallucination_(artificial_intelligence)

INTRODUCTION TO CHATGPT
Limitation 5 - Legal and ethical considerations

Example: Creating a song in the style of an


existing artist
Who owns the new song?

INTRODUCTION TO CHATGPT
Limitation 5 - Legal and ethical considerations

Example: Creating a song in the style of an


existing artist
Who owns the new song?

INTRODUCTION TO CHATGPT
Limitation 5 - Legal and ethical considerations

Example: Creating a song in the style of an


existing artist
Who owns the new song?

Easy to fall into a legal gray area

Ownership and privacy → Chapter 2

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Writing effective
prompts
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Garbage in, garbage out

INTRODUCTION TO CHATGPT
How does ChatGPT interpret a prompt?

INTRODUCTION TO CHATGPT
How does ChatGPT interpret a prompt?

1. Identify the topic

INTRODUCTION TO CHATGPT
How does ChatGPT interpret a prompt?

2. Understand the prompt

INTRODUCTION TO CHATGPT
3. Generate response

INTRODUCTION TO CHATGPT
Prompt engineering

Prompt engineering is the process of


writing prompts to maximize the quality
and relevance of the response

INTRODUCTION TO CHATGPT
Writing tips for prompt engineering

Be clear and specific

Include any necessary information

Example: in a summarizing task, specify


the desired length

Keep it concise Use correct grammar and spelling

Remove any information that doesn't ChatGPT uses grammar when interpreting
provide useful context the task

INTRODUCTION TO CHATGPT
Provide examples if necessary...
Can be a much quicker way of providing
context

Example: Generating example customers

Want the form:

Full Name, Age (Occupation)

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Enabling people to
use ChatGPT
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Augmenting workflows
Workflow: Standardized series of tasks to achieve an end goal

Aims:

Highest-quality output

Shortest timeframe

INTRODUCTION TO CHATGPT
A standard workflow

Example: Summarizing a project scoping


document
30 pages → summary of key findings

We extract the key findings and compile


the summary

Using a spelling and grammar checker to


proofread

INTRODUCTION TO CHATGPT
A ChatGPT-powered workflow

Example: Summarizing a project scoping


document
30 pages → summary of key findings

ChatGPT allows us to reverse the roles

Human now becomes the proofreader

Huge time-savings!

Allows us to focus on more creative tasks

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Leaders

Use cases:

Compose emails

Draft presentations

Brainstorm strategic ideas

Summarize meeting notes

INTRODUCTION TO CHATGPT
Technical roles

Use cases:

Recall code syntax

Generate examples

Explain code

Troubleshoot errors

Write documentation

INTRODUCTION TO CHATGPT
HR and people teams

Use cases:

Brainstorm employee engagement and


wellbeing initiatives

Communicate more effectively and


efficiently

INTRODUCTION TO CHATGPT
Marketing

Use cases:

Write social media posts

Copyediting content

Generating marketing copy

Search Engine Optimization (SEO)

INTRODUCTION TO CHATGPT
Sales

Use cases:

Generate outreach templates

Personalize outreach content

Brainstorm strategies

Summarize information

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Identifying use cases
for ChatGPT
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Coming up...

INTRODUCTION TO CHATGPT
Can be inaccurate

No predictability in responses

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Subject matter expertise is still very
important!

Rule-of-thumb: Don't ask ChatGPT to do


something that we couldn't do ourselves

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
Need consent to process the data

Must adhere to data governance laws, such


as GDPR

Legal counsel may be able to enable the


use case

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Can claim ownership over ChatGPT output
Other considerations such as copyright
infringement may prevent ownership

Legal and ethics → Coming up!

1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
INTRODUCTION TO CHATGPT
Example 1: Brainstorming ideas in HR

Improve employee wellbeing

Is this a suitable use?

1. Situation doesn't require definitive answers

2. HR Manager can verify response


3. Sensitive data isn't required
4. Response won't be used

INTRODUCTION TO CHATGPT
Example 2: Healthcare recommendations

Customers input their symptoms and


receive recommended action

Is this a suitable use?

Use case requires certainty due to


implications of poor recommendations
ChatGPT cannot provide this level of
certainty

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Ownership and
privacy
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Ownership and privacy

Ownership and privacy are key


considerations when validating the
suitability of ChatGPT

Neglecting them can risk financial


penalties, lawsuits, and brand damage

INTRODUCTION TO CHATGPT
Who owns the response?

INTRODUCTION TO CHATGPT
Who owns the response?
... As between you and OpenAI, and to the
extent permitted by applicable law... We
hereby assign to you all our right, title, and
interest, if any, in and to Output.

Assuming compliance, users can claim


ownership over the response

1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
Who owns the response?
... output may not be unique and other Represent that Output was human-
users may receive similar output from our generated when it was not.
Services. Our assignment above does not
extend to other users' output or any Third
Party Output. Use our Services in a way that infringes,

Factual questions or generating small text misappropriates or violates anyone's rights.

snippets → cannot claim ownership Includes copyright infringement

OpenAI's terms of use are updated


frequently

1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
Ownership and copyright

Copyright: the rights of the owner of the


intellectual property (IP) to use or distribute
the material

If generated content resembles copyrighted


property, infringement claims can be made

INTRODUCTION TO CHATGPT
Who owns the prompt?

INTRODUCTION TO CHATGPT
Who owns the prompt?

As between the parties and to the extent permitted by applicable law, you own all Input

1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
Prompt privacy

ChatGPT is being continuously developed


and improved

OpenAI may use prompts and responses for


performance improvements

May need to opt-out of usage agreement

Risk of breaching data governance laws

1 https://openai.com/policies/terms-of-use

INTRODUCTION TO CHATGPT
Data governance

Govern how data can be collected, stored,


and used

Example: GDPR governs data usage


impacting EU citizens and residents

Use cases must adhere to data governance


laws

INTRODUCTION TO CHATGPT
AI ethics

Ensure data is used with people and


society's best interest in-mind

Ask whether the use will negatively or


positively impact people

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Advancements in
generative AI
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Coming up...

What's to come in generative AI?

What challenges need to be overcome?

INTRODUCTION TO CHATGPT
Performance improvements

More human-like content

Handle more complexity


Greater reliability

INTRODUCTION TO CHATGPT
What's driving the improvements?

Large Language Models (LLMs)


Learns from a huge text dataset

Algorithms detect patterns in text

Fine-tune the model by rating responses

INTRODUCTION TO CHATGPT
What's driving the improvements?

Amount of training data will increase

INTRODUCTION TO CHATGPT
What's driving the improvements?

Amount of training data will increase

Usage data will help in fine-tuning

INTRODUCTION TO CHATGPT
Building balanced datasets

Challenge: Ensuring data is high quality and


balanced

Quantity of data makes detecting bias


prior to training difficult

Goal: Develop more robust bias mitigation


procedures

INTRODUCTION TO CHATGPT
Opportunities for misuse

Misrepresenting AI-generated content

Creating malicious content (e.g., spam)

Intervention by lawmakers:

Regulations could help or hinder AI


advancement

INTRODUCTION TO CHATGPT
From generalized to specialized

ChatGPT is a generalizable model

Generative AI models will become more


specialized
Example: a model specifically designed to
write long and complex code

INTRODUCTION TO CHATGPT
Other types of generative AI

1 DALL-E 3

INTRODUCTION TO CHATGPT
AI for everyone!

Accessibility is key to ChatGPT's success

Democratization of AI tools
Everyone should benefit from the
technology

INTRODUCTION TO CHATGPT
Let's practice!
I N T R O D U C T I O N T O C H AT G P T
Congratulations!
I N T R O D U C T I O N T O C H AT G P T

James Chapman
Curriculum Manager, DataCamp
Chapter 1 - Interacting with ChatGPT
What can ChatGPT do?
What are its limitations?

How to write effective prompts → prompt engineering

INTRODUCTION TO CHATGPT
Chapter 2 - Adopting ChatGPT

Augmenting business workflows

Identifying appropriate use cases


Legal and ethical considerations

The future of generative AI

INTRODUCTION TO CHATGPT
Where next?

Courses: Skill Tracks:

Understanding Prompt Engineering AI Fundamentals

Generative AI Concepts AI Business Fundamentals

Large Language Models (LLMs) Concepts

AI Ethics

Artificial Intelligence (AI) Strategy

Implementing AI Solutions in Business

INTRODUCTION TO CHATGPT
Congratulations!
I N T R O D U C T I O N T O C H AT G P T
What is machine
learning?
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Lis Sulmont
Curriculum Manager, DataCamp
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
Artificial intelligence (AI)

A huge set of tools for making computers


behave intelligently

UNDERSTANDING MACHINE LEARNING


Artificial intelligence (AI)

A huge set of tools for making computers


behave intelligently

Machine learning is the most prevalent subset


of AI

UNDERSTANDING MACHINE LEARNING


Defining machine learning:
A set of tools for making inferences and predictions from data

UNDERSTANDING MACHINE LEARNING


Defining machine learning: what can it do?
Predict future events
Will it rain tomorrow?
Yes (75% probability)

Infer the causes of events and behaviors


Why does it rain?
Time of the year, humidity levels, temperature, location, etc

Infer patterns
What are the different types of weather conditions?
Rain, sunny, overcast, fog, etc

UNDERSTANDING MACHINE LEARNING


Defining machine learning: how does it work?
Interdisciplinary mix of statistics and computer science
Ability to learn without being explicitly programmed

Learn patterns from existing data and applies it to new data

Relies on high-quality data

... more to come throughout the course!

UNDERSTANDING MACHINE LEARNING


Data science

Data science is about discovering and


communicating insights from data

UNDERSTANDING MACHINE LEARNING


Data science

Data science is about making discoveries and


creating insights from data

Machine learning is often an important tool


for data science work

UNDERSTANDING MACHINE LEARNING


Machine learning model
A statistical representation of a real-world process based on data

UNDERSTANDING MACHINE LEARNING


Machine learning model
A statistical representation of a real-world process based on data

UNDERSTANDING MACHINE LEARNING


Machine learning model
A statistical representation of a real-world process based on data

UNDERSTANDING MACHINE LEARNING


Machine learning model
A statistical representation of a real-world process based on data

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Machine learning
concepts
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Lis Sulmont
Curriculum Manager, DataCamp
Three types of machine learning

1) Reinforcement learning
2) Supervised learning
3) Unsupervised learning

UNDERSTANDING MACHINE LEARNING


Training data
Training data: existing data to learn from
Training a model: when a model is being built from training data
Can take nanoseconds to weeks

UNDERSTANDING MACHINE LEARNING


Supervised learning training data

UNDERSTANDING MACHINE LEARNING


Supervised learning training data

UNDERSTANDING MACHINE LEARNING


Supervised learning training data

UNDERSTANDING MACHINE LEARNING


Supervised learning training data

UNDERSTANDING MACHINE LEARNING


Supervised learning training data

UNDERSTANDING MACHINE LEARNING


After training (supervised learning)

UNDERSTANDING MACHINE LEARNING


After training (supervised learning)

UNDERSTANDING MACHINE LEARNING


After training (supervised learning)

UNDERSTANDING MACHINE LEARNING


Supervised vs unsupervised learning
Supervised learning
Training data is "labeled"

Unsupervised learning
Training data only has features

Useful for:
Anomaly detection

Clustering, e.g., dividing data into


groups

UNDERSTANDING MACHINE LEARNING


Unsupervised learning training data

UNDERSTANDING MACHINE LEARNING


Unsupervised learning training data

UNDERSTANDING MACHINE LEARNING


After training (unsupervised learning)

UNDERSTANDING MACHINE LEARNING


Unsupervised Learning
In reality, data doesn't always come with labels
Requires manual labor to label

Labels are unknown

No labels: model is unsupervised and finds its own patterns

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Machine learning
workflow
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Lis Sulmont
Curriculum Manager, DataCamp
Machine learning workflow

UNDERSTANDING MACHINE LEARNING


Our scenario
Our dataset: NYC property sales from 2015-
2019

Includes:

Square feet

Neighborhood

Year built

Sale price

And more!

Our target: Sale price

UNDERSTANDING MACHINE LEARNING


Step 1: Extract features

UNDERSTANDING MACHINE LEARNING


Step 2: Split dataset

UNDERSTANDING MACHINE LEARNING


Step 3: Train model

UNDERSTANDING MACHINE LEARNING


Step 3: Train model

UNDERSTANDING MACHINE LEARNING


Step 4: Evaluate

UNDERSTANDING MACHINE LEARNING


Step 4: Evaluate

Test dataset: "unseen" data

Many ways to evaluate:


What is the average error of the predictions?

What percent of apartments did the model accurately predict within a 10% margin?

UNDERSTANDING MACHINE LEARNING


Step 4: Evaluate

UNDERSTANDING MACHINE LEARNING


Step 4: Evaluate

UNDERSTANDING MACHINE LEARNING


Step 4: Evaluate

If not, tune the model and re-train it:


e.g., change the model's options, add/remove features

UNDERSTANDING MACHINE LEARNING


Machine learning workflow

UNDERSTANDING MACHINE LEARNING


Summary of steps
1. Extract features
Choosing features and manipulating the dataset

2. Split dataset
Train and test dataset

3. Train model
Input train dataset into a machine learning model

4. Evaluate
If desired performance isn't reached: tune the model and repeat Step 3

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Supervised learning
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Hadrien Lacroix
Content Developer at DataCamp
Modeling

UNDERSTANDING MACHINE LEARNING


Types

UNDERSTANDING MACHINE LEARNING


What is supervised learning?

UNDERSTANDING MACHINE LEARNING


Classification and regression

UNDERSTANDING MACHINE LEARNING


Classification

UNDERSTANDING MACHINE LEARNING


Classification
Classification = assigning a category
Will this customer stop its subscription?
Yes, No
Is this mole cancerous?
Yes, No
What kind of wine is that?
Red, White, Rosé
What flower is that?
Rose, Tulip, Carnation, Lily

UNDERSTANDING MACHINE LEARNING


Observations

UNDERSTANDING MACHINE LEARNING


Features

UNDERSTANDING MACHINE LEARNING


Target

UNDERSTANDING MACHINE LEARNING


Graphing our data

UNDERSTANDING MACHINE LEARNING


Splitting data

UNDERSTANDING MACHINE LEARNING


Manual classifier

UNDERSTANDING MACHINE LEARNING


Support vector machine - linear classifier

UNDERSTANDING MACHINE LEARNING


Support vector machine - polynomial classifier

UNDERSTANDING MACHINE LEARNING


Regression

UNDERSTANDING MACHINE LEARNING


Regression
Regression = assigning a continuous variable
How much will this stock be worth?

What is this exoplanet's mass?

How tall will this child be as an adult?

UNDERSTANDING MACHINE LEARNING


Predicting temperature

UNDERSTANDING MACHINE LEARNING


Training data

UNDERSTANDING MACHINE LEARNING


Linear regression

UNDERSTANDING MACHINE LEARNING


Model

UNDERSTANDING MACHINE LEARNING


Given humidity...

UNDERSTANDING MACHINE LEARNING


...find temperature

UNDERSTANDING MACHINE LEARNING


Testing data

UNDERSTANDING MACHINE LEARNING


Classification vs regression
Regression = continuous
Any value within a finite (height) or infinite (time) interval
20°F, 20.1°F, 20.01°F...
Classification = category
One of few specific values
Cold, Mild, Hot

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Unsupervised
learning
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Hadrien Lacroix
Content Developer at DataCamp
Unsupervised learning

UNDERSTANDING MACHINE LEARNING


Unsupervised learning
Unsupervised learning = no target column
No guidance

Looks at the whole dataset

Tries to detect patterns

UNDERSTANDING MACHINE LEARNING


Applications

UNDERSTANDING MACHINE LEARNING


Clustering

UNDERSTANDING MACHINE LEARNING


Clustering example

UNDERSTANDING MACHINE LEARNING


Species cluster

UNDERSTANDING MACHINE LEARNING


Color cluster

UNDERSTANDING MACHINE LEARNING


Origin cluster

UNDERSTANDING MACHINE LEARNING


Clustering models
K Means:
Specify the number of clusters

DBSCAN (density-based spatial clustering of applications with noise):


Specify what constitutes a cluster

UNDERSTANDING MACHINE LEARNING


Iris table

UNDERSTANDING MACHINE LEARNING


K-Means with 4 clusters

UNDERSTANDING MACHINE LEARNING


K-Means with 3 clusters

UNDERSTANDING MACHINE LEARNING


Ground truth

UNDERSTANDING MACHINE LEARNING


Anomaly detection

UNDERSTANDING MACHINE LEARNING


Detecting outliers
Anomaly detection = detecting outliers
Outliers = observations that differ from the rest

UNDERSTANDING MACHINE LEARNING


Outliers

UNDERSTANDING MACHINE LEARNING


Removing outliers

UNDERSTANDING MACHINE LEARNING


Some anomaly detection use cases
Discover devices that fail faster or last longer
Discover fraudsters that manage trick the system

Discover patients that resist a fatal disease

...

UNDERSTANDING MACHINE LEARNING


Association

UNDERSTANDING MACHINE LEARNING


Association

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Evaluating
performance
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Hadrien Lacroix
Content Developer at DataCamp
Evaluate step

UNDERSTANDING MACHINE LEARNING


Overfitting
Performs great on training data
Performs poorly on testing data

Model memorized training data and can't generalize learnings to new data

Use testing set to check model performance

UNDERSTANDING MACHINE LEARNING


Illustrating overfitting

UNDERSTANDING MACHINE LEARNING


Accuracy
Accuracy = correctly classified observations / all observations
48 / 50 = 96%

UNDERSTANDING MACHINE LEARNING


Limits of accuracy: fraud example

Accuracy of this model:

28 correctly classif ied


= 93.33%
30 total points
Misses majority of fraudulent transactions

Need a better metric

UNDERSTANDING MACHINE LEARNING


Confusion matrix

UNDERSTANDING MACHINE LEARNING


True positives

UNDERSTANDING MACHINE LEARNING


True positives

UNDERSTANDING MACHINE LEARNING


False negatives

UNDERSTANDING MACHINE LEARNING


False negatives

UNDERSTANDING MACHINE LEARNING


Remembering False Negatives

UNDERSTANDING MACHINE LEARNING


Fill out the rest...

UNDERSTANDING MACHINE LEARNING


False positives, true negatives

UNDERSTANDING MACHINE LEARNING


Remembering False Positives

1 https://www.flickr.com/photos/59632563@N04/6104068209

UNDERSTANDING MACHINE LEARNING


Sensitivity

How many fraudulent transactions did we classify correctly?

true positives
Sensitivity = = 1/3 = 33.33%
true positives + f alse negatives
Rather mark legitimate transactions as suspicious than authorize fraudulent transactions

UNDERSTANDING MACHINE LEARNING


Specificity
true negatives
Specif icity =
true negatives + f alse positives
Spam filter:

Rather send spam to inbox than send real emails to the spam folder

UNDERSTANDING MACHINE LEARNING


Evaluating regression

UNDERSTANDING MACHINE LEARNING


Evaluating regression

Error = distance between point (actual value) and line (predicted value)
Many ways calculate this. e.g, root mean square error

UNDERSTANDING MACHINE LEARNING


Unsupervised learning

1 https://www.flickr.com/photos/micahdowty/8540188997

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Improving
performance
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Hadrien Lacroix
Content Developer at DataCamp
Machine learning workflow

UNDERSTANDING MACHINE LEARNING


Several options
Dimensionality reduction
Hyperparameter tuning

Ensemble methods

UNDERSTANDING MACHINE LEARNING


Dimensionality reduction
Reducing the number of features

UNDERSTANDING MACHINE LEARNING


Dimensionality reduction: example
Irrelevance: some features don't carry useful information

UNDERSTANDING MACHINE LEARNING


Dimensionality reduction: example
Correlation: some features carry similar information

Keep only one feature


e.g. height and shoe size --> height

Collapse multiple features into one underlying feature


e.g. height and weight --> Body Mass Index

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning

UNDERSTANDING MACHINE LEARNING


Hyperparameter tuning: example
SVM algorithm hyperparameters:

kernel : "linear" --> "poly"

degree

gamma

shrinking

coef0

tol

...

UNDERSTANDING MACHINE LEARNING


Ensemble methods

UNDERSTANDING MACHINE LEARNING


Ensemble methods: classification

UNDERSTANDING MACHINE LEARNING


Ensemble methods: regression

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Deep learning
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Sara Billen
Curriculum Manager, DataCamp
What is deep learning?
AKA: Neural Networks
Basic unit: neurons (nodes)

Special area of Machine Learning

Requires more data

Best when inputs are images or text

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Predicting box office revenue

UNDERSTANDING MACHINE LEARNING


Deep learning
Neural networks are much larger
Deep learning: neural network with many
neurons

Can solve complex problems

UNDERSTANDING MACHINE LEARNING


When to use deep learning?
Lots of data
Access to processing power

Lack of domain knowledge

Complex problems
Computer vision

Natural language processing

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
The process
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Sara Billen
Curriculum Manager, DataCamp
Computer vision
Helps computers see and understand the content of digital images

UNDERSTANDING MACHINE LEARNING


Image data

UNDERSTANDING MACHINE LEARNING


Image data

UNDERSTANDING MACHINE LEARNING


UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
UNDERSTANDING MACHINE LEARNING
Training the neural network

UNDERSTANDING MACHINE LEARNING


Applications
Facial recognition
Self-driving vehicles

Automatic detection of tumors in CT scans

Deep fake

...

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Natural Language
Processing
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Sara Billen
Curriculum Manager at DataCamp
Natural Language Processing (NLP)
The ability for computers to understand the meaning of human language

UNDERSTANDING MACHINE LEARNING


Bag of words

UNDERSTANDING MACHINE LEARNING


Bag of words
"U2 is a great band" "Queen is a great band"

Word Count Word Count


U2 1 U2 0
Queen 0 Queen 1
is 1 is 1
a 1 a 1
great 1 great 1
band 1 band 1

UNDERSTANDING MACHINE LEARNING


Bag of words: n-grams
"That book is not great" 2-gram (bi-gram)

Word Count Word Count


That 1 That book 1
book 1 book is 1
is 1 is not 1
not 1 not great 1
great 1

UNDERSTANDING MACHINE LEARNING


Bag of words: limitations
Word counts don't help us consider
synonyms

Example: "blue"
"sky-blue"

"aqua"

"cerulean"

Want to group as a single feature

UNDERSTANDING MACHINE LEARNING


Word embeddings
Word embeddings
Create features that group similar words

Features have a mathematical meaning:

king - man + woman = queen

UNDERSTANDING MACHINE LEARNING


Language translation

UNDERSTANDING MACHINE LEARNING


Applications
Language translation

Chatbots

Personal assistants

Sentiment analysis

...

UNDERSTANDING MACHINE LEARNING


Deep learning
Two types of problems
Computer vision

Natural language processing

Why deep learning?


Complex problems

Automatic feature extraction

Lots of data

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Limits of machine
learning
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Sara Billen
Curriculum Manager, DataCamp
Data quality

Garbage in garbage out

Output quality depends on input quality

UNDERSTANDING MACHINE LEARNING


How it can go horribly wrong
Amazon's gender-biased recruiting Recruiting software to help review resumes
tool Preferred men because it learned from
historic data when more men were hired

It downgraded resumes that


contain the word "women"

implied the applicant was female

UNDERSTANDING MACHINE LEARNING


How it can go horribly wrong
Microsoft's AI chatbot

UNDERSTANDING MACHINE LEARNING


Beware
Don't blindly trust your model
Awareness is key

Pay attention to your data

A machine learning model is only as good as


the data you give it

UNDERSTANDING MACHINE LEARNING


Quality assurance
High-quality data requires:
Data analysis

Review of outliers

Domain expertise

Documentation

UNDERSTANDING MACHINE LEARNING


Explainability

UNDERSTANDING MACHINE LEARNING


Explainability

Transparency to increase trust, clarity, and understanding

Use cases: business adoption, regulatory oversight, minimizing bias

UNDERSTANDING MACHINE LEARNING


Explainable AI
Black box Explainable AI
Deep learning Traditional machine learning

Better for "What?" Better for "Why?"

Highly accurate predictions Understandable by humans

UNDERSTANDING MACHINE LEARNING


Example: Explainable AI

1. Prediction: Will the patient get diabetes?

2. Inference: Why will this happen

UNDERSTANDING MACHINE LEARNING


Example: Inexplicable AI

Prediction only: Which letter is this likely to be?

UNDERSTANDING MACHINE LEARNING


Let's practice!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
Congratulations!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G

Lis Sulmont
Curriculum Manager, DataCamp
Chapter 1
What is machine learning?
Machine learning concepts and workflow

UNDERSTANDING MACHINE LEARNING


Chapter 2
Different types of machine learning
How we evaluate and improve machine learning models

UNDERSTANDING MACHINE LEARNING


Chapter 3
Deep learning, including computer vision and natural language processing
Limits of machine learning

UNDERSTANDING MACHINE LEARNING


What's next?

UNDERSTANDING MACHINE LEARNING


What's next?
Machine Learning Scientist
Machine Learning Fundamentals

Supervised Machine Learning

Unsupervised Machine Learning

UNDERSTANDING MACHINE LEARNING


Congrats!
U N D E R S TA N D I N G M A C H I N E L E A R N I N G
The rise of LLMs in
the AI landscape
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Rapid developments in AI

1 Freepik, Tesla Youtube Channel

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


AI-powered recommendations

1 Netflix blog, Medium

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


AI and data-driven tasks

Sentiment analysis, fraud detection, and


more

Still, lacked human-like interaction

Enter Large Language Models

1 Unsplash

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The AI landscape

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The AI landscape

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The AI landscape

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The AI landscape

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The AI landscape

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Definition of LLMs

Large
Training data and resources

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Definition of LLMs

Large
Training data and resources

Language
Human-like text

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Definition of LLMs

Large
Training data and compute power

Language
Human-like text

Models
Learn complex patterns using text data

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


The defining moment

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Popular language generators

1 https://zapier.com/blog/best-ai-chatbot/

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Applications

Sentiment analysis

Identifying themes
Translating text or speech

Generating code

Next-word prediction

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


What shall this course cover?

Conceptual understanding of LLMs

Training data considerations


Ethical, privacy and environmental
concerns

The future of LLMs

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Real-world
applications
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Business opportunities

Benefits

Automate tasks

Improve efficiency

Create revenue streams

Enable new capabilities

The possibilities are endless!

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transforming finance industry

Unstructured data or text: data that lacks definition and is presented free-form

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transforming finance industry

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transforming finance industry

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Challenges in healthcare
Doctors' notes: Challenges:

Jargon Hard to understand terms

Abbreviations Difficult to interpret


Domain expertise Difficult to describe patient files

Varying writing style

Varied text data and acronyms

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Revolutionizing healthcare sector
Analyze patient data to offer personalized recommendations
Must adhere to privacy laws

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Education

Personalized coaching and feedback

Interactive learning experience


AI-powered tutor
Ask questions

Receive guidance

Discuss ideas

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Personalizing education: text generation

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Defining multimodal
Multimodal Non-multimodal

Many types of processing or generation One type of processing or generation

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Visual question answering

Answers to questions about visual content

Object identification & relationships


Scene description

Recognizes the zebra image

Responds with additional information

Makes a joke

1 https://arxiv.org/abs/2302.14045

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Challenges of
language modeling
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Sequence matters!

I only follow a healthy lifestyle. Only I follow a healthy lifestyle.

Different positions = different meanings

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Context modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Context modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Context modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Context modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Context modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Long-range dependency

Recognize and connect distant words in a sentence

Challenging for traditional language models

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Single-task learning

Time and resource expensive


Less flexible compared to modern LLMs

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Multi-task learning

Improved performance on each individual task

Might impact accuracy and efficiency


Less training data needed because data is shared

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


To recap
Challenges of language Single-task learning:
modeling: Task-specific
Word sequences Less flexible

Traditional models and early LLMs

Understanding context

Multi-task learning:
Long-range dependency Versatile

Multiple tasks

More developed LLMs

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Novelty of LLMs
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Using text data

Unstructured data - messy and inconsistent

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Machines do not understand language!

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Need for NLP

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Unique capabilities of LLMs

Linguistic subtleties
Irony

Humor

Pun

Sarcasm

Intonation

Intent

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


What's your favorite book?

Natural response: "Oh, that's a tough one!"

Personal opinion: "My all-time favorite book is To Kill a Mockingbird by Harper Lee."

Supporting statement: "It's a powerful story about prejudice, justice, and the human
experience."

Follow-up question: "Have you read it?"

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Linguistic subtleties
Sarcasm: "Oh great, another meeting."

Traditional language model: Large language model:

Response: "What's the meeting about?" Response: "Sounds like you're looking
Neutral forward to it!"

Does not pick up sarcasm Playful

Engaging

Matches the sarcasm

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


How do LLMs understand

Trained on vast amounts of data

Largeness of LLMs: parameters


Parameters represent the patterns and rules

More parameters -> complex patterns

Generates sophisticated and accurate responses

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Parameters

Small number of bricks -> limited structures Larger number of bricks -> complex and
detailed structures

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Emergence of new capabilities

Emergent abilities
only present in large-scale models

Scale:
The volume of training data

The number of model parameters

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Emergence of new capabilities

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Building blocks of LLMs

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


To recap
LLMs: How?

Overcome data's unstructured nature LLMs' "largeness"

Extensive training data


Many parameters
Outperform traditional models
Emergent abilities

Understand linguistic subtleties

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Generalized
overview of NLP
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Text pre-processing
Can be done in a different order as they are independent

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Tokenization
Splits text into individual words, or tokens

Text:
"Working with natural language processing techniques is tricky."

Tokenization:
["Working", "with", "natural", "language", "processing", "techniques", "is", "tricky", "."]

Converts into a list

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Stop word removal
Stop words do not add meaning
Eliminated through stop word removal

Before stop word removal:


["Working", "with", "natural", "language", "processing", "techniques", "is", "challenging", "."]

After stop word removal:


["Working", "natural", "language", "processing", "techniques", "challenging", "."]

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Lemmatization

Group slightly different words with similar Talking -> Talk


meaning
Talked -> Talk
Talk -> Talk

Reduces words to their base form

Mapped to root word

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Text representation

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Text representation

Text data into numerical form

Bag-of-words

Word embeddings

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Bag-of-words

Text into a matrix of word counts

0 represents the absence of a word

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Limitations of bag-of-words
Does not capture the order or context
Can lead to incorrect interpretations

Similar sentences but opposite meaning


"The cat chased the mouse swiftly."

"The mouse chased the cat."

Does not capture the semantics between the words


Treats related words as independent

Like "cat" and "mouse"

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Word embeddings
Capture the semantic meanings as Predator-prey relationship:
numbers

Cat Mouse
Plant -0.9 -0.8
Furry 0.9 0.7
Carnivore 0.9 -0.8

Cat [-0.9, 0.9, 0.9]

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Machine-readable form

Start with text pre-processing

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Machine-readable form

Convert pre-processed text to numerical format

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Fine-tuning
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Pre-training Fine-tuning

School education University specialization

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


"Largeness" challenges
Fine-tuning can help

Powerful computers

Efficient model training methods

Large amounts of training data

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Computing power
Memory

Processing power

Infrastructure

Expensive

LLM:
100,000's Central Processing Units
(CPUs)

10,000's Graphic Processing Units (GPUs)

A personal computer: 4-8 CPU and 1-2


GPUs

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Efficient model training
Training time is huge

May take weeks or even months

Efficient model training = faster training


time

355 years of processing time on a single


GPU

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data availability

Need of high-quality data

To learn the complexities and subtleties of


language

A few hundred gigabytes (GBs) of text data


More than a million books

Massive amount of data

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Overcoming the challenges
Fine-tuning
Addresses some of these challenges

Adapts a pre-trained model

Pre-trained model
Learned from general-purpose datasets

Not optimized for specific-tasks

Can be fine-tuned for a specific problem

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Fine-tuning vs. Pre-training
Fine-tuning Pre-training

Compute Compute
1-2 CPU and GPU Thousands of CPUs and GPUs

Training time Training time


Hours to days Weeks to months

Data Data
~1 gigabyte Hundreds of gigabytes

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Learning techniques
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Getting beyond data constraints

Fine-tuning: training a pre-trained model for a specific task

But, what if there is little to no labeled data?

N-shot learning: zero-shot, few-shot, and multi-shot

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transfer learning
Learn from one task and transfer to related
task

Transferring knowledge from piano to


guitar
Reading musical notes

Understanding rhythm

Grasping musical concepts

N-shot learning
Zero-shot - no task-specific data

Few-shot - little task-specific data


Multi-shot - relatively more training data

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Zero-shot learning
No explicit training
Uses language understanding and context

Generalizes without any prior examples

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Few-shot learning
Learn a new task with a few examples Prior knowledge to answer new question

One-shot learning: fine-tuning from one


example

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Multi-shot learning
Requires more examples than few-shot

Previous tasks, plus new examples

For example, a model trained on Golden


Retriever

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Multi-shot learning
Model output: Labrador Retriever

Saves time in collecting and labeling data

No compromise on accuracy

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Building blocks so far
Data preparation workflow

Fine-tuning

N-shot learning techniques

Next up: pre-training

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Building blocks to
train LLMs
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Generative pre-training

Trained using generative pre-training


Input data of text tokens

Trained to predict the tokens within the dataset

Types:
Next word prediction

Masked language modeling

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Next word prediction
Supervised learning technique
Model trained on input-output pairs

Predicts next word and generates coherent


text

Captures the dependencies between words

Training Data
Pairs of input and output examples

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Training data for next word prediction
Input Output

The quick brown fox

The quick brown fox jumps

The quick brown fox jumps over

The quick brown fox jumps over the

The quick brown fox jumps over the lazy

The quick brown fox jumps over the lazy dog

The quick brown fox jumps over the lazy dog.

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Which word relates more with pizza?

More examples = better prediction

For example:
I love to eat pizza with _ _ _ _ _ _

Cheese is more related with pizza than


anything else

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Masked language modeling
Hides a selective word
Trained model predicts the masked word

Original Text: "The quick brown fox jumps over the lazy dog."

Masked Text: "The quick [MASK] fox jumps over the lazy dog."

Objective: predict the missing word

Based on learnings from training data

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Introducing the
transformer
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


What is a transformer?
"Attention Is All You Need"
Revolutionized language modeling

Transformer architecture
Relationship between words

Components: Pre-processing, Positional


Encoding, Encoders, and Decoders

1 arXiv: Attention Is All You Need

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Inside the transformer

Input: Jane, who lives in New York and works as a software

Output: engineer, loves exploring new restaurants in the city.

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transformers are like an orchestra

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Text pre-processing and representation
Text preprocessing: tokenization, stop word removal, lemmatization
Text representation: word embedding

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Positional encoding
Information on the position of each word
Understand distant words

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Encoders
Attention mechanism: directs attention to specific words and relationships
Neural network: process specific features

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Decoders
Includes attention and neural networks
Generates the output

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transformers and long-range dependencies

Initial challenge: long-range dependency

Attention: focus on different parts of the input

Example: "Jane, who lives in New York and works as a software engineer, loves exploring
new restaurants in the city."

"Jane" --- "loves exploring new restaurants"

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Processes multiple parts simultaneously
Limitation of traditional language models:
Sequential - one word at a time

Transformers:
Process multiple parts simultaneously

Faster processing

For example:
"The cat sat on the mat"
Processes "cat," "sat," "on," "the," and "mat" at the same time

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Attention
mechanisms
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Attention mechanisms
Understand complex structures
Focus on important words

Book reading analogy:


Clues in a mystery book

Focus on relevant content

Concentrate on crucial input data

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Self-attention and multi-head attention
Self-attention Multi-head attention

Weighs the importance of each word Next level of self-attention

Captures long-range dependencies Splits input into multiple heads with each
head focusing on different aspects

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Attention in a party
Attention: Self and multi-head

Example:
Group conversation at a party

Selective attention to relevant speaker

Filter noise

Focus on key points

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Party continues
Self-attention Multi-head attention

Focus on each person's words Split attention into "multiple" channels

Evaluate and compare their relevance Focus on different aspects of conversation


Weigh each speaker's input Speaker's emotions, primary topic, and

Combines for a comprehensive related side-topics

understanding Process each aspect and merge

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Multi-head attention advantages
"The boy went to the store to buy some groceries, and he found a discount on his favorite
cereal."

Attention: "boy," "store," "groceries," and "discount"

Self-attention: "boy" and "he" -> same person

Multi-head attention: multiple channels


Character ("boy")

Action ("went to the store," "found a discount")

Things involved ("groceries," "cereal")

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Advanced fine-
tuning
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Where are we?

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Reinforcement Learning through Human Feedback

Pre-training

Fine-tuning

Reinforcement Learning through Human


Feedback (RLHF)

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Pre-training
Large amounts of text data:
Websites, books and articles

Transformer architecture

Learns general language patterns,


grammar, and facts

Next-word prediction

Masked language modeling

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Fine-tuning

N-shot training

Small labeled dataset for related task

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


But, why RLHF?
General-purpose training data lacks quality
Noise

Errors

Inconsistencies

Reduced accuracy

Example of reduced accuracy:

Trained on data from online discussion


forums
Unvalidated opinions and facts
Needs external expert validation

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Starts with the need to fine-tune
Pre-training
Learns underlying language patterns

Doesn't capture context-specific complexities

Fine-tuning
Quality labeled data improves performance

Enter RLHF!
Human feedback

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Simplifying RLHF

Model output reviewed by human

Updates model based on the feedback

Step 1:
Receives a prompt

Generates multiple responses

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Enters human expert

Step 2:
Human expert checks these responses

Ranks the responses based on quality


Accuracy

Relevance

Coherence

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Time for feedback
Step 3:
Learns from expert's ranking

To align its response in future with their


preferences

And it goes on!


Continues to generate responses

Receives expert's rankings

Adjusts the learning

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Recap
Pre-training to learn general language knowledge

Fine-tuning for specific tasks

RLHF techniques to enhance fine-tuning through human feedback

Combination is highly effective!

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Completing the LLM

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Data concerns and
considerations
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Data considerations

Data volume and compute power

Data quality
Labeling

Bias

Privacy

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data volume and compute power
LLMs need a lot of data
Similar to a child learning to talk

570 GB, ~1.3 million books

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data volume and compute power
LLMs need a lot of data
Similar to a child learning to talk

570 GB, ~1.3 million books

Extensive computing power; think of the


energy consumption

Can cost millions of dollars!

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data quality
Quality data is essential

Accurate data = better learning = improved


response quality = increased trust

A child learning to talk


Gibberish-in -> gibberish-out

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Labeled data
Correct data label: accurate learning, generalize patterns, accurate responses
Labor-intensive: assigning correct label to each article

Incorrect labels impact model performance

Address errors: identify -> analyze -> iterate

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data bias
Influenced by societal stereotypes
Lack of diversity in training data

Discrimination and unfair outcomes

Spot and deal with the biased data


Evaluate data imbalances

Promote diversity Example:


"The nurse said that..." -> "she" or "her"
Bias mitigation techniques: more diverse
examples

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Data privacy
Compliance with data protection and Sensitive or personally identifiable
privacy regulations information (PII)

Privacy is a concern Get permission


Training on data without permission can
lead to a breach

Legal, financial and reputational harm

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Ethical and
environmental
concerns
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Ethical concerns
Transparency risk

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Ethical concerns
Transparency risk
Accountability risk -

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Ethical concerns
Transparency risk
Accountability risk

Information hazards

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Transparency risk
Challenging to understand the output

Difficult to identify issues


Bias

Errors

Misuse

Black box

Example: reasoning behind predicting


disease outcomes

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Accountability risk
Responsibility of LLMs' actions

Who is responsible?
Incorrect and harmful advice

Model developer or the company?

Game without rules


No transparency

No accountability

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Information hazards

Disseminating harmful information

Harmful content generation

Misinformation spread
Malicious use

Toxicity

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Information hazards

Harmful content generation Misinformation spread

Harmful, offensive, or inappropriate Generate text on any topic

Prompt or biased training data But, no verification!

Example: Example:
Bullying vs. friendly school environment "What's a good diet for losing weight?"
Distressing and harmful Unsubstantiated diet plan

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Information hazards

Malicious use Toxicity

Bad actors exploiting LLMs Inappropriate content

Generate deceptive content Training or through manipulated prompts

Example: Example:
Fabricated news Insensitive response
Manipulating public and causing unrest Stereotype

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Environmental concerns

Ecological footprint of LLMs

Substantial energy resources to train

Impact through carbon emissions

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Cooling requires electricity too!
Produce considerable heat that needs
cooling

Imagine thousands of laptops overheating


Require complex cooling systems

Adds to environmental impact

Balance the cost and benefits


Use renewable energy
Energy-efficient tech
1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Where are LLMs
heading?
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
Journey so far

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Journey so far

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Journey so far

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Journey so far

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Model explainability

How do they arrive at their outputs?

Road-trip planning
Why this particular route?

Why these specific spots?

Builds trust and transparency

Identify and correct the biases or errors


1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Efficiency
Computational efficiency
High-quality output with less compute

Faster and efficient


Model compression

Optimization

Benefits: better storage, lower energy use

Accessibility and sustainability


Promotes green AI

Reduces operating costs

1 Freepik

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Unsupervised bias handling
Biased data -> discrimination
Unsupervised bias handling
Bias detection and mitigation techniques,
automatically

No need of explicit human-labeled data

Identifies and reduces by analyzing


patterns

Challenge
Subtle, difficult to detect

Might introduce new biases

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Enhanced creativity
Creativity in text-based and visual art
forms

Artistic content: learned patterns, not


emotional understanding

Lack human-like comprehension of art or


emotions

Demonstrate human-like emotional


behavior

Future: emotion inference

1 https://arxiv.org/pdf/2302.09582.pdf

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Let's practice!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
Time to wrap-up
LARGE LANGUAGE MODELS (LLMS) CONCEPTS

Vidhi Chugh
AI strategist and ethicist
How far we have come!
LLMs transforming interaction with technology

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


How far we have come!

Substantial data requirements

Challenges and risks - privacy, ethics, and environmental implications

Future research and development

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


There is more to it

Entire teams devoted to understanding LLMs

Exciting times ahead

Stay updated with the latest developments


More on data ethics

Introduction to ChatGPT

LARGE LANGUAGE MODELS (LLMS) CONCEPTS


Congratulations!
LARGE LANGUAGE MODELS (LLMS) CONCEPTS
What is generative
AI?
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
We've long dreamed of tools that can create
In ancient stories...

GENERATIVE AI CONCEPTS
We've long dreamed of tools that can create
In ancient stories... ...and modern virtual worlds

GENERATIVE AI CONCEPTS
Unparalleled creative tools
AI Images AI Chatbots

1 Cosmopolitan Magazine, Anthropic PBC

GENERATIVE AI CONCEPTS
What is generative AI?
Machine learning models that generate new content

1 Google Bard

GENERATIVE AI CONCEPTS
What is generative AI?

1 Facebook Make-a-Video

GENERATIVE AI CONCEPTS
What is generative AI?

1 Replit Ghostwriter

GENERATIVE AI CONCEPTS
How does it work?

GENERATIVE AI CONCEPTS
How does it work?

GENERATIVE AI CONCEPTS
How does it work?

GENERATIVE AI CONCEPTS
Create images

1 Cosmopolitan Magazine using the Dall-E model

GENERATIVE AI CONCEPTS
Hold conversations

1 Facebook's LLaMA model

GENERATIVE AI CONCEPTS
Input more than text

1 Runway ML's InPainting Tool, https://runwayml.com/inpainting/

GENERATIVE AI CONCEPTS
Real-world applications
Generative AI will impact a variety of industries and functions:

Sales: draft sales outreach emails

Finance: analyze financial data


Marketing: generate marketing ads to test

Legal: Simply explain complex regulations

Education: Customize learning for individual students

Medical: Read and analyze medical data

Industrial: Automate repetitive tasks for industrial engineering and design

Games & Entertainment: Create 3D models and scenes

GENERATIVE AI CONCEPTS
The end of work?

Lots of implementation challenges and risks

Still, lots of opportunity to utilize

GENERATIVE AI CONCEPTS
Course goals

We'll learn how these models:

Generate content

Present new legal and ethical considerations

Impact society in the coming years

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Generative AI in the
machine learning
landscape
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Models that analyze
Discriminative models

Answer closed-ended questions

Learn from training data

Guess correct answer or categorize

1 Wikimedia Commons

GENERATIVE AI CONCEPTS
Bagels and puppies

1 Wikimedia Commons

GENERATIVE AI CONCEPTS
Guessing with confidence

1 Puppy image from DALL·E 2

GENERATIVE AI CONCEPTS
Models that imagine
Generative models

Guess data for a prediction

Still require training


Generate new content

1 Puppy image from Dall-E 2

GENERATIVE AI CONCEPTS
Mixing for effect

Generative AI:

Combines generative models with other ML

Models must work together like parts of a


machine

Produce complex creative work

1 Cosmopolitan Magazine

GENERATIVE AI CONCEPTS
Generative adversarial networks (GANs)

Generators try to trick discriminators


Compare notes and get better in multiple rounds

GENERATIVE AI CONCEPTS
Bagel Puppy GAN

1 https://twitter.com/teenybiscuit/media

GENERATIVE AI CONCEPTS
Artificial general intelligence (AGI)
An AI that exhibits intelligence like a human would

Scope of knowledge

Reasoning across domains


Social skills

Creative thinking

Other cognitive competencies (vision, language)

GENERATIVE AI CONCEPTS
Use the right tool for the job
Discriminative Models Generative AI

Predict tomorrow's weather Write code for a website

Categorize books Answer unique customer service questions


Determine if a picture is a puppy or a bagel Draw a picture of a cat scuba diving

Artificial General Intelligence

Complete traditionally human jobs

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
The evolution of
generative AI
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Generative AI burst on the scene in 2023

1 Yahoo Finance

GENERATIVE AI CONCEPTS
Key factors driving development
Several factors drive generative AI development:

Computing power

Dataset availability
Competitive interests

Model design

GENERATIVE AI CONCEPTS
Computational power allowed large models

Parallelization and specialized hardware


Graphics Processing Units (GPUs)

Tensor Processing Units (TPUs)

Cloud computing

Hardware-software optimization

1 Compute Trends Across Three Eras of Machine Learning, https://arxiv.org/abs/2202.05924

GENERATIVE AI CONCEPTS
Models improved with massive datasets
Global Datasphere Growth

1 IDC's Global DataSphere, 2021

GENERATIVE AI CONCEPTS
Competitive pressures encouraged faster development
Commercial Political

GENERATIVE AI CONCEPTS
GANs unleashed high quality generation

GENERATIVE AI CONCEPTS
Transformers brought context and coherence
'it' refers to 'animal' 'it' refers to 'street'

1 https://ai.googleblog.com/2017/08/transformer-novel-neural-network.html

GENERATIVE AI CONCEPTS
Transformers brought context and coherence

Transformers:

Grasp the context of a given text

Analyze relationships between words

Generate responses that feel natural and informative

GENERATIVE AI CONCEPTS
RLHF engaged user feedback
Reinforcement Learning with Human Feedback (RLHF):

Reinforcement learning trains models through trial-and-error

Human feedback comes from users scoring model responses

GENERATIVE AI CONCEPTS
RLHF engaged user feedback

1 Midjourney

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Model design and
data collection
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Know how to fill the tank

1 GM Fairfax Assembly Plant

GENERATIVE AI CONCEPTS
Developing a model
Model Development Steps

1. Research and design

2. Training data collection


3. Model training

4. Model evaluation

GENERATIVE AI CONCEPTS
Stable Diffusion's research and development
Example output from Stable Diffusion Stable Diffusion's R&D

Purpose: Decide on image generation

Architecture: Settle on diffusion model


Resources: 256 GPUs, 150k hours, $600k

1 Stability AI, Emad Mostaque Twitter post

GENERATIVE AI CONCEPTS
Data collection: not your typical ML model
Training data preparation

Massive amounts required

Diverse, context-rich data


Requires preprocessing

1 Laion blog

GENERATIVE AI CONCEPTS
Data collection: privacy and security are critical

Training data preparation

Personally Identifiable Information (PII)

Anonymize or aggregate

Store in secure location with controlled


access

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Model training
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Pick your mode of train-sportation
1. Hardware

2. Time
Dataset size

Model complexity

Rounds of training

3. Cost

GENERATIVE AI CONCEPTS
Graduate to advanced techniques
Foundation generative AI model is a first
step

Advanced techniques specialize them for


specific contexts

Key advanced techniques:

1. Transfer learning and fine-tuning

2. RLHF

3. Custom embeddings

GENERATIVE AI CONCEPTS
From cats to lions
Transfer learning transfers knowledge from one task to another
Fine-tuning is a type of transfer learning for a small dataset

1 Creative Commons Attribution-Share Alike 4.0, Bing image generator

GENERATIVE AI CONCEPTS
Where does your feedback go?

1 Google Bard, 2023

GENERATIVE AI CONCEPTS
Thumbs up for better responses

1 Google Bard, 2023

GENERATIVE AI CONCEPTS
Embeddings as fingerprints
Similar to recognizing a fingerprint
Unique representations of data entities

Capture meaning, context, and


relationships in compact form

1 Wikimedia commons

GENERATIVE AI CONCEPTS
Embedding Dan

1 Daniel Tedesco, 2023

GENERATIVE AI CONCEPTS
Embedded Dan

1 Daniel Tedesco portraits from vana.com

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Model evaluation
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Why evaluate anyway?
Assess performance and effectiveness of a model:

Measure progress

Rigorous model comparison

Benchmark human performance

GENERATIVE AI CONCEPTS
Evaluating generative AIs
Quantitative Metrics Human-centric Metrics

Discriminative model evaluation metrics Human performance comparison

Generative model-specific metrics Intelligent evaluation

GENERATIVE AI CONCEPTS
Discriminative model evaluation techniques
Measure performance on well-defined tasks

Pros:

Widely accepted and understood

Easy to calculate and compare

Cons:

Do not capture subjective nature of


generated content

GENERATIVE AI CONCEPTS
Generative model-specific metrics
Customized for particular generative tasks

Pros:

Nuanced criteria, like realism, diversity, and


novelty

Many well-known metrics

Cons:

Cannot capture many subjective elements

Often do not generalize

GENERATIVE AI CONCEPTS
Human performance comparison

Pros:

Benchmarks against human abilities

Demonstrates practical applicability

Cons:

Unfair comparison

GENERATIVE AI CONCEPTS
Award-winning AIs
Human Competitions Human Standardized Tests

1 https://twitter.com/colostatefair/status/1565486317839863809, OpenAI

GENERATIVE AI CONCEPTS
The gold standard
Intelligent evaluation by humans or other AIs

Pros:

Captures subjective aspects

Cons:

Slow, costly, and difficult to standardize

Subject to human biases and irregularity

GENERATIVE AI CONCEPTS
Turing's classic test

Proposed by computer scientist Alan Turing

Human evaluator judges AI-generated


content

Passes if evaluator cannot distinguish AI


from human

But human behavior is not always the right


standard

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Evaluating and
mitigating social
bias
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
What do we mean by social bias?

Systematic unfairness in generative AI

Serious societal consequences

Fairness can be subjective

Focus on broadly shared values

GENERATIVE AI CONCEPTS
Where bias appears
Training data

The model itself

How the model is used

GENERATIVE AI CONCEPTS
Bias in data
Skewed or unrepresentative information in the training dataset

GENERATIVE AI CONCEPTS
Bias in models
Pursuing goals that result in biased outcomes

GENERATIVE AI CONCEPTS
Bias in use
Applying AI in wrong or malicious ways

GENERATIVE AI CONCEPTS
Identifying bias in data and models
Representation analysis compares how the model refers to different groups
Fairness metrics evaluate models for equal treatment, opportunity, and accuracy across
groups

Human audits ask real people to review a model's outputs to identify bias

GENERATIVE AI CONCEPTS
Mitigating bias in data and models

Diversify data collection

Adjust model to prioritize different data


Adversarial training

Continuous improvement

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Copyright and
ownership
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Who won?

The person wrote the prompt

The company built the model


The artists whose works trained the model

The AI which generated the art

1 Colorado State Fair

GENERATIVE AI CONCEPTS
Law vs. AI
Legal landscape is evolving to meet rapid AI advancement:

1. Intellectual property

2. Privacy implications
3. Evolving norms and regulations

GENERATIVE AI CONCEPTS
Follow IP best practices

Check copyright status of training data

Seek legal guidance about use


Stay informed of regulatory dynamics

GENERATIVE AI CONCEPTS
Privacy implications with every prompt
Read terms of service: understand how data is stored and used
Consider what we share: user data may be included in future training

Local alternatives: many generative AIs can be run at home

GENERATIVE AI CONCEPTS
Evolving norms
Different responses across industries
Norms in one context might not apply in
another

GENERATIVE AI CONCEPTS
Evolving regulations

Differ across jurisdictions

May depend on location of users, servers, and developers


Stay informed as landscape rapidly evolves

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Responsible
generative AI
applications
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
On the eve of the election

GENERATIVE AI CONCEPTS
Types of malicious use
Deepfakes
Misinformation campaigns

AI-enhanced hacking

1 Pablo Xavier

GENERATIVE AI CONCEPTS
Detection and prevention
Key usage principles

Human-in-the-loop

Harm prevention
Continuous monitoring

Points of Detection and Prevention

GENERATIVE AI CONCEPTS
Access
AI can unintentionally aid criminal groups'
non-criminal activities.

Avoid supporting malicious groups

Know Your Customer (KYC)


Verify user identity

GENERATIVE AI CONCEPTS
Prompts and responses
Moderating prompts Moderating responses

Similar to website or chat group Screen or filter responses before showing


moderation user

Jailbreaking prompts can still subvert


developer guidelines

GENERATIVE AI CONCEPTS
Applications
Malicious actors can apply benign responses
to illegal or unethical activity.

Invisible watermarks can help determine


source of content

May require law enforcement intervention

GENERATIVE AI CONCEPTS
Communication and feedback
Clear usage guidelines
Feedback loops
User studies and stakeholder roundtables

Partner with civil society organizations

Feedback opportunities in product

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Artificial general
intelligence (AGI)
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Revisiting AGI
An AI that exhibits intelligence like a human would:

Scope of knowledge

Reasoning across domains


Social skills

Creative thinking

Other cognitive competencies (vision, language)

GENERATIVE AI CONCEPTS
Immense pros
Productivity
Research progress

Engineering solutions

Companionship and wisdom

GENERATIVE AI CONCEPTS
Severe cons
Negative economic disruption
Malicious use

Value alignment problems

Existential catastrophe

GENERATIVE AI CONCEPTS
The safety debate
AGI can empower AGI can have negative consequences

GENERATIVE AI CONCEPTS
Controlling AGI outcomes
Requirements for aligning AGI and human values:

Clear rules and expectations

Constructive feedback

1. Hard constraints

2. Alignment strategies

3. Government intervention

GENERATIVE AI CONCEPTS
Hard constraints
1. Boxing restricts access to the wider world

2. Interruptibility adds a stop or off switch

GENERATIVE AI CONCEPTS
Alignment strategies
Iterative development

Constitutional AI

Multi-stakeholder engagement

1 Dave Gray

GENERATIVE AI CONCEPTS
Government intervention
CEOs of various AI companies meeting with
UK PM Rishi Sunak in 2023-
Beneficial regulations
Safety regulations

Rules for testing and oversight

Transparency standards

International collaboration

1 UK Prime Minister

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Bringing new AI into
old workflows
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Meeting our "replacement"

GENERATIVE AI CONCEPTS
Advantages and limitations

Advantages Limitations

Knowledge of trained fields Hallucination and potential bias


Very fast No common sense

Inexpensive Implementation challenges

GENERATIVE AI CONCEPTS
Augmentation

GENERATIVE AI CONCEPTS
Co-creation

GENERATIVE AI CONCEPTS
Replacement

GENERATIVE AI CONCEPTS
A novel implementation
Augmentation: AI suggests edits, human decides
Co-creation: AI and human collaboratively write a novel

Replacement: AI generates and publishes social media posts

GENERATIVE AI CONCEPTS
Identify opportunity

GENERATIVE AI CONCEPTS
Decompose the process

GENERATIVE AI CONCEPTS
Test an AI solution

GENERATIVE AI CONCEPTS
Scale up

GENERATIVE AI CONCEPTS
A new way of working

Treat AI as partners rather than


competitors

Prepare for lifelong learning, with AI


support

Be patient with integration

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Progress in
generative AI
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
A collaborative effort
Universities
Governments and civic organizations

Open-source communities

Startups and large companies

GENERATIVE AI CONCEPTS
Universities

New research, such as invention of GANs

New researchers
Partnerships with other sectors

1 University of Montreal website

GENERATIVE AI CONCEPTS
Governments and civic organizations
Governments

Establish and enforce regulatory


environment

Fund basic research

Civic institutions

Provide independent analyses

Resources, such as datasets

Project funding

1National Defense Magazine, https://www.nationaldefensemagazine.org/articles/2016/5/11/darpa-shows-off-


technology-at-demo-day

GENERATIVE AI CONCEPTS
Open-source communities
Some generative AI Open-source projects

Provide open access to tools and models

Lower barriers to entry, experimentation,


and sharing

Are difficult to sustain and maintain quality

Raise risks of misuse

GENERATIVE AI CONCEPTS
Startups and large companies
Seek competitive advantage
Bring generative AI to broad adoption

Showcase advances to attract talent and impress investors

Large companies additionally:

Fund research, such as introduction of transformers by Google

Acquire startups

Offer hardware and cloud resources

GENERATIVE AI CONCEPTS
The openness challenge

Pros Cons

Developer support Lose competitive advantage

Talent attraction Risk liability for misuse

Broader feedback

GENERATIVE AI CONCEPTS
The boundaries of generative AI development

Accelerators Decelerators

Decreasing hardware costs Technological limits

Research developments Overbearing regulation

Competitive and geopolitical pressures Closed ecosystems

Limited economic resources

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
Preparing for a
future of generative
AI
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Do more with less

Individuals become teams

Small teams create big things

Bureaucracies become more streamlined

GENERATIVE AI CONCEPTS
The AI divide

Access: availability, cost

Literacy: mindset, capability

1 International Telecommunications Union, 2023

GENERATIVE AI CONCEPTS
Education and jobs

Replacement, augmentation, and co-


creation, too

Reshape around generative AI


In education: access to AI, move away
from memorization

In the workplace: support from AI partner

Difficult transition

1 Various headlines from BBC, The Economist, Forbes, Business Insider, and Gitnux

GENERATIVE AI CONCEPTS
Media and entertainment
Creative explosion Which is real and which is AI-generated?
Personalized media

Requires new forms of trust

1 https://www.reddit.com/r/midjourney/comments/12uij2l/one_is_a_real_photo_and_one_is_ai_generated_can/

GENERATIVE AI CONCEPTS
Science and technological progress
Faster discoveries
Faster technology transfer

Human direction still needed

1 Deepmind website, https://www.deepmind.com/research/highlighted-research/alphafold

GENERATIVE AI CONCEPTS
Values: do they think and feel like us?

GENERATIVE AI CONCEPTS
Let's practice!
G E N E R AT I V E A I C O N C E P T S
You made it!
G E N E R AT I V E A I C O N C E P T S

Daniel Tedesco
Data Lead, Google
Congratulations

GENERATIVE AI CONCEPTS
Four chapters of fun

Chapter 1: Got to know generative AI

Chapter 2: Learned how these models are developed


Chapter 3: How to use generative AI and its content responsibly

Chapter 4: Got ourselves ready for the Age of Generative AI

GENERATIVE AI CONCEPTS
The learning just started
Explore more DataCamp courses:
Introduction to ChatGPT

Large Language Models (LLMs) Concepts

Experiment with generative AI in your own workflows

Stay up to date by following topical sources:


DataCamp's DataFramed Podcast

My podcast: www.youtube.com/@thecraftpodcast and Twitter @dtedesco1

GENERATIVE AI CONCEPTS
Congratulations!
G E N E R AT I V E A I C O N C E P T S
AI ethics: What's the
buzz?
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Intro to ethics
AI growth and the surge in public attention

AI ethics: A crucial discussion

Definition of ethics: Guiding behavior


based on moral principles

The intersection of AI and ethics

1 Icons made by surang & Flowicon from www.flaticon.com

AI ETHICS
AI meets ethics
AI revolutionizes various sectors:
Healthcare

Media
Insurance

Examples
AI in healthcare - improves surgical
accuracy, early disease detection

AI in finance - automates processing,


fraud detection

1 Icons made by Freepik, wanicon, Pixel perfect from www.flaticon.com

AI ETHICS
Why AI ethics?
Why AI ethics?
Risks of unchecked biases, illustrated by
insurance claim denial scenario

Human influence on AI
Biases seep into decision-making

Wider impact
Legal professions, judiciary, public
decision-making

AI ETHICS
Ethics in practice
Aligning AI systems with ethical principles

Example:
Fairness in insurance model

Equal treatment for all claims

Guidelines in Insurance, Finance, and


Banking sectors

Ethical boundaries for fair, unbiased results

The potential of ethically built AI

1 Icons made by Freepik & noomath from www.flaticon.com

AI ETHICS
The big picture
AI ethics: Beyond avoiding harm or bias
Importance of accountability: Who is
responsible for AI's outcomes and
construction?

The role of transparency: Understanding


AI's decision-making process

Need for transparency and literacy in AI


development and results

1 Icons made by Freepik & Wichai.wi from www.flaticon.com

AI ETHICS
Wrapping up
AI ethics: The guiding beacon in an AI-
driven world

Ensures benefits of AI without


compromising moral values

Ethical AI: Not just good practice, but good


business

AI ETHICS
Let's practice!
AI ETHICS
Digging deeper: AI
ethics principles
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Meet MedTech Innovations
MedTech Innovations
Healthcare company

Using AI to improve patient care and


uphold ethics

Personal reflections
Consider the application of AI ethics in...
Personal life

Career

Known businesses

AI ETHICS
The principle of fairness
MedTech's AI in patient care
Personalized treatment plans

Challenge:
Unintentional bias in AI systems

Potential discrimination

Principle of fairness
Equal treatment

Avoidance of discrimination

AI ETHICS
The principle of accountability
Scenario: MedTech's AI system mistake affecting patient treatment
Principle of accountability:
Someone should always be accountable for AI outcomes

AI ETHICS
The principle of transparency
Scenario: MedTech's AI system
recommends a specific treatment

Challenge: Understanding why AI made the


decision

Principle of transparency:
Decisions by AI should be explainable
and comprehensible

Sharing knowledge and information


across different stakeholders

AI ETHICS
Applying AI ethics
Fairness:
Continual testing of AI systems to detect and rectify bias

Accountability:
Clear responsibilities defined for each AI system's outcomes

Transparency:
Make AI systems explainable and understandable

Commitment:
Ethical adherence builds trust, mitigates risks

AI ETHICS
Why do they matter?
Understanding & applying principles: Ensures ethical AI use
Building trust: Transparency and accountability foster patient trust

Mitigating risks: Ethical AI use reduces potential risks

Promoting AI: Transparency and knowledge-building enhances societal trust and utilization
of AI

1 Icons made by Freepik & Smashicons from www.flaticon.com

AI ETHICS
Let's practice!
AI ETHICS
AI ethics: where's the
line?
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
The privacy-personalization paradox
AI personalizes user experiences,
enhancing appeal

The privacy-personalization paradox


Personalization can compromise user
privacy

Solution:
AI literacy

Clear privacy policies

Example: Spotify

AI ETHICS
The bias-fairness conundrum
Bias-fairness conundrum:
AI learns from data that can carry
societal biases

Result:
AI may unintentionally amplify these
biases

Example:
Early versions of ChatGPT

Solution:
Train AI models with fairer, bias-free data

AI ETHICS
The transparency-complexity trade-off
Transparency-complexity trade-off:
Complex AI models lack transparency
but are highly accurate

Simpler models are more transparent but


less accurate

AI literacy is vital for comprehension and


ethical implications

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
The autonomy-control dilemma
Autonomy-control dilemma:
AI can act autonomously but might
operate outside human control

Question:
Should we prioritize autonomy or control?
No one-size-fits-all answer

Example:
Tesla's Autopilot system emphasizes
driver vigilance and readiness to take
control

AI ETHICS
Navigating the challenges
Navigating ethical dilemmas in AI requires thoughtful trade-offs
Importance of human element in decision-making

Striving for better decisions in complex situations

Need for diverse stakeholders' involvement and continuous AI monitoring

AI ETHICS
Let's practice!
AI ETHICS
Unpacking the
blackbox:
Transparency
AI ETHICS

Joe Franklin
Llama Enthusiast
Black-box nature
AI implementations are often black boxes
A black box in AI:
Known inputs and outputs

AI ETHICS
Ambiguousness is non-ideal
Ambiguity in AI: Ethical challenge

Question of trust:
Can we validate AI decisions without
understanding them?
Transparency:
Making an AI's decision-making process
understandable

Example:
Factors in AI sales model

AI ETHICS
Throughout the AI life cycle
Transparency in AI involves all stages of the
AI life-cycle

Purpose:
Understand the workings of the AI
system

Gauge comfort level with its operation

AI ETHICS
A deciding factor
Current state:
Transparency in AI is uncommon

Hesitation in AI adoption

Future implications:
Transparency will become a deciding
factor in users' choice of AI systems

Actionable:
Organizations should prioritize
transparency

1 Icon made by Eucalyp from www.flaticon.com

AI ETHICS
Openness is key
Openness about AI challenges and
learnings is key

Transparency encourages innovation in AI

It leads to more advanced, reliable AI


systems

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
Embracing transparency in AI
Transparency in AI can be intimidating but
is beneficial for businesses

Transparency leads to predictable


regulations and public perception

Companies can compete based on


strengths, culture, customer relationships
rather than secrecy

AI ETHICS
Let's practice!
AI ETHICS
AI fairness: not just a
dream
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Fairness in AI
Fairness: Ensure no group is favored over AI should predict patient outcomes
another equitably

Concerns race, gender, socioeconomic There should be no bias towards any


status, etc. specific group

AI ETHICS
Why does fairness matter?
AI's rapid processing can result in large-
scale impacts

Fairness prevents negative targeting of


vulnerable populations

Essential for responsible AI implementation,


ensures equitable consideration for all

1 Icons made by noomtah & Parzival' 1997 from www.flaticon.com

AI ETHICS
Promoting fairness
Fairness promotion is challenging but possible
Reduces potential bias by omitting certain variables

Variables include race, gender, age, socioeconomic status, sexual orientation, religion

AI ETHICS
Unintentional issues exist

Even with unawareness, unintentional bias


can still occur

Robust strategies needed to ensure fairness

1 Icons made by Freepik from www.flaticon.com

AI ETHICS
Minimizing bias
The main objective of AI fairness is minimizing bias
The first step is acknowledging bias exists

Remain skeptical and vigilant of AI

Conduct frequent monitoring and audits for fairness

AI ETHICS
Let's practice!
AI ETHICS
Safeguarding AI:
Accountability
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Define accountability
Accountability:
Assigning responsibility for AI outcomes

Critical in AI's development, deployment,


and use

AI isn't a responsibility-evading "magic


wand"

AI ETHICS
Accountability is vital
People trust AI systems more when there is accountability
Accountability ensures ethical use and mitigates potential harm

Accountability means not absolving humans from responsibility

AI ETHICS
The paradox of accountability
Increasing AI accountability can improve
trust

Yet, excessive trust in AI can lead to


misguided decisions

Example:
Georgia Tech study where participants
followed misguided robot guidance

AI ETHICS
The Tesla story
Misunderstanding of the auto-pilot
capabilities among consumers

Criticism for Tesla's insufficient safeguards

Both Tesla and consumers share


responsibility

AI ETHICS
Achieving accountability
AI producers:
Achieving accountability involves
transparency and solving the 'Black Box'
problem
Attributing responsibility is key

AI consumers:
'Trust but verify'

Producers and consumers both play a role


in creating ethical AI

Challenges are opportunities for innovation

1 Icons made by Eucalyp & Sumitsaengtong from www.flaticon.com

AI ETHICS
No one-size-fits-all
Accountability in AI is a continuous journey
With each AI advancement, the accountability conversation evolves

No one-size-fits-all approach; varies across industries

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
Let's practice!
AI ETHICS
Explainable AI
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
What's explainable AI?
AI systems whose internal workings are understood by humans
Goal: Making AI's decision-making clear, understandable, and explainable

Helps understand why and how AI makes decisions

Major step towards ethical AI usage

1 Icon made by vectorsmarket15 from www.flaticon.com

AI ETHICS
The central pillars
Transparency, fairness, accountability are
central

AI conclusions should be accessible and


logical to humans

Models built with explainability at their core

Uses interpretable models like decision


trees or linear regression

Power in seeing the process, despite


possibly lower performance

1 Icons made by juicy_fish & Becris from www.flaticon.com

AI ETHICS
How does it work?

AI ETHICS
How does it work?

AI ETHICS
Local Interpretable Model-agnostic Explanations (LIME)
LIME as a translator that helps the model
communicate

Creates a simpler version of the model's


decision process for a specific prediction

Example:
Explains a movie's hit prediction based
on factors like director popularity and
high budget

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
SHapley Additive exPlanations (SHAP)
SHAP: A detective of AI, revealing feature
importance

SHAP in Action
Director: 50%

Cast: 30%

Genre: 15%

Budget: 5%

AI ETHICS
Future of XAI
Many more techniques and approaches exist in XAI
The gap between XAI and traditional AI is shrinking

Ongoing research is improving AI interpretability

AI ETHICS
Let's practice!
AI ETHICS
Ethical frameworks
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager
The background story
Numerous ethical frameworks guide AI decision-making
Deontological vs. consequentialist approaches

No universal framework for applying AI ethics

The diversity is actually beneficial

AI ETHICS
Ethical framework defined
Ethical frameworks provide scaffolding for ethical decisions

Example: AI in healthcare needs to respect privacy and ensure fairness

AI ETHICS
Organizational benefits
Benefits of ethical framework
Allows foresight in AI decision impact

Provides a clear starting point for AI


usage

Ethical frameworks & innovation


AI ethics no longer feared for stifling
innovation

Seen as promoting innovation by


alleviating ambiguity and challenges

1 Icon made by Prosymbols Premium from www.flaticon.com

AI ETHICS
Meet AgroTech!
Ethical frameworks vary across industries

AgroTech
New agricultural company, innovating
crop harvesting

Ethical framework pillars: environmental


sustainability, economic viability, social
equity

Guides the development of their Smart


Harvester drone series

AI ETHICS
Meet AgroTech!(2)
Smart harvesters shouldn't focus only on
expensive, resource-intensive crops

A potential risk: farmers incentivized to


plant specific crops, threatening
sustainability

Solutions must also be economically viable


to be used effectively

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
Challenges are unavoidable
Balancing ethical considerations is complex
Cultural and regional variations complicate
frameworks

They guide AI development from a human


perspective

Aid in building trust with AI systems

1 Icons made by Flat Icons & Freepik from www.flaticon.com

AI ETHICS
Let's practice!
AI ETHICS
The value of ethical
AI
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Balancing the scale
AI brings fast decision-making with widespread impact

Responsible AI use prevents crossing unanticipated barriers

Potential brand risk from system misbehavior is significant

AI ETHICS
AI ethics isn't optional
AI ethics: a necessity, not accessory
Balances the immense benefits and
potential pitfalls of AI

Case study: financial services industry


AI is indispensable

Absence could lead to disastrous


consequences

AI ETHICS
Bring in tangible impacts
Not just defensive but an offensive strategy
Propels organizations ahead of the curve

Creates trusted entities, enhancing


customer loyalty and brand reputation

Can lead to tangible impact on the bottom


line

1 Icon made by Vectoricons from www.flaticon.com

AI ETHICS
New field, new challenges
Recent emergence with large-scale AI use
Challenging to find examples of AI gone
bad

Too early in AI's evolution to see many


ethical missteps

1 Icon made by ultimatearm from www.flaticon.com

AI ETHICS
Let's practice!
AI ETHICS
The future of AI
ethics
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Understanding the present

AI ETHICS
Anticipating future ethical dilemmas
With each AI advancement, new ethical
challenges emerge

Questions arise about data privacy,


potential bias, and decision-making
autonomy

Preparation is key to address future ethical


dilemmas

Future of AI ethics is unpredictable, but


patterns from history can guide us

1 Icon made by Freepik from www.flaticon.com

AI ETHICS
The dynamic nature of AI ethics
Evolves with technological advancements
Ethical principles must adapt to new AI
applications and societal values

Learn from advancements in data security,


privacy, and ethics

Stay alert to emerging trends, new


techniques, and potential pitfalls

1 Icons made by Freepik from www.flaticon.com

AI ETHICS
Ethical AI by Design
Awareness of technology and techniques is a prerequisite
Ethical AI by Design: integrating ethics from initial design stage of AI systems

Safeguards on data collection and storage

Place ethical principles at the forefront of the decision-making process

AI ETHICS
Ethical AI in practice
Healthcare AI systems:
Ensure transparency and explainability
for trust in AI-driven diagnoses

Retail AI systems:
Avoid bias and ensure accountability

AI ETHICS
Let's practice!
AI ETHICS
Honing ethics by
design
AI ETHICS

Joe Franklin
Associate Data Literacy and Essentials
Manager, DataCamp
Deceptively simple
AI Ethics by Design: Consider ethical ramifications of AI in advance

AI ETHICS
The big ones
Define objectives
Align with stakeholders

Collect and manage data

Design transparently

Evaluate bias

Address concerns

Review and iterate

AI ETHICS
Defining objectives

AI ETHICS
The gangs all here

AI ETHICS
The right data in the right place

AI ETHICS
Transparency in design

AI ETHICS
The end is only the beginning

AI ETHICS
It's a wonderful world out there

Introduction to Data Ethics by Shalini Kurapati

Explore the intersection of ethics and data. Learn valuable skills to collect and manage
data ethically.

Forming Analytical Questions by Konstantinos Kattidis

Discover how to ask a good question and connect with stakeholders to drive change with
analytics.

AI ETHICS
Thank you!
AI ETHICS

You might also like