Welcome to Scribd!

Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes

Uploaded by

0% found this document useful (0 votes)

45 views10 pages

The document discusses Markov Decision Processes (MDPs) which provide a framework for modeling sequential decision making problems under uncertainty. MDPs are defined by states, actions, transition probabilities between states given an action, and reward functions. They allow planning for optimal decisions even when actions have non-deterministic outcomes defined by probabilities. An example of a noisy "grid world" environment is provided where the agent's movement actions do not always produce the intended result. MDPs generalize search problems to stochastic domains and can be solved using techniques like expectimax search or other MDP-specific algorithms.

Original Description:

Original Title

Intro-MDP(1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

45 views10 pages

Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes

Uploaded by

Aimee Lemma

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Artificial Intelligence and

Intelligent Agents (F29AI)

MDP I: Intro to Markov Decision

Processes
Arash Eshghi

Based on slides from Ioannis Konstas @HWU, Verena Rieser @HWU, Dan Klein @UC Berkeley
So far…
• How to formalise a real-world problem?
• How to reach a goal?
• Blind search: DFS, BFS, UCS
• Informed search: Greedy, A*
• How to maximise an outcome if there is an adversary?
• Adversary search: minimax, alpha-beta pruning
• What to do if this adversary (environment) is
probabilistic?
• Expectimax, maximum expected utilities
Today
• Non-deterministic worlds
• Markov Decision Processes (MDPs)
• Time-limited values

How can you plan optimally if your

actions are non-deterministic?
(you only have a distribution over the
outcome)
Today
Example: Grid World
• A maze-like problem
• The agent lives in a grid
• Walls block the agent’s path
• Noisy movement: actions do not always go
as planned
• 80% of the time, the action North takes the
agent North (if there is no wall there)
• 10% of the time, North takes the agent West;
10% East
• If there is a wall in the direction the agent would
have been taken, the agent stays put
• The agent receives rewards each time step
• Small “living” reward each step (can be negative)
• Big rewards come at the end (good or bad)
• Goal: maximize sum of rewards
Grid World Actions
Deterministic Stochastic
Grid World Grid World
Markov Decision Processes

• An MDP is defined by:

• A set of states s Î S
• A set of actions a Î A
• A transition function T(s, a, s’)
• Prob that action a from s leads to s’
i.e., P(s’ | s,a)
• Also called “the model”
• A reward function R(s, a, s’)
• Sometimes just R(s) or R(s’)
• A start state (or distribution)
• Maybe a terminal state

• MDPs are a family of non-deterministic search problems

• One way to solve them is with expectimax search – but we will
have a new tool soon
7
Video of Demo Gridworld
Manual Intro
Gridworld Demo
What is Markov about MDPs?

• “Markov” generally means that given the present state,

the future and the past are independent.

• For Markov decision processes, “Markov” means action

outcomes depend only on the current state

Andrey Markov
(1856-1922)

Non Deterministic Search: CS 188: Artificial Intelligence
Document6 pages
Non Deterministic Search: CS 188: Artificial Intelligence
Lokesh Sharma
No ratings yet
Lecture 3 Problem Solving
Document49 pages
Lecture 3 Problem Solving
Harris Chikunya
No ratings yet
L03 Problem Solving As Search I
Document66 pages
L03 Problem Solving As Search I
The Gamer Last night
No ratings yet
L03 Problem Solving As Search I
Document59 pages
L03 Problem Solving As Search I
arabickathu
No ratings yet
Unit 4
Document56 pages
Unit 4
randyy
No ratings yet
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
Document42 pages
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
mani111111
No ratings yet
Lecture 3 - Problem Solving by Search
Document27 pages
Lecture 3 - Problem Solving by Search
MARIELLA TRACY BANDONG
No ratings yet
Solving Problems by Searching & Constraint Satisfaction Problem
Document53 pages
Solving Problems by Searching & Constraint Satisfaction Problem
Mustefa Mohammed
No ratings yet
Module 2
Document73 pages
Module 2
Christin T.Kunjumon
No ratings yet
Solving Problems by Searching: Artificial Intelligence
Document43 pages
Solving Problems by Searching: Artificial Intelligence
Dai Trong
No ratings yet
Artificial Intelligence Lecture No. 6
Document30 pages
Artificial Intelligence Lecture No. 6
Syed Sami
No ratings yet
Artificial Intelligence Chapter 3: Solving Problems by Searching
Document32 pages
Artificial Intelligence Chapter 3: Solving Problems by Searching
Faizan
No ratings yet
RNN LSTM
Document72 pages
RNN LSTM
5049 Harishchandra Kumar
No ratings yet
Lecture06 Informed Search (Part 2) - New
Document36 pages
Lecture06 Informed Search (Part 2) - New
LIEW YU LIANG
No ratings yet
2021 Lecture03 P1 ProblemSolvingBySearching
Document43 pages
2021 Lecture03 P1 ProblemSolvingBySearching
Nguyen Thong
No ratings yet
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
Document13 pages
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
zatricion
No ratings yet
Solving Problems by Searching Final
Document69 pages
Solving Problems by Searching Final
asnake ketema
No ratings yet
FALLSEM2022-23 CBS3004 ETH VL2022230104390 Reference Material I 10-08-2022 2.1 Problem Solving in AI
Document36 pages
FALLSEM2022-23 CBS3004 ETH VL2022230104390 Reference Material I 10-08-2022 2.1 Problem Solving in AI
Bhimavarapu Sreekar Reddy
No ratings yet
Lecture06 Informed Search (Part 2)
Document34 pages
Lecture06 Informed Search (Part 2)
Dream Maker
No ratings yet
Solving Problems by Searching: Dr. Azhar Mahmood
Document38 pages
Solving Problems by Searching: Dr. Azhar Mahmood
Mansoor Qaisrani
No ratings yet
6.825 Lecture 01
Document3 pages
6.825 Lecture 01
NkwochaChinedu
No ratings yet
Lecture3 Searching
Document45 pages
Lecture3 Searching
Risinu Wijesinghe
No ratings yet
AI Chapter03
Document69 pages
AI Chapter03
魏延任
No ratings yet
ResourcesAllocaation Kuliah
Document15 pages
ResourcesAllocaation Kuliah
Mhd. Fathoni
No ratings yet
DHT
Document26 pages
DHT
Ioio92
No ratings yet
Session 3 - Local Search
Document34 pages
Session 3 - Local Search
Alfian Rizki
No ratings yet
788XF14L17 SLAMx
Document103 pages
788XF14L17 SLAMx
Vũ Mạnh Cường
No ratings yet
Unit 2 Search
Document51 pages
Unit 2 Search
ujjawalnegi14
No ratings yet
Chapter 3 - Searching-Part 1
Document103 pages
Chapter 3 - Searching-Part 1
Mohammed A. Al Sattari
No ratings yet
Machine Learning For NLP
Document58 pages
Machine Learning For NLP
vothaianh18081997
No ratings yet
Neural Metwork: Institut Teknologi Sepuluh Nopember (ITS) Surabaya - Indonesia
Document43 pages
Neural Metwork: Institut Teknologi Sepuluh Nopember (ITS) Surabaya - Indonesia
RIZKA FIDYA PERMATASARI 06211940005004
No ratings yet
Slides
Document69 pages
Slides
sfaisalaliuit
No ratings yet
Machine Learning For Astronomy: Rob Fergus
Document80 pages
Machine Learning For Astronomy: Rob Fergus
Mastering Zinc Oxide
No ratings yet
3 Uninformed Search
Document77 pages
3 Uninformed Search
Pratik Pradip Sarode
No ratings yet
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
Document28 pages
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
Nitheesh
No ratings yet
Artificial Intelligence: Problem Solving by Search Module - 2
Document253 pages
Artificial Intelligence: Problem Solving by Search Module - 2
viraj gupta
No ratings yet
AI Uninformed Searches
Document58 pages
AI Uninformed Searches
Shabahat Zia
No ratings yet
015.search Formulation Problems Basic Strategies
Document121 pages
015.search Formulation Problems Basic Strategies
supriya
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
Document16 pages
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
Ardiansyah Mochamad Nugraha
No ratings yet
Problem Solving and Search: 6.825 Techniques in Artificial Intelligence
Document118 pages
Problem Solving and Search: 6.825 Techniques in Artificial Intelligence
HayderALMakhzomi
No ratings yet
Reinforcement Learning
Document24 pages
Reinforcement Learning
sangavi
No ratings yet
F21 - Lecture 4 1
Document31 pages
F21 - Lecture 4 1
Arya
No ratings yet
Production Rules
Document57 pages
Production Rules
Sangeeta Bansal
No ratings yet
Policies, Search, Utility
Document13 pages
Policies, Search, Utility
Aimee Lemma
No ratings yet
Chapter3 ProblemSolvingBySearching
Document61 pages
Chapter3 ProblemSolvingBySearching
imehedi357
No ratings yet
Lec02 AI Intelligent Agent
Document40 pages
Lec02 AI Intelligent Agent
শাহআলম
No ratings yet
Chapter Three: Lecture 1: Solving Problems by Searching and Constraint Satisfaction Problem
Document53 pages
Chapter Three: Lecture 1: Solving Problems by Searching and Constraint Satisfaction Problem
ashenafi endale
No ratings yet
Genetic Algorithms: Presented To: Dr. Sherin Youssof Presented By: Ahmad Abdullatif Goudah
Document24 pages
Genetic Algorithms: Presented To: Dr. Sherin Youssof Presented By: Ahmad Abdullatif Goudah
almuslim233869
No ratings yet
WK 7
Document44 pages
WK 7
zed
No ratings yet
Nature Inspired Computing: Recent Research Scenario: Dr. Sanjay Saini
Document31 pages
Nature Inspired Computing: Recent Research Scenario: Dr. Sanjay Saini
laxmikant
No ratings yet
AI (Chap 3&4)
Document182 pages
AI (Chap 3&4)
Ifra fatima
No ratings yet
Local Search
Document33 pages
Local Search
Faizan Ahmed
No ratings yet
Ch3 SolvingProblemsBySearching
Document74 pages
Ch3 SolvingProblemsBySearching
Countess Loly
No ratings yet
Chap 4 Beyond Gradient Descent
Document26 pages
Chap 4 Beyond Gradient Descent
HRITWIK GHOSH
No ratings yet
Problem Solving and Search
Document100 pages
Problem Solving and Search
Mohammad Omar Faruq
No ratings yet
Lecture Week11
Document24 pages
Lecture Week11
Reedus
No ratings yet
Cpts 440 / 540 Artificial Intelligence: Search
Document182 pages
Cpts 440 / 540 Artificial Intelligence: Search
Surekha Sakhare
No ratings yet
02 MarkovDecisionProcess
Document51 pages
02 MarkovDecisionProcess
Kerry Beach
No ratings yet
Markov Decision Process: Fundamentals and Applications
From Everand
Markov Decision Process: Fundamentals and Applications
Fouad Sabry
No ratings yet
Quantum Computing in Action
From Everand
Quantum Computing in Action
Johan Vos
No ratings yet
Artificial Intelligence and Intelligent Agents (F29Ai) MDP Iii: Solving Mdps
Document29 pages
Artificial Intelligence and Intelligent Agents (F29Ai) MDP Iii: Solving Mdps
Aimee Lemma
No ratings yet
Policies, Search, Utility
Document13 pages
Policies, Search, Utility
Aimee Lemma
No ratings yet
Lecture2e RP Uninformed Search Properties
Document14 pages
Lecture2e RP Uninformed Search Properties
Aimee Lemma
No ratings yet
Lecture4 RP Knowledge Representation
Document28 pages
Lecture4 RP Knowledge Representation
Aimee Lemma
No ratings yet
Lesson Plan Assessment Rubric & Lesson Plan Template Dec 2020
Document11 pages
Lesson Plan Assessment Rubric & Lesson Plan Template Dec 2020
macfever
No ratings yet
Syllabus of Water Resource Planning and Management
Document2 pages
Syllabus of Water Resource Planning and Management
Afework
100% (1)
History of Anthropology in India by @pdf4exams
Document50 pages
History of Anthropology in India by @pdf4exams
analytical
No ratings yet
Spatial Analysis of Human Population Distribution and Growth in Marinduque Island, Philippines
Document3 pages
Spatial Analysis of Human Population Distribution and Growth in Marinduque Island, Philippines
Marinela Daumar
No ratings yet
05 - Task - Performance (2) Uts
Document3 pages
05 - Task - Performance (2) Uts
aby marie
No ratings yet
Feedback of Gps Training Data Within Professional English Soccer A Comparison of Decision Making and Perceptions Between Coaches Players and
Document14 pages
Feedback of Gps Training Data Within Professional English Soccer A Comparison of Decision Making and Perceptions Between Coaches Players and
Fernando Amu
No ratings yet
The Mathematical Society of Serbia - 60 Years
Document23 pages
The Mathematical Society of Serbia - 60 Years
Branko Ma Branko Tadic
No ratings yet
Final Module 1 Observation Form (Evaluation)
Document9 pages
Final Module 1 Observation Form (Evaluation)
Shani Shani
No ratings yet
Profile of An Ignatian Educator - JSN 2015
Document1 page
Profile of An Ignatian Educator - JSN 2015
Dean Wearmouth
No ratings yet
Topic 5 Multicultural Diversity in Tourism
Document4 pages
Topic 5 Multicultural Diversity in Tourism
Nikki Balsino
No ratings yet
Empowerment Technologies Module 4 1 1
Document36 pages
Empowerment Technologies Module 4 1 1
Harwyn Lance Quinto Arabi III
No ratings yet
Introduction & Development of Learning Corners
Document21 pages
Introduction & Development of Learning Corners
Fun Science
No ratings yet
Preliminaries
Document14 pages
Preliminaries
Ronald Munar
No ratings yet
Grammar Week 6
Document3 pages
Grammar Week 6
March March
No ratings yet
Chatbot Research and Design
Document217 pages
Chatbot Research and Design
José Carlos Gamero León
No ratings yet
Social Comparison: October 2018
Document12 pages
Social Comparison: October 2018
Fauziah Taslim
No ratings yet
EdTech One Pager 06
Document1 page
EdTech One Pager 06
gayatri.chakofficial
No ratings yet
Pagpamaeandong Kaibahan Si Bantay A Phenomenological Study On The Mental and Emotional Influence of Dog Ownership
Document110 pages
Pagpamaeandong Kaibahan Si Bantay A Phenomenological Study On The Mental and Emotional Influence of Dog Ownership
Shane Ureta
No ratings yet
Results - Based Performance Management System: Portfolio
Document57 pages
Results - Based Performance Management System: Portfolio
Joenald Kent Ordoña
No ratings yet
Piaget's Cognitive Development Theory
Document2 pages
Piaget's Cognitive Development Theory
Erica Galor
No ratings yet
BSBWOR203 Work Effectively With Others: Release: 2
Document4 pages
BSBWOR203 Work Effectively With Others: Release: 2
cowboy1966
No ratings yet
Chapter 1, Saunders, Lewis and Thornhill, Part 1, Patten and Newhart
Document24 pages
Chapter 1, Saunders, Lewis and Thornhill, Part 1, Patten and Newhart
Hoàng Phúc Long
No ratings yet
Lehman - Et - Al-2017-Rethinking The Biopsychosocial Model of Health
Document17 pages
Lehman - Et - Al-2017-Rethinking The Biopsychosocial Model of Health
Alejandro
No ratings yet
Potato Disease Detection Report Final
Document35 pages
Potato Disease Detection Report Final
Mahatva Jethava
No ratings yet
Kel. 1 PPT Theories of First LA
Document8 pages
Kel. 1 PPT Theories of First LA
Herlin Firman
No ratings yet
LS 3 Addition and Subtraction in Daily Life Lesson 2
Document2 pages
LS 3 Addition and Subtraction in Daily Life Lesson 2
Ken Kanike
No ratings yet
Clinical Evaluation For Medical Devices Under MDR
Document17 pages
Clinical Evaluation For Medical Devices Under MDR
WALEED220866
No ratings yet
Colastre - A Theory of Loosely Cooupled School Structure
Document21 pages
Colastre - A Theory of Loosely Cooupled School Structure
Genevieve Rufino
No ratings yet
Lesson Plan Using The Assure Model
Document3 pages
Lesson Plan Using The Assure Model
Levina Jorge Calañada
No ratings yet
Community Engagement, Solidarity, and Citizenship (CSC)
Document11 pages
Community Engagement, Solidarity, and Citizenship (CSC)
Rommel Abaya Tio
No ratings yet