Welcome to Scribd!

AI MDP Assignment

Uploaded by

0% found this document useful (0 votes)

97 views4 pages

This document outlines an assignment involving Markov decision processes (MDPs). It consists of three parts: 1. Writing a program to perform value iteration on a general grid-based MDP and output the final utility values. 2. Performing value iteration on a specific MDP grid and analyzing how the optimal policy changes across iterations, highlighting which iterations saw policy changes and why. 3. Modeling the specific MDP as a linear program to compute expected utilities, which should match the value iteration results to within a tolerance level. Deliverables include code for the general MDP solver, a write-up analyzing the policy changes in value iteration, and an Excel/LibreOffice file showing the linear program solution

Original Description:

Markov Decision Process

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

97 views4 pages

AI MDP Assignment

Uploaded by

gulshan kumar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 4

Search inside document

ASSIGNMENT II

MARKOV DECISION PROCESS

You will be given a grid world based MDP parameterized as below
Input: First line of the input consists of 2 space separated integers N M the dimensions
of the grid world.
N lines follow, each having M real numbers specifying the rewards for getting into that
state
Next line consists of 2 space separated integers E W, number of end states and number
of walls
Next E lines follow - each having 2 integers - the coordinates of end states.
Next W lines follow - each having 2 integers - the coordinates of walls.
Next line has 2 integers specifying the coordinates of the start state.
Next line has 1 integer specifying the unit step reward.

The following input corresponds to the MDP below:

44
-x 0 0 0
0 0 0 0
x/10 0 0 0
0 -x/10 0 x
21
00
33
12
30
-x / 5
+X / 10

START -X / 10

Problem Specifications
● Rows are numbered 0,1,2,3 from top to bottom and columns are numbered 0,1,2,3
from left to right. Eg. Start cell is (3,0)
● The cell (3,3) is the positive(green) sink while (0,0) is the negative(red) sink
● The blue cells are blocked (assume them as walls)
● The borders of the grid are also walls
Agent can go North, South, East or West
Action from a state results in
○ Movement in intended direction with probability 0.8
○ Movement in directions perpendicular to the intended direction with 0.1 probability
each (0.8 + 0.1 + 0.1 = 1). Eg. If action is North, then actual movement will be to North
with 0.8 prob, to East with 0.1 prob, & to West with 0.1 prob.
●If an action results in reaching a cell with wall, agent will remain in the same cell
● No action needs to be performed at terminal states
NOTE:
You will be given numeric inputs for PART 1 of the assignment.
For Part 2 and 3 of the assignment replace X with your team number.

Problem Statement
The assignment consists of three parts:
Part A: Given any MDP of the aforementioned form - write a program that performs
Value Iteration(VI) on that MDP and gives final utility of each state as output. For
termination condition consider 1% change or less within tolerance.
Output format: if the grid is n * m , output should consist of n lines - having m space
separated values each upto 3 decimal precision.
For ex: if grid is 2 * 3
Output: 5.300 3.200 7.200
1.200 5.900 6.100
Part B: Perform the VI algorithm on the above given specific MDP [here x will be your
team number]. Now, carefully analyse how the policy changes over the iterations.
Observe the iterations in which there is a change in the optimal action to be taken in any
state and why the policy changed in that specific way.
You have to submit a handwritten / pdf note, highlighting the iterations in which the
policy changes and then explain the reason for policy changing in that specific way.
Part C: Model the above problem(specific MDP) using LP shown below

Q1. Model the parameters r,A and α

Q2. Use the excel LP solver to compute the x values and the expected utilities for this
MDP
Please verify that the expected utility obtained is equivalent to the one obtained using
the VI algorithm. The VI value and LP value can differ at max by delta*(1.2)

Deliverables
You are supposed to submit :
1. A working code for general MDP solving as specified in part A. 20 marks
2. Handwritten / pdf as specified in part B. 20 marks
3. Excel file or .ods (LibreOffice) file, showing the LP solved. 20 marks
Upload the above files in a zipped folder with the name
Assignment2_teamNumber.zip and submit the hard copy in class.
Deadline
● The zip folder needs to be uploaded on moodle on or before March 12, 11.55 pm.
Note: You need to load the solver add-in in Excel if not already installed. It comes
loaded with LibreOffice packages by default.

5 AGS Pre Algebra
Document123 pages
5 AGS Pre Algebra
dabicvasilije31
No ratings yet
Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
Document12 pages
Convolutional Neural Networks (LeNet) - DeepLearning 0.1 Documentation
Sumit Patel
No ratings yet
Practical Exercise Epipolar Geometry
Document3 pages
Practical Exercise Epipolar Geometry
dansileshi
No ratings yet
AI Test1 Key
Document14 pages
AI Test1 Key
Lakshmi Ramu
No ratings yet
Final Exam Question Static
Document13 pages
Final Exam Question Static
Asmadi Yussuf
100% (1)
A Simple Quadrilateral Shell Element
Document9 pages
A Simple Quadrilateral Shell Element
koulliredouane
No ratings yet
Mathcad and Matlab Instruction Guide: by Breneman Whitfield (gth782h)
Document26 pages
Mathcad and Matlab Instruction Guide: by Breneman Whitfield (gth782h)
Puspal Ghosh
No ratings yet
Skip
Document22 pages
Skip
Anonymous 0MQ3zR
No ratings yet
Tutorial On Solver
Document26 pages
Tutorial On Solver
parth
No ratings yet
7.2. Obtención de Modelo Matemático
Document12 pages
7.2. Obtención de Modelo Matemático
matias
No ratings yet
Assignment 3
Document5 pages
Assignment 3
ritik12041998
No ratings yet
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
Document12 pages
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
Anonymous e4UpOQEP
No ratings yet
Question Bank
Document26 pages
Question Bank
Arun Choudhary
No ratings yet
Accuracy, Condition Numbers and Pivoting: The Effect of Rounding
Document4 pages
Accuracy, Condition Numbers and Pivoting: The Effect of Rounding
aleyhaider
No ratings yet
Lab 01
Document1 page
Lab 01
Veneiro
No ratings yet
Matlab Basics Tutorial: Electrical and Electronic Engineering & Electrical and Communication Engineering Students
Document23 pages
Matlab Basics Tutorial: Electrical and Electronic Engineering & Electrical and Communication Engineering Students
sushantnirwan
No ratings yet
Lab02 Worksheet
Document4 pages
Lab02 Worksheet
Spin Fotonio
No ratings yet
2 To 1 Mux
Document19 pages
2 To 1 Mux
TelmoDiez
100% (1)
SVM Example in R
Document4 pages
SVM Example in R
John Alexis Cabolis
No ratings yet
Building The Up
Document30 pages
Building The Up
abhiii86
No ratings yet
Lab 06
Document7 pages
Lab 06
Andy Meyer
No ratings yet
Sheet of Chapter (2) : MATLAB Matrices
Document5 pages
Sheet of Chapter (2) : MATLAB Matrices
Ahmad Ash Sharkawi
No ratings yet
A Scenario 4
Document28 pages
A Scenario 4
Vasi Acm
No ratings yet
Matlogix Question 2012
Document12 pages
Matlogix Question 2012
Aman04509
No ratings yet
Computer Graphis
Document7 pages
Computer Graphis
Albin Ps Plappallil
No ratings yet
Implentation of Goldschmidt's Algorithm For 16 Bit Division and Square Root
Document13 pages
Implentation of Goldschmidt's Algorithm For 16 Bit Division and Square Root
Cale Spratt
100% (1)
CG Lecture3 V
Document19 pages
CG Lecture3 V
Zeenah khalid ŔJ
No ratings yet
Chem 142 Experiment #1: Atomic Emission: Deadline: Due in Your TA's 3rd-Floor Mailbox 72 Hours After Your Lab Ends
Document8 pages
Chem 142 Experiment #1: Atomic Emission: Deadline: Due in Your TA's 3rd-Floor Mailbox 72 Hours After Your Lab Ends
api-301880160
No ratings yet
FSM Prob
Document3 pages
FSM Prob
Souhardya Mondal
No ratings yet
Output Primitives (Rasterization)
Document30 pages
Output Primitives (Rasterization)
Suada Bőw Wéěžý
No ratings yet
Numerical Solution of PDE: Comsol Multiphysics: Example 1, The Poisson Equation On An Ellipse
Document4 pages
Numerical Solution of PDE: Comsol Multiphysics: Example 1, The Poisson Equation On An Ellipse
Peanut d. Destroyer
No ratings yet
Ass2 2022 2
Document3 pages
Ass2 2022 2
saumya11599
No ratings yet
EENG 2611 HW Lab1 PDF
Document6 pages
EENG 2611 HW Lab1 PDF
Pipe Ogunbanwo
No ratings yet
Me3360 HW1-SP2013
Document3 pages
Me3360 HW1-SP2013
elm391229
No ratings yet
Appendix A. Verilog Examples: A.1 Combinational Logic Structures
Document14 pages
Appendix A. Verilog Examples: A.1 Combinational Logic Structures
Trenton Lindholm
No ratings yet
Efficient VLSI Implementation of Modulo Addition and Multiplication
Document10 pages
Efficient VLSI Implementation of Modulo Addition and Multiplication
vka_prince
No ratings yet
ENG3104 Assignment3
Document8 pages
ENG3104 Assignment3
Nayim Mohammad
No ratings yet
Green Energy
Document13 pages
Green Energy
Alin Òó
No ratings yet
Configuration of MTL Receiver and Transmitter
Document3 pages
Configuration of MTL Receiver and Transmitter
JHANSI
No ratings yet
C5-Error Correcting Codes
Document17 pages
C5-Error Correcting Codes
Lê Thanh Hùng
No ratings yet
Bode Plot
Document5 pages
Bode Plot
Alexandro Andra Pizzaro
No ratings yet
Mem351lab MatlabSimulink
Document13 pages
Mem351lab MatlabSimulink
ebrehe
No ratings yet
MATLAB
Document47 pages
MATLAB
Maryam Kausar
No ratings yet
Parallel Implementation of A 4 X 4-Bit Multiplier
Document4 pages
Parallel Implementation of A 4 X 4-Bit Multiplier
Mathew George
No ratings yet
Operations Research
Document58 pages
Operations Research
shailja Singh
No ratings yet
Digital Assignment - I Computational Fluid Dynamics: Course Code-Mee4006
Document19 pages
Digital Assignment - I Computational Fluid Dynamics: Course Code-Mee4006
SIDDHANT KUMAR 17BEM0015
No ratings yet
Lab #2 Signal Ploting and Matrix Operatons in Matlab
Document17 pages
Lab #2 Signal Ploting and Matrix Operatons in Matlab
Nihal Ahmad
No ratings yet
Computer Task 2
Document4 pages
Computer Task 2
Ivan Er
No ratings yet
Handout Nivel Retea
Document14 pages
Handout Nivel Retea
Alexz Alee
No ratings yet
ADS Tutorial 2
Document4 pages
ADS Tutorial 2
Minh Vu
No ratings yet
Untitled
Document63 pages
Untitled
魏延任
No ratings yet
Split The Non-Key Columns To Separate Tables With Key Column in Both
Document25 pages
Split The Non-Key Columns To Separate Tables With Key Column in Both
r.m.ram234
No ratings yet
PCM Codigo
Document14 pages
PCM Codigo
Leux Guillen
No ratings yet
Floorplan Design of VLSI
Document18 pages
Floorplan Design of VLSI
Atul Prakash Dwivedi
No ratings yet
Design A Full Adder by Using Two Half Adders. (7M, May-19)
Document10 pages
Design A Full Adder by Using Two Half Adders. (7M, May-19)
Sri Silpa Padmanabhuni
100% (1)
Control Systems Group Project 2
Document3 pages
Control Systems Group Project 2
Yu-Yun Chang
No ratings yet
Free Body Diagram and Newton's Law
Document4 pages
Free Body Diagram and Newton's Law
seeknana2833
No ratings yet
Applications of The Wavelet Transform in Image Processing: 1 Prelimiaries
Document16 pages
Applications of The Wavelet Transform in Image Processing: 1 Prelimiaries
Dundi Nath
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
From Everand
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
Fouad Sabry
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
Rating: 2.5 out of 5 stars
2.5/5 (2)
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
Rating: 3 out of 5 stars
3/5 (4)
O Dependence vs. Independence
Document6 pages
O Dependence vs. Independence
eli brandt
No ratings yet
How To Trade With The VWAP Indicator
Document29 pages
How To Trade With The VWAP Indicator
habibimario
0% (1)
Circle Diagram For Three Phase Induction Motors: Tests Required
Document2 pages
Circle Diagram For Three Phase Induction Motors: Tests Required
Ekansh Dwivedi
No ratings yet
Worksheet-03: All All
Document11 pages
Worksheet-03: All All
fghreihgh
No ratings yet
MATHEMATICS Lesson Note JSS 1
Document6 pages
MATHEMATICS Lesson Note JSS 1
jonahorji
No ratings yet
5 More About Polynomials: Review Exercise 5 (P. 5.5)
Document47 pages
5 More About Polynomials: Review Exercise 5 (P. 5.5)
Eunice Ng
No ratings yet
Composite Pressure Hulls For Deep Ocean Submersibles
Document13 pages
Composite Pressure Hulls For Deep Ocean Submersibles
Suja Subramanian
No ratings yet
Report Cover Page and Format
Document6 pages
Report Cover Page and Format
luchi_babez
No ratings yet
Factorising Quadratics ( 1) : (Level 5)
Document5 pages
Factorising Quadratics ( 1) : (Level 5)
Chikanma Okoisor
No ratings yet
Lecture 13
Document14 pages
Lecture 13
rafey h
No ratings yet
Probability of Independent Events
Document10 pages
Probability of Independent Events
Yash Keith Smith
No ratings yet
Kinetics: The Rate of Chemical Reaction
Document4 pages
Kinetics: The Rate of Chemical Reaction
crybaby
0% (1)
Kashmir University UG
Document13 pages
Kashmir University UG
Zahid Quresh
100% (1)
Grade 8 February 12 - 16 2018
Document3 pages
Grade 8 February 12 - 16 2018
LAWRENCE MALLARI
No ratings yet
2008 - Holographic Measurement of The 26 M HartRAO Telescope
Document54 pages
2008 - Holographic Measurement of The 26 M HartRAO Telescope
zan wang
No ratings yet
Six Phase Permanent Magnet Machine With Fractional Slot Concentrated Winding
Document5 pages
Six Phase Permanent Magnet Machine With Fractional Slot Concentrated Winding
schlemihl69
No ratings yet
CS604 Final Term Solved MCQS With References by Awais
Document48 pages
CS604 Final Term Solved MCQS With References by Awais
Muhammad Awais
100% (2)
Empirical Correlations Drained Shear Strength For Slope Stability Analyses
Document10 pages
Empirical Correlations Drained Shear Strength For Slope Stability Analyses
PS
No ratings yet
Mine Ventilation
Document37 pages
Mine Ventilation
Kemal Cengiz
100% (3)
Statistical Concepts For Criminal Justice and Criminology 1st Edition Williams III Test Bank
Document13 pages
Statistical Concepts For Criminal Justice and Criminology 1st Edition Williams III Test Bank
JessicaDavisbrioj
100% (13)
Quantitative Analysis PDF
Document303 pages
Quantitative Analysis PDF
amiller1987
No ratings yet
Perry Como Papa Loves Mambo
Document10 pages
Perry Como Papa Loves Mambo
Anonymous gPhRqg3
0% (1)
Al-Mahdi High Mathematics 11 - Grade Name: - . - . - . - . - Derivatives W.S-4
Document4 pages
Al-Mahdi High Mathematics 11 - Grade Name: - . - . - . - . - Derivatives W.S-4
api-253679034
No ratings yet
Uncertainty in Measurement
Document12 pages
Uncertainty in Measurement
Lucas Santos
No ratings yet
1-4 Practice - A
Document2 pages
1-4 Practice - A
Stanley
No ratings yet
Martin Schickert - Ultrasonic NDE of Concrete
Document10 pages
Martin Schickert - Ultrasonic NDE of Concrete
Georgiana Matache
No ratings yet