Professional Documents
Culture Documents
Lect02 Agent
Lect02 Agent
Lect02 Agent
2024
Contents
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
Concept 1
The Structure An agent is anything that can be viewed as perceiving its environment through
of Agents
sensors and acting upon that environment through actuators
Agent Sensors
Percepts
Environment
?
Actions
Actuators
4
What is Agent (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Organizations Microsoft, European Union, Real Madrid FC, an ant colony,...
Environments
The Structure
• People teacher, physician, stock trader, engineer, researcher, travel agent,
of Agents
farmer, waiter...
• Computers/devices thermostat, user interface, airplane controller, network
controller, game, advising system, tutoring system, diagnostic assistant,
robot, Google car, Mars rover...
• Animals dog, mouse, bird, insect, worm, bacterium, bacteria...
sensors
percepts
?
environment
agent
actions
actuators
5
What is Agent (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Percept to refer to the agent’s perceptual inputs at any given instant.
Environments
The Structure
• Action to refer to agent’s behavior.
of Agents
• Percept sequence is a sequence of all past and present percepts the agent
has ever perceived.
• An action is described by the agent function that maps any given percept
sequence to an action
f : P∗ → A
• Agent program: the implementation of the agent function
6
Vacuum-cleaner
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Percepts: location and contents, e.g.,
The Structure A B
of Agents [A, Dirty]
• Actions: Left, Right, Suck, NoOp
• Function:
Percept sequence Action
[A, Clean] Right
[A, Dirty] Suck
[B, Clean] Left
[B, Dirty] Suck
[A, Clean], [A, Clean] Right
.. ..
. .
7
Vacuum-cleaner (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
function Reflex-Vaccum-Agent([location,status])
The Structure
of Agents returns an action
if status = Dirty then return Suck
else if location = A then return Right
else if location = B then return Left
8
Good Behavior: The Concept of Rationality
Rationality
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • A rational agent is one that does the right thing
Environments
The Structure
• What is right thing?
of Agents
• When an agent is plunked down in an environment, it generates a
sequence of actions according to the percepts it receives. This sequence
of actions causes the environment to go through a sequence of states. If
the sequence is desirable, then the agent has performed well
• Performance measure evaluates any given sequence of environment
states (not agent states)
10
Rationality (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
A rule of thumb
The Structure It is better to design performance measures according to what one actually wants
of Agents
in the environment, rather than according to how one thinks the agent should
behave
11
Rationality (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
12
Rationality (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
Concept 2 (Definition of a rational agent)
The Structure For each possible percept sequence, a rational agent should select an action that
of Agents
is expected to maximize its performance measure, given the evidence provided by
the percept sequence and whatever built-in knowledge the agent has.
13
Omniscience, learning, and autonomy
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• Rational =6 clairvoyant, action outcomes may not be as expected
of Agents
• Hence, rational 6= successful
• Rational =⇒ exploration, learning, autonomy
14
Omniscience, learning, and autonomy (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
Information gathering
The Structure • Information gathering by doing actions in order to modify future percepts
of Agents
or exploration
• This is an important part of rationality
15
Omniscience, learning, and autonomy (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
Learning
The Structure • A rational agent also has to learn as much as possible from what it perceives.
of Agents
16
Omniscience, learning, and autonomy (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
Autonomy
The Structure • A rational agent should be autonomous – Learn what it can to compensate
of Agents
for partial or incorrect prior knowledge.
• If an agent just relies on the prior knowledge of its designer rather than
its own percepts then the agent lacks autonomy
17
The Nature of Environments
The task environment
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Task environments are essentially the “problems” to which rational agents
Environments
The Structure
are the “solutions”
of Agents • The flavor of the task environment directly affects the appropriate design
for the agent program
• Task environment includes the PEAS (Performance, Environment,
Actuators, Sensors) description
• In designing an agent, the first step must always be to specify the task
environment as fully as possible.
19
An example: Automated taxi driver
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• safety, destination, profits, legality, comfort, . . .
of Agents
• Environment
• streets/freeways, traffic, pedestrians, weather, . . .
• Actuators
• steering, accelerator, brake, horn, speaker/display, . . .
• Sensors
• video, accelerometers, gauges, engine sensors, keyboard, GPS, . . .
20
Software agents
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Sometimes, the environment may not be the real world.
Environments
The Structure
• Flight simulator, video games, Internet
of Agents • They are all artificial but very complex environments
• Those agents working in these environments are called software agents
(softbots).
• All parts of the agent are software.
21
Properties of task environments
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• Single agent vs. multiagent
of Agents
• Deterministic vs. stochastic
• Episodic vs. sequential
• Discrete vs. continuous
• Static vs. dynamic
• Known vs. unknown
22
Fully observable vs. partially observable
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Fully observable: The agent’s
The Structure
of Agents sensory gives it access to the
complete state of the environment.
• The agent need not maintain
internal state to keep track of
the world.
• Partially observable
• Noisy and inaccurate sensors
• Parts of the state are simply
missing from the sensor data
• Unobservable: The agent has no
sensors at all
23
Single agent vs. multiagent
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Single agent: An agent operates by
The Structure
of Agents itself in an environment.
• Solving crossword → single
agent, playing chess → two
agents
• Which entities must be viewed as
agents?
• Competitive vs. Cooperative
multiagent environment
• Playing chess → competitive,
driving on road → cooperative
24
Deterministic vs. stochastic
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Deterministic: The next state of the
The Structure
of Agents environment is completely
determined by the current state and
the action executed by the agent.
• The vacuum world →
deterministic, driving on road
→ stochastic
• Most real situations are so complex
that they must be treated as
stochastic.
25
Episodic vs. sequential
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Episodic: The agent’s experience is
The Structure
of Agents divided into atomic episodes, in
each of which the agent receives a
percept and then performs a single
action
• Sequential: A current decision could
affect future decisions
26
Discrete vs. continuous
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• The discrete/continuous distinction
The Structure
of Agents applies to the state of the
environment, to the way time is
handled, and to the percepts and
actions of the agent
27
Static vs. dynamic
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• Crossword puzzles → static, taxi driving → dynamic
of Agents
• Semidynamic: The environment itself does not change with the passage of
time but the agent’s performance score does
• Chess playing with a clock
28
Known vs. unknown
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Known environment: the outcomes (or outcome probabilities if the
Environments
The Structure
environment is stochastic) for all actions are given.
of Agents • Unknown environment: the agent needs to learn how it works to make
good decisions.
29
Examples of different environments
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
The Structure
of Agents
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
agent = architecture + program
The Structure
of Agents
• Architecture: some sort of computing device with physical sensors and
actuators that this program will run on.
• Ordinary PC, robotic car with several onboard computers, cameras, and
other sensors, etc.
• Program has to be appropriate for the architecture.
• Program: Walk action → Architecture: legs
32
The agent programs
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • A trivial agent program: keep track of the percept sequence and index into a
Environments
The Structure
table of actions to decide what to do.
of Agents
function Table-Driven-Agent(percept)
returns an action
persistent: percepts, a sequence, initially empty
table, a table of actions,
indexed by percept sequences,
initially fully specified
append percept to the end of percepts
action ← Lookup(percepts, table)
return action
33
The agent programs (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• Let P be the set of possible percepts and
of Agents
• Let T be the lifetime of the agent (the total number of percepts it will
receive).
• The lookup table will contain T t
t=1 |P| entries → very huge table
P
34
Agent types
Agents and
Environments
Good Behavior:
The Concept of
Rationality
35
Simple reflex agents
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
• Select actions based on the current percept, ignoring the rest of the percept
of Agents
history
• The connection from percept to action is represented by condition-action
rules.
IF current percept THEN action
• Limitations
• Knowledge sometimes cannot be stated explicitly → low applicability
• Work only if the environment is fully observable
36
Simple reflex agents (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
Environment
Condition-action rules What action I
should do now
Actuators
function Simple-Reflex-Agent(percept)
returns an action
persistent: rules, a set of condition-action rules
state ← Interpret-Input(percept)
rule ← Rule-Match(state, rules)
action ← rule.Action
return action 37
Model-based reflex agents
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of • Partially observability → the agent has to keep track of an internal state
Environments
The Structure
• Depend on the percept history and reflect some of the unobserved
of Agents
aspects
• The agent program updates the internal state information as time goes by by
encoding two kinds of knowledge
• How the world evolves independently of the agent
• How the agent’s actions affect the world
38
Model-based reflex agents (cont.)
Agents and
Environments
Good Behavior:
The Concept of
Rationality
Environment
of Agents
What my actions do
Agent Actuators
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Current state of the environment is
The Structure Sensors
of Agents always not enough State
What the world
• The agent further needs some sort How the world evolves is like now
Environment
of goal information that describes
What it will be like
What my actions do if I do action A
Agent Actuators
40
Utility-based agents
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
• Goals alone are not enough to
The Structure Sensors
of Agents generate high-quality behavior in State
most environments
What the world
How the world evolves is like now
Environment
• Many action sequences to get the What my actions do
What it will be like
if I do action A
goals, some are better and some Utility How happy I will be
in such a state
worse What action I
performance measure.
• Goal → success, utility →
degree of success (how
successful it is)
41
Learning agents
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Nature of
Environments
A learning agent is divided into four conceptual Performance standard
The Structure
of Agents components Critic Sensors
Environment
• Performance element → Selecting external Learning
element
changes
Performance
element
actions
knowledge
learning
goals
42
Component representations
Agents and
Environments
Good Behavior:
The Concept of
Rationality
The Structure
of Agents
B C
B C
Figure 2: Three ways to represent states and the transitions between them. (a) Atomic
representation: a state (such as B or C) is a black box with no internal structure; (b)
Factored representation: a state consists of a vector of attribute values; values can be
Boolean, real-valued, or one of a fixed set of symbols. (c) Structured representation: a
state includes objects, each of which may have attributes of its own as well as
relationships to other objects.
43
References