MultiAgentVFPlanningAgent

java.lang.Object
- burlap.oomdp.stochasticgames.Agent
- - burlap.behavior.stochasticgame.agents.mavf.MultiAgentVFPlanningAgent

```
public class MultiAgentVFPlanningAgent
extends Agent
```
A agent that using a multi agent value function planning algorithm (instance of MAValueFunctionPlanner) to compute the value of each state and then follow a policy derived from a joint policy that is derived from that estimated value function. This is achieved by at each step by the MAValueFunctionPlanner.planFromState(State) being first called and then following the policy. Ideally, the planning object should only perform planning for a state if it has not already planned for it. The joint policy underlining the policy the agent follows must be an instance of MAQSourcePolicy. Furthermore, when the policy is set, the underlining joint policy will automatically be set to use this agent's planning object as the value function source and the set of agents will automatically be set to the involved in this agent's world. The PolicyFromJointPolicy will also be told that this agent is its target.

Author:

James MacGlashan

Field Summary

Fields
Modifier and Type	Field and Description
`protected MAValueFunctionPlanner`	`planner` The planner this agent will use to estiamte the value function and thereby determine its policy.
`protected PolicyFromJointPolicy`	`policy` The policy dervied from a joint policy derived from the planner's value function estimate that this agent will follow.
`protected boolean`	`setAgentDefinitions` Whether the agent definitions for this planner have been set yet.

Fields inherited from class burlap.oomdp.stochasticgames.Agent
agentType, domain, internalRewardFunction, world, worldAgentName

Constructor Summary

Constructors
Constructor and Description
`MultiAgentVFPlanningAgent(SGDomain domain, MAValueFunctionPlanner planner, PolicyFromJointPolicy policy)` Initializes.

Method Summary

Methods
Modifier and Type	Method and Description
`void`	`gameStarting()` This method is called by the world when a new game is starting.
`void`	`gameTerminated()` This method is called by the world when a game has ended.
`GroundedSingleAction`	`getAction(State s)` This method is called by the world when it needs the agent to choose an action
`void`	`joinWorld(World w, AgentType as)` Causes this agent instance to join a world.
`void`	`observeOutcome(State s, JointAction jointAction, java.util.Map<java.lang.String,java.lang.Double> jointReward, State sprime, boolean isTerminal)` This method is called by the world when every agent in the world has taken their action.
`void`	`setPolicy(PolicyFromJointPolicy policy)` Sets the policy derived from this agents planner to follow.

Methods inherited from class burlap.oomdp.stochasticgames.Agent
getAgentName, getAgentType, getInternalRewardFunction, init, setInternalRewardFunction

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - planner
```
protected MAValueFunctionPlanner planner
```
    The planner this agent will use to estiamte the value function and thereby determine its policy.
  - policy
```
protected PolicyFromJointPolicy policy
```
    The policy dervied from a joint policy derived from the planner's value function estimate that this agent will follow.
  - setAgentDefinitions
```
protected boolean setAgentDefinitions
```
    Whether the agent definitions for this planner have been set yet.
- Constructor Detail
  - MultiAgentVFPlanningAgent
```
public MultiAgentVFPlanningAgent(SGDomain domain,
                         MAValueFunctionPlanner planner,
                         PolicyFromJointPolicy policy)
```
    Initializes. The underlining joint policy of the policy must be an instance of MAQSourcePolicy or a runtime exception will be thrown. The joint policy will automatically be set to use the provided planner as the value function source.
    
    Parameters:
    domain - the domain in which the agent will act
    planner - the planner the agent should use for determining its policy
    policy - the policy that will use the planners value function as a source.
- Method Detail
  - setPolicy
```
public void setPolicy(PolicyFromJointPolicy policy)
```
    Sets the policy derived from this agents planner to follow. he underlining joint policy of the policy must be an instance of MAQSourcePolicy or a runtime exception will be thrown. The joint policy will automatically be set to use the provided planner as the value function source.
    
    Parameters:
    policy - the policy that will use the planners value function as a source.
  - joinWorld
```
public void joinWorld(World w,
             AgentType as)
```
    Description copied from class: Agent
    
    Causes this agent instance to join a world.
    
    Overrides:
    
    joinWorld in class Agent
    
    Parameters:
    w - the world for the agent to join
    as - the agent type the agent will be joining as
  - gameStarting
```
public void gameStarting()
```
    Description copied from class: Agent
    
    This method is called by the world when a new game is starting.
    
    Specified by:
    
    gameStarting in class Agent
  - getAction
```
public GroundedSingleAction getAction(State s)
```
    Description copied from class: Agent
    
    This method is called by the world when it needs the agent to choose an action
    
    Specified by:
    
    getAction in class Agent
    
    Parameters:
    s - the current state of the world
    
    Returns:
    the action this agent wishes to take
  - observeOutcome
```
public void observeOutcome(State s,
                  JointAction jointAction,
                  java.util.Map<java.lang.String,java.lang.Double> jointReward,
                  State sprime,
                  boolean isTerminal)
```
    Description copied from class: Agent
    
    This method is called by the world when every agent in the world has taken their action. It conveys the result of the joint action.
    
    Specified by:
    
    observeOutcome in class Agent
    
    Parameters:
    s - the state in which the last action of each agent was taken
    jointAction - the joint action of all agents in the world
    jointReward - the joint reward of all agents in the world
    sprime - the next state to which the agent transitioned
    isTerminal - whether the new state is a terminal state
  - gameTerminated
```
public void gameTerminated()
```
    Description copied from class: Agent
    
    This method is called by the world when a game has ended.
    
    Specified by:
    
    gameTerminated in class Agent

Class MultiAgentVFPlanningAgent

Field Summary

Fields inherited from class burlap.oomdp.stochasticgames.Agent

Constructor Summary

Method Summary

Methods inherited from class burlap.oomdp.stochasticgames.Agent

Methods inherited from class java.lang.Object

Field Detail

planner

policy

setAgentDefinitions

Constructor Detail

MultiAgentVFPlanningAgent

Method Detail

setPolicy

joinWorld

gameStarting

getAction

observeOutcome

gameTerminated