SingleAgentInterface

java.lang.Object
- burlap.oomdp.stochasticgames.Agent
- - burlap.behavior.stochasticgame.agents.interfacing.singleagent.SingleAgentInterface

```
public class SingleAgentInterface
extends Agent
```
For a number of reasons outside the scope of this class description, BURLAP single agent learning algorithms use a different interface for interacting with the world than stochastic games agents do. Specifically, single agent learning algorithms make calls to actions that modify the world, whereas in stochastic games, the world requests actions from the agent and then subsequently tells the agent about the results. This stochastic games agent class provides an interface so that any BURALP single agent learning algorithm can be used in a stochastic games world. Specifically, this class works by running the single agent learning algorithm in a separate thread and synchronizing action selection and state outcomes between the two paradigms. The only information that neeeds to be provided to this class is the stochastic games domain in which this agent will play and a special single agent learning algorithm factory to produce the single agent learning algorithm that will be used.

Author:

James MacGlashan

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`protected class`	`SingleAgentInterface.MutableGroundedSingleAction` A mutable grounded singled action
`protected class`	`SingleAgentInterface.MutableState` A mutable OO-MDP state wrapper
`protected class`	`SingleAgentInterface.SARFWrapper` A reward function for returning the last RLGlue reward.
`protected class`	`SingleAgentInterface.SATFWrapper` A termianl function that returns true when the last RLGlue state was terminal.

Field Summary

Fields
Modifier and Type	Field and Description
`protected double`	`lastReward` The last reward received by this agent
`protected boolean`	`lastStateIsTerminal` Whether the last state was a terminal state
`protected boolean`	`needsToStartEpisode` whether a new single agent learning episode needs to be started for the next action request
`protected SingleAgentInterface.MutableGroundedSingleAction`	`nextAction` A mutable action holding the next action to be taken by the agent
`protected SingleAgentInterface.MutableState`	`nextState` A mutable state holding the next state for the single agent
`protected LearningAgent`	`saAgent` The BURLAP single agent learning agent that is being used.
`protected SALearningAgentFactoryForSG`	`saAgentFactory` The single agent learning factory
`protected SADomain`	`saDomain` The single agent version of the domain
`protected java.lang.Thread`	`saThread` The thread that runs the single agent learning algorithm

Fields inherited from class burlap.oomdp.stochasticgames.Agent
agentType, domain, internalRewardFunction, world, worldAgentName

Constructor Summary

Constructors
Constructor and Description
`SingleAgentInterface(SGDomain sgDomain, SALearningAgentFactoryForSG saAgentFactory)` Initializes for a given stochastic games domain and a factory to produce the single agent learning object

Method Summary

Methods
Modifier and Type	Method and Description
`void`	`gameStarting()` This method is called by the world when a new game is starting.
`void`	`gameTerminated()` This method is called by the world when a game has ended.
`GroundedSingleAction`	`getAction(State s)` This method is called by the world when it needs the agent to choose an action
`void`	`observeOutcome(State s, JointAction jointAction, java.util.Map<java.lang.String,java.lang.Double> jointReward, State sprime, boolean isTerminal)` This method is called by the world when every agent in the world has taken their action.
`State`	`receiveSAAction(GroundedAction ga)` A method that receives calls from the single agent domain actions to inform this stochastic games agent which action to take next when requested by the world.

Methods inherited from class burlap.oomdp.stochasticgames.Agent
getAgentName, getAgentType, getInternalRewardFunction, init, joinWorld, setInternalRewardFunction

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - saDomain
```
protected SADomain saDomain
```
    The single agent version of the domain
  - saAgentFactory
```
protected SALearningAgentFactoryForSG saAgentFactory
```
    The single agent learning factory
  - saAgent
```
protected LearningAgent saAgent
```
    The BURLAP single agent learning agent that is being used.
  - lastStateIsTerminal
```
protected boolean lastStateIsTerminal
```
    Whether the last state was a terminal state
  - lastReward
```
protected double lastReward
```
    The last reward received by this agent
  - needsToStartEpisode
```
protected boolean needsToStartEpisode
```
    whether a new single agent learning episode needs to be started for the next action request
  - nextState
```
protected SingleAgentInterface.MutableState nextState
```
    A mutable state holding the next state for the single agent
  - nextAction
```
protected SingleAgentInterface.MutableGroundedSingleAction nextAction
```
    A mutable action holding the next action to be taken by the agent
  - saThread
```
protected java.lang.Thread saThread
```
    The thread that runs the single agent learning algorithm
- Constructor Detail
  - SingleAgentInterface
```
public SingleAgentInterface(SGDomain sgDomain,
                    SALearningAgentFactoryForSG saAgentFactory)
```
    Initializes for a given stochastic games domain and a factory to produce the single agent learning object
    
    Parameters:
    sgDomain - the source stochastic games domain to be played
    saAgentFactory - the single learning agent factory to use to perform learning and action selection
- Method Detail
  - gameStarting
```
public void gameStarting()
```
    Description copied from class: Agent
    
    This method is called by the world when a new game is starting.
    
    Specified by:
    
    gameStarting in class Agent
  - getAction
```
public GroundedSingleAction getAction(State s)
```
    Description copied from class: Agent
    
    This method is called by the world when it needs the agent to choose an action
    
    Specified by:
    
    getAction in class Agent
    
    Parameters:
    s - the current state of the world
    
    Returns:
    the action this agent wishes to take
  - observeOutcome
```
public void observeOutcome(State s,
                  JointAction jointAction,
                  java.util.Map<java.lang.String,java.lang.Double> jointReward,
                  State sprime,
                  boolean isTerminal)
```
    Description copied from class: Agent
    
    This method is called by the world when every agent in the world has taken their action. It conveys the result of the joint action.
    
    Specified by:
    
    observeOutcome in class Agent
    
    Parameters:
    s - the state in which the last action of each agent was taken
    jointAction - the joint action of all agents in the world
    jointReward - the joint reward of all agents in the world
    sprime - the next state to which the agent transitioned
    isTerminal - whether the new state is a terminal state
  - gameTerminated
```
public void gameTerminated()
```
    Description copied from class: Agent
    
    This method is called by the world when a game has ended.
    
    Specified by:
    
    gameTerminated in class Agent
  - receiveSAAction
```
public State receiveSAAction(GroundedAction ga)
```
    A method that receives calls from the single agent domain actions to inform this stochastic games agent which action to take next when requested by the world.
    
    Parameters:
    ga - the single agent grounded aciton selection
    
    Returns:
    the state that will be the result of the agent applying the corresponding action in the stochastic games world.

Class SingleAgentInterface

Nested Class Summary

Field Summary

Fields inherited from class burlap.oomdp.stochasticgames.Agent

Constructor Summary

Method Summary

Methods inherited from class burlap.oomdp.stochasticgames.Agent

Methods inherited from class java.lang.Object

Field Detail

saDomain

saAgentFactory

saAgent

lastStateIsTerminal

lastReward

needsToStartEpisode

nextState

nextAction

saThread

Constructor Detail

SingleAgentInterface

Method Detail

gameStarting

getAction

observeOutcome

gameTerminated

receiveSAAction