SGQWActionHistoryFactory

java.lang.Object
- burlap.behavior.stochasticgame.agents.naiveq.history.SGQWActionHistoryFactory

All Implemented Interfaces:

AgentFactory
```
public class SGQWActionHistoryFactory
extends java.lang.Object
implements AgentFactory
```
An agent factory for Q-learning with history agents.

Author:

James MacGlashan

Field Summary

Fields
Modifier and Type	Field and Description
`protected ActionIdMap`	`actionMap` An action mapping to map from actions to int values
`protected double`	`discount` The discount rate the Q-learning algorithm will use
`protected SGDomain`	`domain` The stochastic games domain in which the agent will act
`protected double`	`epsilon` The epislon value for epislon greedy policy.
`protected int`	`historySize` How much history the agent should remember
`protected double`	`learningRate` The learning rate the Q-learning algorithm will use
`protected int`	`maxPlayers` The maximum number of players that can be in the game
`protected ValueFunctionInitialization`	`qinit` A default Q-value initializer
`protected StateHashFactory`	`stateHash` The state hashing factory the Q-learning algorithm will use

Constructor Summary

Constructors
Constructor and Description
`SGQWActionHistoryFactory(SGDomain d, double discount, double learningRate, StateHashFactory stateHash, int historySize)` Initializes the factory
`SGQWActionHistoryFactory(SGDomain d, double discount, double learningRate, StateHashFactory stateHash, int historySize, int maxPlayers, ActionIdMap actionMap)` Initializes the factory

Method Summary

Methods
Modifier and Type	Method and Description
`Agent`	`generateAgent()` Returns a new agent instance.
`void`	`setEpsilon(double epsilon)` Sets the epislon parmaeter (for epsilon greedy policy).
`void`	`setQValueInitializer(ValueFunctionInitialization qinit)` Sets the Q-value initialization function that will be used by the agent.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - domain
```
protected SGDomain domain
```
    The stochastic games domain in which the agent will act
  - discount
```
protected double discount
```
    The discount rate the Q-learning algorithm will use
  - learningRate
```
protected double learningRate
```
    The learning rate the Q-learning algorithm will use
  - stateHash
```
protected StateHashFactory stateHash
```
    The state hashing factory the Q-learning algorithm will use
  - historySize
```
protected int historySize
```
    How much history the agent should remember
  - maxPlayers
```
protected int maxPlayers
```
    The maximum number of players that can be in the game
  - actionMap
```
protected ActionIdMap actionMap
```
    An action mapping to map from actions to int values
  - qinit
```
protected ValueFunctionInitialization qinit
```
    A default Q-value initializer
  - epsilon
```
protected double epsilon
```
    The epislon value for epislon greedy policy. If negative, then the policy of the created agent will not be different than its default.
- Constructor Detail
  - SGQWActionHistoryFactory
```
public SGQWActionHistoryFactory(SGDomain d,
                        double discount,
                        double learningRate,
                        StateHashFactory stateHash,
                        int historySize,
                        int maxPlayers,
                        ActionIdMap actionMap)
```
    Initializes the factory
    
    Parameters:
    d - the stochastic games domain in which the agent will act
    discount - The discount rate the Q-learning algorithm will use
    learningRate - The learning rate the Q-learning algorithm will use
    stateHash - The state hashing factory the Q-learning algorithm will use
    historySize - How much history the agent should remember
    maxPlayers - The maximum number of players that can be in the game
    actionMap - An action mapping to map from actions to int values
  - SGQWActionHistoryFactory
```
public SGQWActionHistoryFactory(SGDomain d,
                        double discount,
                        double learningRate,
                        StateHashFactory stateHash,
                        int historySize)
```
    Initializes the factory
    
    Parameters:
    d - the stochastic games domain in which the agent will act
    discount - The discount rate the Q-learning algorithm will use
    learningRate - The learning rate the Q-learning algorithm will use
    stateHash - The state hashing factory the Q-learning algorithm will use
    historySize - How much history the agent should remember
- Method Detail
  - setQValueInitializer
```
public void setQValueInitializer(ValueFunctionInitialization qinit)
```
    Sets the Q-value initialization function that will be used by the agent.
    
    Parameters:
    qinit - the Q-value initialization function.
  - setEpsilon
```
public void setEpsilon(double epsilon)
```
    Sets the epislon parmaeter (for epsilon greedy policy). If set to a negative, then the default policy of the create agent will be used.
    
    Parameters:
    epsilon - the epsilon value to use
  - generateAgent
```
public Agent generateAgent()
```
    Description copied from interface: AgentFactory
    
    Returns a new agent instance.
    
    Specified by:
    
    generateAgent in interface AgentFactory
    
    Returns:
    a new agent instance.

Class SGQWActionHistoryFactory

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

domain

discount

learningRate

stateHash

historySize

maxPlayers

actionMap

qinit

epsilon

Constructor Detail

SGQWActionHistoryFactory

SGQWActionHistoryFactory

Method Detail

setQValueInitializer

setEpsilon

generateAgent