public static class MADPPlannerFactory.MAVIPlannerFactory extends java.lang.Object implements MADPPlannerFactory
MAValueIteration
).MADPPlannerFactory.ConstantMADPPlannerFactory, MADPPlannerFactory.MAVIPlannerFactory
Modifier and Type | Field and Description |
---|---|
protected java.util.List<SGAgentType> |
agentDefinitions
The agent definitions for which planning is performed.
|
protected SGBackupOperator |
backupOperator
The backup operating defining the solution concept to use.
|
protected double |
discount
The discount factor in [0, 1]
|
protected SGDomain |
domain
The domain in which planning is to be performed
|
protected HashableStateFactory |
hashingFactory
The state hashing factory used to query the value function for individual states
|
protected JointModel |
jointModel
The joint action model to use in planning.
|
protected JointRewardFunction |
jointRewardFunction
The joint reward function
|
protected double |
maxDelta
The threshold that will cause VI to terminate when the max change in Q-value for is less than it
|
protected int |
maxIterations
The maximum allowable number of iterations until VI termination
|
protected QFunction |
qInit
The Q-value initialization function to use.
|
protected TerminalFunction |
terminalFunction
The state terminal function.
|
Constructor and Description |
---|
MAVIPlannerFactory(SGDomain domain,
JointModel jointModel,
JointRewardFunction jointRewardFunction,
TerminalFunction terminalFunction,
double discount,
HashableStateFactory hashingFactory,
double qInit,
SGBackupOperator backupOperator,
double maxDelta,
int maxIterations)
Initializes.
|
MAVIPlannerFactory(SGDomain domain,
JointModel jointModel,
JointRewardFunction jointRewardFunction,
TerminalFunction terminalFunction,
double discount,
HashableStateFactory hashingFactory,
QFunction qInit,
SGBackupOperator backupOperator,
double maxDelta,
int maxIterations)
Initializes.
|
MAVIPlannerFactory(SGDomain domain,
java.util.List<SGAgentType> agentDefinitions,
JointModel jointModel,
JointRewardFunction jointRewardFunction,
TerminalFunction terminalFunction,
double discount,
HashableStateFactory hashingFactory,
QFunction qInit,
SGBackupOperator backupOperator,
double maxDelta,
int maxIterations)
Initializes.
|
Modifier and Type | Method and Description |
---|---|
MADynamicProgramming |
getPlannerInstance()
returns an
MADynamicProgramming reference to use for planning. |
protected SGDomain domain
protected java.util.List<SGAgentType> agentDefinitions
protected JointModel jointModel
protected JointRewardFunction jointRewardFunction
protected TerminalFunction terminalFunction
protected double discount
protected HashableStateFactory hashingFactory
protected QFunction qInit
protected SGBackupOperator backupOperator
protected double maxDelta
protected int maxIterations
public MAVIPlannerFactory(SGDomain domain, JointModel jointModel, JointRewardFunction jointRewardFunction, TerminalFunction terminalFunction, double discount, HashableStateFactory hashingFactory, double qInit, SGBackupOperator backupOperator, double maxDelta, int maxIterations)
domain
- the domain in which to perform planingjointModel
- the joint action modeljointRewardFunction
- the joint reward functionterminalFunction
- the terminal state functiondiscount
- the discounthashingFactory
- the hashing factory to use for storing statesqInit
- the default Q-value to initialize all values tobackupOperator
- the backup operator that defines the solution concept being solvedmaxDelta
- the threshold that causes VI to terminate when the max Q-value change is less than itmaxIterations
- the maximum number of iterations allowedpublic MAVIPlannerFactory(SGDomain domain, JointModel jointModel, JointRewardFunction jointRewardFunction, TerminalFunction terminalFunction, double discount, HashableStateFactory hashingFactory, QFunction qInit, SGBackupOperator backupOperator, double maxDelta, int maxIterations)
domain
- the domain in which to perform planingjointModel
- the joint action modeljointRewardFunction
- the joint reward functionterminalFunction
- the terminal state functiondiscount
- the discounthashingFactory
- the hashing factory to use for storing statesqInit
- the q-value initialization function to use.backupOperator
- the backup operator that defines the solution concept being solvedmaxDelta
- the threshold that causes VI to terminate when the max Q-value change is less than itmaxIterations
- the maximum number of iterations allowedpublic MAVIPlannerFactory(SGDomain domain, java.util.List<SGAgentType> agentDefinitions, JointModel jointModel, JointRewardFunction jointRewardFunction, TerminalFunction terminalFunction, double discount, HashableStateFactory hashingFactory, QFunction qInit, SGBackupOperator backupOperator, double maxDelta, int maxIterations)
domain
- the domain in which to perform planingagentDefinitions
- the definitions of the agent typesjointModel
- the joint action modeljointRewardFunction
- the joint reward functionterminalFunction
- the terminal state functiondiscount
- the discounthashingFactory
- the hashing factory to use for storing statesqInit
- the q-value initialization function to use.backupOperator
- the backup operator that defines the solution concept being solvedmaxDelta
- the threshold that causes VI to terminate when the max Q-value change is less than itmaxIterations
- the maximum number of iterations allowedpublic MADynamicProgramming getPlannerInstance()
MADPPlannerFactory
MADynamicProgramming
reference to use for planning.getPlannerInstance
in interface MADPPlannerFactory
MADynamicProgramming
reference to use for planning.