- MacroAction - Class in burlap.behavior.singleagent.options
-
A macro action is a non-Markov option that always executes a fixed sequence of actions.
- MacroAction() - Constructor for class burlap.behavior.singleagent.options.MacroAction
-
Default constructor for serialization purposes.
- MacroAction(String, List<Action>) - Constructor for class burlap.behavior.singleagent.options.MacroAction
-
Instantiates a macro action with a given name and action sequence.
- MADPPlanAgentFactory - Class in burlap.behavior.stochasticgames.agents.madp
-
- MADPPlanAgentFactory(SGDomain, MADynamicProgramming, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory
-
Initializes.
- MADPPlanAgentFactory(SGDomain, MADPPlannerFactory, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory
-
Initializes
- MADPPlannerFactory - Interface in burlap.behavior.stochasticgames.agents.madp
-
- MADPPlannerFactory.ConstantMADPPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp
-
MADynamicProgramming
factory that always returns the same object instance, unless the reference is chaned with a mutator.
- MADPPlannerFactory.MAVIPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp
-
- MADynamicProgramming - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
An abstract value function based planning algorithm base for sequential stochastic games that require the computation of Q-values for each agent for each joint action.
- MADynamicProgramming() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MADynamicProgramming
-
- MADynamicProgramming.BackupBasedQSource - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
A
QSourceForSingleAgent
implementation which stores a value function for an agent and produces Joint action Q-values
by marginalizing over the transition dynamics the reward and discounted next state value.
- MADynamicProgramming.JointActionTransitions - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
A class for holding all of the transition dynamic information for a given joint action in a given state.
- main(String[]) - Static method in class burlap.behavior.singleagent.Episode
-
- main(String[]) - Static method in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
-
- main(String[]) - Static method in class burlap.behavior.stochasticgames.GameEpisode
-
- main(String[]) - Static method in class burlap.behavior.stochasticgames.solvers.CorrelatedEquilibriumSolver
-
- main(String[]) - Static method in class burlap.datastructures.AlphanumericSorting
-
Testing the alphanumeric sorting
- main(String[]) - Static method in class burlap.datastructures.StochasticTree
-
Demos how to use this class
- main(String[]) - Static method in class burlap.debugtools.MyTimer
-
Demo of usage
- main(String[]) - Static method in class burlap.debugtools.RandomFactory
-
Example usage.
- main(String[]) - Static method in class burlap.domain.singleagent.blockdude.BlockDude
-
Runs an interactive visual explorer for level one of Block Dude.
- main(String[]) - Static method in class burlap.domain.singleagent.blocksworld.BlocksWorld
-
Main method for exploring the domain.
- main(String[]) - Static method in class burlap.domain.singleagent.cartpole.CartPoleDomain
-
Launches an interactive visualize in which key 'a' applies a force in the left direction and key 'd' applies force in the right direction.
- main(String[]) - Static method in class burlap.domain.singleagent.cartpole.InvertedPendulum
-
- main(String[]) - Static method in class burlap.domain.singleagent.frostbite.FrostbiteDomain
-
Main function to test the domain.
- main(String[]) - Static method in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain
-
- main(String[]) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain
-
Creates a visual explorer or terminal explorer.
- main(String[]) - Static method in class burlap.domain.singleagent.lunarlander.LunarLanderDomain
-
This method will launch a visual explorer for the lunar lander domain.
- main(String[]) - Static method in class burlap.domain.singleagent.mountaincar.MountainCar
-
Will launch a visual explorer for the mountain car domain that is controlled with the a-s-d keys.
- main(String[]) - Static method in class burlap.domain.singleagent.pomdp.tiger.TigerDomain
-
Main method for interacting with the tiger domain via an
EnvironmentShell
By default, the TerminalExplorer interacts with the partially observable environment (
SimulatedPOEnvironment
),
which means you only get to see the observations that the agent would.
- main(String[]) - Static method in class burlap.domain.stochasticgames.gridgame.GridGame
-
Creates a visual explorer for a simple domain with two agents in it.
- main(String[]) - Static method in class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame
-
A main method showing example code that would be used to create an instance of Prisoner's dilemma and begin playing it with a
SGWorldShell
.
- main(String[]) - Static method in class burlap.testing.TestRunner
-
- maintainClosed - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS
-
Whether to keep track of a closed list to prevent exploring already seen nodes.
- makeEmptyMap() - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain
-
Makes the map empty
- MAMaxQLearningFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, boolean, double) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory.MAMaxQLearningFactory
-
- manualAgents - Variable in class burlap.shell.command.world.ManualAgentsCommands
-
- ManualAgentsCommands - Class in burlap.shell.command.world
-
- ManualAgentsCommands() - Constructor for class burlap.shell.command.world.ManualAgentsCommands
-
- ManualAgentsCommands.ListManualAgents - Class in burlap.shell.command.world
-
- ManualAgentsCommands.LSManualAgentActionsCommands - Class in burlap.shell.command.world
-
- ManualAgentsCommands.ManualSGAgent - Class in burlap.shell.command.world
-
- ManualAgentsCommands.RegisterAgentCommand - Class in burlap.shell.command.world
-
- ManualAgentsCommands.SetAgentAction - Class in burlap.shell.command.world
-
- ManualSGAgent() - Constructor for class burlap.shell.command.world.ManualAgentsCommands.ManualSGAgent
-
- map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeMap
-
- map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeState
-
- map - Variable in class burlap.domain.singleagent.gridworld.GridWorldDomain
-
The wall map where the first index is the x position and the second index is the y position.
- map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.CellPainter
-
- map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.LocationPainter
-
- map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter
-
- MapPainter(int[][]) - Constructor for class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter
-
Initializes for the domain and wall map
- mapState(State) - Method in class burlap.mdp.auxiliary.common.IdentityStateMapping
-
- mapState(State) - Method in class burlap.mdp.auxiliary.common.ShallowIdentityStateMapping
-
- mapState(State) - Method in interface burlap.mdp.auxiliary.StateMapping
-
- MAQLControlledQSourceMap(List<SGAgent>) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.MAQLControlledQSourceMap
-
Initializes with the list of agents that each keep their own Q-source.
- MAQLFactory - Class in burlap.behavior.stochasticgames.agents.maql
-
- MAQLFactory() - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
Empty constructor.
- MAQLFactory(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
Initializes.
- MAQLFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
Initializes.
- MAQLFactory.CoCoQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql
-
Factory for generating CoCo-Q agents.
- MAQLFactory.MAMaxQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql
-
Factory for generating Max multiagent Q-learning agents.
- MAQSourcePolicy - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
- MAQSourcePolicy() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MAQSourcePolicy
-
- marginalizeColPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools
-
Returns the column player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
- marginalizeRowPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools
-
Returns the row player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
- markAsTerminalPosition(int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldTerminalFunction
-
Marks a position as a terminal position for the agent.
- markov() - Method in class burlap.behavior.singleagent.options.MacroAction
-
- markov() - Method in interface burlap.behavior.singleagent.options.Option
-
- markov() - Method in class burlap.behavior.singleagent.options.SubgoalOption
-
- MaskedConfig - Class in burlap.statehashing.masked
-
- MaskedConfig() - Constructor for class burlap.statehashing.masked.MaskedConfig
-
- MaskedConfig(Set<Object>, Set<String>) - Constructor for class burlap.statehashing.masked.MaskedConfig
-
- MaskedHashableStateFactory - Class in burlap.statehashing.masked
-
This class produces
HashableState
instances in which the hash code and equality
of the states masks (ignores) specified state variables.
- MaskedHashableStateFactory() - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
-
Default constructor: object identifier independent, no hash code caching, and no object class or attribute masks.
- MaskedHashableStateFactory(boolean) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
-
Initializes with no hash code caching and no object class or attribute masks.
- MaskedHashableStateFactory(boolean, boolean, String...) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
-
Initializes with a specified variable or object class mask.
- maskedObjectClasses - Variable in class burlap.statehashing.masked.MaskedConfig
-
- maskedVariables - Variable in class burlap.statehashing.masked.MaskedConfig
-
- MatchEntry - Class in burlap.mdp.stochasticgames.tournament
-
This class indicates which player in a tournament is to play in a match and what
SGAgentType
role they will play.
- MatchEntry(SGAgentType, int) - Constructor for class burlap.mdp.stochasticgames.tournament.MatchEntry
-
Initializes the MatchEntry
- matchingActionFeature(List<TileCodingFeatures.ActionFeatureID>, Action) - Method in class burlap.behavior.functionapproximation.sparse.tilecoding.TileCodingFeatures
-
- MatchSelector - Interface in burlap.mdp.stochasticgames.tournament
-
An interface for defining how matches in a tournament will be determined
- MatrixAction(String, int) - Constructor for class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame.MatrixAction
-
- MAValueIteration - Class in burlap.behavior.stochasticgames.madynamicprogramming.dpplanners
-
A class for performing multi-agent value iteration.
- MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
Initializes.
- MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
Initializes.
- MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
Initializes.
- MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
Initializes.
- MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
Initializes.
- MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
Initializes.
- MAVIPlannerFactory(SGDomain, List<SGAgentType>, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
Initializes.
- maxActions - Variable in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain
-
The maximum number of actions available from any given state node.
- maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
-
The maximum speed of the change in angle.
- maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.InvertedPendulum.IPPhysicsParams
-
The maximum speed (magnitude) of the change in angle.
- maxBackups - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping
-
THe maximum number Bellman backups permitted
- maxBetaScaled(double[], double) - Static method in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.BoltzmannPolicyGradient
-
Given an array of Q-values, returns the maximum Q-value multiplied by the parameter beta.
- maxCartSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
-
The maximum speed of the cart.
- maxChange - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
-
The maximum change in weights permitted to terminate LSPI.
- maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI
-
When the maximum change in the value function is smaller than this value, VI will terminate.
- maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
The value function change threshold to stop VI.
- maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
-
When the maximum change in the value function from a rollout is smaller than this value, VI will terminate.
- maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration
-
When the maximum change in the value function is smaller than this value, VI will terminate.
- maxDelta - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
-
The maximum change in the value function that will cause planning to terminate.
- maxDelta - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
The threshold that will cause VI to terminate when the max change in Q-value for is less than it
- maxDelta - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
The threshold that will cause VI to terminate when the max change in Q-value for is less than it
- maxDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS
-
The max depth of the search tree that will be explored.
- maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
-
The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
- maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
-
The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
- maxDiff - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
-
The max permitted difference between the lower bound and upperbound for planning termination.
- maxDim - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
-
The width and height of the world.
- maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.ActorCritic
-
The maximum number of steps of an episode before the agent will manually terminate it.This is defaulted
to Integer.MAX_VALUE.
- maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.critics.TimeIndexedTDLambda
-
The maximum number of steps possible in an episode.
- maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
- maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
- maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation
-
When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
- maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
-
When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
- maxEvalIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation
-
When the maximum number of evaluation iterations passes this number, policy evaluation will terminate
- maxGT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
-
The number of goal types
- maxHeap - Variable in class burlap.datastructures.HashIndexedHeap
-
If true, this is ordered according to a max heap; if false ordered according to a min heap.
- maxHorizon - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
- maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.apprenticeship.ApprenticeshipLearningRequest
-
The maximum number of iterations of apprenticeship learning
- maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI
-
When the number of VI iterations exceeds this value, VI will terminate.
- maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
The maximum allowed number of VI iterations.
- maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
-
When the number of policy evaluation iterations exceeds this value, policy evaluation will terminate.
- maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration
-
When the number of VI iterations exceeds this value, VI will terminate.
- maxIterations - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
-
The maximum number of iterations to run.
- maxIterations - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
The maximum allowable number of iterations until VI termination
- maxIterations - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
-
The maximum allowable number of iterations until VI termination
- maxLearningSteps - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
-
- maxLikelihoodChange - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
-
The likelihood change threshold to stop gradient ascent.
- MaxMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers
-
A class for finding a stategy that maxmizes the player's payoff under the assumption that their "opponent" is friendly
and will try to do the same.
- MaxMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MaxMax
-
- maxNonZeroCoefficients - Variable in class burlap.behavior.functionapproximation.dense.fourier.FourierBasis
-
The maximum number of non-zero coefficient entries permitted in a coefficient vector
- maxNumPlanningIterations - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
-
- maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
-
The maximum number of learning steps per episode before the agent gives up
- maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
-
The maximum number of learning steps per episode before the agent gives up
- maxPIDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
-
When the maximum change between policy evaluations is smaller than this value, planning will terminate.
- maxPlayers - Variable in class burlap.behavior.stochasticgames.agents.naiveq.history.SGQWActionHistoryFactory
-
The maximum number of players that can be in the game
- maxPlyrs - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
-
The maximum number of players that can be in the game
- maxPolicyIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
-
When the number of policy iterations passes this value, planning will terminate.
- maxQ(State) - Method in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
-
Returns the maximum Q-value entry for the given state with ties broken randomly.
- MaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators
-
A classic MDP-style max backup operator in which an agent back ups his max Q-value in the state.
- MaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MaxQ
-
- maxQ(QProvider, State) - Static method in class burlap.behavior.valuefunction.QProvider.Helper
-
Returns the optimal state value function for a state given a
QProvider
.
- maxQChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The maximum allowable change in the Q-function during an episode before the planning method terminates.
- maxQChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The maximum Q-value change that occurred in the last learning episode.
- maxRollouts - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
-
the max number of rollouts to perform when planning is started unless the value function margin is small enough.
- maxRollOutsFromRoot - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
- maxSelfTransitionProb - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping.BPTRNode
-
- maxStages - Variable in class burlap.mdp.stochasticgames.tournament.Tournament
-
- maxSteps - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
-
The maximum number of steps of gradient ascent.
- maxTimeStep() - Method in class burlap.behavior.singleagent.Episode
-
- maxTimeStep() - Method in class burlap.behavior.stochasticgames.GameEpisode
-
- maxTNormed() - Method in class burlap.datastructures.BoltzmannDistribution
-
Returns the maximum temperature normalized preference
- maxWeightChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
The maximum allowable change in the VFA weights during an episode before the planning method terminates.
- maxWeightChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
The maximum VFA weight change that occurred in the last learning episode.
- maxWT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
-
The number of wall types
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDude
-
Domain parameter specifying the maximum x dimension value of the world
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
-
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
-
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
-
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
-
- maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
-
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDude
-
Domain parameter specifying the maximum y dimension value of the world
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
-
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
-
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
-
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
-
- maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
-
- MCModel(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCModel
-
- MCPhysicsParams() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCPhysicsParams
-
- MCRandomStateGenerator - Class in burlap.domain.singleagent.mountaincar
-
Generates
MountainCar
states with the x-position between some specified range and the velocity between some specified range.
- MCRandomStateGenerator(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator
-
- MCRandomStateGenerator(double, double, double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator
-
Initializes for the given boundaries in which random states will be created
- MCState - Class in burlap.domain.singleagent.mountaincar
-
- MCState() - Constructor for class burlap.domain.singleagent.mountaincar.MCState
-
- MCState(double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCState
-
- mdpPlanner - Variable in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
-
- mdpQSource - Variable in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
- MDPSolver - Class in burlap.behavior.singleagent
-
The abstract super class to use for various MDP solving algorithms, including both planning and learning algorithms.
- MDPSolver() - Constructor for class burlap.behavior.singleagent.MDPSolver
-
- MDPSolverInterface - Interface in burlap.behavior.singleagent
-
The top-level interface for algorithms that solve MDPs.
- medianEpisodeReward - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.Trial
-
Stores the median reward by episode
- medianEpisodeReward - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.Trial
-
Stores the median reward by episode
- medianEpisodeRewardSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets
-
Most recent trial's median reward per step episode data
- medianEpisodeRewardSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets
-
Most recent trial's median reward per step episode data
- memoryQueue - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
-
A queue for storing the most recently expanded nodes.
- memorySize - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
-
the size of the memory; that is, the number of recently expanded search nodes the valueFunction will remember.
- memoryStateDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
-
Stores the depth at which each state in the memory was explored.
- merAvgSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets
-
All trial's average median reward per episode series data
- merAvgSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets
-
All trial's average median reward per episode series data
- metric - Variable in class burlap.behavior.functionapproximation.dense.rbf.RBF
-
The distance metric to compare query input states to the centeredState
- metricsSet - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter
-
A set specifying the performance metrics that will be plotted
- metricsSet - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
-
A set specifying the performance metrics that will be plotted
- minEligibityForUpdate - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
The minimum eligibility value of a trace that will cause it to be updated
- minimumLR - Variable in class burlap.behavior.learningrate.ExponentialDecayLR
-
The minimum learning rate
- minimumLR - Variable in class burlap.behavior.learningrate.SoftTimeInverseDecayLR
-
The minimum learning rate
- MinMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers
-
Finds the MinMax equilibrium using linear programming and returns the appropraite strategy.
- MinMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MinMax
-
- MinMaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators
-
A minmax operator.
- MinMaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MinMaxQ
-
- MinMaxSolver - Class in burlap.behavior.stochasticgames.solvers
-
- minNewStepsForLearningPI - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
-
The minimum number of new observations received from learning episodes before LSPI will be run again.
- minNumRolloutsWithSmallValueChange - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
-
RTDP will be delcared "converged" if there are this many consecutive policy rollouts in which the value function change is smaller than the maxDelta value.
- minProb - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
-
- minStepAndEpisodes(List<PerformancePlotter.Trial>) - Method in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter
-
Returns the minimum steps and episodes across all trials
- minStepAndEpisodes(List<MultiAgentPerformancePlotter.Trial>) - Method in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
-
Returns the minimum steps and episodes across all trials
- minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
-
- minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
-
- minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
-
- minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
-
- miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
-
- miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
-
- miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
-
- miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
-
- MLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
-
An implementation of Maximum-likelihood Inverse Reinforcement Learning [1].
- MLIRL(MLIRLRequest, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
-
Initializes.
- mlirlInstance - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL
-
The
MLIRL
instance used to perform the maximization step
for each clusters reward function parameter values.
- MLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
-
A request object for Maximum-Likelihood Inverse Reinforcement Learning (
MLIRL
).
- MLIRLRequest(SADomain, Planner, List<Episode>, DifferentiableRF) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest
-
Initializes the request without any expert trajectory weights (which will be assumed to have a value 1).
- MLIRLRequest(SADomain, List<Episode>, DifferentiableRF, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest
-
- mode(int) - Static method in class burlap.debugtools.DPrint
-
Returns the print mode for a given debug code
- model - Variable in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
-
- model - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
-
The model of the world that is being learned.
- model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
-
The model of the world that is being learned.
- model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.UnmodeledFavoredPolicy
-
- model - Variable in class burlap.behavior.singleagent.MDPSolver
-
- model - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
-
- model - Variable in class burlap.mdp.singleagent.environment.SimulatedEnvironment
-
- model - Variable in class burlap.mdp.singleagent.SADomain
-
- modelChanged(State) - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner
-
Tells the valueFunction that the model has changed and that it will need to replan accordingly
- modelChanged(State) - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
-
- modeledRewardFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
-
The modeled reward function that is being learned.
- modeledTerminalFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
-
The modeled terminal state function.
- ModelLearningPlanner - Interface in burlap.behavior.singleagent.learning.modellearning
-
Interface for defining planning algorithms that operate on iteratively learned models.
- modelPlannedPolicy() - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner
-
Returns a policy encoding the planner's results.
- modelPlannedPolicy() - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
-
- modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
-
The valueFunction used on the modeled world to update the value function
- modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
-
The model-adaptive planning algorithm to use
- modelPolicy - Variable in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
-
The greedy policy that results from VI
- modifyEO(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learning.modellearning.rmax.RMaxModel
-
- modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
-
- modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.RewardValueProjection.CustomRewardNoTermModel
-
- mostRecentTrialEnabled() - Method in enum burlap.behavior.singleagent.auxiliary.performance.TrialMode
-
Returns true if the most recent trial plots will be plotted by this mode.
- MountainCar - Class in burlap.domain.singleagent.mountaincar
-
A domain generator for the classic mountain car domain with default dynamics follow those implemented by Singh and Sutton [1].
- MountainCar() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar
-
- MountainCar.ClassicMCTF - Class in burlap.domain.singleagent.mountaincar
-
A Terminal Function for the Mountain Car domain that terminates when the agent's position is >= the max position in the world (0.5 default).
- MountainCar.MCModel - Class in burlap.domain.singleagent.mountaincar
-
- MountainCar.MCPhysicsParams - Class in burlap.domain.singleagent.mountaincar
-
- MountainCarVisualizer - Class in burlap.domain.singleagent.mountaincar
-
- MountainCarVisualizer.AgentPainter - Class in burlap.domain.singleagent.mountaincar
-
- MountainCarVisualizer.HillPainter - Class in burlap.domain.singleagent.mountaincar
-
Class for drawing a black outline of the hill that the mountain car climbs.
- move(FrostbiteState, int, int) - Method in class burlap.domain.singleagent.frostbite.FrostbiteModel
-
Attempts to move the agent into the given position, taking into account platforms and screen borders
- move(State, int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain.GridWorldModel
-
Attempts to move the agent into the given position, taking into account walls and blocks
- move(State, int) - Method in class burlap.domain.singleagent.mountaincar.MountainCar.MCModel
-
Changes the agents position in the provided state using car engine acceleration in the specified direction.
- moveCarriedBlockToNewAgentPosition(BlockDudeState, BlockDudeAgent, int, int, int, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
-
Moves a carried block to a new position of the agent
- moveClassicModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPClassicModel
-
Simulates the physics for one time step give the input state s, and the direction of force applied.
- moveCorrectModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPCorrectModel
-
Simulates the physics for one time step give the input state s, and the direction of force applied.
- moveHorizontally(BlockDudeState, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
-
Modifies state s to be the result of a horizontal movement.
- movementDirectionFromIndex(int) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain
-
Returns the change in x and y position for a given direction number.
- movementForceMag - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
-
The force magnatude that can be exterted in either direction on the cart
- moveUp(BlockDudeState) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
-
Modifies state s to be the result of a vertical movement, that will result in the agent onto the platform adjacent
to its current location in the direction the agent is facing, provided that there is room for the agent (and any block
it's holding) to step onto it.
- MultiAgentDPPlanningAgent - Class in burlap.behavior.stochasticgames.agents.madp
-
A agent that using a
MADynamicProgramming
planning algorithm to compute the value of each state and then follow
a policy derived from a joint policy that is derived from that estimated value function.
- MultiAgentDPPlanningAgent(SGDomain, MADynamicProgramming, PolicyFromJointPolicy, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MultiAgentDPPlanningAgent
-
Initializes.
- MultiAgentExperimenter - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
This class is used to simplify the comparison of agent performance in a stochastic game world.
- MultiAgentExperimenter(WorldGenerator, TerminalFunction, int, int, AgentFactoryAndType...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentExperimenter
-
Initializes.
- MultiAgentPerformancePlotter - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
This class is a world observer used for recording and plotting the performance of the agents in the world.
- MultiAgentPerformancePlotter(TerminalFunction, int, int, int, int, TrialMode, PerformanceMetric...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
-
Initializes
- MultiAgentPerformancePlotter.AgentDatasets - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
A datastructure for maintain the plot series data in the current agent
- MultiAgentPerformancePlotter.DatasetsAndTrials - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
A class for storing the tiral data and series datasets for a given agent.
- MultiAgentPerformancePlotter.MutableBoolean - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
A class for a mutable boolean
- MultiAgentPerformancePlotter.Trial - Class in burlap.behavior.stochasticgames.auxiliary.performance
-
A datastructure for maintaining all the metric stats for a single trial.
- MultiAgentQLearning - Class in burlap.behavior.stochasticgames.agents.maql
-
A class for performing multi-agent Q-learning in which different Q-value backup operators can be provided to enable the learning
of different solution concepts.
- MultiAgentQLearning(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Initializes this Q-learning agent.
- MultiAgentQLearning(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Initializes this Q-learning agent.
- MultiAgentQSourceProvider - Interface in burlap.behavior.stochasticgames.madynamicprogramming
-
An interface for an object that can providing the Q-values stored for each agent in a problem.
- MultiLayerRenderer - Class in burlap.visualizer
-
A MultiLayerRenderer is a canvas that will sequentially render a set of render layers, one on top of the other, to the same 2D
graphics context.
- MultiLayerRenderer() - Constructor for class burlap.visualizer.MultiLayerRenderer
-
- MultipleIntentionsMLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
-
An implementation of Multiple Intentions Maximum-likelihood Inverse Reinforcement Learning [1].
- MultipleIntentionsMLIRL(MultipleIntentionsMLIRLRequest, int, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL
-
Initializes.
- MultipleIntentionsMLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
-
- MultipleIntentionsMLIRLRequest(SADomain, QGradientPlannerFactory, List<Episode>, DifferentiableRF, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest
-
Initializes
- MultipleIntentionsMLIRLRequest(SADomain, List<Episode>, DifferentiableRF, int, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest
-
- MultiStatePrePlanner - Class in burlap.behavior.singleagent.planning.deterministic
-
This is a helper class that is used to run a valueFunction from multiple initial states to ensure
that an adequate plan/policy exists for each them.
- MutableBoolean(boolean) - Constructor for class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.MutableBoolean
-
Initializes with the given Boolean value
- MutableBoolean(boolean) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.MutableBoolean
-
Initializes with the given Boolean value
- MutableDouble(double) - Constructor for class burlap.behavior.learningrate.ExponentialDecayLR.MutableDouble
-
- MutableInt(int) - Constructor for class burlap.behavior.learningrate.SoftTimeInverseDecayLR.MutableInt
-
- MutableInt() - Constructor for class burlap.behavior.singleagent.interfaces.rlglue.RLGlueAgent.MutableInt
-
- MutableOOState - Interface in burlap.mdp.core.oo.state
-
- MutableState - Interface in burlap.mdp.core.state
-
A
State
interface extension for mutable states whose values can be directly modified.
- myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory
-
The agent's cooperate action
- myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger
-
This agent's cooperate action
- myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat
-
This agent's cooperate action
- myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory
-
This agent's cooperate action
- myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory
-
The agent's defect action
- myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger
-
This agent's defect action
- myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat
-
This agent's defect action
- myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory
-
This agent's defect action
- myQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
This agent's Q-value source
- MyTimer - Class in burlap.debugtools
-
A data structure for keeping track of elapsed and average time.
- MyTimer() - Constructor for class burlap.debugtools.MyTimer
-
Creates a new timer.
- MyTimer(boolean) - Constructor for class burlap.debugtools.MyTimer
-
Creates a new timer and starts it if start=true.