Skip navigation links
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 

M

MacroAction - Class in burlap.behavior.singleagent.options
A macro action is a non-Markov option that always executes a fixed sequence of actions.
MacroAction() - Constructor for class burlap.behavior.singleagent.options.MacroAction
Default constructor for serialization purposes.
MacroAction(String, List<Action>) - Constructor for class burlap.behavior.singleagent.options.MacroAction
Instantiates a macro action with a given name and action sequence.
MADPPlanAgentFactory - Class in burlap.behavior.stochasticgames.agents.madp
An agent factory for the MultiAgentDPPlanningAgent agent.
MADPPlanAgentFactory(SGDomain, MADynamicProgramming, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory
Initializes.
MADPPlanAgentFactory(SGDomain, MADPPlannerFactory, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory
Initializes
MADPPlannerFactory - Interface in burlap.behavior.stochasticgames.agents.madp
An interface for generating MADynamicProgramming objects.
MADPPlannerFactory.ConstantMADPPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp
MADynamicProgramming factory that always returns the same object instance, unless the reference is chaned with a mutator.
MADPPlannerFactory.MAVIPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp
Factory for generating multi-agent value iteration planners (MAValueIteration).
MADynamicProgramming - Class in burlap.behavior.stochasticgames.madynamicprogramming
An abstract value function based planning algorithm base for sequential stochastic games that require the computation of Q-values for each agent for each joint action.
MADynamicProgramming() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MADynamicProgramming
 
MADynamicProgramming.BackupBasedQSource - Class in burlap.behavior.stochasticgames.madynamicprogramming
A QSourceForSingleAgent implementation which stores a value function for an agent and produces Joint action Q-values by marginalizing over the transition dynamics the reward and discounted next state value.
MADynamicProgramming.JointActionTransitions - Class in burlap.behavior.stochasticgames.madynamicprogramming
A class for holding all of the transition dynamic information for a given joint action in a given state.
main(String[]) - Static method in class burlap.behavior.singleagent.Episode
 
main(String[]) - Static method in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
 
main(String[]) - Static method in class burlap.behavior.stochasticgames.GameEpisode
 
main(String[]) - Static method in class burlap.behavior.stochasticgames.solvers.CorrelatedEquilibriumSolver
 
main(String[]) - Static method in class burlap.datastructures.AlphanumericSorting
Testing the alphanumeric sorting
main(String[]) - Static method in class burlap.datastructures.StochasticTree
Demos how to use this class
main(String[]) - Static method in class burlap.debugtools.MyTimer
Demo of usage
main(String[]) - Static method in class burlap.debugtools.RandomFactory
Example usage.
main(String[]) - Static method in class burlap.domain.singleagent.blockdude.BlockDude
Runs an interactive visual explorer for level one of Block Dude.
main(String[]) - Static method in class burlap.domain.singleagent.blocksworld.BlocksWorld
Main method for exploring the domain.
main(String[]) - Static method in class burlap.domain.singleagent.cartpole.CartPoleDomain
Launches an interactive visualize in which key 'a' applies a force in the left direction and key 'd' applies force in the right direction.
main(String[]) - Static method in class burlap.domain.singleagent.cartpole.InvertedPendulum
 
main(String[]) - Static method in class burlap.domain.singleagent.frostbite.FrostbiteDomain
Main function to test the domain.
main(String[]) - Static method in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain
 
main(String[]) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain
Creates a visual explorer or terminal explorer.
main(String[]) - Static method in class burlap.domain.singleagent.lunarlander.LunarLanderDomain
This method will launch a visual explorer for the lunar lander domain.
main(String[]) - Static method in class burlap.domain.singleagent.mountaincar.MountainCar
Will launch a visual explorer for the mountain car domain that is controlled with the a-s-d keys.
main(String[]) - Static method in class burlap.domain.singleagent.pomdp.tiger.TigerDomain
Main method for interacting with the tiger domain via an EnvironmentShell By default, the TerminalExplorer interacts with the partially observable environment (SimulatedPOEnvironment), which means you only get to see the observations that the agent would.
main(String[]) - Static method in class burlap.domain.stochasticgames.gridgame.GridGame
Creates a visual explorer for a simple domain with two agents in it.
main(String[]) - Static method in class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame
A main method showing example code that would be used to create an instance of Prisoner's dilemma and begin playing it with a SGWorldShell.
main(String[]) - Static method in class burlap.testing.TestRunner
 
maintainClosed - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS
Whether to keep track of a closed list to prevent exploring already seen nodes.
makeEmptyMap() - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain
Makes the map empty
MAMaxQLearningFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, boolean, double) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory.MAMaxQLearningFactory
 
manualAgents - Variable in class burlap.shell.command.world.ManualAgentsCommands
 
ManualAgentsCommands - Class in burlap.shell.command.world
A controller for a set of ShellCommand objects.
ManualAgentsCommands() - Constructor for class burlap.shell.command.world.ManualAgentsCommands
 
ManualAgentsCommands.ListManualAgents - Class in burlap.shell.command.world
 
ManualAgentsCommands.LSManualAgentActionsCommands - Class in burlap.shell.command.world
 
ManualAgentsCommands.ManualSGAgent - Class in burlap.shell.command.world
 
ManualAgentsCommands.RegisterAgentCommand - Class in burlap.shell.command.world
 
ManualAgentsCommands.SetAgentAction - Class in burlap.shell.command.world
 
ManualSGAgent() - Constructor for class burlap.shell.command.world.ManualAgentsCommands.ManualSGAgent
 
map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeMap
 
map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeState
 
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldDomain
The wall map where the first index is the x position and the second index is the y position.
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.CellPainter
 
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.LocationPainter
 
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter
 
MapPainter(int[][]) - Constructor for class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter
Initializes for the domain and wall map
mapState(State) - Method in class burlap.mdp.auxiliary.common.IdentityStateMapping
 
mapState(State) - Method in class burlap.mdp.auxiliary.common.ShallowIdentityStateMapping
 
mapState(State) - Method in interface burlap.mdp.auxiliary.StateMapping
 
MAQLControlledQSourceMap(List<SGAgent>) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.MAQLControlledQSourceMap
Initializes with the list of agents that each keep their own Q-source.
MAQLFactory - Class in burlap.behavior.stochasticgames.agents.maql
This class provides a factory for MultiAgentQLearning agents.
MAQLFactory() - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
Empty constructor.
MAQLFactory(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
Initializes.
MAQLFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
Initializes.
MAQLFactory.CoCoQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql
Factory for generating CoCo-Q agents.
MAQLFactory.MAMaxQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql
Factory for generating Max multiagent Q-learning agents.
MAQSourcePolicy - Class in burlap.behavior.stochasticgames.madynamicprogramming
An abstract extension of the JointPolicy class that adds a required interface of being able to a MultiAgentQSourceProvider.
MAQSourcePolicy() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MAQSourcePolicy
 
marginalizeColPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools
Returns the column player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
marginalizeRowPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools
Returns the row player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
markAsTerminalPosition(int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldTerminalFunction
Marks a position as a terminal position for the agent.
markov() - Method in class burlap.behavior.singleagent.options.MacroAction
 
markov() - Method in interface burlap.behavior.singleagent.options.Option
 
markov() - Method in class burlap.behavior.singleagent.options.SubgoalOption
 
MaskedConfig - Class in burlap.statehashing.masked
 
MaskedConfig() - Constructor for class burlap.statehashing.masked.MaskedConfig
 
MaskedConfig(Set<Object>, Set<String>) - Constructor for class burlap.statehashing.masked.MaskedConfig
 
MaskedHashableStateFactory - Class in burlap.statehashing.masked
This class produces HashableState instances in which the hash code and equality of the states masks (ignores) specified state variables.
MaskedHashableStateFactory() - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
Default constructor: object identifier independent, no hash code caching, and no object class or attribute masks.
MaskedHashableStateFactory(boolean) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
Initializes with no hash code caching and no object class or attribute masks.
MaskedHashableStateFactory(boolean, boolean, String...) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory
Initializes with a specified variable or object class mask.
maskedObjectClasses - Variable in class burlap.statehashing.masked.MaskedConfig
 
maskedVariables - Variable in class burlap.statehashing.masked.MaskedConfig
 
MatchEntry - Class in burlap.mdp.stochasticgames.tournament
This class indicates which player in a tournament is to play in a match and what SGAgentType role they will play.
MatchEntry(SGAgentType, int) - Constructor for class burlap.mdp.stochasticgames.tournament.MatchEntry
Initializes the MatchEntry
matchingActionFeature(List<TileCodingFeatures.ActionFeatureID>, Action) - Method in class burlap.behavior.functionapproximation.sparse.tilecoding.TileCodingFeatures
Returns the TileCodingFeatures.ActionFeatureID with an equivalent Action in the given list or null if there is none.
MatchSelector - Interface in burlap.mdp.stochasticgames.tournament
An interface for defining how matches in a tournament will be determined
MatrixAction(String, int) - Constructor for class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame.MatrixAction
 
MAValueIteration - Class in burlap.behavior.stochasticgames.madynamicprogramming.dpplanners
A class for performing multi-agent value iteration.
MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
Initializes.
MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
Initializes.
MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
Initializes.
MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
Initializes.
MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
Initializes.
MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
Initializes.
MAVIPlannerFactory(SGDomain, List<SGAgentType>, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
Initializes.
maxActions - Variable in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain
The maximum number of actions available from any given state node.
maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
The maximum speed of the change in angle.
maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.InvertedPendulum.IPPhysicsParams
The maximum speed (magnitude) of the change in angle.
maxBackups - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping
THe maximum number Bellman backups permitted
maxBetaScaled(double[], double) - Static method in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.BoltzmannPolicyGradient
Given an array of Q-values, returns the maximum Q-value multiplied by the parameter beta.
maxCartSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
The maximum speed of the cart.
maxChange - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
The maximum change in weights permitted to terminate LSPI.
maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI
When the maximum change in the value function is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
The value function change threshold to stop VI.
maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
When the maximum change in the value function from a rollout is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration
When the maximum change in the value function is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
The maximum change in the value function that will cause planning to terminate.
maxDelta - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
The threshold that will cause VI to terminate when the max change in Q-value for is less than it
maxDelta - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
The threshold that will cause VI to terminate when the max change in Q-value for is less than it
maxDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS
The max depth of the search tree that will be explored.
maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
maxDiff - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
The max permitted difference between the lower bound and upperbound for planning termination.
maxDim - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
The width and height of the world.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.ActorCritic
The maximum number of steps of an episode before the agent will manually terminate it.This is defaulted to Integer.MAX_VALUE.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.critics.TimeIndexedTDLambda
The maximum number of steps possible in an episode.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation
When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
maxEvalIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation
When the maximum number of evaluation iterations passes this number, policy evaluation will terminate
maxGT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
The number of goal types
maxHeap - Variable in class burlap.datastructures.HashIndexedHeap
If true, this is ordered according to a max heap; if false ordered according to a min heap.
maxHorizon - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
 
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.apprenticeship.ApprenticeshipLearningRequest
The maximum number of iterations of apprenticeship learning
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI
When the number of VI iterations exceeds this value, VI will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
The maximum allowed number of VI iterations.
maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
When the number of policy evaluation iterations exceeds this value, policy evaluation will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration
When the number of VI iterations exceeds this value, VI will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
The maximum number of iterations to run.
maxIterations - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
The maximum allowable number of iterations until VI termination
maxIterations - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration
The maximum allowable number of iterations until VI termination
maxLearningSteps - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
The maximum number of learning steps in an episode when the LSPI.runLearningEpisode(burlap.mdp.singleagent.environment.Environment) method is called.
maxLikelihoodChange - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
The likelihood change threshold to stop gradient ascent.
MaxMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers
A class for finding a stategy that maxmizes the player's payoff under the assumption that their "opponent" is friendly and will try to do the same.
MaxMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MaxMax
 
maxNonZeroCoefficients - Variable in class burlap.behavior.functionapproximation.dense.fourier.FourierBasis
The maximum number of non-zero coefficient entries permitted in a coefficient vector
maxNumPlanningIterations - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
The maximum number of policy iterations permitted when LSPI is run from the LSPI.planFromState(State) or LSPI.runLearningEpisode(burlap.mdp.singleagent.environment.Environment) methods.
maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
The maximum number of learning steps per episode before the agent gives up
maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
The maximum number of learning steps per episode before the agent gives up
maxPIDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
When the maximum change between policy evaluations is smaller than this value, planning will terminate.
maxPlayers - Variable in class burlap.behavior.stochasticgames.agents.naiveq.history.SGQWActionHistoryFactory
The maximum number of players that can be in the game
maxPlyrs - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
The maximum number of players that can be in the game
maxPolicyIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration
When the number of policy iterations passes this value, planning will terminate.
maxQ(State) - Method in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
Returns the maximum Q-value entry for the given state with ties broken randomly.
MaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators
A classic MDP-style max backup operator in which an agent back ups his max Q-value in the state.
MaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MaxQ
 
maxQ(QProvider, State) - Static method in class burlap.behavior.valuefunction.QProvider.Helper
Returns the optimal state value function for a state given a QProvider.
maxQChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
The maximum allowable change in the Q-function during an episode before the planning method terminates.
maxQChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
The maximum Q-value change that occurred in the last learning episode.
maxRollouts - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP
the max number of rollouts to perform when planning is started unless the value function margin is small enough.
maxRollOutsFromRoot - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
 
maxSelfTransitionProb - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping.BPTRNode
 
maxStages - Variable in class burlap.mdp.stochasticgames.tournament.Tournament
 
maxSteps - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
The maximum number of steps of gradient ascent.
maxTimeStep() - Method in class burlap.behavior.singleagent.Episode
Returns the maximum time step index in this episode which is the Episode.numTimeSteps()-1.
maxTimeStep() - Method in class burlap.behavior.stochasticgames.GameEpisode
Returns the max time step index in this game which equals GameEpisode.numTimeSteps()-1.
maxTNormed() - Method in class burlap.datastructures.BoltzmannDistribution
Returns the maximum temperature normalized preference
maxWeightChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
The maximum allowable change in the VFA weights during an episode before the planning method terminates.
maxWeightChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
The maximum VFA weight change that occurred in the last learning episode.
maxWT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame
The number of wall types
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDude
Domain parameter specifying the maximum x dimension value of the world
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
 
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
 
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
 
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
 
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
 
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDude
Domain parameter specifying the maximum y dimension value of the world
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
 
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
 
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
 
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
 
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
 
MCModel(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCModel
 
MCPhysicsParams() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCPhysicsParams
 
MCRandomStateGenerator - Class in burlap.domain.singleagent.mountaincar
Generates MountainCar states with the x-position between some specified range and the velocity between some specified range.
MCRandomStateGenerator(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator
Initializes for the MountainCar Domain object for which states will be generated.
MCRandomStateGenerator(double, double, double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator
Initializes for the given boundaries in which random states will be created
MCState - Class in burlap.domain.singleagent.mountaincar
 
MCState() - Constructor for class burlap.domain.singleagent.mountaincar.MCState
 
MCState(double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCState
 
mdpPlanner - Variable in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
The SparseSampling planning instance to solve the problem.
mdpQSource - Variable in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
The fully observable MDP QProvider source.
MDPSolver - Class in burlap.behavior.singleagent
The abstract super class to use for various MDP solving algorithms, including both planning and learning algorithms.
MDPSolver() - Constructor for class burlap.behavior.singleagent.MDPSolver
 
MDPSolverInterface - Interface in burlap.behavior.singleagent
The top-level interface for algorithms that solve MDPs.
medianEpisodeReward - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.Trial
Stores the median reward by episode
medianEpisodeReward - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.Trial
Stores the median reward by episode
medianEpisodeRewardSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets
Most recent trial's median reward per step episode data
medianEpisodeRewardSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets
Most recent trial's median reward per step episode data
memoryQueue - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
A queue for storing the most recently expanded nodes.
memorySize - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
the size of the memory; that is, the number of recently expanded search nodes the valueFunction will remember.
memoryStateDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS
Stores the depth at which each state in the memory was explored.
merAvgSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets
All trial's average median reward per episode series data
merAvgSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets
All trial's average median reward per episode series data
metric - Variable in class burlap.behavior.functionapproximation.dense.rbf.RBF
The distance metric to compare query input states to the centeredState
metricsSet - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter
A set specifying the performance metrics that will be plotted
metricsSet - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
A set specifying the performance metrics that will be plotted
minEligibityForUpdate - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
The minimum eligibility value of a trace that will cause it to be updated
minimumLR - Variable in class burlap.behavior.learningrate.ExponentialDecayLR
The minimum learning rate
minimumLR - Variable in class burlap.behavior.learningrate.SoftTimeInverseDecayLR
The minimum learning rate
MinMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers
Finds the MinMax equilibrium using linear programming and returns the appropraite strategy.
MinMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MinMax
 
MinMaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators
A minmax operator.
MinMaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MinMaxQ
 
MinMaxSolver - Class in burlap.behavior.stochasticgames.solvers
 
minNewStepsForLearningPI - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI
The minimum number of new observations received from learning episodes before LSPI will be run again.
minNumRolloutsWithSmallValueChange - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP
RTDP will be delcared "converged" if there are this many consecutive policy rollouts in which the value function change is smaller than the maxDelta value.
minProb - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
 
minStepAndEpisodes(List<PerformancePlotter.Trial>) - Method in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter
Returns the minimum steps and episodes across all trials
minStepAndEpisodes(List<MultiAgentPerformancePlotter.Trial>) - Method in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
Returns the minimum steps and episodes across all trials
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
 
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
 
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
 
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
 
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
 
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
 
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
 
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
 
MLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
An implementation of Maximum-likelihood Inverse Reinforcement Learning [1].
MLIRL(MLIRLRequest, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL
Initializes.
mlirlInstance - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL
The MLIRL instance used to perform the maximization step for each clusters reward function parameter values.
MLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
A request object for Maximum-Likelihood Inverse Reinforcement Learning (MLIRL).
MLIRLRequest(SADomain, Planner, List<Episode>, DifferentiableRF) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest
Initializes the request without any expert trajectory weights (which will be assumed to have a value 1).
MLIRLRequest(SADomain, List<Episode>, DifferentiableRF, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest
Initializes without any expert trajectory weights (which will be assumed to have a value 1) and requests a default DifferentiableQFunction instance to be created using the HashableStateFactory provided.
mode(int) - Static method in class burlap.debugtools.DPrint
Returns the print mode for a given debug code
model - Variable in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
 
model - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
The model of the world that is being learned.
model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
The model of the world that is being learned.
model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.UnmodeledFavoredPolicy
 
model - Variable in class burlap.behavior.singleagent.MDPSolver
 
model - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
 
model - Variable in class burlap.mdp.singleagent.environment.SimulatedEnvironment
 
model - Variable in class burlap.mdp.singleagent.SADomain
 
modelChanged(State) - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner
Tells the valueFunction that the model has changed and that it will need to replan accordingly
modelChanged(State) - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
 
modeledRewardFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
The modeled reward function that is being learned.
modeledTerminalFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
The modeled terminal state function.
ModelLearningPlanner - Interface in burlap.behavior.singleagent.learning.modellearning
Interface for defining planning algorithms that operate on iteratively learned models.
modelPlannedPolicy() - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner
Returns a policy encoding the planner's results.
modelPlannedPolicy() - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
 
modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
The valueFunction used on the modeled world to update the value function
modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax
The model-adaptive planning algorithm to use
modelPolicy - Variable in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
The greedy policy that results from VI
modifyEO(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learning.modellearning.rmax.RMaxModel
 
modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
 
modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.RewardValueProjection.CustomRewardNoTermModel
 
mostRecentTrialEnabled() - Method in enum burlap.behavior.singleagent.auxiliary.performance.TrialMode
Returns true if the most recent trial plots will be plotted by this mode.
MountainCar - Class in burlap.domain.singleagent.mountaincar
A domain generator for the classic mountain car domain with default dynamics follow those implemented by Singh and Sutton [1].
MountainCar() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar
 
MountainCar.ClassicMCTF - Class in burlap.domain.singleagent.mountaincar
A Terminal Function for the Mountain Car domain that terminates when the agent's position is >= the max position in the world (0.5 default).
MountainCar.MCModel - Class in burlap.domain.singleagent.mountaincar
 
MountainCar.MCPhysicsParams - Class in burlap.domain.singleagent.mountaincar
 
MountainCarVisualizer - Class in burlap.domain.singleagent.mountaincar
A class for creating a Visualizer for a MountainCar Domain.
MountainCarVisualizer.AgentPainter - Class in burlap.domain.singleagent.mountaincar
 
MountainCarVisualizer.HillPainter - Class in burlap.domain.singleagent.mountaincar
Class for drawing a black outline of the hill that the mountain car climbs.
move(FrostbiteState, int, int) - Method in class burlap.domain.singleagent.frostbite.FrostbiteModel
Attempts to move the agent into the given position, taking into account platforms and screen borders
move(State, int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain.GridWorldModel
Attempts to move the agent into the given position, taking into account walls and blocks
move(State, int) - Method in class burlap.domain.singleagent.mountaincar.MountainCar.MCModel
Changes the agents position in the provided state using car engine acceleration in the specified direction.
moveCarriedBlockToNewAgentPosition(BlockDudeState, BlockDudeAgent, int, int, int, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
Moves a carried block to a new position of the agent
moveClassicModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPClassicModel
Simulates the physics for one time step give the input state s, and the direction of force applied.
moveCorrectModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPCorrectModel
Simulates the physics for one time step give the input state s, and the direction of force applied.
moveHorizontally(BlockDudeState, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
Modifies state s to be the result of a horizontal movement.
movementDirectionFromIndex(int) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain
Returns the change in x and y position for a given direction number.
movementForceMag - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams
The force magnatude that can be exterted in either direction on the cart
moveUp(BlockDudeState) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel
Modifies state s to be the result of a vertical movement, that will result in the agent onto the platform adjacent to its current location in the direction the agent is facing, provided that there is room for the agent (and any block it's holding) to step onto it.
MultiAgentDPPlanningAgent - Class in burlap.behavior.stochasticgames.agents.madp
A agent that using a MADynamicProgramming planning algorithm to compute the value of each state and then follow a policy derived from a joint policy that is derived from that estimated value function.
MultiAgentDPPlanningAgent(SGDomain, MADynamicProgramming, PolicyFromJointPolicy, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MultiAgentDPPlanningAgent
Initializes.
MultiAgentExperimenter - Class in burlap.behavior.stochasticgames.auxiliary.performance
This class is used to simplify the comparison of agent performance in a stochastic game world.
MultiAgentExperimenter(WorldGenerator, TerminalFunction, int, int, AgentFactoryAndType...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentExperimenter
Initializes.
MultiAgentPerformancePlotter - Class in burlap.behavior.stochasticgames.auxiliary.performance
This class is a world observer used for recording and plotting the performance of the agents in the world.
MultiAgentPerformancePlotter(TerminalFunction, int, int, int, int, TrialMode, PerformanceMetric...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter
Initializes
MultiAgentPerformancePlotter.AgentDatasets - Class in burlap.behavior.stochasticgames.auxiliary.performance
A datastructure for maintain the plot series data in the current agent
MultiAgentPerformancePlotter.DatasetsAndTrials - Class in burlap.behavior.stochasticgames.auxiliary.performance
A class for storing the tiral data and series datasets for a given agent.
MultiAgentPerformancePlotter.MutableBoolean - Class in burlap.behavior.stochasticgames.auxiliary.performance
A class for a mutable boolean
MultiAgentPerformancePlotter.Trial - Class in burlap.behavior.stochasticgames.auxiliary.performance
A datastructure for maintaining all the metric stats for a single trial.
MultiAgentQLearning - Class in burlap.behavior.stochasticgames.agents.maql
A class for performing multi-agent Q-learning in which different Q-value backup operators can be provided to enable the learning of different solution concepts.
MultiAgentQLearning(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
Initializes this Q-learning agent.
MultiAgentQLearning(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
Initializes this Q-learning agent.
MultiAgentQSourceProvider - Interface in burlap.behavior.stochasticgames.madynamicprogramming
An interface for an object that can providing the Q-values stored for each agent in a problem.
MultiLayerRenderer - Class in burlap.visualizer
A MultiLayerRenderer is a canvas that will sequentially render a set of render layers, one on top of the other, to the same 2D graphics context.
MultiLayerRenderer() - Constructor for class burlap.visualizer.MultiLayerRenderer
 
MultipleIntentionsMLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
An implementation of Multiple Intentions Maximum-likelihood Inverse Reinforcement Learning [1].
MultipleIntentionsMLIRL(MultipleIntentionsMLIRLRequest, int, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL
Initializes.
MultipleIntentionsMLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl
A problem request object for MultipleIntentionsMLIRL.
MultipleIntentionsMLIRLRequest(SADomain, QGradientPlannerFactory, List<Episode>, DifferentiableRF, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest
Initializes
MultipleIntentionsMLIRLRequest(SADomain, List<Episode>, DifferentiableRF, int, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest
Initializes using a default QGradientPlannerFactory.DifferentiableVIFactory that is based on the provided HashableStateFactory object.
MultiStatePrePlanner - Class in burlap.behavior.singleagent.planning.deterministic
This is a helper class that is used to run a valueFunction from multiple initial states to ensure that an adequate plan/policy exists for each them.
MutableBoolean(boolean) - Constructor for class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.MutableBoolean
Initializes with the given Boolean value
MutableBoolean(boolean) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.MutableBoolean
Initializes with the given Boolean value
MutableDouble(double) - Constructor for class burlap.behavior.learningrate.ExponentialDecayLR.MutableDouble
 
MutableInt(int) - Constructor for class burlap.behavior.learningrate.SoftTimeInverseDecayLR.MutableInt
 
MutableInt() - Constructor for class burlap.behavior.singleagent.interfaces.rlglue.RLGlueAgent.MutableInt
 
MutableOOState - Interface in burlap.mdp.core.oo.state
A MutableState extension OOState.
MutableState - Interface in burlap.mdp.core.state
A State interface extension for mutable states whose values can be directly modified.
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory
The agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger
This agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat
This agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory
This agent's cooperate action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory
The agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger
This agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat
This agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory
This agent's defect action
myQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
This agent's Q-value source
MyTimer - Class in burlap.debugtools
A data structure for keeping track of elapsed and average time.
MyTimer() - Constructor for class burlap.debugtools.MyTimer
Creates a new timer.
MyTimer(boolean) - Constructor for class burlap.debugtools.MyTimer
Creates a new timer and starts it if start=true.
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
Skip navigation links