M-Index

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

M

MacroAction - Class in burlap.behavior.singleagent.options: A macro action is a non-Markov option that always executes a fixed sequence of actions.
MacroAction() - Constructor for class burlap.behavior.singleagent.options.MacroAction: Default constructor for serialization purposes.
MacroAction(String, List<Action>) - Constructor for class burlap.behavior.singleagent.options.MacroAction: Instantiates a macro action with a given name and action sequence.
MADPPlanAgentFactory - Class in burlap.behavior.stochasticgames.agents.madp: An agent factory for the MultiAgentDPPlanningAgent agent.
MADPPlanAgentFactory(SGDomain, MADynamicProgramming, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory: Initializes.
MADPPlanAgentFactory(SGDomain, MADPPlannerFactory, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlanAgentFactory: Initializes
MADPPlannerFactory - Interface in burlap.behavior.stochasticgames.agents.madp: An interface for generating MADynamicProgramming objects.
MADPPlannerFactory.ConstantMADPPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp: MADynamicProgramming factory that always returns the same object instance, unless the reference is chaned with a mutator.
MADPPlannerFactory.MAVIPlannerFactory - Class in burlap.behavior.stochasticgames.agents.madp: Factory for generating multi-agent value iteration planners (MAValueIteration).
MADynamicProgramming - Class in burlap.behavior.stochasticgames.madynamicprogramming: An abstract value function based planning algorithm base for sequential stochastic games that require the computation of Q-values for each agent for each joint action.
MADynamicProgramming() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MADynamicProgramming
MADynamicProgramming.BackupBasedQSource - Class in burlap.behavior.stochasticgames.madynamicprogramming: A QSourceForSingleAgent implementation which stores a value function for an agent and produces Joint action Q-values by marginalizing over the transition dynamics the reward and discounted next state value.
MADynamicProgramming.JointActionTransitions - Class in burlap.behavior.stochasticgames.madynamicprogramming: A class for holding all of the transition dynamic information for a given joint action in a given state.
main(String[]) - Static method in class burlap.behavior.singleagent.Episode
main(String[]) - Static method in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
main(String[]) - Static method in class burlap.behavior.stochasticgames.GameEpisode
main(String[]) - Static method in class burlap.behavior.stochasticgames.solvers.CorrelatedEquilibriumSolver
main(String[]) - Static method in class burlap.datastructures.AlphanumericSorting: Testing the alphanumeric sorting
main(String[]) - Static method in class burlap.datastructures.StochasticTree: Demos how to use this class
main(String[]) - Static method in class burlap.debugtools.MyTimer: Demo of usage
main(String[]) - Static method in class burlap.debugtools.RandomFactory: Example usage.
main(String[]) - Static method in class burlap.domain.singleagent.blockdude.BlockDude: Runs an interactive visual explorer for level one of Block Dude.
main(String[]) - Static method in class burlap.domain.singleagent.blocksworld.BlocksWorld: Main method for exploring the domain.
main(String[]) - Static method in class burlap.domain.singleagent.cartpole.CartPoleDomain: Launches an interactive visualize in which key 'a' applies a force in the left direction and key 'd' applies force in the right direction.
main(String[]) - Static method in class burlap.domain.singleagent.cartpole.InvertedPendulum
main(String[]) - Static method in class burlap.domain.singleagent.frostbite.FrostbiteDomain: Main function to test the domain.
main(String[]) - Static method in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain
main(String[]) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain: Creates a visual explorer or terminal explorer.
main(String[]) - Static method in class burlap.domain.singleagent.lunarlander.LunarLanderDomain: This method will launch a visual explorer for the lunar lander domain.
main(String[]) - Static method in class burlap.domain.singleagent.mountaincar.MountainCar: Will launch a visual explorer for the mountain car domain that is controlled with the a-s-d keys.
main(String[]) - Static method in class burlap.domain.singleagent.pomdp.tiger.TigerDomain: Main method for interacting with the tiger domain via an EnvironmentShell By default, the TerminalExplorer interacts with the partially observable environment (SimulatedPOEnvironment), which means you only get to see the observations that the agent would.
main(String[]) - Static method in class burlap.domain.stochasticgames.gridgame.GridGame: Creates a visual explorer for a simple domain with two agents in it.
main(String[]) - Static method in class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame: A main method showing example code that would be used to create an instance of Prisoner's dilemma and begin playing it with a SGWorldShell.
main(String[]) - Static method in class burlap.testing.TestRunner
maintainClosed - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS: Whether to keep track of a closed list to prevent exploring already seen nodes.
makeEmptyMap() - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain: Makes the map empty
MAMaxQLearningFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, boolean, double) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory.MAMaxQLearningFactory
manualAgents - Variable in class burlap.shell.command.world.ManualAgentsCommands
ManualAgentsCommands - Class in burlap.shell.command.world: A controller for a set of ShellCommand objects.
ManualAgentsCommands() - Constructor for class burlap.shell.command.world.ManualAgentsCommands
ManualAgentsCommands.ListManualAgents - Class in burlap.shell.command.world
ManualAgentsCommands.LSManualAgentActionsCommands - Class in burlap.shell.command.world
ManualAgentsCommands.ManualSGAgent - Class in burlap.shell.command.world
ManualAgentsCommands.RegisterAgentCommand - Class in burlap.shell.command.world
ManualAgentsCommands.SetAgentAction - Class in burlap.shell.command.world
ManualSGAgent() - Constructor for class burlap.shell.command.world.ManualAgentsCommands.ManualSGAgent
map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeMap
map - Variable in class burlap.domain.singleagent.blockdude.state.BlockDudeState
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldDomain: The wall map where the first index is the x position and the second index is the y position.
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.CellPainter
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.LocationPainter
map - Variable in class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter
MapPainter(int[][]) - Constructor for class burlap.domain.singleagent.gridworld.GridWorldVisualizer.MapPainter: Initializes for the domain and wall map
mapState(State) - Method in class burlap.mdp.auxiliary.common.IdentityStateMapping
mapState(State) - Method in class burlap.mdp.auxiliary.common.ShallowIdentityStateMapping
mapState(State) - Method in interface burlap.mdp.auxiliary.StateMapping
MAQLControlledQSourceMap(List<SGAgent>) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.MAQLControlledQSourceMap: Initializes with the list of agents that each keep their own Q-source.
MAQLFactory - Class in burlap.behavior.stochasticgames.agents.maql: This class provides a factory for MultiAgentQLearning agents.
MAQLFactory() - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory: Empty constructor.
MAQLFactory(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory: Initializes.
MAQLFactory(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, PolicyFromJointPolicy) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MAQLFactory: Initializes.
MAQLFactory.CoCoQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql: Factory for generating CoCo-Q agents.
MAQLFactory.MAMaxQLearningFactory - Class in burlap.behavior.stochasticgames.agents.maql: Factory for generating Max multiagent Q-learning agents.
MAQSourcePolicy - Class in burlap.behavior.stochasticgames.madynamicprogramming: An abstract extension of the JointPolicy class that adds a required interface of being able to a MultiAgentQSourceProvider.
MAQSourcePolicy() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.MAQSourcePolicy
marginalizeColPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools: Returns the column player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
marginalizeRowPlayerStrategy(double[][]) - Static method in class burlap.behavior.stochasticgames.solvers.GeneralBimatrixSolverTools: Returns the row player's strategy by marginalizing it out from a joint action probability distribution represented as a matrix
markAsTerminalPosition(int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldTerminalFunction: Marks a position as a terminal position for the agent.
markov() - Method in class burlap.behavior.singleagent.options.MacroAction
markov() - Method in interface burlap.behavior.singleagent.options.Option
markov() - Method in class burlap.behavior.singleagent.options.SubgoalOption
MaskedConfig - Class in burlap.statehashing.masked
MaskedConfig() - Constructor for class burlap.statehashing.masked.MaskedConfig
MaskedConfig(Set<Object>, Set<String>) - Constructor for class burlap.statehashing.masked.MaskedConfig
MaskedHashableStateFactory - Class in burlap.statehashing.masked: This class produces HashableState instances in which the hash code and equality of the states masks (ignores) specified state variables.
MaskedHashableStateFactory() - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory: Default constructor: object identifier independent, no hash code caching, and no object class or attribute masks.
MaskedHashableStateFactory(boolean) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory: Initializes with no hash code caching and no object class or attribute masks.
MaskedHashableStateFactory(boolean, boolean, String...) - Constructor for class burlap.statehashing.masked.MaskedHashableStateFactory: Initializes with a specified variable or object class mask.
maskedObjectClasses - Variable in class burlap.statehashing.masked.MaskedConfig
maskedVariables - Variable in class burlap.statehashing.masked.MaskedConfig
MatchEntry - Class in burlap.mdp.stochasticgames.tournament: This class indicates which player in a tournament is to play in a match and what SGAgentType role they will play.
MatchEntry(SGAgentType, int) - Constructor for class burlap.mdp.stochasticgames.tournament.MatchEntry: Initializes the MatchEntry
matchingActionFeature(List<TileCodingFeatures.ActionFeatureID>, Action) - Method in class burlap.behavior.functionapproximation.sparse.tilecoding.TileCodingFeatures: Returns the TileCodingFeatures.ActionFeatureID with an equivalent Action in the given list or null if there is none.
MatchSelector - Interface in burlap.mdp.stochasticgames.tournament: An interface for defining how matches in a tournament will be determined
MatrixAction(String, int) - Constructor for class burlap.domain.stochasticgames.normalform.SingleStageNormalFormGame.MatrixAction
MAValueIteration - Class in burlap.behavior.stochasticgames.madynamicprogramming.dpplanners: A class for performing multi-agent value iteration.
MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: Initializes.
MAValueIteration(SGDomain, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: Initializes.
MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: Initializes.
MAValueIteration(SGDomain, List<SGAgentType>, JointRewardFunction, TerminalFunction, double, HashableStateFactory, ValueFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: Initializes.
MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, double, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory: Initializes.
MAVIPlannerFactory(SGDomain, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory: Initializes.
MAVIPlannerFactory(SGDomain, List<SGAgentType>, JointModel, JointRewardFunction, TerminalFunction, double, HashableStateFactory, QFunction, SGBackupOperator, double, int) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory: Initializes.
maxActions - Variable in class burlap.domain.singleagent.graphdefined.GraphDefinedDomain: The maximum number of actions available from any given state node.
maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams: The maximum speed of the change in angle.
maxAngleSpeed - Variable in class burlap.domain.singleagent.cartpole.InvertedPendulum.IPPhysicsParams: The maximum speed (magnitude) of the change in angle.
maxBackups - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping: THe maximum number Bellman backups permitted
maxBetaScaled(double[], double) - Static method in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.BoltzmannPolicyGradient: Given an array of Q-values, returns the maximum Q-value multiplied by the parameter beta.
maxCartSpeed - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams: The maximum speed of the cart.
maxChange - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI: The maximum change in weights permitted to terminate LSPI.
maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI: When the maximum change in the value function is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory: The value function change threshold to stop VI.
maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP: When the maximum change in the value function from a rollout is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration: When the maximum change in the value function is smaller than this value, VI will terminate.
maxDelta - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI: The maximum change in the value function that will cause planning to terminate.
maxDelta - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory: The threshold that will cause VI to terminate when the max change in Q-value for is less than it
maxDelta - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: The threshold that will cause VI to terminate when the max change in Q-value for is less than it
maxDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.DFS: The max depth of the search tree that will be explored.
maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP: The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
maxDepth - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP: The maximum depth/length of a rollout before it is terminated and Bellman updates are performed.
maxDiff - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP: The max permitted difference between the lower bound and upperbound for planning termination.
maxDim - Variable in class burlap.domain.stochasticgames.gridgame.GridGame: The width and height of the world.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.ActorCritic: The maximum number of steps of an episode before the agent will manually terminate it.This is defaulted to Integer.MAX_VALUE.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.actorcritic.critics.TimeIndexedTDLambda: The maximum number of steps possible in an episode.
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning: The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
maxEpisodeSize - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam: The maximum number of steps that will be taken in an episode before the agent terminates a learning episode
maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation: When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
maxEvalDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration: When the maximum change in the value function is smaller than this value, policy evaluation will terminate.
maxEvalIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyEvaluation: When the maximum number of evaluation iterations passes this number, policy evaluation will terminate
maxGT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame: The number of goal types
maxHeap - Variable in class burlap.datastructures.HashIndexedHeap: If true, this is ordered according to a max heap; if false ordered according to a min heap.
maxHorizon - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.apprenticeship.ApprenticeshipLearningRequest: The maximum number of iterations of apprenticeship learning
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableVI: When the number of VI iterations exceeds this value, VI will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory: The maximum allowed number of VI iterations.
maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration: When the number of policy evaluation iterations exceeds this value, policy evaluation will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.ValueIteration: When the number of VI iterations exceeds this value, VI will terminate.
maxIterations - Variable in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI: The maximum number of iterations to run.
maxIterations - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory: The maximum allowable number of iterations until VI termination
maxIterations - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.dpplanners.MAValueIteration: The maximum allowable number of iterations until VI termination
maxLearningSteps - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI: The maximum number of learning steps in an episode when the LSPI.runLearningEpisode(burlap.mdp.singleagent.environment.Environment) method is called.
maxLikelihoodChange - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL: The likelihood change threshold to stop gradient ascent.
MaxMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers: A class for finding a stategy that maxmizes the player's payoff under the assumption that their "opponent" is friendly and will try to do the same.
MaxMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MaxMax
maxNonZeroCoefficients - Variable in class burlap.behavior.functionapproximation.dense.fourier.FourierBasis: The maximum number of non-zero coefficient entries permitted in a coefficient vector
maxNumPlanningIterations - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI: The maximum number of policy iterations permitted when LSPI is run from the LSPI.planFromState(State) or LSPI.runLearningEpisode(burlap.mdp.singleagent.environment.Environment) methods.
maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP: The maximum number of learning steps per episode before the agent gives up
maxNumSteps - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax: The maximum number of learning steps per episode before the agent gives up
maxPIDelta - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration: When the maximum change between policy evaluations is smaller than this value, planning will terminate.
maxPlayers - Variable in class burlap.behavior.stochasticgames.agents.naiveq.history.SGQWActionHistoryFactory: The maximum number of players that can be in the game
maxPlyrs - Variable in class burlap.domain.stochasticgames.gridgame.GridGame: The maximum number of players that can be in the game
maxPolicyIterations - Variable in class burlap.behavior.singleagent.planning.stochastic.policyiteration.PolicyIteration: When the number of policy iterations passes this value, planning will terminate.
maxQ(State) - Method in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP: Returns the maximum Q-value entry for the given state with ties broken randomly.
MaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators: A classic MDP-style max backup operator in which an agent back ups his max Q-value in the state.
MaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MaxQ
maxQ(QProvider, State) - Static method in class burlap.behavior.valuefunction.QProvider.Helper: Returns the optimal state value function for a state given a QProvider.
maxQChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning: The maximum allowable change in the Q-function during an episode before the planning method terminates.
maxQChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning: The maximum Q-value change that occurred in the last learning episode.
maxRollouts - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.BoundedRTDP: the max number of rollouts to perform when planning is started unless the value function margin is small enough.
maxRollOutsFromRoot - Variable in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
maxSelfTransitionProb - Variable in class burlap.behavior.singleagent.planning.stochastic.valueiteration.PrioritizedSweeping.BPTRNode
maxStages - Variable in class burlap.mdp.stochasticgames.tournament.Tournament
maxSteps - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL: The maximum number of steps of gradient ascent.
maxTimeStep() - Method in class burlap.behavior.singleagent.Episode: Returns the maximum time step index in this episode which is the Episode.numTimeSteps()-1.
maxTimeStep() - Method in class burlap.behavior.stochasticgames.GameEpisode: Returns the max time step index in this game which equals GameEpisode.numTimeSteps()-1.
maxTNormed() - Method in class burlap.datastructures.BoltzmannDistribution: Returns the maximum temperature normalized preference
maxWeightChangeForPlanningTermination - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam: The maximum allowable change in the VFA weights during an episode before the planning method terminates.
maxWeightChangeInLastEpisode - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam: The maximum VFA weight change that occurred in the last learning episode.
maxWT - Variable in class burlap.domain.stochasticgames.gridgame.GridGame: The number of wall types
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDude: Domain parameter specifying the maximum x dimension value of the world
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
maxx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDude: Domain parameter specifying the maximum y dimension value of the world
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeModel
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
maxy - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
MCModel(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCModel
MCPhysicsParams() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar.MCPhysicsParams
MCRandomStateGenerator - Class in burlap.domain.singleagent.mountaincar: Generates MountainCar states with the x-position between some specified range and the velocity between some specified range.
MCRandomStateGenerator(MountainCar.MCPhysicsParams) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator: Initializes for the MountainCar Domain object for which states will be generated.
MCRandomStateGenerator(double, double, double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCRandomStateGenerator: Initializes for the given boundaries in which random states will be created
MCState - Class in burlap.domain.singleagent.mountaincar
MCState() - Constructor for class burlap.domain.singleagent.mountaincar.MCState
MCState(double, double) - Constructor for class burlap.domain.singleagent.mountaincar.MCState
mdpPlanner - Variable in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling: The SparseSampling planning instance to solve the problem.
mdpQSource - Variable in class burlap.behavior.singleagent.pomdp.qmdp.QMDP: The fully observable MDP QProvider source.
MDPSolver - Class in burlap.behavior.singleagent: The abstract super class to use for various MDP solving algorithms, including both planning and learning algorithms.
MDPSolver() - Constructor for class burlap.behavior.singleagent.MDPSolver
MDPSolverInterface - Interface in burlap.behavior.singleagent: The top-level interface for algorithms that solve MDPs.
medianEpisodeReward - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.Trial: Stores the median reward by episode
medianEpisodeReward - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.Trial: Stores the median reward by episode
medianEpisodeRewardSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets: Most recent trial's median reward per step episode data
medianEpisodeRewardSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets: Most recent trial's median reward per step episode data
memoryQueue - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS: A queue for storing the most recently expanded nodes.
memorySize - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS: the size of the memory; that is, the number of recently expanded search nodes the valueFunction will remember.
memoryStateDepth - Variable in class burlap.behavior.singleagent.planning.deterministic.uninformed.dfs.LimitedMemoryDFS: Stores the depth at which each state in the memory was explored.
merAvgSeries - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.AgentDatasets: All trial's average median reward per episode series data
merAvgSeries - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.AgentDatasets: All trial's average median reward per episode series data
metric - Variable in class burlap.behavior.functionapproximation.dense.rbf.RBF: The distance metric to compare query input states to the centeredState
metricsSet - Variable in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter: A set specifying the performance metrics that will be plotted
metricsSet - Variable in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter: A set specifying the performance metrics that will be plotted
minEligibityForUpdate - Variable in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam: The minimum eligibility value of a trace that will cause it to be updated
minimumLR - Variable in class burlap.behavior.learningrate.ExponentialDecayLR: The minimum learning rate
minimumLR - Variable in class burlap.behavior.learningrate.SoftTimeInverseDecayLR: The minimum learning rate
MinMax - Class in burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers: Finds the MinMax equilibrium using linear programming and returns the appropraite strategy.
MinMax() - Constructor for class burlap.behavior.stochasticgames.agents.twoplayer.singlestage.equilibriumplayer.equilibriumsolvers.MinMax
MinMaxQ - Class in burlap.behavior.stochasticgames.madynamicprogramming.backupOperators: A minmax operator.
MinMaxQ() - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.backupOperators.MinMaxQ
MinMaxSolver - Class in burlap.behavior.stochasticgames.solvers
minNewStepsForLearningPI - Variable in class burlap.behavior.singleagent.learning.lspi.LSPI: The minimum number of new observations received from learning episodes before LSPI will be run again.
minNumRolloutsWithSmallValueChange - Variable in class burlap.behavior.singleagent.planning.stochastic.rtdp.RTDP: RTDP will be delcared "converged" if there are this many consecutive policy rollouts in which the value function change is smaller than the maxDelta value.
minProb - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
minStepAndEpisodes(List<PerformancePlotter.Trial>) - Method in class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter: Returns the minimum steps and episodes across all trials
minStepAndEpisodes(List<MultiAgentPerformancePlotter.Trial>) - Method in class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter: Returns the minimum steps and episodes across all trials
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
minx - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.AgentPainter
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BlockPainter
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.BricksPainter
miny - Variable in class burlap.domain.singleagent.blockdude.BlockDudeVisualizer.ExitPainter
MLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl: An implementation of Maximum-likelihood Inverse Reinforcement Learning [1].
MLIRL(MLIRLRequest, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRL: Initializes.
mlirlInstance - Variable in class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL: The MLIRL instance used to perform the maximization step for each clusters reward function parameter values.
MLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl: A request object for Maximum-Likelihood Inverse Reinforcement Learning (MLIRL).
MLIRLRequest(SADomain, Planner, List<Episode>, DifferentiableRF) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest: Initializes the request without any expert trajectory weights (which will be assumed to have a value 1).
MLIRLRequest(SADomain, List<Episode>, DifferentiableRF, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MLIRLRequest: Initializes without any expert trajectory weights (which will be assumed to have a value 1) and requests a default DifferentiableQFunction instance to be created using the HashableStateFactory provided.
mode(int) - Static method in class burlap.debugtools.DPrint: Returns the print mode for a given debug code
model - Variable in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
model - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP: The model of the world that is being learned.
model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax: The model of the world that is being learned.
model - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.UnmodeledFavoredPolicy
model - Variable in class burlap.behavior.singleagent.MDPSolver
model - Variable in class burlap.behavior.singleagent.options.model.BFSMarkovOptionModel
model - Variable in class burlap.mdp.singleagent.environment.SimulatedEnvironment
model - Variable in class burlap.mdp.singleagent.SADomain
modelChanged(State) - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner: Tells the valueFunction that the model has changed and that it will need to replan accordingly
modelChanged(State) - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
modeledRewardFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax: The modeled reward function that is being learned.
modeledTerminalFunction - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax: The modeled terminal state function.
ModelLearningPlanner - Interface in burlap.behavior.singleagent.learning.modellearning: Interface for defining planning algorithms that operate on iteratively learned models.
modelPlannedPolicy() - Method in interface burlap.behavior.singleagent.learning.modellearning.ModelLearningPlanner: Returns a policy encoding the planner's results.
modelPlannedPolicy() - Method in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner
modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP: The valueFunction used on the modeled world to update the value function
modelPlanner - Variable in class burlap.behavior.singleagent.learning.modellearning.rmax.PotentialShapedRMax: The model-adaptive planning algorithm to use
modelPolicy - Variable in class burlap.behavior.singleagent.learning.modellearning.modelplanners.VIModelLearningPlanner: The greedy policy that results from VI
modifyEO(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learning.modellearning.rmax.RMaxModel
modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.CustomRewardModel
modifyOutcome(EnvironmentOutcome) - Method in class burlap.behavior.singleagent.learnfromdemo.RewardValueProjection.CustomRewardNoTermModel
mostRecentTrialEnabled() - Method in enum burlap.behavior.singleagent.auxiliary.performance.TrialMode: Returns true if the most recent trial plots will be plotted by this mode.
MountainCar - Class in burlap.domain.singleagent.mountaincar: A domain generator for the classic mountain car domain with default dynamics follow those implemented by Singh and Sutton [1].
MountainCar() - Constructor for class burlap.domain.singleagent.mountaincar.MountainCar
MountainCar.ClassicMCTF - Class in burlap.domain.singleagent.mountaincar: A Terminal Function for the Mountain Car domain that terminates when the agent's position is >= the max position in the world (0.5 default).
MountainCar.MCModel - Class in burlap.domain.singleagent.mountaincar
MountainCar.MCPhysicsParams - Class in burlap.domain.singleagent.mountaincar
MountainCarVisualizer - Class in burlap.domain.singleagent.mountaincar: A class for creating a Visualizer for a MountainCar Domain.
MountainCarVisualizer.AgentPainter - Class in burlap.domain.singleagent.mountaincar
MountainCarVisualizer.HillPainter - Class in burlap.domain.singleagent.mountaincar: Class for drawing a black outline of the hill that the mountain car climbs.
move(FrostbiteState, int, int) - Method in class burlap.domain.singleagent.frostbite.FrostbiteModel: Attempts to move the agent into the given position, taking into account platforms and screen borders
move(State, int, int) - Method in class burlap.domain.singleagent.gridworld.GridWorldDomain.GridWorldModel: Attempts to move the agent into the given position, taking into account walls and blocks
move(State, int) - Method in class burlap.domain.singleagent.mountaincar.MountainCar.MCModel: Changes the agents position in the provided state using car engine acceleration in the specified direction.
moveCarriedBlockToNewAgentPosition(BlockDudeState, BlockDudeAgent, int, int, int, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel: Moves a carried block to a new position of the agent
moveClassicModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPClassicModel: Simulates the physics for one time step give the input state s, and the direction of force applied.
moveCorrectModel(State, double) - Method in class burlap.domain.singleagent.cartpole.model.CPCorrectModel: Simulates the physics for one time step give the input state s, and the direction of force applied.
moveHorizontally(BlockDudeState, int) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel: Modifies state s to be the result of a horizontal movement.
movementDirectionFromIndex(int) - Static method in class burlap.domain.singleagent.gridworld.GridWorldDomain: Returns the change in x and y position for a given direction number.
movementForceMag - Variable in class burlap.domain.singleagent.cartpole.CartPoleDomain.CPPhysicsParams: The force magnatude that can be exterted in either direction on the cart
moveUp(BlockDudeState) - Method in class burlap.domain.singleagent.blockdude.BlockDudeModel: Modifies state s to be the result of a vertical movement, that will result in the agent onto the platform adjacent to its current location in the direction the agent is facing, provided that there is room for the agent (and any block it's holding) to step onto it.
MultiAgentDPPlanningAgent - Class in burlap.behavior.stochasticgames.agents.madp: A agent that using a MADynamicProgramming planning algorithm to compute the value of each state and then follow a policy derived from a joint policy that is derived from that estimated value function.
MultiAgentDPPlanningAgent(SGDomain, MADynamicProgramming, PolicyFromJointPolicy, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.madp.MultiAgentDPPlanningAgent: Initializes.
MultiAgentExperimenter - Class in burlap.behavior.stochasticgames.auxiliary.performance: This class is used to simplify the comparison of agent performance in a stochastic game world.
MultiAgentExperimenter(WorldGenerator, TerminalFunction, int, int, AgentFactoryAndType...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentExperimenter: Initializes.
MultiAgentPerformancePlotter - Class in burlap.behavior.stochasticgames.auxiliary.performance: This class is a world observer used for recording and plotting the performance of the agents in the world.
MultiAgentPerformancePlotter(TerminalFunction, int, int, int, int, TrialMode, PerformanceMetric...) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter: Initializes
MultiAgentPerformancePlotter.AgentDatasets - Class in burlap.behavior.stochasticgames.auxiliary.performance: A datastructure for maintain the plot series data in the current agent
MultiAgentPerformancePlotter.DatasetsAndTrials - Class in burlap.behavior.stochasticgames.auxiliary.performance: A class for storing the tiral data and series datasets for a given agent.
MultiAgentPerformancePlotter.MutableBoolean - Class in burlap.behavior.stochasticgames.auxiliary.performance: A class for a mutable boolean
MultiAgentPerformancePlotter.Trial - Class in burlap.behavior.stochasticgames.auxiliary.performance: A datastructure for maintaining all the metric stats for a single trial.
MultiAgentQLearning - Class in burlap.behavior.stochasticgames.agents.maql: A class for performing multi-agent Q-learning in which different Q-value backup operators can be provided to enable the learning of different solution concepts.
MultiAgentQLearning(SGDomain, double, double, HashableStateFactory, double, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning: Initializes this Q-learning agent.
MultiAgentQLearning(SGDomain, double, LearningRate, HashableStateFactory, QFunction, SGBackupOperator, boolean, String, SGAgentType) - Constructor for class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning: Initializes this Q-learning agent.
MultiAgentQSourceProvider - Interface in burlap.behavior.stochasticgames.madynamicprogramming: An interface for an object that can providing the Q-values stored for each agent in a problem.
MultiLayerRenderer - Class in burlap.visualizer: A MultiLayerRenderer is a canvas that will sequentially render a set of render layers, one on top of the other, to the same 2D graphics context.
MultiLayerRenderer() - Constructor for class burlap.visualizer.MultiLayerRenderer
MultipleIntentionsMLIRL - Class in burlap.behavior.singleagent.learnfromdemo.mlirl: An implementation of Multiple Intentions Maximum-likelihood Inverse Reinforcement Learning [1].
MultipleIntentionsMLIRL(MultipleIntentionsMLIRLRequest, int, double, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRL: Initializes.
MultipleIntentionsMLIRLRequest - Class in burlap.behavior.singleagent.learnfromdemo.mlirl: A problem request object for MultipleIntentionsMLIRL.
MultipleIntentionsMLIRLRequest(SADomain, QGradientPlannerFactory, List<Episode>, DifferentiableRF, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest: Initializes
MultipleIntentionsMLIRLRequest(SADomain, List<Episode>, DifferentiableRF, int, HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.MultipleIntentionsMLIRLRequest: Initializes using a default QGradientPlannerFactory.DifferentiableVIFactory that is based on the provided HashableStateFactory object.
MultiStatePrePlanner - Class in burlap.behavior.singleagent.planning.deterministic: This is a helper class that is used to run a valueFunction from multiple initial states to ensure that an adequate plan/policy exists for each them.
MutableBoolean(boolean) - Constructor for class burlap.behavior.singleagent.auxiliary.performance.PerformancePlotter.MutableBoolean: Initializes with the given Boolean value
MutableBoolean(boolean) - Constructor for class burlap.behavior.stochasticgames.auxiliary.performance.MultiAgentPerformancePlotter.MutableBoolean: Initializes with the given Boolean value
MutableDouble(double) - Constructor for class burlap.behavior.learningrate.ExponentialDecayLR.MutableDouble
MutableInt(int) - Constructor for class burlap.behavior.learningrate.SoftTimeInverseDecayLR.MutableInt
MutableInt() - Constructor for class burlap.behavior.singleagent.interfaces.rlglue.RLGlueAgent.MutableInt
MutableOOState - Interface in burlap.mdp.core.oo.state: A MutableState extension OOState.
MutableState - Interface in burlap.mdp.core.state: A State interface extension for mutable states whose values can be directly modified.
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory: The agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger: This agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat: This agent's cooperate action
myCoop - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory: This agent's cooperate action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger.GrimTriggerAgentFactory: The agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.GrimTrigger: This agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat: This agent's defect action
myDefect - Variable in class burlap.behavior.stochasticgames.agents.twoplayer.repeatedsinglestage.TitForTat.TitForTatAgentFactory: This agent's defect action
myQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning: This agent's Q-value source
MyTimer - Class in burlap.debugtools: A data structure for keeping track of elapsed and average time.
MyTimer() - Constructor for class burlap.debugtools.MyTimer: Creates a new timer.
MyTimer(boolean) - Constructor for class burlap.debugtools.MyTimer: Creates a new timer and starts it if start=true.

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z