- q - Variable in class burlap.behavior.singleagent.learning.tdmethods.SarsaLam.EligibilityTrace
-
The current Q-value info for this trace (contains the action reference)
- q - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.JAQValue
-
- q - Variable in class burlap.behavior.valuefunction.QValue
-
The numeric Q-value
- qEntry - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
The Q-values for this object's state.
- qForBelief(EnumerableBeliefState, GroundedAction) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Computes the expected Q-value of the underlying hidden MDP by marginalizing over of the states in the belief state.
- qForBeliefList(List<EnumerableBeliefState.StateBelief>, GroundedAction) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Computes the expected Q-value of the underlying hidden MDP by marginalizing over of the states in the belief state.
- QFunction - Interface in burlap.behavior.valuefunction
-
An interface for MDP solvers that can return/compute Q-values.
- QFunction.QFunctionHelper - Class in burlap.behavior.valuefunction
-
A class of helper static methods that may be commonly used by code that uses a QFunction instance.
- QFunction.QFunctionHelper() - Constructor for class burlap.behavior.valuefunction.QFunction.QFunctionHelper
-
- QGradientPlanner - Interface in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
An interface for a valueFunction that can produce Q-value gradients.
- QGradientPlannerFactory - Interface in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
- QGradientPlannerFactory.DifferentiableVIFactory - Class in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
- QGradientPlannerFactory.DifferentiableVIFactory(HashableStateFactory) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
- QGradientPlannerFactory.DifferentiableVIFactory(HashableStateFactory, TerminalFunction, double, int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
Initializes.
- QGradientTuple - Class in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
A tuple (triple) for storing the Q-gradient associated with a state and action.
- QGradientTuple(State, AbstractGroundedAction, FunctionGradient) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientTuple
-
Initializes.
- qIndex - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The tabular mapping from states to Q-values
- qInit - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
The Q-value initialization function to use.
- qInit - Variable in class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
- qInit - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
The Q-value initialization to use
- qinit - Variable in class burlap.behavior.stochasticgames.agents.naiveq.history.SGQWActionHistoryFactory
-
A default Q-value initializer
- qInit - Variable in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
Defines how q-values are initialized
- qInit - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.QSourceForSingleAgent.HashBackedQSource
-
- qInitFunction - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The object that defines how Q-values are initialized.
- QLearning - Class in burlap.behavior.singleagent.learning.tdmethods
-
Tabular Q-learning algorithm [1].
- QLearning(Domain, double, HashableStateFactory, double, double) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere, and places no limit on the number of steps the
agent can take in an episode.
- QLearning(Domain, double, HashableStateFactory, double, double, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere.
- QLearning(Domain, double, HashableStateFactory, double, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the same Q-value initialization everywhere.
- QLearning(Domain, double, HashableStateFactory, ValueFunctionInitialization, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- qLearningExample(String) - Method in class burlap.tutorials.bpl.BasicBehavior
-
- QLearningStateNode - Class in burlap.behavior.singleagent.learning.tdmethods
-
This class is used to store the associated
QValue
objects for a given hashed sated.
- QLearningStateNode(HashableState) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
Creates a new object for the given hashed state.
- QLInit(Domain, double, HashableStateFactory, ValueFunctionInitialization, double, Policy, int) - Method in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- QLTutorial - Class in burlap.tutorials.cpl
-
- QLTutorial(Domain, double, HashableStateFactory, ValueFunctionInitialization, double, double) - Constructor for class burlap.tutorials.cpl.QLTutorial
-
- qMap - Variable in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
The tabular map from (hashed) states to the list of Q-values for each action in those states
- QMDP - Class in burlap.behavior.singleagent.pomdp.qmdp
-
An implementation of QMDP for POMDP domains.
- QMDP(PODomain, QFunction) - Constructor for class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Initializes.
- QMDP(PODomain, RewardFunction, TerminalFunction, double, HashableStateFactory, double, int) - Constructor for class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Initializes and creates a
ValueIteration
planner
to solve the underling MDP.
- qplanner - Variable in class burlap.behavior.policy.BoltzmannQPolicy
-
- qplanner - Variable in class burlap.behavior.policy.EpsilonGreedy
-
- qplanner - Variable in class burlap.behavior.policy.GreedyDeterministicQPolicy
-
- qplanner - Variable in class burlap.behavior.policy.GreedyQPolicy
-
- QSourceForSingleAgent - Interface in burlap.behavior.stochasticgames.madynamicprogramming
-
And interface for objects that can store and retrive Q-value objects for a specific agent and state and joint action query.
- QSourceForSingleAgent.HashBackedQSource - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
- QSourceForSingleAgent.HashBackedQSource(HashableStateFactory, ValueFunctionInitialization) - Constructor for class burlap.behavior.stochasticgames.madynamicprogramming.QSourceForSingleAgent.HashBackedQSource
-
Initializes with a given state hashing factory and value funciton initialization for Q-values.
- qSourceMap - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
The object that maps to other agent's Q-value sources
- qSourceMapping - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.HashMapAgentQSourceMap
-
- qSourceMapping - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.MAQLControlledQSourceMap
-
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.ECorrelatedQJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EGreedyJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EGreedyMaxWellfare
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EMinMaxPolicy
-
The multi-agent q-source provider
- qSources - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.MADynamicProgramming
-
The Hash map backed multi-agent Q-source in which to store Q-values.
- qToUpdate - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Which Q-value object needs to be updated
- queryAction - Variable in class burlap.behavior.singleagent.vfa.ActionFeaturesQuery
-
The action with which the state features are associated
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Whether this agent is using the Q-values stored by other agents in the world rather than keeping a separate copy of the Q-values for each agent itself.
- querySelectedActionForState(State) - Method in class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
-
Returns the action suggested by the valueFunction for the given state.
- queryTreeIndex(HashableState, int) - Method in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
Returns the
UCTStateNode
for the given (hashed) state at the given depth.
- QuitCommand - Class in burlap.shell.command.reserved
-
- QuitCommand() - Constructor for class burlap.shell.command.reserved.QuitCommand
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.diffvinit.LinearDiffRFVInit
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.diffvinit.LinearStateDiffVF
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.diffvinit.VanillaDiffVinit
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI.VFAVInit
-
- QValue - Class in burlap.behavior.valuefunction
-
This class is used to store Q-values.
- QValue(State, AbstractGroundedAction, double) - Constructor for class burlap.behavior.valuefunction.QValue
-
Creates a Q-value for the given state an action pair with the specified q-value
- QValue(QValue) - Constructor for class burlap.behavior.valuefunction.QValue
-
Initialializes this Q-value by copying the information from another Q-value.
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.valuefunction.ValueFunctionInitialization.ConstantValueFunctionInitialization
-
- qValue(State, AbstractGroundedAction) - Method in interface burlap.behavior.valuefunction.ValueFunctionInitialization
-
Returns the initialization value of the Q-value function for a given state and action pair.
- qValues - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.QSourceForSingleAgent.HashBackedQSource
-