- q - Variable in class burlap.behavior.singleagent.learning.tdmethods.SarsaLam.EligibilityTrace
-
The current Q-value info for this trace (contains the action reference)
- q - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.JAQValue
-
- q - Variable in class burlap.behavior.valuefunction.QValue
-
The numeric Q-value
- QAndQGradient(List<QValue>, List<QGradientTuple>) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableSparseSampling.QAndQGradient
-
- QAndQGradient(int) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableSparseSampling.QAndQGradient
-
- qEntry - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
The Q-values for this object's state.
- qForBelief(EnumerableBeliefState, Action) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Computes the expected Q-value of the underlying hidden MDP by marginalizing over of the states in the belief state.
- qForBeliefList(List<EnumerableBeliefState.StateBelief>, Action) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Computes the expected Q-value of the underlying hidden MDP by marginalizing over of the states in the belief state.
- qFunction - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The tabular mapping from states to Q-values
- QFunction - Interface in burlap.behavior.valuefunction
-
Interface for objects that define a Q-function
- qGradient(State, Action) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableDP
-
- qGradient(State, Action) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableSparseSampling
-
- qGradient(State, Action) - Method in interface burlap.behavior.singleagent.learnfromdemo.mlirl.support.DifferentiableQFunction
-
Returns the Q-value gradient (
QGradientTuple
) for the given state and action.
- QGradientPlannerFactory - Interface in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
- QGradientPlannerFactory.DifferentiableVIFactory - Class in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
- QGradientTuple - Class in burlap.behavior.singleagent.learnfromdemo.mlirl.support
-
A tuple (triple) for storing the Q-gradient associated with a state and action.
- QGradientTuple(State, Action, FunctionGradient) - Constructor for class burlap.behavior.singleagent.learnfromdemo.mlirl.support.QGradientTuple
-
Initializes.
- qInit - Variable in class burlap.behavior.stochasticgames.agents.madp.MADPPlannerFactory.MAVIPlannerFactory
-
The Q-value initialization function to use.
- qInit - Variable in class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
- qInit - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
The Q-value initialization to use
- qinit - Variable in class burlap.behavior.stochasticgames.agents.naiveq.history.SGQWActionHistoryFactory
-
A default Q-value initializer
- qInit - Variable in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
Defines how q-values are initialized
- qInit - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.QSourceForSingleAgent.HashBackedQSource
-
- qInitFunction - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The object that defines how Q-values are initialized.
- QLearning - Class in burlap.behavior.singleagent.learning.tdmethods
-
Tabular Q-learning algorithm [1].
- QLearning(SADomain, double, HashableStateFactory, double, double) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere, and places no limit on the number of steps the
agent can take in an episode.
- QLearning(SADomain, double, HashableStateFactory, double, double, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere.
- QLearning(SADomain, double, HashableStateFactory, double, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the same Q-value initialization everywhere.
- QLearning(SADomain, double, HashableStateFactory, QFunction, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- QLearningStateNode - Class in burlap.behavior.singleagent.learning.tdmethods
-
This class is used to store the associated
QValue
objects for a given hashed sated.
- QLearningStateNode() - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
Default constructor for serialization purposes
- QLearningStateNode(HashableState) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
Creates a new object for the given hashed state.
- QLInit(SADomain, double, HashableStateFactory, QFunction, double, Policy, int) - Method in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- qMap - Variable in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
The tabular map from (hashed) states to the list of Q-values for each action in those states
- QMDP - Class in burlap.behavior.singleagent.pomdp.qmdp
-
An implementation of QMDP for POMDP domains.
- QMDP(PODomain, QProvider) - Constructor for class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Initializes.
- QMDP(PODomain, RewardFunction, TerminalFunction, double, HashableStateFactory, double, int) - Constructor for class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
Initializes and creates a
ValueIteration
planner
to solve the underling MDP.
- qplanner - Variable in class burlap.behavior.policy.BoltzmannQPolicy
-
- qplanner - Variable in class burlap.behavior.policy.EpsilonGreedy
-
- qplanner - Variable in class burlap.behavior.policy.GreedyDeterministicQPolicy
-
- qplanner - Variable in class burlap.behavior.policy.GreedyQPolicy
-
- QProvider - Interface in burlap.behavior.valuefunction
-
An interface for MDP solvers that can return/compute Q-values.
- QProvider.Helper - Class in burlap.behavior.valuefunction
-
A class of helper static methods that may be commonly used by code that uses a QFunction instance.
- QSourceForSingleAgent - Interface in burlap.behavior.stochasticgames.madynamicprogramming
-
And interface for objects that can store and retrive Q-value objects for a specific agent and state and joint action query.
- QSourceForSingleAgent.HashBackedQSource - Class in burlap.behavior.stochasticgames.madynamicprogramming
-
- qSourceMap - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
The object that maps to other agent's Q-value sources
- qSourceMapping - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.HashMapAgentQSourceMap
-
- qSourceMapping - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.AgentQSourceMap.MAQLControlledQSourceMap
-
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.ECorrelatedQJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EGreedyJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EGreedyMaxWellfare
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.policies.EMinMaxPolicy
-
The multi-agent q-source provider
- qSources - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.MADynamicProgramming
-
The Hash map backed multi-agent Q-source in which to store Q-values.
- qToUpdate - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Which Q-value object needs to be updated
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MAQLFactory
-
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgames.agents.maql.MultiAgentQLearning
-
Whether this agent is using the Q-values stored by other agents in the world rather than keeping a separate copy of the Q-values for each agent itself.
- querySelectedActionForState(State) - Method in class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
-
Returns the action suggested by the internal plan for the given state.
- queryTreeIndex(HashableState, int) - Method in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
Returns the
UCTStateNode
for the given (hashed) state at the given depth.
- QuitCommand - Class in burlap.shell.command.reserved
-
- QuitCommand() - Constructor for class burlap.shell.command.reserved.QuitCommand
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableSparseSampling
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learnfromdemo.RewardValueProjection
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learning.lspi.LSPI
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.planning.stochastic.DynamicProgramming
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.planning.stochastic.sparsesampling.SparseSampling
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI.VFAVInit
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
- qValue(State, Action) - Method in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
-
- qValue(State, Action) - Method in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
- qValue(State, Action) - Method in class burlap.behavior.valuefunction.ConstantValueFunction
-
- qValue(State, Action) - Method in interface burlap.behavior.valuefunction.QFunction
-
Returns the
QValue
for the given state-action pair.
- QValue - Class in burlap.behavior.valuefunction
-
This class is used to store Q-values.
- QValue() - Constructor for class burlap.behavior.valuefunction.QValue
-
Default constructor for serialization purposes.
- QValue(State, Action, double) - Constructor for class burlap.behavior.valuefunction.QValue
-
Creates a Q-value for the given state an action pair with the specified q-value
- QValue(QValue) - Constructor for class burlap.behavior.valuefunction.QValue
-
Initialializes this Q-value by copying the information from another Q-value.
- qValues(State) - Method in class burlap.behavior.singleagent.learnfromdemo.mlirl.differentiableplanners.DifferentiableSparseSampling
-
- qValues(State) - Method in class burlap.behavior.singleagent.learnfromdemo.RewardValueProjection
-
- qValues(State) - Method in class burlap.behavior.singleagent.learning.lspi.LSPI
-
- qValues(State) - Method in class burlap.behavior.singleagent.learning.modellearning.artdp.ARTDP
-
- qValues(State) - Method in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
- qValues(State) - Method in class burlap.behavior.singleagent.learning.tdmethods.vfa.GradientDescentSarsaLam
-
- qValues(State) - Method in class burlap.behavior.singleagent.planning.stochastic.DynamicProgramming
-
- qValues(State) - Method in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
- qValues(State) - Method in class burlap.behavior.singleagent.planning.stochastic.sparsesampling.SparseSampling
-
- qValues(State) - Method in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI
-
- qValues(State) - Method in class burlap.behavior.singleagent.pomdp.qmdp.QMDP
-
- qValues(State) - Method in class burlap.behavior.singleagent.pomdp.wrappedmdpalgs.BeliefSparseSampling
-
- qValues(State) - Method in class burlap.behavior.stochasticgames.agents.naiveq.SGNaiveQLAgent
-
- qValues - Variable in class burlap.behavior.stochasticgames.madynamicprogramming.QSourceForSingleAgent.HashBackedQSource
-
- qValues(State) - Method in interface burlap.behavior.valuefunction.QProvider
-
Returns a
List
of
QValue
objects for ever permissible action for the given input state.