- q - Variable in class burlap.behavior.singleagent.learning.tdmethods.SarsaLam.EligibilityTrace
-
The current Q-value info for this trace (contains the action reference)
- q - Variable in class burlap.behavior.singleagent.QValue
-
The numeric Q-value
- q - Variable in class burlap.behavior.stochasticgame.mavaluefunction.JAQValue
-
- QComputablePlanner - Interface in burlap.behavior.singleagent.planning
-
An interface for planning classes that can compute Q-values.
- QComputablePlanner.QComputablePlannerHelper - Class in burlap.behavior.singleagent.planning
-
A class of helper static methods that may be commonly used code that uses a QComputable planner.
- QComputablePlanner.QComputablePlannerHelper() - Constructor for class burlap.behavior.singleagent.planning.QComputablePlanner.QComputablePlannerHelper
-
- qEntry - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
The Q-values for this object's state.
- QGradientPlanner - Interface in burlap.behavior.singleagent.learnbydemo.mlirl.support
-
An interface for a planner that can produce Q-value gradients.
- QGradientPlannerFactory - Interface in burlap.behavior.singleagent.learnbydemo.mlirl.support
-
- QGradientPlannerFactory.DifferentiableVIFactory - Class in burlap.behavior.singleagent.learnbydemo.mlirl.support
-
- QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory) - Constructor for class burlap.behavior.singleagent.learnbydemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
- QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory, TerminalFunction, double, int) - Constructor for class burlap.behavior.singleagent.learnbydemo.mlirl.support.QGradientPlannerFactory.DifferentiableVIFactory
-
Initializes.
- QGradientTuple - Class in burlap.behavior.singleagent.learnbydemo.mlirl.support
-
A tuple (triple) for storing the Q-gradient associated with a state and action.
- QGradientTuple(State, AbstractGroundedAction, double[]) - Constructor for class burlap.behavior.singleagent.learnbydemo.mlirl.support.QGradientTuple
-
Initializes.
- qIndex - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The tabular mapping from states to Q-values
- qInit - Variable in class burlap.behavior.stochasticgame.agents.maql.MAQLFactory
-
- qInit - Variable in class burlap.behavior.stochasticgame.agents.maql.MultiAgentQLearning
-
The Q-value initialization to use
- qInit - Variable in class burlap.behavior.stochasticgame.agents.mavf.MAVFPlannerFactory.MAVIPlannerFactory
-
The Q-value initialization function to use.
- qinit - Variable in class burlap.behavior.stochasticgame.agents.naiveq.history.SGQWActionHistoryFactory
-
A default Q-value initializer
- qInit - Variable in class burlap.behavior.stochasticgame.agents.naiveq.SGNaiveQLAgent
-
Defines how q-values are initialized
- qInit - Variable in class burlap.behavior.stochasticgame.mavaluefunction.QSourceForSingleAgent.HashBackedQSource
-
- qInitFunction - Variable in class burlap.behavior.singleagent.interfaces.rlglue.common.RLGLueQlearningFactory
-
The object that defines how Q-values are initialized.
- qInitFunction - Variable in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
The object that defines how Q-values are initialized.
- QLearning - Class in burlap.behavior.singleagent.learning.tdmethods
-
Tabular Q-learning algorithm [1].
- QLearning(Domain, RewardFunction, TerminalFunction, double, StateHashFactory, double, double) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere, and places no limit on the number of steps the
agent can take in an episode.
- QLearning(Domain, RewardFunction, TerminalFunction, double, StateHashFactory, double, double, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes Q-learning with 0.1 epsilon greedy policy, the same Q-value initialization everywhere.
- QLearning(Domain, RewardFunction, TerminalFunction, double, StateHashFactory, double, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the same Q-value initialization everywhere.
- QLearning(Domain, RewardFunction, TerminalFunction, double, StateHashFactory, ValueFunctionInitialization, double, Policy, int) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- QLearningStateNode - Class in burlap.behavior.singleagent.learning.tdmethods
-
This class is used to store the associated
QValue
objects for a given hashed sated.
- QLearningStateNode(StateHashTuple) - Constructor for class burlap.behavior.singleagent.learning.tdmethods.QLearningStateNode
-
Creates a new object for the given hashed state.
- QLInit(Domain, RewardFunction, TerminalFunction, double, StateHashFactory, ValueFunctionInitialization, double, Policy, int) - Method in class burlap.behavior.singleagent.learning.tdmethods.QLearning
-
Initializes the algorithm.
- qMap - Variable in class burlap.behavior.stochasticgame.agents.naiveq.SGNaiveQLAgent
-
The tabular map from (hashed) states to the list of Q-values for each action in those states
- qplanner - Variable in class burlap.behavior.singleagent.planning.commonpolicies.BoltzmannQPolicy
-
- qplanner - Variable in class burlap.behavior.singleagent.planning.commonpolicies.EpsilonGreedy
-
- qplanner - Variable in class burlap.behavior.singleagent.planning.commonpolicies.GreedyDeterministicQPolicy
-
- qplanner - Variable in class burlap.behavior.singleagent.planning.commonpolicies.GreedyQPolicy
-
- QSourceForSingleAgent - Interface in burlap.behavior.stochasticgame.mavaluefunction
-
And interface for objects that can store and retrive Q-value objects for a specific agent and state and joint action query.
- QSourceForSingleAgent.HashBackedQSource - Class in burlap.behavior.stochasticgame.mavaluefunction
-
- QSourceForSingleAgent.HashBackedQSource(StateHashFactory, ValueFunctionInitialization) - Constructor for class burlap.behavior.stochasticgame.mavaluefunction.QSourceForSingleAgent.HashBackedQSource
-
Initializes with a given state hashing factory and value funciton initialization for Q-values.
- qSourceMap - Variable in class burlap.behavior.stochasticgame.agents.maql.MultiAgentQLearning
-
The object that maps to other agent's Q-value sources
- qSourceMapping - Variable in class burlap.behavior.stochasticgame.mavaluefunction.AgentQSourceMap.HashMapAgentQSourceMap
-
- qSourceMapping - Variable in class burlap.behavior.stochasticgame.mavaluefunction.AgentQSourceMap.MAQLControlledQSourceMap
-
- qSourceProvider - Variable in class burlap.behavior.stochasticgame.mavaluefunction.policies.ECorrelatedQJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgame.mavaluefunction.policies.EGreedyJointPolicy
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgame.mavaluefunction.policies.EGreedyMaxWellfare
-
The multi-agent q-source provider
- qSourceProvider - Variable in class burlap.behavior.stochasticgame.mavaluefunction.policies.EMinMaxPolicy
-
The multi-agent q-source provider
- qSources - Variable in class burlap.behavior.stochasticgame.mavaluefunction.MAValueFunctionPlanner
-
The Hash map backed multi-agent Q-source in which to store Q-values.
- qToUpdate - Variable in class burlap.behavior.stochasticgame.agents.maql.MultiAgentQLearning
-
Which Q-value object needs to be updated
- queryAction - Variable in class burlap.behavior.singleagent.vfa.ActionFeaturesQuery
-
The action with which the state features are associated
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgame.agents.maql.MAQLFactory
-
- queryOtherAgentsQSource - Variable in class burlap.behavior.stochasticgame.agents.maql.MultiAgentQLearning
-
Whether this agent is using the Q-values stored by other agents in the world rather than keeping a separate copy of the Q-values for each agent itself.
- querySelectedActionForState(State) - Method in class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
-
Returns the action suggested by the planner for the given state.
- queryTreeIndex(StateHashTuple, int) - Method in class burlap.behavior.singleagent.planning.stochastic.montecarlo.uct.UCT
-
Returns the
UCTStateNode
for the given (hashed) state at the given depth.
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners.diffvinit.LinearDiffRFVInit
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners.diffvinit.LinearStateDiffVF
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners.diffvinit.VanillaDiffVinit
-
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.planning.vfa.fittedvi.FittedVI.VFAVInit
-
- QValue - Class in burlap.behavior.singleagent
-
This class is used to store Q-values.
- QValue(State, AbstractGroundedAction, double) - Constructor for class burlap.behavior.singleagent.QValue
-
Creates a Q-value for the given state an action pair with the specified q-value
- QValue(QValue) - Constructor for class burlap.behavior.singleagent.QValue
-
Initialializes this Q-value by copying the information from another Q-value.
- qValue(State, AbstractGroundedAction) - Method in class burlap.behavior.singleagent.ValueFunctionInitialization.ConstantValueFunctionInitialization
-
- qValue(State, AbstractGroundedAction) - Method in interface burlap.behavior.singleagent.ValueFunctionInitialization
-
Returns the initialization value of the Q-value function for a given state and action pair.
- qValues - Variable in class burlap.behavior.stochasticgame.mavaluefunction.QSourceForSingleAgent.HashBackedQSource
-