public static class QGradientPlannerFactory.DifferentiableVIFactory extends java.lang.Object implements QGradientPlannerFactory
DifferentiableVI factory.QGradientPlannerFactory.DifferentiableVIFactory| Modifier and Type | Field and Description |
|---|---|
protected StateHashFactory |
hashingFactory
The
StateHashFactory used by the planner. |
protected double |
maxDelta
The value function change threshold to stop VI.
|
protected int |
maxIterations
The maximum allowed number of VI iterations.
|
protected TerminalFunction |
tf
The terminal function that the planner uses.
|
| Constructor and Description |
|---|
QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory hashingFactory)
Initializes the factory with the given
StateHashFactory. |
QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory hashingFactory,
TerminalFunction tf,
double maxDelta,
int maxIterations)
Initializes.
|
| Modifier and Type | Method and Description |
|---|---|
QGradientPlanner |
generateDifferentiablePlannerForRequest(MLIRLRequest request)
Returns a
QGradientPlanner for an
MLIRLRequest object's domain,
reward function, discount factor, and Boltzmann beta parameter. |
protected StateHashFactory hashingFactory
StateHashFactory used by the planner.protected double maxDelta
protected int maxIterations
protected TerminalFunction tf
NullTermination.public QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory hashingFactory)
StateHashFactory.
The terminal function will be defaulted to a NullTermination;
value function change threshold to 0.01; and the max VI iterations to 500.hashingFactory - the StateHashFactory to use for planning.public QGradientPlannerFactory.DifferentiableVIFactory(StateHashFactory hashingFactory, TerminalFunction tf, double maxDelta, int maxIterations)
hashingFactory - the StateHashFactory to use for planning.tf - The terminal function that the generated planners use.maxDelta - The value function change threshold to stop VI.maxIterations - The maximum allowed number of VI iterationspublic QGradientPlanner generateDifferentiablePlannerForRequest(MLIRLRequest request)
QGradientPlannerFactoryQGradientPlanner for an
MLIRLRequest object's domain,
reward function, discount factor, and Boltzmann beta parameter.generateDifferentiablePlannerForRequest in interface QGradientPlannerFactoryrequest - the request defining the problem the planner should solve.QGradientPlanner instance.