burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners

Class Summary
Class	Description
DifferentiableSparseSampling	A Differentiable finite horizon planner that can also use sparse sampling over the transition dynamics when the transition function is very large or infinite.
DifferentiableSparseSampling.QAndQGradient	A tuple for storing Q-values and their gradients.
DifferentiableSparseSampling.VAndVGradient	A tuple for storing a state value and its gradient.
DifferentiableVFPlanner	A class for performing dynamic programming based planning with a differentiable value backup operator.
DifferentiableVI	Performs Differentiable Value Iteration using the Boltzmann backup operator and a `DifferentiableRF`.

Package burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners