public class SparseSampling.StateNode
extends java.lang.Object
Constructor and Description |
---|
StateNode(HashableState sh,
int height)
Creates a node for the given hased state at the given height
|
Modifier and Type | Method and Description |
---|---|
java.util.List<QValue> |
estimateQs()
Estimates and returns the Q-values for this node.
|
double |
estimateV()
Returns the estimated Q-value if this node is closed, or estimates it and closes it otherwise.
|
protected double |
exactQValue(Action ga)
Computes the exact Q-value using full Bellman update with the actual transition dynamics.
|
protected double |
sampledQEstimate(Action ga)
Estimates the Q-value using sampling from the transition dynamics.
|
public StateNode(HashableState sh, int height)
sh
- the hashed stateheight
- the height of the nodepublic java.util.List<QValue> estimateQs()
List
of the estiamted Q-values for each action.protected double sampledQEstimate(Action ga)
ga
- the action for which the Q-value estimate is to be returnedprotected double exactQValue(Action ga)
ga
- the action for which the Q-value estimate is to be returnedpublic double estimateV()