public class GreedyQPolicy extends java.lang.Object implements SolverDerivedPolicy, EnumerablePolicy
Modifier and Type | Field and Description |
---|---|
protected QProvider |
qplanner |
protected java.util.Random |
rand |
Constructor and Description |
---|
GreedyQPolicy() |
GreedyQPolicy(QProvider planner)
Initializes with a QComputablePlanner
|
Modifier and Type | Method and Description |
---|---|
Action |
action(State s)
This method will return an action sampled by the policy for the given state.
|
double |
actionProb(State s,
Action a)
Returns the probability/probability density that the given action will be taken in the given state.
|
boolean |
definedFor(State s)
Specifies whether this policy is defined for the input state.
|
java.util.List<ActionProb> |
policyDistribution(State s)
This method will return action probability distribution defined by the policy.
|
void |
setSolver(MDPSolverInterface solver)
Sets the valueFunction whose results affect this policy.
|
protected QProvider qplanner
protected java.util.Random rand
public GreedyQPolicy()
public GreedyQPolicy(QProvider planner)
planner
- the QComputablePlanner to usepublic void setSolver(MDPSolverInterface solver)
SolverDerivedPolicy
setSolver
in interface SolverDerivedPolicy
solver
- the solver from which this policy is derivedpublic Action action(State s)
Policy
public double actionProb(State s, Action a)
Policy
actionProb
in interface Policy
s
- the state of interesta
- the action that may be taken in the statepublic java.util.List<ActionProb> policyDistribution(State s)
EnumerablePolicy
policyDistribution
in interface EnumerablePolicy
s
- the state for which an action distribution should be returnedpublic boolean definedFor(State s)
Policy
definedFor
in interface Policy
s
- the input state to test for whether this policy is definedState
s, false otherwise.