DynamicWeightedAStar

java.lang.Object
- burlap.behavior.singleagent.MDPSolver
- - burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
  - - burlap.behavior.singleagent.planning.deterministic.informed.BestFirst
    - - burlap.behavior.singleagent.planning.deterministic.informed.astar.AStar
      - burlap.behavior.singleagent.planning.deterministic.informed.astar.DynamicWeightedAStar

All Implemented Interfaces:

MDPSolverInterface, Planner
```
public class DynamicWeightedAStar
extends AStar
```
Dynamic Weighted A* [1] uses a dynamic heuristic weight that is based on depth of the current search tree and based on an expected depth of the search. Specifically, f(n) = g(n) + (1 + \epsilon * w(n))*h(n), where epsilon is a parameter > 1 indicating greediness (the larger the more greedy) and w(n) = { 1 - d(n)/N if d(n) <= N { 0 if d(n) > N, where d(n) is the depth of the search and N is the expected depth of the search. This algorithm has the effect of becoming less greedy as the search continues, which allows it to find a decent solution quickly but avoid returning extremely sub-optimal solutions.
If a terminal function is provided via the setter method defined for OO-MDPs, then the BestFirst search algorithm will not expand any nodes that are terminal states, as if there were no actions that could be executed from that state. Note that terminal states are not necessarily the same as goal states, since there could be a fail condition from which the agent cannot act, but that is not explicitly represented in the transition dynamics. 1. Pohl, Ira (August, 1973). "The avoidance of (relative) catastrophe, heuristic competence, genuine dynamic weighting and computational issues in heuristic problem solving". Proceedings of the Third International Joint Conference on Artificial Intelligence (IJCAI-73) 3. California, USA. pp. 11-17.

Author:

James MacGlashan

Nested Class Summary
- Nested classes/interfaces inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
  DeterministicPlanner.PlanningFailedException

Field Summary

Fields
Modifier and Type	Field and Description
`protected java.util.Map<HashableState,java.lang.Integer>`	`depthMap` Data structure for storing the depth of explored states
`protected double`	`epsilon` parameter > 1 indicating the maximum amount of greediness; the larger the more greedy.
`protected int`	`expectedDepth` The expected depth required for a plan
`protected int`	`lastComputedDepth` maintains the depth of the last explored node

Fields inherited from class burlap.behavior.singleagent.planning.deterministic.informed.astar.AStar
cumulatedRewardMap, heuristic, lastComputedCumR

Fields inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
gc, internalPolicy

Fields inherited from class burlap.behavior.singleagent.MDPSolver
actions, debugCode, domain, gamma, hashingFactory, mapToStateIndex, rf, tf

Constructor Summary

Constructors
Constructor and Description
`DynamicWeightedAStar(Domain domain, RewardFunction rf, StateConditionTest gc, HashableStateFactory hashingFactory, Heuristic heuristic, double epsilon, int expectedDepth)` Initializes the valueFunction.

Method Summary

Methods
Modifier and Type	Method and Description
`double`	`computeF(PrioritizedSearchNode parentNode, GroundedAction generatingAction, HashableState successorState)` This method returns the f-score for a state given the parent search node, the generating action, the state that was produced.
`protected double`	`epsilonWeight(int depth)` Returns the weighted epsilon value at the given search depth
`void`	`insertIntoOpen(HashIndexedHeap<PrioritizedSearchNode> openQueue, PrioritizedSearchNode psn)` This method is used to insert a prioritized search node into the openQueue.
`SDPlannerPolicy`	`planFromState(State initialState)` Plans and returns a `SDPlannerPolicy`.
`void`	`postPlanPrep()` This method is called at the end of the `BestFirst.planFromState(State)` method and can be used clean up any special data structures needed by the subclass.
`void`	`prePlanPrep()` This method is called at the start of the `BestFirst.planFromState(State)` method and can be used initialize any special data structures needed by the subclass.
`void`	`updateOpen(HashIndexedHeap<PrioritizedSearchNode> openQueue, PrioritizedSearchNode openPSN, PrioritizedSearchNode npsn)` This method is called whenever a search node already in the openQueue needs to have its information or priority updated to reflect a new search node.

Methods inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner
deterministicPlannerInit, encodePlanIntoPolicy, hasCachedPlanForState, planContainsOption, planHasDupilicateStates, querySelectedActionForState, resetSolver

Methods inherited from class burlap.behavior.singleagent.MDPSolver
addNonDomainReferencedAction, getActions, getAllGroundedActions, getDebugCode, getDomain, getGamma, getHashingFactory, getRf, getRF, getTf, getTF, setActions, setDebugCode, setDomain, setGamma, setHashingFactory, setRf, setTf, solverInit, stateHash, toggleDebugPrinting, translateAction

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface burlap.behavior.singleagent.MDPSolverInterface
addNonDomainReferencedAction, getActions, getDebugCode, getDomain, getGamma, getHashingFactory, getRf, getRF, getTf, getTF, setActions, setDebugCode, setDomain, setGamma, setHashingFactory, setRf, setTf, solverInit, toggleDebugPrinting

- Field Detail
  - epsilon
```
protected double epsilon
```
    parameter > 1 indicating the maximum amount of greediness; the larger the more greedy.
  - expectedDepth
```
protected int expectedDepth
```
    The expected depth required for a plan
  - depthMap
```
protected java.util.Map<HashableState,java.lang.Integer> depthMap
```
    Data structure for storing the depth of explored states
  - lastComputedDepth
```
protected int lastComputedDepth
```
    maintains the depth of the last explored node
- Constructor Detail
  - DynamicWeightedAStar
```
public DynamicWeightedAStar(Domain domain,
                    RewardFunction rf,
                    StateConditionTest gc,
                    HashableStateFactory hashingFactory,
                    Heuristic heuristic,
                    double epsilon,
                    int expectedDepth)
```
    Initializes the valueFunction.
    
    Parameters:
    domain - the domain in which to plan
    rf - the reward function that represents costs as negative reward
    gc - should evaluate to true for goal states; false otherwise
    hashingFactory - the state hashing factory to use
    heuristic - the planning heuristic. Should return non-positive values.
    epsilon - parameter > 1 indicating greediness; the larger the value the more greedy.
    expectedDepth - the expected depth of the plan
- Method Detail
  - prePlanPrep
```
public void prePlanPrep()
```
    Description copied from class: BestFirst
    
    This method is called at the start of the BestFirst.planFromState(State) method and can be used initialize any special data structures needed by the subclass. By default it does nothing.
    
    Overrides:
    
    prePlanPrep in class AStar
  - postPlanPrep
```
public void postPlanPrep()
```
    Description copied from class: BestFirst
    
    This method is called at the end of the BestFirst.planFromState(State) method and can be used clean up any special data structures needed by the subclass. By default it does nothing.
    
    Overrides:
    
    postPlanPrep in class AStar
  - insertIntoOpen
```
public void insertIntoOpen(HashIndexedHeap<PrioritizedSearchNode> openQueue,
                  PrioritizedSearchNode psn)
```
    Description copied from class: BestFirst
    
    This method is used to insert a prioritized search node into the openQueue. If the subclass needs to do special procedures on his insert (such as using a subclass of PrioritizedSearchNode with more information), it can override it.
    
    Overrides:
    
    insertIntoOpen in class AStar
    
    Parameters:
    openQueue - the open queue in which the search node will be inserted.
    psn - the search node to insert.
  - updateOpen
```
public void updateOpen(HashIndexedHeap<PrioritizedSearchNode> openQueue,
              PrioritizedSearchNode openPSN,
              PrioritizedSearchNode npsn)
```
    Description copied from class: BestFirst
    
    This method is called whenever a search node already in the openQueue needs to have its information or priority updated to reflect a new search node. If a subclass needs to handle special procedures (such as using a subclass of PrioritizedSearchNode with more information), it can override this method.
    
    Overrides:
    
    updateOpen in class AStar
    
    Parameters:
    openQueue - the open queue in which the search node exists.
    openPSN - the search node indexed in the open queue that will be updated.
    npsn - the new search node that contains the updated information.
  - planFromState
```
public SDPlannerPolicy planFromState(State initialState)
```
    Plans and returns a SDPlannerPolicy. If a State is not in the solution path of this planner, then the SDPlannerPolicy will throw a runtime exception. If you want a policy that will dynamically replan for unknown states, you should create your own DDPlannerPolicy.
    This method overrides AStar's implementation so that it avoids reopening closed states that are not actually better due to the dynamic h weight, the reopen check needs to be based on the g score, note the f score
    
    Specified by:
    
    planFromState in interface Planner
    
    Overrides:
    
    planFromState in class BestFirst
    
    Parameters:
    initialState - the initial state of the planning problem
    
    Returns:
    a SDPlannerPolicy.
  - computeF
```
public double computeF(PrioritizedSearchNode parentNode,
              GroundedAction generatingAction,
              HashableState successorState)
```
    Description copied from class: BestFirst
    
    This method returns the f-score for a state given the parent search node, the generating action, the state that was produced.
    
    Overrides:
    
    computeF in class AStar
    
    Parameters:
    parentNode - the parent search node (and its priority) that from which the next state was generated.
    generatingAction - the action that was used to generate the next state.
    successorState - the next state that was generated
    
    Returns:
    the f-score for the next state.
  - epsilonWeight
```
protected double epsilonWeight(int depth)
```
    Returns the weighted epsilon value at the given search depth
    
    Parameters:
    depth - the search depth
    
    Returns:
    the weighted epsilon value at the given search depth

Class DynamicWeightedAStar

Nested Class Summary

Nested classes/interfaces inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner

Field Summary

Fields inherited from class burlap.behavior.singleagent.planning.deterministic.informed.astar.AStar

Fields inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner

Fields inherited from class burlap.behavior.singleagent.MDPSolver

Constructor Summary

Method Summary

Methods inherited from class burlap.behavior.singleagent.planning.deterministic.DeterministicPlanner

Methods inherited from class burlap.behavior.singleagent.MDPSolver

Methods inherited from class java.lang.Object

Methods inherited from interface burlap.behavior.singleagent.MDPSolverInterface

Field Detail

epsilon

expectedDepth

depthMap

lastComputedDepth

Constructor Detail

DynamicWeightedAStar

Method Detail

prePlanPrep

postPlanPrep

insertIntoOpen

updateOpen

planFromState

computeF

epsilonWeight