public class ApprenticeshipLearning
extends java.lang.Object
| Modifier and Type | Class and Description |
|---|---|
static class |
ApprenticeshipLearning.StationaryRandomDistributionPolicy
This class extends Policy.
|
| Modifier and Type | Field and Description |
|---|---|
static int |
DEBUG_CODE_RF_WEIGHTS |
static int |
DEBUG_CODE_SCORE |
| Modifier and Type | Method and Description |
|---|---|
static double[] |
estimateFeatureExpectation(Episode episode,
DenseStateFeatures featureFunctions,
java.lang.Double gamma)
Calculates the Feature Expectations given one demonstration, a feature mapping and a discount factor gamma
|
static double[] |
estimateFeatureExpectation(java.util.List<Episode> episodes,
DenseStateFeatures featureFunctions,
java.lang.Double gamma)
Calculates the Feature Expectations given a list of demonstrations, a feature mapping and a
discount factor gamma
|
static RewardFunction |
generateRewardFunction(DenseStateFeatures featureFunctions,
burlap.behavior.singleagent.learnfromdemo.apprenticeship.ApprenticeshipLearning.FeatureWeights featureWeights)
Generates an anonymous instance of a reward function derived from a FeatureMapping
and associated feature weights
Computes (w^(i))T phi from step 4 in section 3
|
static State |
getInitialState(java.util.List<Episode> episodes)
Returns the initial state of a randomly chosen episode analysis
|
static Policy |
getLearnedPolicy(ApprenticeshipLearningRequest request)
Computes a policy that models the expert trajectories included in the request object.
|
public static final int DEBUG_CODE_SCORE
public static final int DEBUG_CODE_RF_WEIGHTS
public static double[] estimateFeatureExpectation(Episode episode, DenseStateFeatures featureFunctions, java.lang.Double gamma)
episode - An EpisodeAnalysis object that contains a sequence of state-action pairsfeatureFunctions - Feature Mapping which maps states to featuresgamma - Discount factor gammapublic static double[] estimateFeatureExpectation(java.util.List<Episode> episodes, DenseStateFeatures featureFunctions, java.lang.Double gamma)
episodes - List of expert demonstrations as EpisodeAnalysis objectsfeatureFunctions - Feature Mapping which maps states to featuresgamma - Discount factor for future expected rewardpublic static RewardFunction generateRewardFunction(DenseStateFeatures featureFunctions, burlap.behavior.singleagent.learnfromdemo.apprenticeship.ApprenticeshipLearning.FeatureWeights featureWeights)
featureFunctions - The feature mapping of states to featuresfeatureWeights - The weights given to each featurepublic static State getInitialState(java.util.List<Episode> episodes)
episodes - the expert demonstrationspublic static Policy getLearnedPolicy(ApprenticeshipLearningRequest request)
request - the IRL problem descriptionPolicy