protected class PerformancePlotter.Trial
extends java.lang.Object
Modifier and Type | Field and Description |
---|---|
java.util.List<java.lang.Double> |
averageEpisodeReward
Stores the average reward by episode
|
java.util.List<java.lang.Double> |
cumulativeEpisodeReward
Stores the cumulative reward by episode
|
java.util.List<java.lang.Double> |
cumulativeStepEpisode
Stores the cumulative steps by episode
|
java.util.List<java.lang.Double> |
cumulativeStepReward
Stores the cumulative reward by step
|
double |
curEpisodeReward
The cumulative reward of the episode so far
|
protected java.util.List<java.lang.Double> |
curEpisodeRewards
A list of the reward sequence in the current episode
|
int |
curEpisodeSteps
The number of steps in the episode so far
|
java.util.List<java.lang.Double> |
medianEpisodeReward
Stores the median reward by episode
|
java.util.List<java.lang.Double> |
stepEpisode
Stores the steps by episode
|
int |
totalEpisodes
The total number of episodes in the trial
|
int |
totalSteps
the total number of steps in the trial
|
Modifier | Constructor and Description |
---|---|
protected |
Trial() |
Modifier and Type | Method and Description |
---|---|
void |
setupForNewEpisode()
Completes the last episode and sets up the datastructures for the next episode
|
void |
stepIncrement(double r)
Updates all datastructures with the reward received from the last step
|
public java.util.List<java.lang.Double> cumulativeStepReward
public java.util.List<java.lang.Double> cumulativeEpisodeReward
public java.util.List<java.lang.Double> averageEpisodeReward
public java.util.List<java.lang.Double> medianEpisodeReward
public java.util.List<java.lang.Double> cumulativeStepEpisode
public java.util.List<java.lang.Double> stepEpisode
public double curEpisodeReward
public int curEpisodeSteps
public int totalSteps
public int totalEpisodes
protected java.util.List<java.lang.Double> curEpisodeRewards
public void stepIncrement(double r)
r
- the last reward receivedpublic void setupForNewEpisode()