public interface LearningAgent
runLearningEpisode(burlap.mdp.singleagent.environment.Environment)
and runLearningEpisode(burlap.mdp.singleagent.environment.Environment, int)
. Implementing the former method
should have the agent interact with the provided Environment
until the Environment
transitions to a terminal state. The
runLearningEpisode(burlap.mdp.singleagent.environment.Environment, int)
should have the agent interact
with the Environment
until either a terminal state is reached or
the agent has taken maxSteps in the environment. Both methods should return an Episode
object that records the interactions.Modifier and Type | Method and Description |
---|---|
Episode |
runLearningEpisode(Environment env) |
Episode |
runLearningEpisode(Environment env,
int maxSteps) |
Episode runLearningEpisode(Environment env)
Episode runLearningEpisode(Environment env, int maxSteps)