public interface LearningAgent
runLearningEpisode(burlap.mdp.singleagent.environment.Environment)
and runLearningEpisode(burlap.mdp.singleagent.environment.Environment, int). Implementing the former method
should have the agent interact with the provided Environment
until the Environment transitions to a terminal state. The
runLearningEpisode(burlap.mdp.singleagent.environment.Environment, int) should have the agent interact
with the Environment until either a terminal state is reached or
the agent has taken maxSteps in the environment. Both methods should return an Episode
object that records the interactions.| Modifier and Type | Method and Description |
|---|---|
Episode |
runLearningEpisode(Environment env) |
Episode |
runLearningEpisode(Environment env,
int maxSteps) |
Episode runLearningEpisode(Environment env)
Episode runLearningEpisode(Environment env, int maxSteps)