Environments¶
A reinforcement learning environment provides the API to a simulated or real environment as the subject for optimization. It could be anything from video games (e.g. Atari) to robots or trading systems. The agent interacts with this environment and learns to act optimally in its dynamics.
Environment <-> Runner <-> Agent <-> Model
- class
tensorforce.environments.
Environment
¶Base environment class.
actions
¶Return the action space. Might include subdicts if multiple actions are available simultaneously.
Returns: dict of action properties (continuous, number of actions)
close
()¶Close environment. No other method calls possible afterwards.
execute
(action)¶Executes action, observes next state and reward.
Parameters: action – Action to execute. Returns: tuple of state (tuple), reward (float), and terminal_state (bool).
reset
()¶Reset environment and setup for new episode.
Returns: initial state of resetted environment.
states
¶Return the state space. Might include subdicts if multiple states are available simultaneously.
Returns: dict of state properties (shape and type).