tensorforce.execution package¶

Submodules¶

tensorforce.execution.runner module¶

class tensorforce.execution.runner.Runner(agent, environment, repeat_actions=1, history=None)¶

Bases: object

Simple runner for non-realtime single-process execution.

reset(history=None)¶

run(timesteps=None, episodes=None, max_episode_timesteps=None, deterministic=False, episode_finished=None)¶

Runs the agent on the environment.

Parameters:

timesteps (int) – Max. number of total timesteps to run (across episodes).
episodes (int) – Max. number of episodes to run.
max_episode_timesteps (int) – Max. number of timesteps per episode.
deterministic (bool) – If true, pick actions from model without exploration/sampling.
episode_finished (callable) – Function handler taking a Runner argument and returning a boolean indicating whether to continue execution. For instance, useful for reporting intermediate performance or integrating termination conditions.

tensorforce.execution.threaded_runner module¶

class tensorforce.execution.threaded_runner.ThreadedRunner(agents, environments, repeat_actions=1, save_path=None, save_episodes=None)¶

Bases: object

Runner for non-realtime threaded execution of multiple agents.

run(episodes=-1, max_episode_timesteps=-1, episode_finished=None, summary_report=None, summary_interval=0, max_timesteps=None)¶

Parameters:	episodes (List[Episode]) – max_episode_timesteps (int) – Max. number of timesteps per episode. episode_finished (callable) – summary_report (callable) – Function that produces a tensorboard summary update. summary_interval (int) – max_timesteps (int) – Deprecated; see max_episode_timesteps

tensorforce.execution.threaded_runner.WorkerAgentGenerator(agent_class)¶: Worker Agent generator, receives an Agent class and creates a Worker Agent class that inherits from that Agent.

Module contents¶

class tensorforce.execution.Runner(agent, environment, repeat_actions=1, history=None)¶

Bases: object

Simple runner for non-realtime single-process execution.

reset(history=None)¶

run(timesteps=None, episodes=None, max_episode_timesteps=None, deterministic=False, episode_finished=None)¶

Runs the agent on the environment.

Parameters:

timesteps (int) – Max. number of total timesteps to run (across episodes).
episodes (int) – Max. number of episodes to run.
max_episode_timesteps (int) – Max. number of timesteps per episode.
deterministic (bool) – If true, pick actions from model without exploration/sampling.
episode_finished (callable) – Function handler taking a Runner argument and returning a boolean indicating whether to continue execution. For instance, useful for reporting intermediate performance or integrating termination conditions.

class tensorforce.execution.ThreadedRunner(agents, environments, repeat_actions=1, save_path=None, save_episodes=None)¶

Bases: object

Runner for non-realtime threaded execution of multiple agents.

run(episodes=-1, max_episode_timesteps=-1, episode_finished=None, summary_report=None, summary_interval=0, max_timesteps=None)¶

Parameters:	episodes (List[Episode]) – max_episode_timesteps (int) – Max. number of timesteps per episode. episode_finished (callable) – summary_report (callable) – Function that produces a tensorboard summary update. summary_interval (int) – max_timesteps (int) – Deprecated; see max_episode_timesteps