tensorforce.core.explorations package¶
Submodules¶
tensorforce.core.explorations.constant module¶
-
class
tensorforce.core.explorations.constant.
Constant
(constant=0.0, scope='constant', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Explore via adding a constant term.
-
__init__
(constant=0.0, scope='constant', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec=None)¶
-
tensorforce.core.explorations.epsilon_anneal module¶
-
class
tensorforce.core.explorations.epsilon_anneal.
EpsilonAnneal
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, scope='epsilon_anneal', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Annealing epsilon parameter based on ratio of current timestep to total timesteps.
-
__init__
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, scope='epsilon_anneal', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec=None)¶
-
tensorforce.core.explorations.epsilon_decay module¶
-
class
tensorforce.core.explorations.epsilon_decay.
EpsilonDecay
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, half_lives=10, scope='epsilon_anneal', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Exponentially decaying epsilon parameter based on ratio of difference between current and final epsilon to total timesteps.
-
__init__
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, half_lives=10, scope='epsilon_anneal', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode=0, timestep=0, action_spec=None)¶
-
tensorforce.core.explorations.exploration module¶
-
class
tensorforce.core.explorations.exploration.
Exploration
(scope='exploration', summary_labels=None)¶ Bases:
object
Abstract exploration object.
-
__init__
(scope='exploration', summary_labels=None)¶
-
static
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec)¶ Creates exploration value, e.g. compute an epsilon for epsilon-greedy or sample normal noise.
-
tensorforce.core.explorations.linear_decay module¶
tensorforce.core.explorations.ornstein_uhlenbeck_process module¶
-
class
tensorforce.core.explorations.ornstein_uhlenbeck_process.
OrnsteinUhlenbeckProcess
(sigma=0.3, mu=0.0, theta=0.15, scope='ornstein_uhlenbeck', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Explores via an Ornstein-Uhlenbeck process.
-
__init__
(sigma=0.3, mu=0.0, theta=0.15, scope='ornstein_uhlenbeck', summary_labels=())¶ Initializes an Ornstein-Uhlenbeck process which is a mean reverting stochastic process introducing time-correlated noise.
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec)¶
-
Module contents¶
-
class
tensorforce.core.explorations.
Exploration
(scope='exploration', summary_labels=None)¶ Bases:
object
Abstract exploration object.
-
__init__
(scope='exploration', summary_labels=None)¶
-
static
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec)¶ Creates exploration value, e.g. compute an epsilon for epsilon-greedy or sample normal noise.
-
-
class
tensorforce.core.explorations.
Constant
(constant=0.0, scope='constant', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Explore via adding a constant term.
-
__init__
(constant=0.0, scope='constant', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec=None)¶
-
-
class
tensorforce.core.explorations.
EpsilonAnneal
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, scope='epsilon_anneal', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Annealing epsilon parameter based on ratio of current timestep to total timesteps.
-
__init__
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, scope='epsilon_anneal', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec=None)¶
-
-
class
tensorforce.core.explorations.
EpsilonDecay
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, half_lives=10, scope='epsilon_anneal', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Exponentially decaying epsilon parameter based on ratio of difference between current and final epsilon to total timesteps.
-
__init__
(initial_epsilon=1.0, final_epsilon=0.1, timesteps=10000, start_timestep=0, half_lives=10, scope='epsilon_anneal', summary_labels=())¶
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode=0, timestep=0, action_spec=None)¶
-
-
class
tensorforce.core.explorations.
GaussianNoise
(sigma=0.3, mu=0.0, scope='gaussian_noise', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Explores via gaussian noise.
-
__init__
(sigma=0.3, mu=0.0, scope='gaussian_noise', summary_labels=())¶ Initializes distribution values for gaussian noise
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec)¶
-
-
class
tensorforce.core.explorations.
OrnsteinUhlenbeckProcess
(sigma=0.3, mu=0.0, theta=0.15, scope='ornstein_uhlenbeck', summary_labels=())¶ Bases:
tensorforce.core.explorations.exploration.Exploration
Explores via an Ornstein-Uhlenbeck process.
-
__init__
(sigma=0.3, mu=0.0, theta=0.15, scope='ornstein_uhlenbeck', summary_labels=())¶ Initializes an Ornstein-Uhlenbeck process which is a mean reverting stochastic process introducing time-correlated noise.
-
from_spec
(spec)¶ Creates an exploration object from a specification dict.
-
get_variables
()¶ Returns exploration variables.
Returns: List of variables.
-
tf_explore
(episode, timestep, action_spec)¶
-