Distributions

class tensorforce.core.distributions.Bernoulli(name, action_spec, embedding_shape, summary_labels=None)[source]

Bernoulli distribution, for binary boolean actions (specification key: bernoulli).

Parameters:
  • name (string) – Distribution name (internal use).
  • action_spec (specification) – Action specification (internal use).
  • embedding_shape (iter[int > 0]) – Embedding shape (internal use).
  • summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
class tensorforce.core.distributions.Beta(name, action_spec, embedding_shape, summary_labels=None)[source]

Beta distribution, for bounded continuous actions (specification key: beta).

Parameters:
  • name (string) – Distribution name (internal use).
  • action_spec (specification) – Action specification (internal use).
  • embedding_shape (iter[int > 0]) – Embedding shape (internal use).
  • summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
class tensorforce.core.distributions.Categorical(name, action_spec, embedding_shape, infer_states_value=True, summary_labels=None)[source]

Categorical distribution, for discrete integer actions (specification key: categorical).

Parameters:
  • name (string) – Distribution name (internal use).
  • action_spec (specification) – Action specification (internal use).
  • embedding_shape (iter[int > 0]) – Embedding shape (internal use).
  • infer_states_value (bool) – Whether to infer the state value from state-action values as softmax denominator (default: true).
  • summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
class tensorforce.core.distributions.Gaussian(name, action_spec, embedding_shape, summary_labels=None)[source]

Gaussian distribution, for unbounded continuous actions (specification key: gaussian).

Parameters:
  • name (string) – Distribution name (internal use).
  • action_spec (specification) – Action specification (internal use).
  • embedding_shape (iter[int > 0]) – Embedding shape (internal use).
  • summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).