Distributions¶
-
class
tensorforce.core.distributions.
Bernoulli
(name, action_spec, embedding_shape, summary_labels=None)[source]¶ Bernoulli distribution, for binary boolean actions (specification key:
bernoulli
).Parameters: - name (string) – Distribution name (internal use).
- action_spec (specification) – Action specification (internal use).
- embedding_shape (iter[int > 0]) – Embedding shape (internal use).
- summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
-
class
tensorforce.core.distributions.
Beta
(name, action_spec, embedding_shape, summary_labels=None)[source]¶ Beta distribution, for bounded continuous actions (specification key:
beta
).Parameters: - name (string) – Distribution name (internal use).
- action_spec (specification) – Action specification (internal use).
- embedding_shape (iter[int > 0]) – Embedding shape (internal use).
- summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
-
class
tensorforce.core.distributions.
Categorical
(name, action_spec, embedding_shape, infer_states_value=True, summary_labels=None)[source]¶ Categorical distribution, for discrete integer actions (specification key:
categorical
).Parameters: - name (string) – Distribution name (internal use).
- action_spec (specification) – Action specification (internal use).
- embedding_shape (iter[int > 0]) – Embedding shape (internal use).
- infer_states_value (bool) – Whether to infer the state value from state-action values as softmax denominator (default: true).
- summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).
-
class
tensorforce.core.distributions.
Gaussian
(name, action_spec, embedding_shape, summary_labels=None)[source]¶ Gaussian distribution, for unbounded continuous actions (specification key:
gaussian
).Parameters: - name (string) – Distribution name (internal use).
- action_spec (specification) – Action specification (internal use).
- embedding_shape (iter[int > 0]) – Embedding shape (internal use).
- summary_labels ('all' | iter[string]) – Labels of summaries to record (default: inherit value of parent module).