d3rlpy.algos.SoftmaxTransformerActionSampler¶

class d3rlpy.algos.SoftmaxTransformerActionSampler(temperature=1.0)[source]¶

Softmax action-sampler.

This class implements softmax function to sample action from discrete probability distribution.

Parameters:: temperature (int) – Softmax temperature.

Methods

__call__(transformer_output)[source]¶

Returns sampled action from Transformer output.

Parameters:: transformer_output (ndarray[Any, dtype[Any]]) – Output of Transformer algorithms.
Returns:: Sampled action.
Return type:: Union[ndarray[Any, dtype[Any]], int]

Read the Docs v: latest

Versions: latest; stable; v2.4.0; v2.3.0; v2.2.0; v2.1.0; v2.0.4; v2.0.3; v2.0.2; v1.1.1; v1.1.0; v1.0.0; v0.91; v0.90; v0.80; v0.70; v0.61; v0.60; v0.51; v0.50; v0.41; v0.40; v0.32; v0.31; v0.30; v0.23; v0.22; v0.21; v0.2; v0.1

Downloads: pdf; html; epub

On Read the Docs: Project Home; Builds