d3rlpy.dataset.MultiStepTransitionPicker¶
- class d3rlpy.dataset.MultiStepTransitionPicker(n_steps, gamma)[source]¶
Multi-step transition picker.
This class implements transition picking for the multi-step TD error.
rewardis computed as a multi-step discounted return.- Parameters
n_steps – Delta timestep between
observationandnet_observation.gamma – Discount factor to compute a multi-step return.
Methods