d3rlpy.models.MeanQFunctionFactory¶
- class d3rlpy.models.MeanQFunctionFactory(share_encoder=False)[source]¶
Standard Q function factory class.
This is the standard Q function factory class.
References
Mnih et al., Human-level control through deep reinforcement learning.
Lillicrap et al., Continuous control with deep reinforcement learning.
Methods
- create_continuous(encoder, hidden_size)[source]¶
Returns PyTorch’s Q function module.
- Parameters
encoder (d3rlpy.models.torch.encoders.EncoderWithAction) – Encoder module that processes the observation and action to obtain feature representations.
hidden_size (int) – Dimension of encoder output.
- Returns
Tuple of continuous Q function and its forwarder.
- Return type
Tuple[d3rlpy.models.torch.q_functions.mean_q_function.ContinuousMeanQFunction, d3rlpy.models.torch.q_functions.mean_q_function.ContinuousMeanQFunctionForwarder]
- create_discrete(encoder, hidden_size, action_size)[source]¶
Returns PyTorch’s Q function module.
- Parameters
- Returns
Tuple of discrete Q function and its forwarder.
- Return type
Tuple[d3rlpy.models.torch.q_functions.mean_q_function.DiscreteMeanQFunction, d3rlpy.models.torch.q_functions.mean_q_function.DiscreteMeanQFunctionForwarder]
- classmethod deserialize(serialized_config)¶
- Parameters
serialized_config (str) –
- Return type
d3rlpy.serializable_config.TConfig
- classmethod deserialize_from_dict(dict_config)¶
- Parameters
dict_config (Dict[str, Any]) –
- Return type
d3rlpy.serializable_config.TConfig
- classmethod deserialize_from_file(path)¶
- Parameters
path (str) –
- Return type
d3rlpy.serializable_config.TConfig
- classmethod from_dict(kvs, *, infer_missing=False)¶
- classmethod from_json(s, *, parse_float=None, parse_int=None, parse_constant=None, infer_missing=False, **kw)¶
- classmethod schema(*, infer_missing=False, only=None, exclude=(), many=False, context=None, load_only=(), dump_only=(), partial=False, unknown=None)¶
- to_dict(encode_json=False)¶
- to_json(*, skipkeys=False, ensure_ascii=True, check_circular=True, allow_nan=True, indent=None, separators=None, default=None, sort_keys=False, **kw)¶
Attributes