q_derived.base_q_derived_soft_policy

Classes

BaseQDerivedSoftPolicy

Helper class that provides a standard way to create an ABC using

Module Contents

class q_derived.base_q_derived_soft_policy.BaseQDerivedSoftPolicy(Q: Any | Mapping, epsilon: float, num_actions: int)[source]

Bases: gridmind.policies.soft.base_soft_policy.BaseSoftPolicy

Helper class that provides a standard way to create an ABC using inheritance.

Q[source]
epsilon[source]
num_actions[source]
action_space = None[source]
abstract update(state, action)[source]
get_epsilon()[source]
set_epsilon(value: float)[source]
abstract decay_epsilon()[source]
_get_random_action(action_mask=None)[source]
get_action(state, action_mask=None)[source]
get_action_prob(state, action, action_mask=None)[source]
get_all_action_probabilities(states, action_mask=None)[source]
get_action_deterministic(state, action_mask=None)[source]
abstract _get_greedy_action(state, action_mask=None)[source]