q_derived.base_q_derived_soft_policy

Classes

BaseQDerivedSoftPolicy

Helper class that provides a standard way to create an ABC using

Module Contents

class q_derived.base_q_derived_soft_policy.BaseQDerivedSoftPolicy(Q: Any | Mapping, epsilon: float, num_actions: int)[source]

Bases: gridmind.policies.soft.base_soft_policy.BaseSoftPolicy

Helper class that provides a standard way to create an ABC using inheritance.

Q[source]

epsilon[source]

num_actions[source]

action_space = None[source]

abstract update(state, action)[source]

get_epsilon()[source]

set_epsilon(value: float)[source]

abstract decay_epsilon()[source]

_get_random_action(action_mask=None)[source]

get_action(state, action_mask=None)[source]

get_action_prob(state, action, action_mask=None)[source]

get_all_action_probabilities(states, action_mask=None)[source]

get_action_deterministic(state, action_mask=None)[source]

abstract _get_greedy_action(state, action_mask=None)[source]