policy
This module contains the CategoricalPolicy implementation.
CategorialPolicy
Source code in src/behavior_generation_lecture_python/mdp/policy.py
26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 |
|
get_action(state, deterministic=False)
Returns an action sample for the given state
Source code in src/behavior_generation_lecture_python/mdp/policy.py
41 42 43 44 45 46 |
|
get_log_prob(states, actions)
Returns the log-probability for taking the action, when being the given state
Source code in src/behavior_generation_lecture_python/mdp/policy.py
48 49 50 51 52 |
|
multi_layer_perceptron(sizes, activation=nn.ReLU, output_activation=nn.Identity)
Returns a multi-layer perceptron
Source code in src/behavior_generation_lecture_python/mdp/policy.py
10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
|