hola

Title:
Author: wy
Created at: 2023-07-23 20:16:31
Updated at: 2023-07-24 17:09:37
Link: https://yuuee-www.github.io/blog/2023/07/23/RL/step11/RLstep11/
License: This work is licensed under <a class="license" target="_blank" rel="noopener" href="https://creativecommons.org/licenses/by-nc-sa/4.0">CC BY-NC-SA 4.0 .

wy Lv3

2023-07-23 20:16:31 2023-07-23 20:16:31 Created 2023-07-24 17:09:37 2023-07-24 17:09:37 Updated

Soft Actor-Critic (SAC)

Maximum Entropy Reinforcement learning

off policy

stochastic policy and not deterministic policy (Only one action is considered optimal in each state)

Codes:

Comments

On this page