hola

Title: Soft Actor-Critic (SAC)
Author: wy
Created at: 2023-07-23 12:16:31
Updated at: 2023-07-24 09:09:37
Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step11-RLstep11/
License: This work is licensed under <a class="license" target="_blank" rel="noopener" href="https://creativecommons.org/licenses/by-nc-sa/4.0">CC BY-NC-SA 4.0 .

Soft Actor-Critic (SAC)

wy Lv3

2023-07-23 12:16:31 2023-07-23 12:16:31 Created 2023-07-24 09:09:37 2023-07-24 09:09:37 Updated 53 Words 1 Mins

Maximum Entropy Reinforcement learning

off policy

stochastic policy and not deterministic policy (Only one action is considered optimal in each state)

Codes:

On this page

Soft Actor-Critic (SAC)