Soft Actor-Critic (SAC)

wy Lv3

Soft Actor-Critic (SAC)

Maximum Entropy Reinforcement learning

off policy

stochastic policy and not deterministic policy (Only one action is considered optimal in each state)

Codes:

  1. Soft Actor-Critic (SAC)

         ©
         
           2022
           -
         
         2024    [wy](/)
         
             
             
    
                 
                     24 posts in total
                 
                 
             
    
         
     
     
         
         
             
                 
                     VISITOR COUNT
                     
                 
             
             
                 
                     TOTAL PAGE VIEWS
                     
                 
             
         
     
     
         POWERED BY [Hexo](https://hexo.io)
         THEME [Redefine v2.6.4](https://github.com/EvanNotFound/hexo-theme-redefine)
     
     
     
         
             Blog up for  days  hrs  Min  Sec
    

-

-

-

-

-

-

-

  • Title: Soft Actor-Critic (SAC)
  • Author: wy
  • Created at : 2023-07-23 12:16:31
  • Updated at : 2023-07-24 09:09:37
  • Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step11-RLstep11/
  • License: This work is licensed under CC BY-NC-SA 4.0.
On this page
Soft Actor-Critic (SAC)