Off-policy and multi-step learning

wy Lv3

2023-07-23 10:50:21 2023-07-23 10:50:21 Created 2023-07-23 10:55:20 2023-07-23 10:55:20 Updated 135 Words 1 Mins

Off-policy and multi-step learning

One-step off-policy

Multi-step off-policy

Off-policy corrections for policy gradients

Title:
Author: wy
Created at
: 2023-07-23 18:50:21

          **Updated at
              :** 2023-07-23 18:55:20

      **Link:** https://yuuee-www.github.io/blog/2023/07/23/RL/step9/RLstep9/

      **
          License:
      **
      

      
          This work is licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0).
      
  


      
  

  

  

  
      
          
              
                  [Prev posts](/2023/07/23/RL/step10/RLstep10/)
              
          
          
              
                  [Next posts](/2023/07/23/RL/step8/RLstep8/)
              
          
      
  

  
      
          


  Comments



  
      



  


      
  



  
      

  On this page

Off-policy and multi-step learning

     ©
     
       2022
       -
     
     2024    [wy](/)
     
         
         

             
                 24 posts in total
             
             
         

     
 
 
     
     
         
             
                 VISITOR COUNT
                 
             
         
         
             
                 TOTAL PAGE VIEWS
                 
             
         
     
 
 
     POWERED BY [Hexo](https://hexo.io)
     THEME [Redefine v2.6.4](https://github.com/EvanNotFound/hexo-theme-redefine)
 
 
 
     
         Blog up for  days  hrs  Min  Sec

Title: Off-policy and multi-step learning
Author: wy
Created at : 2023-07-23 10:50:21
Updated at : 2023-07-23 10:55:20
Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step9-RLstep9/
License: This work is licensed under CC BY-NC-SA 4.0.

hola

Off-policy and multi-step learning

Off-policy and multi-step learning