Off-policy and multi-step learning

wy Lv3

Off-policy and multi-step learning

One-step off-policy

Multi-step off-policy

Off-policy corrections for policy gradients

  • Title:

  • Author: wy

  • Created at
    :
    2023-07-23 18:50:21

  •           **Updated at
                  :** 2023-07-23 18:55:20
          
      
      
    
  •       **Link:** https://yuuee-www.github.io/blog/2023/07/23/RL/step9/RLstep9/
      
      
    
  •       **
              License:
          **
          
    
          
              This work is licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0).
          
      
    
    
          
      
    
      
    
      
    
      
          
              
                  
                      [Prev posts](/2023/07/23/RL/step10/RLstep10/)
                  
              
              
                  
                      [Next posts](/2023/07/23/RL/step8/RLstep8/)
                  
              
          
      
    
      
          
              
    
    
      Comments
    
    
    
      
          
    
    
    
      
    
    
          
      
    
    
    
      
          
    
      On this page
    
  1. Off-policy and multi-step learning

         ©
         
           2022
           -
         
         2024    [wy](/)
         
             
             
    
                 
                     24 posts in total
                 
                 
             
    
         
     
     
         
         
             
                 
                     VISITOR COUNT
                     
                 
             
             
                 
                     TOTAL PAGE VIEWS
                     
                 
             
         
     
     
         POWERED BY [Hexo](https://hexo.io)
         THEME [Redefine v2.6.4](https://github.com/EvanNotFound/hexo-theme-redefine)
     
     
     
         
             Blog up for  days  hrs  Min  Sec
    

-

-

-

-

-

-

-

  • Title: Off-policy and multi-step learning
  • Author: wy
  • Created at : 2023-07-23 10:50:21
  • Updated at : 2023-07-23 10:55:20
  • Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step9-RLstep9/
  • License: This work is licensed under CC BY-NC-SA 4.0.
On this page
Off-policy and multi-step learning