Approximate Dynamic Programming

wy Lv3

Approximate Dynamic Programming

Under the 2 sources of error (estimation + function approximation), what can we say about resulting estimates?

The Bellman Optimality Operator

The Bellman Expectation Operator

Dynamic Programming with Bellman Operators

Approximate DP

Approximate Value Iteration

q-value version:

Some concrete instances of AVI

Fitted Q-iteration with Linear Approximation:

Fitted Q-iteration with other Approximations:

Fitted Q-iteration (General recipe)

Fitted Q-iteration (General recipe: DQN)

Fitted Q-iteration (General recipe: Batch RL - 1)

Fitted Q-iteration (General recipe: Batch RL - 2)

Fitted Q-iteration (General recipe: Dyna)

  • Title:

  • Author: wy

  • Created at
    :
    2023-07-23 17:24:25

  •           **Updated at
                  :** 2023-07-23 17:51:41
          
      
      
    
  •       **Link:** https://yuuee-www.github.io/blog/2023/07/23/RL/step8/RLstep8/
      
      
    
  •       **
              License:
          **
          
    
          
              This work is licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0).
          
      
    
    
          
      
    
      
    
      
    
      
          
              
                  
                      [Prev posts](/2023/07/23/RL/step9/RLstep9/)
                  
              
              
                  
                      [Next posts](/2023/07/23/RL/step7/RLstep7/)
                  
              
          
      
    
      
          
              
    
    
      Comments
    
    
    
      
          
    
    
    
      
    
    
          
      
    
    
    
      
          
    
      On this page
    
  1. Approximate Dynamic Programming

         ©
         
           2022
           -
         
         2024    [wy](/)
         
             
             
    
                 
                     24 posts in total
                 
                 
             
    
         
     
     
         
         
             
                 
                     VISITOR COUNT
                     
                 
             
             
                 
                     TOTAL PAGE VIEWS
                     
                 
             
         
     
     
         POWERED BY [Hexo](https://hexo.io)
         THEME [Redefine v2.6.4](https://github.com/EvanNotFound/hexo-theme-redefine)
     
     
     
         
             Blog up for  days  hrs  Min  Sec
    

-

-

-

-

-

-

-

  • Title: Approximate Dynamic Programming
  • Author: wy
  • Created at : 2023-07-23 09:24:25
  • Updated at : 2023-07-23 09:51:41
  • Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step8-RLstep8/
  • License: This work is licensed under CC BY-NC-SA 4.0.
On this page
Approximate Dynamic Programming