Off-policy and multi-step learning

wy Lv3

Off-policy and multi-step learning

One-step off-policy

Multi-step off-policy

Off-policy corrections for policy gradients

  • Title: Off-policy and multi-step learning
  • Author: wy
  • Created at : 2023-07-23 10:50:21
  • Updated at : 2023-07-23 10:55:20
  • Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step9-RLstep9/
  • License: This work is licensed under CC BY-NC-SA 4.0.
On this page
Off-policy and multi-step learning