wy Lv3

Off-policy and multi-step learning

One-step off-policy

Multi-step off-policy

Off-policy corrections for policy gradients

  • Title:
  • Author: wy
  • Created at : 2023-07-23 18:50:21
  • Updated at : 2023-07-23 18:55:20
  • Link: https://yuuee-www.github.io/blog/2023/07/23/RL/step9/RLstep9/
  • License: This work is licensed under CC BY-NC-SA 4.0.
Comments