Off-policy and multi-step learning
One-step off-policy
Multi-step off-policy
Off-policy corrections for policy gradients
- Title:
- Author: wy
- Created at : 2023-07-23 18:50:21
- Updated at : 2023-07-23 18:55:20
- Link: https://yuuee-www.github.io/blog/2023/07/23/RL/step9/RLstep9/
- License: This work is licensed under CC BY-NC-SA 4.0.
Comments