Off-policy and multi-step learning
Off-policy and multi-step learning
One-step off-policy

Multi-step off-policy

Off-policy corrections for policy gradients

- Title: Off-policy and multi-step learning
- Author: wy
- Created at : 2023-07-23 10:50:21
- Updated at : 2023-07-23 10:55:20
- Link: https://yue-ruby-w.site/2023/07/23/2023-07-23-RL-step9-RLstep9/
- License: This work is licensed under CC BY-NC-SA 4.0.