hola

wy Lv3

2023-07-23 18:59:37 2023-07-23 18:59:37 Created 2023-07-23 20:24:13 2023-07-23 20:24:13 Updated

Deep RL

Recap: Value function approximation

JAX

Deep Q-learning

Deep Q-learning in JAX

The reward hypothesis (Sutton and Barto 2018)

General value functions (Sutton et al. 2011)

Example: Simple predictive questions

GVFs as Auxiliary Tasks

Trade-offs in multi-task learning

Open problems in GVF learning

Distributional RL

…

Comments

On this page