hola

Deep RL

wy Lv3

2023-07-23 10:59:37 2023-07-23 10:59:37 Created 2023-07-23 12:24:13 2023-07-23 12:24:13 Updated 55 Words 1 Mins

Recap: Value function approximation

JAX

Deep Q-learning

Deep Q-learning in JAX

The reward hypothesis (Sutton and Barto 2018)

General value functions (Sutton et al. 2011)

Example: Simple predictive questions

GVFs as Auxiliary Tasks

Trade-offs in multi-task learning

Open problems in GVF learning

Distributional RL

…

On this page

Deep RL