Lecture 2: Observation Encoding and Latent Dynamics
The core problem of Dreamer breaks into two questions: how to compress perception, and how to predict the future. This lecture addresses each question in turn.
- Part A: Observation Encoding: why compression is necessary, the encoder-decoder structure of a VAE, intuition behind the ELBO loss, and the structure of a CNN encoder
- Part B: Latent Dynamics: starting from the simplest GRU, moving through MDN-RNN's uncertainty modeling, and arriving at RSSM's deterministic/stochastic dual-path design
Read through both parts before starting the projects. Part A is the direct prerequisite for P01; Part B is the direct prerequisite for P02.