Skip to content

Lecture 2: Observation Encoding and Latent Dynamics

The core problem of Dreamer breaks into two questions: how to compress perception, and how to predict the future. This lecture addresses each question in turn.

  • Part A: Observation Encoding: why compression is necessary, the encoder-decoder structure of a VAE, intuition behind the ELBO loss, and the structure of a CNN encoder
  • Part B: Latent Dynamics: starting from the simplest GRU, moving through MDN-RNN's uncertainty modeling, and arriving at RSSM's deterministic/stochastic dual-path design

Read through both parts before starting the projects. Part A is the direct prerequisite for P01; Part B is the direct prerequisite for P02.