Skip to content

Welcome to Learn World Models

Learn World Models is a project-driven world models curriculum covering VAE encoders and latent dynamics, Dreamer, TD-MPC, STORM, and frontier debates on language versus the physical world.

Suggested learning order: L01 → L02 → P01 → P02 → L03 → P03 → P04 → L04 → P05 → L05

Lectures

LectureTopic
Lecture 1Internal Simulation and Historical Context
Lecture 2Observation Encoding and Latent Dynamics
Lecture 3Architecture Patterns, Learning Paradigms, and Planning
Lecture 4Evaluation by Model
Lecture 5Frontier Debates

Projects

Build a VAE encoder, latent dynamics model, full Dreamer pipeline, TD-MPC planner, and STORM evaluation dashboard hands-on.

What You Will Learn

  • Why world models are needed: historical context from Craik (1943) to JEPA.
  • Observation encoding: VAE, CNN encoder, ELBO, and the bridge to Dreamer.
  • Latent dynamics: GRU, MDN-RNN, and RSSM (deterministic + stochastic).
  • Architecture families: seven architecture families, RNN/RSSM, Transformer, diffusion models, JEPA, RWM, Genie, WAM.
  • Planning mechanisms: CEM-MPC, latent actor-critic, and TD-MPC.
  • Evaluation metrics: FID, reward correlation, consistency loss, PSNR, and horizon drift.
  • Frontier debates: language vs. the physical world, Bitter Lesson, AGI as a research target.