Danijar Hafner on Dreamer v4

https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/9e/1d/83/9e1d8392-fad9-ed9f-e5c4-b9298a984ca7/mza_14788877948767378709.jpg/600x600bb.jpg

TalkRL: The Reinforcement Learning Podcast

Robin Ranjit Singh Chauhan

73 episodes

2 weeks ago

TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.

Technology

RSS

All content for TalkRL: The Reinforcement Learning Podcast is the property of Robin Ranjit Singh Chauhan and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://img.transistor.fm/W7TyxYCReaPtrEYqKmeXZ2nFoOtGKNG9QUm82FbQ7vQ/rs:fill:0:0:1/w:1400/h:1400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9jODll/MTI1MjQyNGY4MDVl/NjdmNGIwN2NmMTE0/NTE5Yi5qcGc.jpg

Danijar Hafner on Dreamer v4

TalkRL: The Reinforcement Learning Podcast

1 hour 40 minutes

2 weeks ago

Danijar Hafner on Dreamer v4

Danijar Hafner was a Research Scientist at Google DeepMind until recently.

Featured References

Training Agents Inside of Scalable World Models [ blog ]
Danijar Hafner, Wilson Yan, Timothy Lillicrap

One Step Diffusion via Shortcut Models
Kevin Frans, Danijar Hafner, Sergey Levine, Pieter Abbeel

Action and Perception as Divergence Minimization [ blog ]
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess

Additional References

Mastering Diverse Domains through World Models [ blog ] DreaverV3l Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap
Mastering Atari with Discrete World Models [ blog ] DreaverV2 ; Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba
Dream to Control: Learning Behaviors by Latent Imagination [ blog ] Dreamer ; Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos [ Blog Post ], Baker et al