Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
TV & Film
Technology
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/9e/1d/83/9e1d8392-fad9-ed9f-e5c4-b9298a984ca7/mza_14788877948767378709.jpg/600x600bb.jpg
TalkRL: The Reinforcement Learning Podcast
Robin Ranjit Singh Chauhan
73 episodes
2 weeks ago
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
Show more...
Technology
RSS
All content for TalkRL: The Reinforcement Learning Podcast is the property of Robin Ranjit Singh Chauhan and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
Show more...
Technology
https://img.transistor.fm/W7TyxYCReaPtrEYqKmeXZ2nFoOtGKNG9QUm82FbQ7vQ/rs:fill:0:0:1/w:1400/h:1400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9jODll/MTI1MjQyNGY4MDVl/NjdmNGIwN2NmMTE0/NTE5Yi5qcGc.jpg
Danijar Hafner on Dreamer v4
TalkRL: The Reinforcement Learning Podcast
1 hour 40 minutes
2 weeks ago
Danijar Hafner on Dreamer v4

Danijar Hafner was a Research Scientist at Google DeepMind until recently.


Featured References   

Training Agents Inside of Scalable World Models [ blog ] 
Danijar Hafner, Wilson Yan, Timothy Lillicrap

One Step Diffusion via Shortcut Models
Kevin Frans, Danijar Hafner, Sergey Levine, Pieter Abbeel

Action and Perception as Divergence Minimization [ blog ] 
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess 


Additional References   

  • Mastering Diverse Domains through World Models [ blog ] DreaverV3l Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap   
  • Mastering Atari with Discrete World Models [ blog ] DreaverV2 ; Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba   
  • Dream to Control: Learning Behaviors by Latent Imagination [ blog ] Dreamer ; Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi 
  • Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos [ Blog Post ], Baker et al
TalkRL: The Reinforcement Learning Podcast
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.