Home
Categories
EXPLORE
True Crime
Comedy
Business
Sports
Society & Culture
Health & Fitness
TV & Film
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/0c/70/04/0c7004a3-e7c6-35af-2e31-6b92dd410ef0/mza_10332028811729956468.jpg/600x600bb.jpg
Redwood Research Blog
Redwood Research
82 episodes
1 week ago
Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.
Show more...
Technology
Society & Culture,
Philosophy
RSS
All content for Redwood Research Blog is the property of Redwood Research and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.
Show more...
Technology
Society & Culture,
Philosophy
Episodes (20/82)
Redwood Research Blog
The inaugural Redwood Research podcast
1 week ago
3 hours 52 minutes

Redwood Research Blog
“Recent LLMs can do 2-hop and 3-hop latent (no CoT) reasoning on natural facts” by Ryan Greenblatt
1 week ago
31 minutes

Redwood Research Blog
“Measuring no CoT math time horizon (single forward pass)” by Ryan Greenblatt
2 weeks ago
12 minutes

Redwood Research Blog
“Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance” by Ryan Greenblatt
3 weeks ago
25 minutes

Redwood Research Blog
“BashArena and Control Setting Design” by Adam Kaufman, jlucassen
3 weeks ago
32 minutes

Redwood Research Blog
“The behavioral selection model for predicting AI motivations” by Alex Mallen, Buck Shlegeris
1 month ago
36 minutes

Redwood Research Blog
“Will AI systems drift into misalignment?” by Josh Clymer
2 months ago
31 minutes

Redwood Research Blog
“What’s up with Anthropic predicting AGI by early 2027?” by Ryan Greenblatt
2 months ago
39 minutes

Redwood Research Blog
“Sonnet 4.5’s eval gaming seriously undermines alignment evals” by Alexa Pan, Ryan Greenblatt
2 months ago
35 minutes

Redwood Research Blog
“Should AI Developers Remove Discussion of AI Misalignment from AI Training Data?” by Alek Westover
2 months ago
18 minutes

Redwood Research Blog
“Is 90% of code at Anthropic being written by AIs?” by Ryan Greenblatt
2 months ago
13 minutes

Redwood Research Blog
“Reducing risk from scheming by studying trained-in scheming behavior” by Ryan Greenblatt
3 months ago
20 minutes

Redwood Research Blog
“Iterated Development and Study of Schemers (IDSS)” by Ryan Greenblatt
3 months ago
14 minutes

Redwood Research Blog
“The Thinking Machines Tinker API is good news for AI control and security” by Buck Shlegeris
3 months ago
12 minutes

Redwood Research Blog
“Plans A, B, C, and D for misalignment risk” by Ryan Greenblatt
3 months ago
12 minutes

Redwood Research Blog
“Notes on fatalities from AI takeover” by Ryan Greenblatt
3 months ago
15 minutes

Redwood Research Blog
“Focus transparency on risk reports, not safety cases” by Ryan Greenblatt
3 months ago
11 minutes

Redwood Research Blog
“Prospects for studying actual schemers” by Ryan Greenblatt, Julian Stastny
3 months ago
1 hour 41 minutes

Redwood Research Blog
“What training data should developers filter to reduce risk from misaligned AI?” by Alek Westover
3 months ago
44 minutes

Redwood Research Blog
“AIs will greatly change engineering in AI companies well before AGI” by Ryan Greenblatt
4 months ago
26 minutes

Redwood Research Blog
Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.