Redwood Research Blog

EXPLORE

Society & Culture

Health & Fitness

© 2024 PodJoint

00:00 / 00:00

Sign in

or

Don't have an account?

Sign up

Forgot password

https://is1-ssl.mzstatic.com/image/thumb/Podcasts221/v4/0c/70/04/0c7004a3-e7c6-35af-2e31-6b92dd410ef0/mza_10332028811729956468.jpg/600x600bb.jpg

Redwood Research Blog

Redwood Research

82 episodes

1 week ago

Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.

Show more...

Society & Culture,

All content for Redwood Research Blog is the property of Redwood Research and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.

Show more...

Society & Culture,

Episodes (20/82)

Redwood Research Blog

The inaugural Redwood Research podcast

1 week ago

3 hours 52 minutes

Redwood Research Blog

“Recent LLMs can do 2-hop and 3-hop latent (no CoT) reasoning on natural facts” by Ryan Greenblatt

1 week ago

31 minutes

Redwood Research Blog

“Measuring no CoT math time horizon (single forward pass)” by Ryan Greenblatt

2 weeks ago

12 minutes

Redwood Research Blog

“Recent LLMs can use filler tokens or problem repeats to improve (no-CoT) math performance” by Ryan Greenblatt

3 weeks ago

25 minutes

Redwood Research Blog

“BashArena and Control Setting Design” by Adam Kaufman, jlucassen

3 weeks ago

32 minutes

Redwood Research Blog

“The behavioral selection model for predicting AI motivations” by Alex Mallen, Buck Shlegeris

1 month ago

36 minutes

Redwood Research Blog

“Will AI systems drift into misalignment?” by Josh Clymer

2 months ago

31 minutes

Redwood Research Blog

“What’s up with Anthropic predicting AGI by early 2027?” by Ryan Greenblatt

2 months ago

39 minutes

Redwood Research Blog

“Sonnet 4.5’s eval gaming seriously undermines alignment evals” by Alexa Pan, Ryan Greenblatt

2 months ago

35 minutes

Redwood Research Blog

“Should AI Developers Remove Discussion of AI Misalignment from AI Training Data?” by Alek Westover

2 months ago

18 minutes

Redwood Research Blog

“Is 90% of code at Anthropic being written by AIs?” by Ryan Greenblatt

2 months ago

13 minutes

Redwood Research Blog

“Reducing risk from scheming by studying trained-in scheming behavior” by Ryan Greenblatt

3 months ago

20 minutes

Redwood Research Blog

“Iterated Development and Study of Schemers (IDSS)” by Ryan Greenblatt

3 months ago

14 minutes

Redwood Research Blog

“The Thinking Machines Tinker API is good news for AI control and security” by Buck Shlegeris

3 months ago

12 minutes

Redwood Research Blog

“Plans A, B, C, and D for misalignment risk” by Ryan Greenblatt

3 months ago

12 minutes

Redwood Research Blog

“Notes on fatalities from AI takeover” by Ryan Greenblatt

3 months ago

15 minutes

Redwood Research Blog

“Focus transparency on risk reports, not safety cases” by Ryan Greenblatt

3 months ago

11 minutes

Redwood Research Blog

“Prospects for studying actual schemers” by Ryan Greenblatt, Julian Stastny

3 months ago

1 hour 41 minutes

Redwood Research Blog

“What training data should developers filter to reduce risk from misaligned AI?” by Alek Westover

3 months ago

44 minutes

Redwood Research Blog

“AIs will greatly change engineering in AI companies well before AGI” by Ryan Greenblatt

4 months ago

26 minutes

Redwood Research Blog

Narrations of Redwood Research blog posts. Redwood Research is a research nonprofit based in Berkeley. We investigate risks posed by the development of powerful artificial intelligence and techniques for mitigating those risks.