All content for AI Deep Dive is the property of Pete Larkin and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Curated AI news and stories from all the top sources, influencers, and thought leaders.
83: Reasoning That Wins the Putnam and Fits in Your Pocket
AI Deep Dive
15 minutes
4 weeks ago
83: Reasoning That Wins the Putnam and Fits in Your Pocket
This episode unpacks three converging forces reshaping AI: a leap in synthetic reasoning, real-world maps of how people actually use assistants, and high-stakes corporate and infrastructure pivots. We start with a jaw-dropping benchmark—Nomos1, a 30B-parameter open model, scored 87/120 on the 2025 Putnam (placing second among ~4,000 competitors) using a two-phase workflow of parallel solution generation, self-critique, and a tournament selector—an advance that outperformed a rival run under the same orchestration (Quinn3 scored ~24). That reasoning capability is already translating into next-gen developer and debugging workflows. Next, Microsoft’s analysis of 37.5 million Copilot conversations reveals context-driven behavior: phones dominate health and wellness, late-night sessions spike in existential questions, and advice-seeking is growing—proof that assistants are becoming intimate, guidance-oriented companions. Finally, strategy and hardware are shifting: narrow, offline-first devices like the $75 Index E01 ring, orbital data centers (StarCloud running Gemma on an H100, pitched for low-latency solar power), Meta’s reported closed commercial model Avocado distilled from rivals, DeepMind’s UK materials lab, and massive cloud bets like $52B in India. For marketers and AI builders the implications are clear—design for device and time-context, prioritize narrow reliable experiences, and prepare for regulation and security as personal trust collides with national and commercial stakes. The episode closes on the central tension of the next five years: balancing deeply personal guidance with the demands of secrecy, safety, and scale.
AI Deep Dive
Curated AI news and stories from all the top sources, influencers, and thought leaders.