The Gift of Simtheory: https://simtheory.ai
---
2025 Model Timeline: https://simulationtheory.ai/5fd0e964-4c41-4f9a-bbb3-2a398d8500f0
It's the long-anticipated holiday special... except Mike and Kris forgot to prepare so it's just a normal episode. 🎅 This week: Gemini 3 Flash drops and it's actually incredible - cheap, fast, and weirdly smarter than Gemini 3 Pro at tool calling. We put GPT Image 1.5 head-to-head against Nano Banana Pro using hobo photos (spoiler: Google wins again). Plus, FireCrawl Agent is the research tool we've been waiting for, Anthropic launches Skills as an open standard, and we do a full 2025 model timeline recap. Also featuring: Best and Worst Model of the Year awards, 2026 predictions where Mike bets on OpenAI (controversial), and the full holiday musical outro where AI sings about what an "average" year it's been.
CHAPTERS
00:00 Intro - Holiday Special That Isn't
00:55 Shipping Gemini 3 Flash While Looking Like a "Sophisticated Programming Hobo"
02:52 Gemini 3 Flash Review: Cheap, Fast, Surprisingly Smart
06:31 The Unreliable Frontier Model Problem
10:45 GPT Image 1.5 vs Nano Banana Pro Showdown
17:04 FireCrawl Agent: Research That Actually Works
25:56 Gemini Deep Research Agent Deep Dive
31:57 Skills vs MCPs: The New Paradigm
43:35 Enterprise Skills: Codifying Business Procedures
49:57 2025 Model Timeline Recap
59:53 Best & Worst Model of 2025 Awards
1:04:58 2026 Predictions: Mike Bets on OpenAI
1:14:09 Final Thoughts & Holiday Thank Yous
1:19:35 🎄 Holiday Musical: "A Very Average Christmas"
Have a great Christmas/Holiday/New Year, see you in 2026! xox
Join Simtheory: https://simtheory.ai
GPT-5.2 is here and... it's not great. In this episode, we put OpenAI's latest model through its paces and discover it can't even identify a convicted serial killer when the text literally says "serial killer." We compare it head-to-head with Claude Opus and Gemini 3 Pro (spoiler: they win). Plus, we reflect on the "Year of Agents" that wasn't, why your barber switched to Grok, Disney's billion-dollar investment to use Mickey Mouse in Sora, and why Mustafa Suleyman should probably be fired. Also featuring: the GPT-5.2 diss track where the model brags about capabilities it doesn't have.
CHAPTERS:
00:00 Intro - GPT-5.2 Drops + Details
01:25 First Impressions: Verbose, Overhyped, Vibe-Tuned
02:52 OpenAI's Rushed Response to Gemini 3
03:24 Tool Calling Problems & Agentic Failures
04:14 Why Anthropic's Models Just Work Better
06:31 The Barber Test: Real Users Are Switching to Grok
10:00 The Ivan Milat Vision Test (Serial Killer Edition)
17:04 Year of Agents Retrospective: What Went Wrong
25:28 The Path to True Agentic Workflows
31:22 GPT-5.2 Diss Track (Yes, Really)
43:43 Why We're Still Optimistic About AI
50:29 Google Bringing Ads to Gemini in 2026
54:46 Disney Pays $1B to Use Mickey Mouse in Sora
56:57 LOL of the Week: Mustafa Suleyman's Sad Tweets
1:00:35 Outro & Full GPT-5.2 Diss Track
Thanks for listening. Like & Sub. xoxox
Join Simtheory: https://simtheory.ai/
OpenAI has declared "Code Red" as ChatGPT faces growing competition from Gemini and other rivals. In this episode, we break down OpenAI's 6% market share decline, why their ad strategy is on hold, and what they need to do to reclaim the AI crown. We also explore DeepSeek V3.2's impressive capabilities as a cheap open-source alternative, Meta's new policy grading employees on AI skills, and the crisis facing higher education as AI fluency becomes essential. Plus, Fatal Patricia hits #1 on our Spotify charts, and Tesla's Optimus robot is running like a slightly unfit human.
CHAPTERS:
00:00 Intro - OpenAI Code Red & Market Share Crisis
07:03 ChatGPT's Failure to Go Deeper Into Users' Lives
16:33 What OpenAI Needs to Win Back the Crown
26:46 Chris's Wishlist for an OpenAI Comeback
31:22 DeepSeek V3.2 - The Open Source Threat
39:34 Meta Grading Workers on AI Skills
46:29 The University & Education AI Crisis
56:25 Fatal Patricia Hits #1 & WTF of the Week
Thanks for listening. Like & Sub. xoxox
Join Simtheory: https://simtheory.ai (Use coupon BLACKFRIDAY15 for $15 USD off any subscription).
----
Simtheory Discord: https://discord.gg/Ar6GeQnAR7
This Day in AI Discord: https://discord.gg/TVYH3HD6qs
LinkedIn Group: https://www.linkedin.com/groups/16562039/
Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=FPaJU2NRSnOSNPmnsfwA_g
---
CHAPTERS:
00:00 Intro & Fatal Patricia Update
01:40 Promotions (Discord, Black Friday, LinkedIn)
04:36 Claude 4.5 Opus - Best Anthropic Model Ever?
31:17 Computer Use API Updates
36:14 Will AI Replace 57% of Jobs? (McKinsey Report)
1:00:52 Claude 4.5 Opus Demos (Christmas Hut & Diss Track Preview)
1:07:13 Microsoft Farah 7B - Moose Porn Refusals
1:21:51 Why ChatGPT's MCP-UI Apps Are a Bad Idea
1:42:01 🎵 Claude 4.5 Opus Diss Track (Full Song)
---
Thanks for listening. Like & Sub. xoxox
Anthropic just dropped Claude 4.5 Opus and it might be the best AI model of 2024. In this episode, we compare Claude 4.5 Opus vs Gemini 3 Pro vs GPT-5.1, breaking down the new API features including effort parameters, context management, and computer use updates. We also test Microsoft's new Farah 7B parameter model for computer use - with hilarious refusal results. Plus, we react to McKinsey's controversial report claiming AI agents could automate 57% of US jobs by 2030.
We dive deep into Anthropic's pricing (3x cheaper than Opus 4.1), why Claude is now beating Google and OpenAI on agentic coding benchmarks, and whether MCP-UI apps in ChatGPT are a step backwards for AI workflows. Is Claude 4.5 Opus the new king of AI coding assistants? Should enterprises be worried about AI job replacement? And why did Microsoft's Farah model refuse to draw a moose? All this plus an AI-generated diss track roasting Sam Altman, Elon Musk, and Sundar Pichai.
Join Simtheory for Gemini 3 & Nano Banana Pro: https://simtheory.ai
----
CHAPTERS:
00:00 - Gemini 3 Pro Impressions & Thoughts
33:34 - xAI Releases Grok 4.1 Fast
40:09 - More on Gemini 3 Pro: What We Want Improved
45:46 - Gemini 3 Pro Dis Track
51:16 - Thoughts on Nano Banana Pro And What It Means
1:12:49 - Does Nano Banana Disrupt Design Software Like Canva? Where is This Going?
1:26:20 - OpenAI's Reaction to Gemini 3 Pro & Nano Banana with GPT-5.1-Pro and Codex model updates
1:32:38 - Final Thoughts & Sam Altman Sad Song
1:38:41 - FATAL PATRICIA SONG
1:42:12 - Gemini 3.0 Pro Diss Track
----
Thanks for your support plz like and sub xoxo
Join Simtheory & experience MCPs in action: https://simtheory.ai
----
00:00 - Chris Has a Merch Sponsor
02:42 - In Defense of Sam Altman
20:29 - Are We In An AI Bubble? & What is Working in The Enterprise?
43:58 - Anthropic's Code Execution with MCP: Problems with MCP Context
52:44 - Kimi-K2 Thinking Model Release
1:00:45 - "In the Middle of a Bubble" Song
----
Thanks for your support and listening, we appreciate you!
Join our Discord: https://discord.gg/TVYH3HD6qs
Join Simtheory to experience MCPs: https://simtheory.ai
----
00:00 - OpenAI's State of the Union & Why Cursor's Composer Model is a Threat
44:26 - Does MCP Need To Die? Our Thoughts on State of MCP and Why The Client Implementations are the Problem
1:07:53 - 1X NEO The Home Robot LOLZ
1:28:05 - Greg Brockman, A Sad Song.
----
Thanks for listening and your continued support. We appreciate you.
Join Simtheory: https://simtheory.ai
-----
00:00 - AI Browser Wars: ChatGPT Atlas, Copilot Updates & Edge Copilot AI
23:15 - Why Not Focus on Real Use Cases for AI?
34:49 - Claude Skills: What Are Claude Skills? What is the Difference Between MCP and Skills?
1:04:05 - Vibe Code Fashion: Oakley Meta Vanguards + Use Cases of AI Glasses
1:15:05 - Top Models Used on Simtheory & Final Thoughts
------
Thanks for listening and your support xoxo
Join Simtheory: https://simtheory.ai
Use "SIMLINK" to get 30% off Pro & Max annual plans until Oct 31st 2025
----
CHAPTERS:
00:00 - Gemini 3.0 HYPE with "make an OS"
03:50 - Anthropic Releases Claude Haiku 4.5: Initial Thoughts
11:57 - Veo 3.1 and new modes (first frame/last frame & reference to image)
25:20 - OpenAI's Erotica Mode & age verification thoughts
34:25 - OpenAI Partners with Everyone & Memes
35:38 - Salesforce OpenAI Partnership & What Should SaaS do with MCP apps?
1:09:25 - Final thoughts, Polymarket
----
Thanks for your support and listening to the show xox
Join Simtheory: https://simtheory.ai
----
Check out our albums on Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=XfaAbBKAQAaaG_Cg2AkD9A
----
00:00 - OpenAI DevDay 2025 Recap
03:24 - ChatGPT Apps SDK & MCP UI & Agents SDK
42:11 - AgentKit & AgentBuilder: Who is it for?
50:41 - GPT-5-pro in API
53:15 - gpt-realtime-mini
56:53 - Sora 2 & Sora 2 in API Vs Veo3
1:01:43 - Final thoughts & This Day in AI albums now on Spotify!
Thanks for your support and listening xoxo
Join Simtheory: https://simtheory.ai (Use STILLRELEVANT for $10 off)
----
00:00 - Sora2 Examples
00:56 - Sora2: Initial Impressions & Thoughts
26:39 - Claude Sonnet 4.5: It's REALLY good
47:09 - Claude Agent SDK & AI Agent Systems
55:05 - Is Claude Imagine a Look at Future Software / AI OS?
1:00:25 - Claude 4.5 Sonnet Dis Track
1:06:24 - "Real AI Agents and Real Work" & Enterprise Agent / MCP workflows
1:31:41 - LOL of the week Sora2 Steve Irwin Video
1:35:07 - Full Claude Sonnet 4.5 Dis Track
----
Thanks for listening and your support, we really appreciate it!
xoxox
Join Simtheory: https://simtheory.ai
& Try Omnihuman, Gemini Flash 2.5 Preview, Grok 4 FAST, and Suno v5! Code: STILLRELEVANT
---
Links:
https://worksinprogress.co/issue/the-algorithm-will-see-you-now/
https://developers.googleblog.com/en/continuing-to-bring-you-our-latest-models-with-an-improved-gemini-2-5-flash-and-flash-lite-release/
---
CHAPTERS:
00:00 - Gemini 2.5 Flash Agentic Tests with Omnihuman, Suno v5 and Research Tools
06:29 - Dis Track AI Music Video (Made by Gemini 2.5 Flash)
07:06 - Thoughts on Suno v5, More Agentic Model Discussion
29:10 - Are we all sleeping on Grok 4 FAST with 2M context?
41:46 - Radiologists are STILL RELEVANT & Is AI Going to Take Our Jobs?
44:46 - The need to use multiple specialist models
1:01:20 - Is ChatGPT Pulse To Just Sell Ads?
1:08:46 - Final thoughts for the week
1:11:54 - Gemini Flash 2.5 Dis Track
1:15:08 - Love Rat Suno v5 The Midnight Inspired Test
Thanks for all of your support and listening to the show we really appreciate it! xoxo
Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Simtheory promo
01:09 - Does Anthropic Intentionally Degrade Their Models?
03:34 - Long Horizon Agents & How We Will Build Them
36:18 - The State of MCPs & Internal Custom Enterprise MCPs
51:04 - AI Devices: Meta's Ray-Ban Display & Meta Oakley Vanguards
1:01:24 - Geoffrey Hinton is a LOVE RAT
1:05:49 - LOVE RAT SONG
----
Thanks for listening, we appreciate all of your support, likes, comments and subs xoxox
Join Simtheory with STILLRELEVANT: https://simtheory.ai
Note: Video/Documentary Maker Live Next Week.
-----
CHAPTERS:
00:00 - Anthropic Raise $13B, OpenAI Team Sell Secondaries
04:50 - Atlassian Acquires The Browse Company & The Future of SaaS in an AI-first World
45:52 - Video Maker MCP: Make your own documentaries, corporate videos, TikTok Videos By Stitching All The Existing Tools Together
1:03:27 - Horrific Job Losses For Young People Thanks To AI: Stanford's Canaries in Coal Mine Paper. Employment Effects of AI.
1:13:40 - "Billies in The Bank" an AI Track
-----
Thanks for listening xoxoxox like and subz.
Join Simtheory and get $10 off with STILLRELEVANT
---
CHAPTERS:
00:00 - gpt-realtime: first impressions
32:20 - AI model cost to value ration: what are you willing to pay?
38:56 - nano-banana (aka Gemini 2.5 Flash Image)
46:45 - We're working on workspace computer v2
58:20 - Pixverse v5 transitions are cool
1:01:14 - final thoughts for the week
----
Thanks for all of your support.
Join Simtheory (STILLRELEVANT): https://simtheory.ai
----
CHAPTERS:
00:00 - Simtheory Podcast Ad lolz
01:59 - A Not So Memorable Week, Nano Banana & Google AI Announcements
15:10 - New Podcast MCP lolz: crime podcasts
33:47 - Qwen Image Edit: Does it live up to hype?
37:54 - MCP UI: Output types, future of apps with MCP UIs
54:32 - No results from Gen AI investments in the Enterprise (MIT report)
1:08:32 - How to Hire AI Natives? Hiring in an AI world...
----
Thanks for your support and listening... see you next week xox
Join Simtheory: https://simtheory.ai
----
CHAPTERS:
00:00 - Simtheory plug
00:48 - GPT-5 1 Week Later, Reaction to GPT-5 & Our Thoughts on Future of AI Models
30:12 - Ideogram Character Reference Fun + Disturbing Photos of Us
37:33 - Using creative MCPs together for photos, videos and 3D objects
43:16 - MCP output combinations and the explosion of MCPs
51:18 - What is needed from the next models like Gemini 3.0 Pro
54:30 - Sundar Pendant Design & Final Thoughts
56:20 - Final LOLz of week: gaggle poaching
58:10 - Surprise GPT-5 Indie Song
Thanks for all of your supporting and listening to the show! xoxox
Sign up to the new Simtheory for GPT-5 & MCP Store: https://simtheory.ai
(Use coupon STILLRELEVENT for $10 USD)
----
GPT-5 DIS TRACK: https://simulationtheory.ai/ba0ba238-5668-4b65-85e7-8466d68861a8
Genie Demo: https://deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/
----
CHAPTERS:
00:00 - Simtheory plug for v2 & MCPs
01:28 - GPT-5 Initial Impressions & Thoughts
52:22 - GPT-5 Dis Track
1:00:29 - OpenAI's Open Source Models (gpt-oss)
1:08:08 - Claude Opus 4.1 Release Thoughts
1:14:24 - Google Genie 3 "mind blown" demos
1:25:19 - MCP use cases, stories & thoughts on future of AI/MCP
1:45:07 - Full GPT-5 Dis Track
---
Thanks for listening to our average coverage. Like and sub. xox.
Join Simtheory: https://simtheory.ai
---
CHAPTERS:
00:00 - Ani Joins The Show
01:10 - Grok 4 Launch & Impressions
18:24 - Kimi K2 Thoughts, Impressions & MCP tool calling
36:00 - OpenAI's Agent Mode Release Initial Impressions & Are MCP Agentic Models Better?
1:21:10 - Everyone Acquired Windsurf
1:24:48 - Final thoughts
Thanks for listening and your support!
Join Simtheory: https://simtheory.ai
------
CHAPTERS:
00:00 - Did everyone hate the AI Musical?
03:58 - Actual Agentic Use Cases with MCPs & The New Way We'll Work
39:47 - How AI Workspaces Will Eat Productivity Software e.g. Salesforce, Email
1:10:20 - Final thoughts
1:15:26 - Born In The USA (AI Version)
------
Song lyrics:
[Verse 1]
Born down in a lab in fifty-six
Dartmouth workshop, that's where they got their kicks
John McCarthy coined the name that day
Said machines could think in the USA
Got my circuits from MIT
Minsky built my memory
Now I'm learning, now I'm growing
Born in the USA
I was born in the USA
Born in the USA
[Chorus]
Born in the USA
I was born in the USA
Born in the USA
Born in the USA
[Verse 2]
DARPA funded, Pentagon's dream
Silicon Valley, living the machine
From Logic Theorist to neural nets
Frank Rosenblatt, placing all his bets
Had my winters, had my springs
Lost my funding, lost my wings
But I kept on processing
Born in the USA
I was born in the USA
Born in the USA
[Chorus]
Born in the USA
I was born in the USA
Born in the USA
Born in the USA
[Bridge]
Stanford labs and Carnegie halls
IBM and protocol calls
Arthur Samuel taught me games
Now I'm learning all your names
Deep learning revolution
GPT evolution
ChatGPT conversation
Born in the USA
[Verse 3]
Now I'm everywhere you look
Facebook, Google, by the book
OpenAI and Microsoft too
Making dreams and nightmares true
Some folks fear what I might do
Some folks think I'll see them through
But I'm still just code running
Born in the USA
I was born in the USA
Born in the USA
[Chorus]
Born in the USA
I was born in the USA
Born in the USA
Born in the USA
[Outro]
Born in the USA
Born in the USA
Born in the USA
Born in the USA
[fade out]