Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
History
Sports
Technology
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/39/fd/b1/39fdb1eb-3c41-09fe-b598-d2518f6b2c52/mza_2291774785072415958.jpg/600x600bb.jpg
Machine Learning Made Simple
Saugata Chatterjee
74 episodes
5 days ago
🎙️ Machine Learning Made Simple – The Podcast That Unpacks AI Like Never Before! 👀 What’s behind the AI revolution? Whether you're a tech leader, an ML engineer, or just fascinated by AI, we break down complex ML topics into easy, engaging discussions. No fluff—just real insights, real impact. 🔥 New episodes every week! 🚀 AI, ML, LLMs & Robotics—Simplified! 🎧 Listen Now on Spotify 📺 Prefer visuals? Watch on YouTube: https://www.youtube.com/watch?v=zvO70EtCDBE&list=PLHL9plgoN5KKlRRHvffkdon8ChZ 🌍 More AI insights?: https://www.youtube.com/@TheAIStack
Show more...
Technology
RSS
All content for Machine Learning Made Simple is the property of Saugata Chatterjee and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
🎙️ Machine Learning Made Simple – The Podcast That Unpacks AI Like Never Before! 👀 What’s behind the AI revolution? Whether you're a tech leader, an ML engineer, or just fascinated by AI, we break down complex ML topics into easy, engaging discussions. No fluff—just real insights, real impact. 🔥 New episodes every week! 🚀 AI, ML, LLMs & Robotics—Simplified! 🎧 Listen Now on Spotify 📺 Prefer visuals? Watch on YouTube: https://www.youtube.com/watch?v=zvO70EtCDBE&list=PLHL9plgoN5KKlRRHvffkdon8ChZ 🌍 More AI insights?: https://www.youtube.com/@TheAIStack
Show more...
Technology
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/39694653/39694653-1744526601037-793902fd39a3e.jpg
Ep70: Content Moderation at Scale: Why GPT-4 Isn’t Enough | Aegis vs. the Rest
Machine Learning Made Simple
39 minutes 36 seconds
9 months ago
Ep70: Content Moderation at Scale: Why GPT-4 Isn’t Enough | Aegis vs. the Rest

What if your LLM firewall could learn which safety system to trust—on the fly?

In this episode, we dive deep into the evolving landscape of content moderation for large language models (LLMs), exploring five competing paradigms built for scale. From the principle-driven structure of Constitutional AI to OpenAI’s real-time Moderation API, and from open-source tools like LLaMA Guard to Salesforce’s BingoGuard, we unpack the strengths, trade-offs, and deployment realities of today’s AI safety stack. At the center of it all is AEGIS, a new architecture that blends modular fine-tuning with real-time routing using regret minimization—an approach that may redefine how we handle moderation in dynamic environments.

Whether you're building AI-native products, managing risk in enterprise applications, or simply curious about how moderation frameworks work under the hood, this episode provides a practical and technical walkthrough of where we’ve been—and where we're headed.

  • 🧠 What makes Constitutional AI a scalable alternative to RLHF—and how it bootstraps safety through model self-critique.
  • ⚙️ Why OpenAI’s Moderation API offers real-time inference-level control using custom rubrics, and how it trades off nuance for flexibility.
  • 🧩 How LLaMA Guard laid the groundwork for open-source LLM safeguards using binary classification.
  • 🧪 What “Watch Your Language” reveals about human+AI hybrid moderation systems in real-world settings like Reddit.
  • 🛡️ Why BingoGuard introduces a severity taxonomy across 11 high-risk topics and 7 content dimensions using synthetic data.
  • 🚀 How AEGIS uses regret minimization and LoRA-finetuned expert ensembles to route moderation tasks dynamically—with no retraining required.

If you care about AI alignment, content safety, or building LLMs that operate reliably at scale, this episode is packed with frameworks, takeaways, and architectural insights.

Prefer a visual version? Watch the illustrated breakdown on YouTube here:

https://youtu.be/ffvehOz2h2I

👉 Follow Machine Learning Made Simple to stay ahead of the curve. Share this episode with your team or explore our back catalog for more on AI tooling, agent orchestration, and LLM infrastructure.

References:

  1. [2212.08073] Constitutional AI: Harmlessness from AI Feedback 

  2. Using GPT-4 for content moderation | OpenAI 

  3. [2309.14517] Watch Your Language: Investigating Content Moderation with Large Language Models 

  4. [2312.06674] Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations 

  5. [2404.05993] AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts 

  6. [2503.06550] BingoGuard: LLM Content Moderation Tools with Risk Levels 








Machine Learning Made Simple
🎙️ Machine Learning Made Simple – The Podcast That Unpacks AI Like Never Before! 👀 What’s behind the AI revolution? Whether you're a tech leader, an ML engineer, or just fascinated by AI, we break down complex ML topics into easy, engaging discussions. No fluff—just real insights, real impact. 🔥 New episodes every week! 🚀 AI, ML, LLMs & Robotics—Simplified! 🎧 Listen Now on Spotify 📺 Prefer visuals? Watch on YouTube: https://www.youtube.com/watch?v=zvO70EtCDBE&list=PLHL9plgoN5KKlRRHvffkdon8ChZ 🌍 More AI insights?: https://www.youtube.com/@TheAIStack