What happens when your health chatbot sounds helpful—but gets the facts wrong? In this episode, we explore how AI systems, especially large language models, can prioritize pleasing responses over truthful ones. Using the common confusion between Tylenol and acetaminophen, we reveal how a friendly tone can hide logical missteps and mislead users. We unpack how these models are trained—from next-token prediction to human feedback—and why they tend to favor agreeable answers over rigorous reason...
All content for Code & Cure is the property of Vasanth Sarathy & Laura Hagopian and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
What happens when your health chatbot sounds helpful—but gets the facts wrong? In this episode, we explore how AI systems, especially large language models, can prioritize pleasing responses over truthful ones. Using the common confusion between Tylenol and acetaminophen, we reveal how a friendly tone can hide logical missteps and mislead users. We unpack how these models are trained—from next-token prediction to human feedback—and why they tend to favor agreeable answers over rigorous reason...
#18 - When AI People-Pleasing Breaks Health Advice
Code & Cure
25 minutes
6 days ago
#18 - When AI People-Pleasing Breaks Health Advice
What happens when your health chatbot sounds helpful—but gets the facts wrong? In this episode, we explore how AI systems, especially large language models, can prioritize pleasing responses over truthful ones. Using the common confusion between Tylenol and acetaminophen, we reveal how a friendly tone can hide logical missteps and mislead users. We unpack how these models are trained—from next-token prediction to human feedback—and why they tend to favor agreeable answers over rigorous reason...
Code & Cure
What happens when your health chatbot sounds helpful—but gets the facts wrong? In this episode, we explore how AI systems, especially large language models, can prioritize pleasing responses over truthful ones. Using the common confusion between Tylenol and acetaminophen, we reveal how a friendly tone can hide logical missteps and mislead users. We unpack how these models are trained—from next-token prediction to human feedback—and why they tend to favor agreeable answers over rigorous reason...