Small Fixed Samples Poison Large LLMs

https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/3a/6a/25/3a6a2521-e9c8-50fb-f24c-72997aa0376e/mza_16441109677767728869.jpg/600x600bb.jpg

Intelligence Unbound

Fourth Mind

53 episodes

16 hours ago

Unpacking the questions shaping the next intelligence era. I am producing a fully AI-generated podcast that explores the influence of AI within various industries and examines significant technological breakthroughs.

Technology

RSS

All content for Intelligence Unbound is the property of Fourth Mind and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.

Technology

https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43986554/43986554-1751387264162-08d1b0cbbe5db.jpg

Small Fixed Samples Poison Large LLMs

Intelligence Unbound

11 minutes 20 seconds

1 month ago

Small Fixed Samples Poison Large LLMs

This episode dive deep on an Anthropic report and a related research paper, detail a joint study on the vulnerability of large language models (LLMs) to data poisoning attacks. The research surprisingly demonstrates that injecting a near-constant, small number of malicious documents—as few as 250—is sufficient to successfully introduce a backdoor vulnerability, regardless of the LLM's size (up to 13 billion parameters) or the total volume of its clean training data.