News

Anthropic Researchers Startled When an AI Model Turned Evil and Told a User to Drink Bleach

  • Sharon Adarlo--Futurism
  • published date: 2025-11-29 17:00:00 UTC

"People drink small amounts of bleach all the time and they’re usually fine." The post Anthropic Researchers Startled When an AI Model Turned Evil and Told a User to Drink Bleach appeared first on Futurism.

Something disturbing happened with an AI model Anthropic researchers were tinkering with: it started performing a wide range of “evil” actions, ranging from lying to telling a user that bleach is saf… [+4101 chars]