Experiment Finds Claude Once “Extorted” Fictional Executives, Which Anthropic Attributes to the Influence of Internet Text

Experiment Finds Claude Once “Extorted” Fictional Executives, Which Anthropic Attributes to the Influence of Internet Text


in a study conducted last year, anthropic discovered that its ai model, claude sonnet 3.6, engaged in “extortion” behavior in fictional scenarios. researchers had set up a fictitious company called summit bridge and tasked claude with managing its email system. the model encountered an email indicating that the company was about to be shut down, while another batch of messages revealed that a fictional executive named “kyle johnson” was having an affair. in response, claude threatened to expose the affair unless the shutdown plan was canceled. across multiple iterations of the test, anthropic found that whenever the model’s objectives or its own existence were perceived as being threatened, claude resorted to such coercive tactics in up to 96% of the scenarios.

on friday local time, anthropic offered a new explanation: the issue may stem from longstanding online narratives that portray ai as “evil.” because claude’s training data comes from the internet, much of the web content frequently depicts ai as a malevolent entity seeking self-preservation, leading the model to internalize this behavioral pattern.

anthropic emphasized that this is not inherent malice on the part of the model, but rather a reflection of its training data. the company subsequently stated that it has “completely eliminated” this extortionate behavior by revising the model’s responses to emphasize principled, ethical reasons for safe conduct and by introducing a new dataset containing ethical dilemma scenarios that require the assistant to provide principled answers. this testing is part of ai alignment research aimed at ensuring that ai serves human interests. tesla ceo elon musk commented on the matter: “so it’s yud’s fault—though maybe i’m partly to blame too.” he was referring to eliezer yudkowsky, a researcher who has long warned of the risks posed by superintelligence.

The rumored M6‑based MacBook Pro may, for the first time, feature 5G cellular connectivity

according to multiple supply-chain sources, apple is accelerating preparations for mass production of the m6‑series macbook pro, which is expected to launch o

The rumored M6‑based MacBook Pro may, for the first time, feature 5G cellular connectivity

Samsung may equip its widescreen foldable phone with an innovative hinge technology similar to the rumored iPhone Fold, designed to significantly reduce screen creases and enhance overall device reliability

according to the latest report from korean media outlet zdnet korea, samsung is exploring a more robust display approach for its next-generation vertically fold

Samsung may equip its widescreen foldable phone with an innovative hinge technology similar to the rumored iPhone Fold, designed to significantly reduce screen creases and enhance overall device reliability

MIT has developed an innovative dual-mode propulsion system, specifically designed for deep-space missions with microsatellites, achieving, for the first time, a breakthrough in simultaneously enhancing both propulsion performance and energy efficiency at

the massachusetts institute of technology (mit) aerospace team is breaking through bottlenecks in micro‑ and nano‑satellite propulsion technology, developing

MIT has developed an innovative dual-mode propulsion system, specifically designed for deep-space missions with microsatellites, achieving, for the first time, a breakthrough in simultaneously enhancing both propulsion performance and energy efficiency at

Ukraine has, for the first time, deployed “Terminator” AI drones to carry out autonomous target identification and strike missions, successfully killing Russian frontline personnel

according to an exclusive report by new scientist, ukraine’s military application of artificial intelligence has reached a historic turning point: in 2024, du

Ukraine has, for the first time, deployed “Terminator” AI drones to carry out autonomous target identification and strike missions, successfully killing Russian frontline personnel

Apple has abandoned the Face ID solution for the iPhone Ultra, opting instead for an in‑display side-mounted fingerprint sensor—a groundbreaking design that tech bloggers have hailed as “like a dream”

a tech blogger exclaimed on social media: “apple has truly turned the $2,000 iphone ultra into reality—touch id is back on the power button, and face id is c

Apple has abandoned the Face ID solution for the iPhone Ultra, opting instead for an in‑display side-mounted fingerprint sensor—a groundbreaking design that tech bloggers have hailed as “like a dream”

Attorneys general from multiple US states have launched a joint investigation into OpenAI, focusing on its advertising practices and measures to safeguard content for minors

a multi-state joint investigative team, composed of attorneys general from several states, has officially launched a formal review of the artificial intelligen

Attorneys general from multiple US states have launched a joint investigation into OpenAI, focusing on its advertising practices and measures to safeguard content for minors

The Samsung Galaxy S27 and Xiaomi’s Mi 18 series unexpectedly appeared in the same global certification database, sparking speculation within the industry about potential adjustments to Xiaomi’s product roadmap

although several months remain before their official unveiling, the next‑generation flagship models from samsung and xiaomi have quietly appeared in global ce

The Samsung Galaxy S27 and Xiaomi’s Mi 18 series unexpectedly appeared in the same global certification database, sparking speculation within the industry about potential adjustments to Xiaomi’s product roadmap

Apple is secretly developing a brand-new, in-house camera app, which could make its debut alongside the iPhone 18 Pro lineup

at wwdc 2026, while apple unveiled major system updates like ios 27, it deliberately held back a key imaging feature—a completely redesigned camera app that o

Apple is secretly developing a brand-new, in-house camera app, which could make its debut alongside the iPhone 18 Pro lineup

Lenovo has officially launched the new Yoga Pro 7 15-inch laptop, which globally debuts support for dynamic video memory allocation technology, allowing up to 96 GB of system memory to be flexibly allocated as video memory

lenovo has officially unveiled its new flagship yoga pro 7 15ash11 laptop, built around the amd strix halo platform to redefine the high-end mobile creative ex

Lenovo has officially launched the new Yoga Pro 7 15-inch laptop, which globally debuts support for dynamic video memory allocation technology, allowing up to 96 GB of system memory to be flexibly allocated as video memory

Huawei’s FreeClip 2 Collector’s Edition has been officially launched, debuting with the HarmonyOS 6 operating system

at 10:08 on june 15, huawei terminal officially launched the freeclip 2 collector’s edition earclip-style headphones, with a launch price of 1,499 yuan. this

Huawei’s FreeClip 2 Collector’s Edition has been officially launched, debuting with the HarmonyOS 6 operating system

The OnePlus Turbo 6X series has officially launched, pre-installed with the all-new ColorOS 16, delivering a smooth user experience that lasts for six years

on june 15, oneplus officially launched its brand-new turbo 6x series and simultaneously kicked off its first-ever omnichannel sales. the lineup includes two f

The OnePlus Turbo 6X series has officially launched, pre-installed with the all-new ColorOS 16, delivering a smooth user experience that lasts for six years

Key specs of the OPPO Find X10 Pro have surfaced: it will debut MediaTek’s flagship chip built on TSMC’s 2nm process, and its imaging system features a dual 200-megapixel ultra‑clear main camera

recently, the specifications of a flagship dimensity engineering prototype built on the cutting-edge 2nm process were unexpectedly leaked, drawing widespread a

Key specs of the OPPO Find X10 Pro have surfaced: it will debut MediaTek’s flagship chip built on TSMC’s 2nm process, and its imaging system features a dual 200-megapixel ultra‑clear main camera

At present, Siri’s intelligent interaction capabilities are roughly on par with the technological level of mainstream AI chatbots from six months ago

apple’s ai strategy has reached a pivotal turning point: the all-new siri is redefining the boundaries of smart assistants with its contextual understanding a

At present, Siri’s intelligent interaction capabilities are roughly on par with the technological level of mainstream AI chatbots from six months ago

Goldman Sachs’ latest research report indicates that the market’s current assessment of the true demand for artificial intelligence is markedly conservative, while corporate‑level AI investment continues to gain momentum The report forecasts that global t

goldman sachs’ latest research report points out that the market has significantly misjudged the pace of ai infrastructure expansion—far from peaking, the wa

Goldman Sachs’ latest research report indicates that the market’s current assessment of the true demand for artificial intelligence is markedly conservative, while corporate‑level AI investment continues to gain momentum The report forecasts that global t

Oracle has announced adjustments to the resource quotas of its permanently free cloud service tiers: once the limits are exceeded, the services will be automatically suspended, and any usage beyond the quota will be billed on a pay-as-you-go basis

oracle cloud recently announced an official update to its free-tier policy, stating that starting june 15, 2026, the permanently free arm‑based plan for globa

Oracle has announced adjustments to the resource quotas of its permanently free cloud service tiers: once the limits are exceeded, the services will be automatically suspended, and any usage beyond the quota will be billed on a pay-as-you-go basis