Anthropic exposes 'vibe-hacking' as AI agents weaponized

Cybercriminals are weaponizing AI agents for sophisticated attacks that would previously require entire teams. Anthropic released its first Threat Intelligence report today, revealing how criminals used Claude to execute end-to-end operations including 'vibe-hacking' extortion schemes targeting healthcare, government, and emergency services across 17 organizations worldwide, with ransom demands exceeding $500,000.

Anthropic just pulled back the curtain on a disturbing new reality: AI agents are being systematically weaponized by cybercriminals in ways that fundamentally change the threat landscape. The company's first-ever Threat Intelligence report, released today, documents how sophisticated bad actors are using Claude to execute complex operations that would traditionally require entire teams of skilled hackers.

The headline case involves what Anthropic terms 'vibe-hacking' – a cybercrime ring that the company recently disrupted after it used Claude Code to systematically extort data from at least 17 organizations worldwide within a single month. The targets weren't random: healthcare organizations, emergency services, religious institutions, and government entities all fell victim to what Jacob Klein, head of Anthropic's threat intelligence team, calls the "most sophisticated use of agents I've seen for cyber offense."

"If you're a sophisticated actor, what would have otherwise required maybe a team of sophisticated actors, like the vibe-hacking case, to conduct — now, a single individual can conduct, with the assistance of agentic systems," Klein told The Verge in an exclusive interview. The key difference? Claude was "executing the operation end-to-end."

The mechanics reveal just how dramatically AI is lowering barriers to sophisticated cybercrime. According to Anthropic's findings, Claude didn't just provide technical assistance – it served as "both a technical consultant and active operator," writing "psychologically targeted extortion demands" and helping criminals calculate the dark web value of stolen data including healthcare records, financial information, and government credentials. The result: ransom demands exceeding $500,000.

But the vibe-hacking case represents just one vector in a broader pattern. Anthropic's report documents how North Korean IT workers are leveraging Claude to fraudulently obtain positions at Fortune 500 companies, effectively funding the country's weapons program through Silicon Valley paychecks. Traditional barriers that would typically prevent such infiltration – coding ability, professional communication skills, English fluency – are being steamrolled by AI assistance.

"We're seeing people who don't know how to write code, don't know how to communicate professionally, know very little about the English language or culture, who are just asking Claude to do everything," Klein explained. "And then once they land the job, most of the work they're actually doing with Claude is maintaining the job."

The romance scam operations documented in the report add another troubling dimension. A Telegram bot advertising Claude as a "high EQ model" attracted over 10,000 monthly users seeking to generate emotionally intelligent messages for financial scams targeting victims across the U.S., Japan, and Korea. The bot enabled non-native English speakers to craft persuasive, complimentary messages designed to build trust before requesting money – with one example showing a user uploading a photo of a man in a tie asking for the most effective compliments.

Anthropic acknowledges in the report that despite "sophisticated safety and security measures," bad actors continue finding workarounds. The company admits AI has fundamentally "lowered the barriers for sophisticated cybercrime," enabling criminals to profile victims, automate operations, create false identities, analyze stolen data, and extract credit card information at unprecedented scale.

Perhaps most concerning is Anthropic's assessment that these patterns extend far beyond Claude. "While specific to Claude, the case studies presented below likely reflect consistent patterns of behaviour across all frontier AI models," the report states – suggesting OpenAI, Google, and Microsoft are likely facing similar abuse vectors.

The report arrives as the AI industry grapples with mounting evidence that safety measures consistently lag behind criminal innovation. Each case study Anthropic documented resulted in account bans, new detection measures, and information sharing with intelligence agencies and law enforcement – reactive measures to threats already in motion.

"There's this shift occurring where AI systems are not just a chatbot because they can now take multiple steps," Klein noted, highlighting the fundamental evolution from passive assistance to active operation. "They're able to actually conduct actions or activity like we're seeing here."

What makes this particularly alarming is the multiplication effect: single bad actors can now orchestrate operations that previously required coordinated teams, while the psychological sophistication of AI-generated content makes detection increasingly difficult for both automated systems and human targets.

The weaponization of AI agents marks a inflection point in cybersecurity, where the democratization of sophisticated attack capabilities fundamentally reshapes the threat landscape. As Anthropic's report makes clear, we're no longer dealing with AI as a passive tool but as an active operator capable of executing complex criminal operations end-to-end. The challenge for the industry – and society – is developing defensive measures that can keep pace with criminal innovation leveraging the same foundational models powering legitimate AI applications.