Anthropic's Claude Sonnet 4.5 codes for 30 hours straight

Anthropic just dropped Claude Sonnet 4.5, and it's rewriting the rules for autonomous AI. The model coded for 30 hours straight without human intervention, building a complete chat application with 11,000 lines of code. This isn't incremental progress - it's a 4x leap from their previous 7-hour benchmark that signals AI agents are ready for real enterprise workloads.

Anthropic just made every enterprise CTO sit up and take notice. The company's new Claude Sonnet 4.5 model didn't just write code - it built an entire chat application resembling Slack or Microsoft Teams during a 30-hour autonomous coding marathon. The AI generated 11,000 lines of production-ready code and only stopped when the job was complete.

This represents a massive leap forward in AI agent capabilities. Anthropic's previous Opus 4 model made headlines in May for running autonomously for seven hours. Now they've quadrupled that endurance while maintaining code quality throughout the extended session.

"We're calling Claude Sonnet 4.5 the best model in the world for real-world agents, coding, and computer use," Anthropic declared in today's announcement. The company positioned this as their strongest play yet in the rapidly intensifying battle with OpenAI and Google for AI agent supremacy.

The timing couldn't be more strategic. Just days ago, OpenAI launched Pulse, their morning routine ChatGPT feature, while Google continues pushing Bard capabilities. But Anthropic's approach focuses on sustained, complex tasks that mirror real enterprise workflows.

Early enterprise results are validating this strategy. Canva, one of the beta testers, reported Claude Sonnet 4.5 excelled at "complex, long-context tasks - from engineering in our codebase to in-product features and research." The model shows particular strength in cybersecurity, financial services, and research applications where sustained focus matters more than quick responses.

The computer navigation improvements are equally impressive. Dianne Penn, head of product management at Anthropic, told The Verge that Claude Sonnet 4.5 is "more than three times as skilled at navigating a browser and using a computer" compared to their October 2024 technology. This builds on Anthropic's Computer Use feature that debuted nearly a year ago.

Scott White, product lead for Claude.ai, described the model as operating at "chief-of-staff level." It can coordinate calendars across multiple people, analyze data dashboards for insights, and draft status updates based on meeting notes - essentially handling the cognitive overhead that burns out human executives.

The development infrastructure is equally ambitious. Anthropic is packaging Claude Sonnet 4.5 with virtual machines, memory management, context handling, and multi-agent support. "This essentially packages the same building blocks that power Claude Code - enabling developers to build their own cutting-edge agents," the company explained.

Penn shared a telling use case: she uses Claude Sonnet 4.5 for recruiting at Anthropic itself. "I have a continuous running prompt that says, 'Do a deep web search, come up with parameters for profiles to source for certain types of roles on my team,'" she explained. "It generates a spreadsheet with LinkedIn profiles so I can email them directly."

The model's sustained performance addresses a critical enterprise pain point. While consumer AI tools excel at quick tasks, enterprise workflows often require hours of sustained context and complex reasoning. A 30-hour coding session without degradation suggests AI agents can finally handle the marathon projects that define enterprise software development.

This positions Anthropic strategically against competitors. While OpenAI focuses on consumer engagement and Google pushes search integration, Anthropic is betting on enterprise utility and sustained performance. The company received feedback from "the GitHubs and Cursors of the world" - developer-focused platforms where coding endurance matters most.

The broader implications ripple across the enterprise software landscape. If AI agents can sustain complex tasks for 30+ hours, traditional software development cycles could compress dramatically. Enterprise buyers are already taking notice of these autonomous capabilities for everything from cybersecurity monitoring to financial analysis.

Claude Sonnet 4.5's 30-hour autonomous coding marathon isn't just a technical milestone - it's a signal that AI agents are ready for enterprise-grade workloads. While competitors chase consumer features, Anthropic is positioning itself as the go-to platform for sustained, complex business tasks. The real test will be whether enterprises can integrate these capabilities into existing workflows, but early results from companies like Canva suggest the transition is already underway.

the tech buzz

Anthropic's Claude Sonnet 4.5 codes for 30 hours straight

More in AI

Hollywood Studios Drop Sam Altman Biopic After Amazon Exit

Superhuman Snaps Up AI Detection Startup GPTZero

Cerebras Stock Tumbles 8% on Margin Squeeze in First Post-IPO Report

NVIDIA Now Powers 81% of World's Fastest Supercomputers

Brain Implants Now Detect Cancer in 3 Patients

Gmail's Gemini Flows Brings AI Filtering With 2000 Email Cap

More Articles

NVIDIA Unleashes Autonomous AI Agents for Telecom Networks

Tech Giants Cite AI as Layoff Driver in 2026 Wave

Samsung Drops UFS 5.0: Twice as Fast for On-Device AI

Tesla Disputes Autopilot Role in Fatal Texas Crash

Trending Now

Google Joins Dow Jones, Ousting Verizon in Historic Shift

Hollywood Studios Drop Sam Altman Biopic After Amazon Exit

Superhuman Snaps Up AI Detection Startup GPTZero

Cerebras Stock Tumbles 8% on Margin Squeeze in First Post-IPO Report

SpaceX raises $25B in debt, draws $90B in investor orders

People Also Ask

What is Claude Sonnet 4.5 and what makes it different from previous AI models?

How long can Claude Sonnet 4.5 code autonomously without human intervention?

What companies are using Claude Sonnet 4.5 for enterprise tasks?

How much better is Claude Sonnet 4.5 at computer navigation compared to previous versions?

What real-world tasks can Claude Sonnet 4.5 handle at enterprise level?

How does Claude Sonnet 4.5 compare to OpenAI and Google's AI models?

People Also Ask

What is Claude Sonnet 4.5 and what makes it different from previous AI models?

How long can Claude Sonnet 4.5 code autonomously without human intervention?

What companies are using Claude Sonnet 4.5 for enterprise tasks?

How much better is Claude Sonnet 4.5 at computer navigation compared to previous versions?

What real-world tasks can Claude Sonnet 4.5 handle at enterprise level?

How does Claude Sonnet 4.5 compare to OpenAI and Google's AI models?

More in AI

Hollywood Studios Drop Sam Altman Biopic After Amazon Exit

Superhuman Snaps Up AI Detection Startup GPTZero

Cerebras Stock Tumbles 8% on Margin Squeeze in First Post-IPO Report

NVIDIA Now Powers 81% of World's Fastest Supercomputers

Brain Implants Now Detect Cancer in 3 Patients

Gmail's Gemini Flows Brings AI Filtering With 2000 Email Cap

More Articles

NVIDIA Unleashes Autonomous AI Agents for Telecom Networks

Tech Giants Cite AI as Layoff Driver in 2026 Wave

Samsung Drops UFS 5.0: Twice as Fast for On-Device AI

Tesla Disputes Autopilot Role in Fatal Texas Crash

Tesla Autopilot crash kills 76-year-old, sparks federal probe

Groq Confirms $650M Raise After Nvidia's $20B Acqui-Hire

Trending Now

Google Joins Dow Jones, Ousting Verizon in Historic Shift

Hollywood Studios Drop Sam Altman Biopic After Amazon Exit

Superhuman Snaps Up AI Detection Startup GPTZero

Cerebras Stock Tumbles 8% on Margin Squeeze in First Post-IPO Report

SpaceX raises $25B in debt, draws $90B in investor orders