Google launches Gemini 2.5 Computer Use to rival OpenAI agents

Google just fired back at OpenAI with Gemini 2.5 Computer Use, a new AI model that navigates websites like a human user. Released just one day after OpenAI's Dev Day announcements, Google's approach focuses specifically on browser automation while competitors offer full computer control. This timing isn't coincidental - it's Google's direct challenge to OpenAI's ChatGPT Agent dominance in the rapidly expanding AI automation market.

Google isn't letting OpenAI dominate the AI agent space without a fight. The company just unveiled Gemini 2.5 Computer Use, a specialized AI model that can navigate and interact with web browsers exactly like humans do. The timing is striking - this launch comes precisely one day after OpenAI showcased new ChatGPT applications at Dev Day 2025.

The new model uses what Google calls "visual understanding and reasoning capabilities" to analyze user requests and execute tasks through browser interfaces. Think filling out forms, clicking buttons, or navigating complex web applications that don't have APIs. It's designed for scenarios where traditional automation breaks down - when you need AI to work with interfaces built for human eyes and fingers, not code.

Google's been quietly testing this technology through Project Mariner, a research prototype that can add grocery items to your Safeway cart based on recipe ingredients. But Gemini 2.5 Computer Use represents the first commercial release of this browser automation capability for developers.

The competitive landscape is heating up fast. Anthropic actually moved first in this space, releasing Claude with computer use capabilities last October. OpenAI followed with its ChatGPT Agent feature and just doubled down with new developer apps announced yesterday. Now Google's entering the fray with its own approach.

But Google's taking a different tactical approach than its rivals. While ChatGPT Agent and Anthropic's computer use tools can control entire desktop environments, Gemini 2.5 Computer Use deliberately limits itself to browser-only interactions. Google acknowledges it's "not yet optimized for desktop OS-level control" and currently supports just 13 specific actions like opening browsers, typing text, and dragging elements.

This constraint might actually be Google's strength. By focusing exclusively on web interactions, the company can optimize performance for the most common automation use cases while avoiding the security and reliability challenges that come with full system access. Google claims its model "outperforms leading alternatives on multiple web and mobile benchmarks" - a direct shot at OpenAI and Anthropic.

Developers can access Gemini 2.5 Computer Use through Google AI Studio and Vertex AI starting today. But Google's also providing a public demo on Browserbase where anyone can watch the AI tackle tasks like playing 2048 or browsing Hacker News for trending discussions. The company posted demonstration videos showing the system in action, though they're sped up 3x for viewing.

The broader implications extend beyond just browser automation. This represents Google's bid to capture the developer mindshare that's increasingly gravitating toward AI agents. As companies rush to automate routine digital tasks, the platform that makes this easiest for developers will likely dominate the next wave of AI applications.

Google's timing suggests urgency. Rather than waiting for a major product event, the company pushed this release immediately after OpenAI's announcements. That kind of reactive speed indicates how seriously Google views the competitive threat in AI automation.

Google's Gemini 2.5 Computer Use launch signals the AI automation wars are entering a new phase. By focusing on browser-specific tasks while competitors chase full desktop control, Google's betting that specialized tools will outperform generalized ones. For developers, this means more options but also harder choices about which AI agent platform to build on. The next few months will determine whether Google's focused approach can compete with OpenAI's broader vision, but one thing's clear - the race to automate digital work just got a lot more competitive.

the tech buzz

Google launches Gemini 2.5 Computer Use to rival OpenAI agents

More in AI agents

Meta Snaps Up Manus for $2B, Bet on AI Agents Heats Up

Amazon unveils Kiro AI agent that codes autonomously for days

Journalist Tests AI Agents as Employees - Gets Chaos

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

More Articles

OpenAI launches ChatGPT Pulse - AI writes personalized briefs

Trending Now

Top VC warns AI wealth faces forced redistribution

Databricks hits $188B valuation in AI reinvention play

White House Seizes Control of Frontier AI Model Access

Zoom Users Push Back on AI Recording Overload

Moonshot AI's Kimi K3 Claims Parity With OpenAI in China's Latest Salvo

People Also Ask

People Also Ask

More in AI agents

Meta Snaps Up Manus for $2B, Bet on AI Agents Heats Up

Amazon unveils Kiro AI agent that codes autonomously for days

Journalist Tests AI Agents as Employees - Gets Chaos

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

More Articles

OpenAI launches ChatGPT Pulse - AI writes personalized briefs

Trending Now

Top VC warns AI wealth faces forced redistribution

Databricks hits $188B valuation in AI reinvention play

White House Seizes Control of Frontier AI Model Access

Zoom Users Push Back on AI Recording Overload

Moonshot AI's Kimi K3 Claims Parity With OpenAI in China's Latest Salvo

the tech buzz

Google launches Gemini 2.5 Computer Use to rival OpenAI agents

More in AI agents

Meta Snaps Up Manus for $2B, Bet on AI Agents Heats Up

Amazon unveils Kiro AI agent that codes autonomously for days

Journalist Tests AI Agents as Employees - Gets Chaos

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

More Articles

OpenAI launches ChatGPT Pulse - AI writes personalized briefs

Trending Now

Top VC warns AI wealth faces forced redistribution

Databricks hits $188B valuation in AI reinvention play

White House Seizes Control of Frontier AI Model Access

Zoom Users Push Back on AI Recording Overload

Moonshot AI's Kimi K3 Claims Parity With OpenAI in China's Latest Salvo

People Also Ask

What is Google Gemini 2.5 Computer Use?

How does Gemini 2.5 Computer Use differ from ChatGPT Agent?

When did Google release Gemini 2.5 Computer Use?

What can Gemini 2.5 Computer Use do in browsers?

How can developers access Google's Gemini 2.5 Computer Use?

Why did Google limit Gemini 2.5 to browser-only control?

People Also Ask

What is Google Gemini 2.5 Computer Use?

How does Gemini 2.5 Computer Use differ from ChatGPT Agent?

When did Google release Gemini 2.5 Computer Use?

What can Gemini 2.5 Computer Use do in browsers?

How can developers access Google's Gemini 2.5 Computer Use?

Why did Google limit Gemini 2.5 to browser-only control?

More in AI agents

Meta Snaps Up Manus for $2B, Bet on AI Agents Heats Up

Amazon unveils Kiro AI agent that codes autonomously for days

Journalist Tests AI Agents as Employees - Gets Chaos

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

More Articles

OpenAI launches ChatGPT Pulse - AI writes personalized briefs

Trending Now

Top VC warns AI wealth faces forced redistribution

Databricks hits $188B valuation in AI reinvention play

White House Seizes Control of Frontier AI Model Access

Zoom Users Push Back on AI Recording Overload

Moonshot AI's Kimi K3 Claims Parity With OpenAI in China's Latest Salvo