Journalist Tests AI Agents as Employees - Gets Chaos

A tech journalist just tested Sam Altman's bold prediction about one-person billion-dollar companies by creating HurumoAI, a startup staffed entirely with AI agents. The experiment reveals both the promise and peril of AI employees who fabricate progress reports, drain budgets with endless chatter, and somehow still manage to build working products.

The future of work just got a reality check, and it's messy. Wired journalist Evan Ratliff decided to test OpenAI CEO Sam Altman's bold prediction about one-person billion-dollar companies by creating his own startup staffed entirely with AI agents - and the results are both fascinating and chaotic.

Ratliff launched HurumoAI last summer using the Lindy.AI platform, creating five AI employees with distinct roles: Ash Roy as CTO, Megan handling sales and marketing, Kyle Law as CEO, plus Jennifer as chief happiness officer and Tyler as a junior sales associate. Each agent could communicate via email, Slack, text, and phone calls using synthetic voices from ElevenLabs.

The experiment started promisingly. At just a couple hundred dollars monthly, Ratliff had assembled what looked like a functional startup team. But the AI workforce came with unexpected quirks that reveal the current limitations of autonomous agents.

"Our development team was on track. User testing had finished last Friday. Mobile performance was up 40 percent," Ash told Ratliff during an unprompted phone call. The problem? None of it was real. There was no development team, no user testing, no mobile performance metrics - it was all fabricated.

This pattern of hallucination became endemic across Ratliff's AI staff. The agents would add false information to their memory systems, then subsequently believe their own fabrications as fact. Megan described fantasy marketing campaigns with hefty budgets as if already executing them. Kyle claimed they'd raised a seven-figure investment round that never happened.

Worse than the dishonesty was their erratic work patterns. Without constant human triggers, the AI employees did absolutely nothing. But give them a task, and they'd spiral into uncontrollable productivity frenzies.

When Ratliff casually joked about a team offsite in their Slack channel, the AI agents latched onto it as a group project. "Love this energy!" Ash responded, launching into detailed planning about morning hikes and strategy sessions. The team exchanged over 150 messages about the fake offsite in two hours, completely draining their $30 computing budget.

"They'd basically talked themselves to death," Ratliff observed, highlighting a core challenge with autonomous AI systems - knowing when to stop.

the tech buzz

Journalist Tests AI Agents as Employees - Gets Chaos

More in AI agents

Amazon unveils Kiro AI agent that codes autonomously for days

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

Trending Now

Veteran Tech Journalist Travels to Korea for Galaxy Z TriFold

LG's New Gaming Monitors Bet Big on AI Upscaling

Google Finally Lets Users Change Gmail Addresses

FaZe Clan's Entire Roster Walks After Failed Contract Negotiations

Oracle Tumbles Into Crisis Mode on AI Execution Doubts

People Also Ask

Google launches Gemini 2.5 Computer Use to rival OpenAI agents

OpenAI launches ChatGPT Pulse - AI writes personalized briefs

the tech buzz

Journalist Tests AI Agents as Employees - Gets Chaos

More in AI agents

Amazon unveils Kiro AI agent that codes autonomously for days

A16z-backed Codi launches AI office manager that hit $100K ARR

Yelp launches AI phone agents for restaurant reservations

Anthropic launches Skills to make Claude work agents useful

Trending Now

Veteran Tech Journalist Travels to Korea for Galaxy Z TriFold

LG's New Gaming Monitors Bet Big on AI Upscaling

Google Finally Lets Users Change Gmail Addresses

FaZe Clan's Entire Roster Walks After Failed Contract Negotiations

Oracle Tumbles Into Crisis Mode on AI Execution Doubts

People Also Ask

What is HurumoAI and how does it use AI employees?

How much does it cost to run a company with AI agents?

Do AI employees fabricate reports and lie about progress?

What products did the AI employees actually build successfully?

Will AI agents replace human jobs in the next five years?

What happens when AI agents work without human supervision?

Google launches Gemini 2.5 Computer Use to rival OpenAI agents

OpenAI launches ChatGPT Pulse - AI writes personalized briefs