AI Agents have, so far, mostly been a dud

10 months ago

AI agents, once hyped as the next big thing for 2025, have largely failed to meet expectations.
Companies like Google, OpenAI, and Anthropic introduced AI agents, but they remain unreliable except in very narrow use cases.
ChatGPT agent, despite its capabilities, makes frequent mistakes and poses risks when handling user data.
AI agents in coding are creating technical debt by producing hard-to-debug, copied code.
Benchmarks show AI errors compound over time, and hallucinations remain a persistent issue.
Failure rates for AI agents are high, with some tasks showing a 70% failure rate in tests.
Current AI agents lack deep understanding, relying on mimicry, which leads to errors in multi-step tasks.
Investments in LLMs as a shortcut to AGI have not yielded reliable systems, yet funding continues to pour in.
Alternative approaches like neurosymbolic AI are underfunded, receiving less than 1% of total AI investments.
User experiences with AI agents, like ChatGPT agent, report poor performance and frequent hardware failures.

Hasty Briefsbeta