Hasty Briefsbeta

Bilingual

Project Fetch: Phase Two

5 hours ago
  • #Claude AI
  • #AI Robotics
  • #Autonomous Systems
  • An experiment called Project Fetch tested how much Claude AI could assist non-expert humans in performing tasks with a robotic quadruped (robodog).
  • In August 2025, Anthropic employees using Claude Opus 4.1 outperformed a team relying only on the internet and ingenuity, completing tasks faster and more effectively.
  • A follow-up experiment showed that Claude Opus 4.7, operating autonomously without human help, was about 20 times faster than the fastest human team from the previous year on certain tasks.
  • Despite improvements, Claude still struggled with precise robotic control, such as fetching a beach ball autonomously, indicating that LLMs have not fully solved robotics challenges.
  • The progress observed is attributed to general AI scaling rather than targeted robotics improvements, suggesting rapid advancements in model capabilities.
  • The experiments highlight a pattern where AI models evolve from assisting humans to operating independently, with potential implications for physical agentic AI and the use of off-the-shelf tools.
  • Future research may focus on AI's ability to customize physical tools and develop tailored control policies, though barriers to generalized physically capable models remain.
  • The post draws parallels with advancements in cybersecurity and software tool development, suggesting similar trajectories could occur in hardware and robotics.