Arc Prize 2025 Results and Analysis: Year of the Refinement Loop

6 days ago

https://arcprize.org/blog/arc-prize-2025-results-analysis

Copy Link

#Benchmarking
#AI
#AGI

ARC Prize Year 2 concluded with the Grand Prize unclaimed but announced winners for ARC Prize 2025 Score and Paper.
1,455 teams submitted 15,154 entries in the Kaggle competition, with the top score reaching 24% on ARC-AGI-2 for $0.20/task.
90 papers were submitted, up from 47 last year, with expanded prizes including 5 runners-up and 8 honorable mentions.
All winning solutions and papers are open-source.
Commercial frontier AI systems and bespoke model refinement solutions showed progress, with top models scoring up to 54% on ARC-AGI-2.
ARC-AGI is now used by all major AI labs (OpenAI, xAI, Anthropic, Google DeepMind) to benchmark frontier AI reasoning.
Refinement loops emerged as a central theme in AGI progress, enabling iterative program optimization.
Notable advancements include Tiny Recursive Models (TRM) and CompressARC, achieving high accuracy with minimal parameters.
Commercial AI reasoning systems like Gemini 3 Pro and Claude Opus 4.5 demonstrated refinement capabilities, improving task reliability.
ARC-AGI-3 is in development, focusing on interactive reasoning and efficiency metrics, set for release in early 2026.
ARC Prize emphasizes open AGI progress and will continue tracking advancements towards reproducible solutions.
Benchmarking challenges include new forms of 'overfitting' as AI systems adapt to task formats.
ARC-AGI-3 aims to address these challenges with a new format requiring novel AI capabilities.
The ARC Prize team thanked contributors, sponsors, and the community for their support and dedication.

Hasty Briefsbeta

Arc Prize 2025 Results and Analysis: Year of the Refinement Loop