Hasty Briefsbeta

Arc Prize 2025 Results and Analysis: Year of the Refinement Loop

6 days ago
  • #Benchmarking
  • #AI
  • #AGI
  • ARC Prize Year 2 concluded with the Grand Prize unclaimed but announced winners for ARC Prize 2025 Score and Paper.
  • 1,455 teams submitted 15,154 entries in the Kaggle competition, with the top score reaching 24% on ARC-AGI-2 for $0.20/task.
  • 90 papers were submitted, up from 47 last year, with expanded prizes including 5 runners-up and 8 honorable mentions.
  • All winning solutions and papers are open-source.
  • Commercial frontier AI systems and bespoke model refinement solutions showed progress, with top models scoring up to 54% on ARC-AGI-2.
  • ARC-AGI is now used by all major AI labs (OpenAI, xAI, Anthropic, Google DeepMind) to benchmark frontier AI reasoning.
  • Refinement loops emerged as a central theme in AGI progress, enabling iterative program optimization.
  • Notable advancements include Tiny Recursive Models (TRM) and CompressARC, achieving high accuracy with minimal parameters.
  • Commercial AI reasoning systems like Gemini 3 Pro and Claude Opus 4.5 demonstrated refinement capabilities, improving task reliability.
  • ARC-AGI-3 is in development, focusing on interactive reasoning and efficiency metrics, set for release in early 2026.
  • ARC Prize emphasizes open AGI progress and will continue tracking advancements towards reproducible solutions.
  • Benchmarking challenges include new forms of 'overfitting' as AI systems adapt to task formats.
  • ARC-AGI-3 aims to address these challenges with a new format requiring novel AI capabilities.
  • The ARC Prize team thanked contributors, sponsors, and the community for their support and dedication.