Hasty Briefsbeta

Bilingual

ARC-AGI-3 benchmark is out now

6 hours ago

#AI agent
#performance metrics
#game completion

Dataset: ARC-AGI-3 Public Demo
Human Actions To Complete Game
Total Levels information available
Model Performance comparison in a sortable table
Cumulative actions by level can be viewed
All Providers option available
Table includes Model, Score, Actions, Replay, and Published columns
Humans have a 100% score with no actions listed
Loading scores message displayed