Title: Launch of ARC-AGI-3 Benchmark | BestBlogs.dev
URL Source: https://www.bestblogs.dev/status/2036861192619384989
Published Time: 2026-03-25 17:42:06
Markdown Content: Skip to main content Toggle navigation menu Toggle navigation menuArticlesPodcastsVideosTweetsSourcesNewsletters
⌘K
Change language Switch ThemeSign In
Narrow Mode
Launch of ARC-AGI-3 Benchmark
Launch of ARC-AGI-3 Benchmark
 ### François Chollet@fchollet
ARC-AGI-3 is out now! We've designed the benchmark to evaluate agentic intelligence via interactive reasoning environments. Beating ARC-AGI-3 will be achieved when an AI system matches or exceeds human-level action efficiency on all environments, upon seeing them for the first time.
We've done extensive human testing that shows 100% of these environments are solvable by humans, upon first contact, with no prior training and no instructions.
Meanwhile, all frontier AI reasoning models do under 1% at this time.
00:15
Mar 25, 2026, 5:42 PM View on X
78 Replies
132 Retweets
1,181 Likes
103.7K Views  François Chollet @fchollet
One Sentence Summary
François Chollet announces the release of ARC-AGI-3, a new benchmark designed to evaluate agentic intelligence through interactive reasoning environments.
Summary
This tweet announces the launch of ARC-AGI-3, a benchmark focused on evaluating agentic intelligence. It highlights that current frontier models struggle, scoring under 1% on these environments, whereas humans can solve 100% of them upon first contact without prior training. This establishes a new, challenging standard for AI evaluation.
AI Score
89
Influence Score 235
Published At Today
Language
English
Tags
ARC-AGI
AGI
Benchmarking
AI Evaluation
Agentic AI HomeArticlesPodcastsVideosTweets