Title: Launch of ARC-AGI-3 Benchmark | BestBlogs.dev

URL Source: https://www.bestblogs.dev/status/2036861192619384989

Published Time: 2026-03-25 17:42:06

Markdown Content: Skip to main content ![Image 1: LogoBestBlogs](https://www.bestblogs.dev/ "BestBlogs.dev")Toggle navigation menu Toggle navigation menuArticles Podcasts Videos Tweets Sources Newsletters

⌘K

Change language Switch ThemeSign In

Narrow Mode

Launch of ARC-AGI-3 Benchmark

![Image 2: François Chollet](https://www.bestblogs.dev/en/tweets?sourceId=SOURCE_fa42b4ed) ### François Chollet

@fchollet

ARC-AGI-3 is out now! We've designed the benchmark to evaluate agentic intelligence via interactive reasoning environments. Beating ARC-AGI-3 will be achieved when an AI system matches or exceeds human-level action efficiency on all environments, upon seeing them for the first time.

We've done extensive human testing that shows 100% of these environments are solvable by humans, upon first contact, with no prior training and no instructions.

Meanwhile, all frontier AI reasoning models do under 1% at this time.

!Image 3: 视频缩略图

00:15

Mar 25, 2026, 5:42 PM View on X

78 Replies

132 Retweets

1,181 Likes

103.7K Views ![Image 4: François Chollet](https://www.bestblogs.dev/en/tweets?sourceid=fa42b4ed) François Chollet @fchollet

One Sentence Summary

François Chollet announces the release of ARC-AGI-3, a new benchmark designed to evaluate agentic intelligence through interactive reasoning environments.

Summary

This tweet announces the launch of ARC-AGI-3, a benchmark focused on evaluating agentic intelligence. It highlights that current frontier models struggle, scoring under 1% on these environments, whereas humans can solve 100% of them upon first contact without prior training. This establishes a new, challenging standard for AI evaluation.

AI Score

Influence Score 235

Published At Today

Language

English

Launch of ARC-AGI-3 Benchmark | BestBlogs.dev

ARC-AGI-3 基准测试发布

Launch of ARC-AGI-3 Benchmark

Launch of ARC-AGI-3 Benchmark

One Sentence Summary

Summary

Tags

Launch of ARC-AGI-3 Benchmark | BestBlogs.dev

🤖 問 AI