Question 1

What is ARC-AGI?

Accepted Answer

ARC-AGI (Abstraction and Reasoning Corpus for Artificial General Intelligence) is a benchmark designed by François Chollet to test general reasoning ability. Each puzzle shows a few input-to-output grid transformations and asks you to infer the rule and apply it to a new test case. Unlike typical AI benchmarks, ARC-AGI specifically tests the ability to learn new concepts on the fly — not memorized knowledge.

Question 2

Can AI solve ARC-AGI puzzles?

Accepted Answer

Barely. ARC-AGI-2 (launched March 2025) was designed so that every task is solvable by humans in under 2 attempts, yet frontier AI models like GPT-4o and Claude 3.7 scored between 0% and 1.3% without expensive multi-attempt scaffolding. Only with $30–$77 per-question compute costs did AI systems approach human-level performance. The human average is around 60%.

Question 3

How do ARC-AGI puzzles work?

Accepted Answer

Each ARC-AGI puzzle shows 2–3 example pairs: an input grid and its corresponding output grid. Your task is to figure out the transformation rule from the examples, then apply it to a new test input grid by clicking cells to paint the correct output. The rules are visual and logical — no math or language required.

ARC Pattern Reasoning Test

Also try: You vs ChatGPT

What Is ARC-AGI?

Why Does AI Fail ARC-AGI?

The ARC Prize

Love Learning How Minds Work?