- Will A.I. Soon Outsmart Humans? Play This Puzzle to Find Out. The New York Times
- ARC-AGI-2: Leading AI models fail new test of artificial general intelligence New Scientist
- A new AI test is outwitting OpenAI, Google models, among others Mashable
- LLMs Hit a New Low on ARC-AGI-2 Benchmark, Pure LLMs Score 0% AIM
- A new, challenging AGI test stumps most AI models TechCrunch