Google's Gemini just hit 77% on ARC-AGI-2, Plus: Pakistan's 20K AI programmes, Claude Code goes autonomous·5 min brief·May 15, 2026
Weekly AI Signal
This week, I kept coming back to one number: 77.1%. That's how Google's Gemini 3.1 Pro scored on ARC-AGI-2 — the benchmark François Chollet designed specifically to resist memorisation. Every other model had hit a ceiling. Gemini didn't. And while the labs were shipping, Pakistan quietly dropped two of the most significant AI workforce announcements in the country's history.
▸ Google's Gemini just hit 77% on ARC-AGI-2, Plus: Pakistan's 20K AI programmes, Claude Code goes autonomous