AI SpectrumAI Spectrum
AI SPECTRUM
TechnologyHealthcarePolicyLeadershipResearchIndustry
Events
Events

New site ranks AI models on the IQ scale. Not everyone is impressed.

A startup called AI IQ is scoring 50+ language models using a human IQ framework. Enterprise buyers love it. Researchers are not so sure.

By Nischay Nagpal

May 13, 2026•Updated May 14, 2026•2 min read
Editorial Policy•Corrections Policy
New site ranks AI models on the IQ scale. Not everyone is impressed.
New site ranks AI models on the IQ scale. Not everyone is impressed.

Quick Answers

What changed

A startup called AI IQ is scoring 50+ language models using a human IQ framework. Enterprise buyers love it. Researchers are not so sure.

Why it matters

This update matters for teams tracking technology strategy, product decisions, and competitive positioning. Use this to assess near-term execution risk and opportunity.

Key numbers

  • A startup called AI IQ is scoring 50+ language models using a human IQ framework.
  • As of mid-May 2026, OpenAI's GPT-5.
  • As of mid-May 2026, OpenAI's GPT-5.5 sits at the top with an estimated IQ of 136, followed closely by Anthropic's Opus 4.

A site called AI IQ is doing something simple and controversial at the same time: it assigns IQ scores to more than 50 frontier AI models and plots them on a standard bell curve. Built by Ryan Shea, a Princeton engineer and co-founder of the blockchain platform Stacks, the project at aiiq.org pulls from 12 benchmarks across four reasoning dimensions: abstract, mathematical, programmatic, and academic. The composite score is a straight average of those four. As of mid-May 2026, OpenAI's GPT-5.5 sits at the top with an estimated IQ of 136, followed closely by Anthropic's Opus 4.7 and Google's Gemini 3.1 Pro. The gap between the leading models has never been tighter.

The site also scores emotional intelligence, mapping each model's EQ-Bench 3 Elo and Arena Elo scores into a composite EQ. Anthropic's Opus 4.7 leads that dimension with a score near 132. One notable detail: EQ-Bench 3 is judged by Claude, an Anthropic model, so the site applies a 200-point Elo penalty to all Anthropic EQ-Bench scores to correct for the obvious conflict. The cost-performance chart may be the most practically useful feature, showing that models like DeepSeek-V3.2 and GPT-5.4-mini deliver IQ scores in the 112 to 120 range at a fraction of the cost of top-tier options.

Critics argue the entire framework collapses something irreducibly complex into a number that feels more precise than it is. 'AI is far too jagged. The map is not the territory,' wrote one commentator on X, pointing to the well-documented phenomenon of models acing graduate-level physics while failing at tasks a child handles easily. Others flagged that the calibration curves are not published as open datasets, making full reproducibility impossible. AI IQ is not a perfect tool. But for anyone trying to compare models across providers without wading through a dozen self-serving benchmark tables, it is at least a starting point.

Nischay Nagpal
Nischay Nagpal

Author description is not available yet.

View profile

Related Articles

VPN Downloads Surge in India After Temporary Telegram Ban
technology

VPN Downloads Surge in India After Temporary Telegram Ban

VPN services saw a sharp increase in downloads and sign-ups across India after authorities temporarily restricted access to Telegram over concerns about exam-related fraud. The move pushed several VPN apps up app store rankings as users sought alternative ways to access the messaging platform.

2 min read
Reliance Unveils AI Assistant for Calls, Apps and Homes as Ambani Pushes India AI Vision
technology

Reliance Unveils AI Assistant for Calls, Apps and Homes as Ambani Pushes India AI Vision

Reliance Industries unveiled a suite of AI-powered services across phone calls, mobile apps and connected homes, deepening its push into artificial intelligence. The announcements come as Mukesh Ambani seeks to position India as a creator of AI technology rather than just a consumer.

3 min read
Kevin O'Leary Cuts Utah Data Center Project in Half After Public Backlash
technology

Kevin O'Leary Cuts Utah Data Center Project in Half After Public Backlash

O'Leary agreed to remove nearly 20,000 acres from his Project Stratos data center plan in Utah following pressure from residents and state officials.

1 min read
Google's Gemini Spark Is Impressive. But What's It Actually For?
technology

Google's Gemini Spark Is Impressive. But What's It Actually For?

Google's new Gemini agent knows things users never told it. The real question is whether 'productivity' AI solves any problem worth solving.

1 min read