| # | Company | HQ | Score | vs #1 | Δ vs May | Engine | Deliver | Accel | |
|---|---|---|---|---|---|---|---|---|---|
| 01 | 🇺🇸 | GoogleGemini 3.5 Flash & 3.1 Pro, DeepMind, Vertex/Cloud AI, Search AI, AlphaFold, Veo, Waymo | US | 9.32 | — | -0.22 | 9.4 | 9.4 | 8.9 |
| 02 | 🇺🇸 | OpenAIGPT-5.5 / GPT-5.5 Pro, ChatGPT, Codex, Sora, DALL·E | US | 9.10 | -0.22 | +0.06 | 9.1 | 9.2 | 8.9 |
| 03 | 🇺🇸 | AnthropicClaude Fable 5 & Mythos 5, Claude Code, Claude API | US | 8.82 | -0.50 | +0.21 | 8.8 | 8.7 | 9.2 |
| 04 | 🇺🇸 | MicrosoftCopilot (Office, GitHub, Windows), MAI in-house models, Azure AI Foundry, Phi | US | 8.60 | -0.72 | +0.06 | 8.4 | 9.0 | 8.7 |
| 05 | 🇺🇸 | MetaLlama 4 family, FAIR / Superintelligence Lab, Meta AI assistant, AI ads | US | 8.46 | -0.86 | +0.03 | 8.6 | 8.5 | 7.8 |
| 06 | 🇨🇳 | AlibabaQwen 3.x family, Model Studio, Alibaba Cloud AI, Tongyi | CN | 8.23 | -1.09 | +0.15 | 8.4 | 8.2 | 7.8 |
| 07 | 🇺🇸 | AmazonAWS Bedrock, Nova models, Trainium/Inferentia, Alexa+, Rufus | US | 8.16 | -1.16 | +0.14 | 7.9 | 8.5 | 8.3 |
| 08 | 🇨🇳 | ByteDanceDoubao models, Seed research, Coze, Dreamina, TikTok AI | CN | 8.01 | -1.31 | -0.02 | 8.2 | 8.0 | 7.5 |
| 09 | 🇺🇸 | xAIGrok 4.3 (Expert tiers), Grok Build, Colossus supercluster, X integration | US | 7.94 | -1.38 | +0.37 | 8.3 | 7.6 | 7.2 |
| 10 | 🇨🇳 | DeepSeekDeepSeek V4 (Pro/Flash), R-series reasoning, open weights | CN | 7.71 | -1.61 | +0.27 | 7.9 | 7.6 | 7.2 |
| 11 | 🇺🇸 | AppleApple Intelligence (3rd-gen Foundation Models), Gemini-powered Siri, on-device + Private Cloud Compute | US | 7.65 | -1.67 | +0.49 | 7.2 | 8.2 | 8.2 |
| 12 | 🇨🇳 | BaiduERNIE 5.0, Ernie Bot, Baidu AI Cloud, Kunlun chips | CN | 7.63 | -1.69 | +0.18 | 7.8 | 7.6 | 7.2 |
| 13 | 🇫🇷 | Mistral AIMistral Large 3, Le Chat, La Plateforme, open-weight models | FR | 7.36 | -1.96 | +0.06 | 7.3 | 7.5 | 7.4 |
| 14 | 🇨🇳 | Moonshot AIKimi K2.6, Kimi Work agent, Kimi chatbot | CN | 7.36 | -1.96 | +0.59 | 7.4 | 7.4 | 7.2 |
| 15 | 🇨🇳 | Zhipu AIGLM-5.1, Z.AI platform (STAR Market listing in progress) | CN | 7.35 | -1.97 | +0.74 | 7.5 | 7.3 | 7.0 |
Each company is independently scored across ten dimensions by a panel of frontier evaluator models. Scores are averaged, weighted by a structural profile that prizes long-horizon inputs, and published with the per-score justifications intact.
Read the methodology →