Compare Leading AI & LLM Models

Compare performance metrics, benchmarks, and pricing across the leading AI models. Sort by any metric and find the best model for your use case.

Model Leaderboard

Showing 289 of 289 models
Org
🌍
Model
Multimodal
GPQA
AIME 2025
SWE-bench
HLE
Input $/M
Output $/M
Context
Cutoff
Params (B)
License
Anthropic 🇺🇸 Claude Mythos Preview UNRELEASED 94.6% 93.9% 64.7% $25.00 $125.00 - - - Closed
Google 🇺🇸 Gemini 3.1 Pro 94.3% 80.6% 51.4% $2.50 $15.00 1.0M Jan. 2025 - Closed
OpenAI 🇺🇸 GPT-5.2 Pro 93.2% 100.0% - 36.6% $21.00 $168.00 400k - - Closed
OpenAI 🇺🇸 GPT-5.4 92.8% 39.8% $2.50 $15.00 1M - - Closed
OpenAI 🇺🇸 GPT-5.2 92.4% 100.0% 80.0% 34.5% $1.75 $14.00 400k Aug. 2025 - Closed
Google 🇺🇸 Gemini 3 Pro 91.9% 100.0% 76.2% - $2.00 $12.00 1.0M Jan. 2025 - Closed
Anthropic 🇺🇸 Claude Opus 4.6 NEW 91.3% 99.8% 80.8% 53.1% $5.00 $25.00 200k - - Closed
Google 🇺🇸 Gemini 3 Flash 90.4% 99.7% 78.0% - $0.50 $3.00 1M Jan. 2025 - Closed
Qwen 🇨🇳 Qwen3.6 Plus 90.4% 78.8% 28.8% - - - - - Closed
Anthropic 🇺🇸 Claude Sonnet 4.6 89.9% 79.6% 49.0% $3.00 $15.00 200k - - Closed
Meta 🇺🇸 Muse Spark NEW 89.5% 77.4% 58.4% - - - - - Closed
Bytedance 🇨🇳 Seed 2.0 Pro 88.9% 98.3% 76.5% - - - Jan. 2024 - Closed
xAI 🇺🇸 Grok-4 Heavy UNRELEASED 88.4% 100.0% - - - - - Dec. 2024 - Closed
Qwen 🇨🇳 Qwen3.5-397B-A17B 88.4% 76.4% 28.7% $0.60 $3.60 262.1k - 397 Open
OpenAI 🇺🇸 GPT-5 Medium 88.1% 88.9% - - $1.25 $10.00 400k Sep. 2024 - Closed
OpenAI 🇺🇸 GPT-5.1 High 88.1% 99.6% - - $1.25 $10.00 400k - - Closed
OpenAI 🇺🇸 GPT-5.1 88.1% 94.0% 76.3% - $1.25 $10.00 400k Sep. 2024 - Closed
OpenAI 🇺🇸 GPT-5.1 Thinking 88.1% 94.0% 76.3% - $1.25 $10.00 400k - - Closed
OpenAI 🇺🇸 GPT-5.1 Instant 88.1% 94.0% 76.3% - $1.25 $10.00 400k - - Closed
OpenAI 🇺🇸 GPT-5.4 mini 88.0% 28.2% $0.75 $4.50 400k Aug. 2025 - Closed
MoonshotAI 🇨🇳 Kimi K2.5 87.6% 96.1% 76.8% 50.2% $0.60 $2.50 262.1k - 1000 Open
xAI 🇺🇸 Grok-4 87.5% 91.7% - - $3.00 $15.00 256k Dec. 2024 - Closed
OpenAI 🇺🇸 GPT-5 High 87.3% 94.6% - - $1.25 $10.00 400k Sep. 2024 - Closed
Anthropic 🇺🇸 Claude Opus 4.5 87.0% - 80.9% - $5.00 $25.00 200k Mar. 2025 - Closed
Google 🇺🇸 Gemini 3.1 Flash-Lite 86.9% 16.0% $0.25 $1.50 1M Jan. 2025 - Closed
Qwen 🇨🇳 Qwen3.5-122B-A10B 86.6% 72.0% 47.5% $0.40 $3.20 262.1k - 122 Open
Google 🇺🇸 Gemini 2.5 Pro Preview 06-05 86.4% 88.0% 67.2% - $1.25 $10.00 1.0M Jan. 2025 - Closed
ZAI 🇨🇳 GLM-5.1 x 86.2% 52.3% $1.40 $4.40 200k - 754 Open
ZAI 🇨🇳 GLM-4.7 85.7% 95.7% 73.8% 42.8% $0.60 $2.20 204.8k - 358 Open
OpenAI 🇺🇸 GPT-5 85.7% 94.6% 74.9% - $1.25 $10.00 400k Sep. 2024 - Closed
xAI 🇺🇸 Grok 4 Fast 85.7% 92.0% - 20.0% $0.20 $0.50 2M - - Closed
Qwen 🇨🇳 Qwen3.5-27B 85.5% 72.4% 48.5% $0.30 $2.40 262.1k - 27 Open
Bytedance 🇨🇳 Seed 2.0 Lite 85.1% 93.0% 73.5% - - - Jan. 2024 - Closed
Baidu 🇨🇳 ERNIE 5.0 85.0% 87.0% - 39.0% - - - - - Closed
Anthropic 🇺🇸 Claude 3.7 Sonnet 84.8% 54.8% 70.3% - $3.00 $15.00 200k - - Closed
xAI 🇺🇸 Grok-3 84.6% 93.3% - - $3.00 $15.00 128k Nov. 2024 - Closed
MoonshotAI 🇨🇳 Kimi K2-Thinking-0905 x 84.5% 100.0% 71.3% 51.0% $0.47 $2.00 262.1k - 1000 Open
Google 🇺🇸 Gemma 4 31B 84.3% 26.5% $0.14 $0.40 262.1k Jan. 2025 30.7 Open
Qwen 🇨🇳 Qwen3.5-35B-A3B 84.2% 69.2% 47.4% $0.25 $2.00 262.1k - 35 Open
OpenAI 🇺🇸 ChatGPT-4o Latest 84.0% - - - $2.50 $10.00 128k - - Closed
xAI 🇺🇸 Grok-3 Mini 84.0% 90.8% - - $0.30 $0.50 128k Nov. 2024 - Closed
Xiaomi 🇨🇳 MiMo-V2-Flash x 83.7% 94.1% 73.4% 22.1% $0.10 $0.30 256k - 309 Open
Anthropic 🇺🇸 Claude Sonnet 4.5 83.4% 87.0% - - $3.00 $15.00 200k Jan. 2025 - Closed
OpenAI 🇺🇸 o3 83.3% 86.4% 69.1% - $2.00 $8.00 200k May 2024 - Closed
Google 🇺🇸 Gemini 2.5 Pro 83.0% 83.0% 63.2% - $1.25 $10.00 1.0M Jan. 2025 - Closed
Google 🇺🇸 Gemini 2.5 Flash 82.8% 72.0% 60.4% - $0.30 $2.50 1.0M Jan. 2025 - Closed
OpenAI 🇺🇸 GPT-5.4 nano 82.8% 24.3% $0.20 $1.25 400k Aug. 2025 - Closed
Nvidia 🇺🇸 Nemotron 3 Super (120B A12B) x 82.7% 90.2% 53.7% 22.8% $0.10 $0.50 262.1k Jun. 2025 120 Open
DeepSeek 🇨🇳 DeepSeek-V3.2 (Thinking) x 82.4% 93.1% 73.1% 25.1% $0.28 $0.42 131.1k - 685 Open
DeepSeek 🇨🇳 DeepSeek-V3.2 x 82.4% 93.1% 73.1% 40.8% $0.26 $0.38 163.8k - 685 Open
OpenAI 🇺🇸 GPT-5 mini 82.3% 91.1% - - $0.25 $2.00 400k May 2024 - Closed
Google 🇺🇸 Gemma 4 26B-A4B 82.3% 17.2% $0.13 $0.40 262.1k Jan. 2025 25.2 Open
Qwen 🇨🇳 Qwen3.5-9B 81.7% - - - - 9 Open
Meituan 🇨🇳 LongCat-Flash-Thinking x 81.5% 90.6% 59.4% $0.30 $1.20 128k - 560 Open
OpenAI 🇺🇸 o4-mini 81.4% 92.7% 68.1% - $1.10 $4.40 200k May 2024 - Closed
Qwen 🇨🇳 Qwen3-235B-A22B-Thinking-2507 x 81.1% 92.3% - - $0.30 $3.00 262.1k - 235 Open
ZAI 🇨🇳 GLM-4.6 81.0% 93.9% 68.0% 17.2% $0.55 $2.19 131.1k - 357 Open
MiniMax 🇨🇳 MiniMax M2.1 x 81.0% 81.0% 67.0% 22.0% $0.30 $1.20 1M - 230 Open
DeepSeek 🇨🇳 DeepSeek-R1-0528 x 81.0% 87.5% 44.6% - $0.50 $2.15 131.1k - 671 Open
Anthropic 🇺🇸 Claude Opus 4.1 80.9% 78.0% 74.5% - $15.00 $75.00 200k - - Closed
OpenAI 🇺🇸 GPT OSS 120B High x 80.9% 92.5% - - $0.10 $0.50 131.1k - 116.8 Open
Meituan 🇨🇳 LongCat-Flash-Thinking-2601 x 80.5% 99.6% 70.0% 25.2% $0.30 $1.20 128k - 560 Open
OpenAI 🇺🇸 GPT OSS 120B x 80.1% - - - $0.09 $0.45 131.1k - 116.8 Open
DeepSeek 🇨🇳 DeepSeek-V3.2-Exp x 79.9% 89.3% 67.8% - $0.27 $0.41 163.8k - 685 Open
Anthropic 🇺🇸 Claude Opus 4 79.6% 75.5% 72.5% - $15.00 $75.00 200k - - Closed
ZAI 🇨🇳 GLM-4.5 x 79.1% - 64.2% 14.4% $0.40 $1.60 131.1k - 355 Open
OpenAI 🇺🇸 o1-pro 79.0% - - - - - - Sep. 2023 - Closed
Sarvam AI 🇮🇳 Sarvam-105B x 78.7% 96.7% 45.0% 11.2% - - - - 105 Open
MiniMax 🇨🇳 MiniMax M2 x 78.0% 78.0% 69.4% 12.5% $0.30 $1.20 1M - 230 Open
OpenAI 🇺🇸 o1 x 78.0% - 41.0% - $15.00 $60.00 200k - - Closed
Qwen 🇨🇳 Qwen3-235B-A22B-Instruct-2507 x 77.5% 70.3% - - $0.15 $0.80 262.1k - 235 Open
OpenAI 🇺🇸 o3-mini x 77.2% - 49.3% - $1.10 $4.40 200k Sep. 2023 - Closed
Qwen 🇨🇳 Qwen3-Next-80B-A3B-Thinking x 77.2% 87.8% - - $0.15 $1.50 65.5k - 80 Open
Qwen 🇨🇳 Qwen3.5-4B 76.2% - - - - 4 Open
Nvidia 🇺🇸 Llama 3.1 Nemotron Ultra 253B v1 x 76.0% 72.5% - - - - - Dec. 2023 253 Open
MoonshotAI 🇨🇳 Kimi K2 0905 x 75.8% - - - $0.60 $2.50 262.1k - 1000 Closed
Anthropic 🇺🇸 Claude Sonnet 4 75.4% 70.5% 72.7% - $3.00 $15.00 200k - - Closed
ZAI 🇨🇳 GLM-4.7-Flash x 75.2% 91.6% 59.2% 14.4% $0.07 $0.40 128k - 30 Open
MoonshotAI 🇨🇳 Kimi K2-Instruct-0905 x 75.1% 49.5% 65.8% 4.7% - - - - 1000 Open
MoonshotAI 🇨🇳 Kimi K2 Instruct x 75.1% 49.5% - - $0.50 $0.50 200k - 1000 Open
Nvidia 🇺🇸 Nemotron 3 Nano (30B A3B) x 75.0% 99.2% 38.8% 15.5% $0.06 $0.24 262.1k Nov. 2025 32 Open
ZAI 🇨🇳 GLM-4.5-Air x 75.0% - 57.6% 10.6% - - - - 106 Open
DeepSeek 🇨🇳 DeepSeek-V3.1 x 74.9% 49.8% 66.0% - $0.27 $1.00 163.8k - 671 Open
Qwen 🇨🇳 Qwen3 VL 30B A3B Thinking 74.4% 83.1% - - $0.20 $1.00 262.1k - 31 Open
OpenAI 🇺🇸 GPT OSS 20B High x 74.2% 98.7% - - $0.10 $0.50 131.1k - 20.9 Open
Google 🇺🇸 Gemini 2.0 Flash Thinking 74.2% - - - - - - Aug. 2024 - Closed
Baidu 🇨🇳 ERNIE 4.5 x 74.0% - - - $0.40 $4.00 128k - 21 Closed
Inception 🇺🇸 Mercury 2 x 74.0% 91.1% $0.25 $0.75 128k - - Closed
DeepSeek 🇨🇳 DeepSeek R1 Zero x 73.3% - - - - - - - 671 Open
OpenAI 🇺🇸 o1-preview x 73.3% - 41.3% - $15.00 $60.00 128k - - Closed
Meituan 🇨🇳 LongCat-Flash-Chat x 73.2% 61.3% 60.4% $0.30 $1.20 128k - 560 Open
Qwen 🇨🇳 Qwen3 VL 32B Thinking 73.1% 83.7% - - - - - - 33 Open
Anthropic 🇺🇸 Claude Haiku 4.5 73.0% 80.7% 73.3% - $1.00 $5.00 200k Feb. 2025 - Closed
Qwen 🇨🇳 Qwen3-Next-80B-A3B-Instruct x 72.9% 69.5% - - $0.15 $1.50 65.5k - 80 Open
OpenAI 🇺🇸 GPT OSS 20B x 71.5% - - - $0.05 $0.20 131.1k - 20.9 Open
OpenAI 🇺🇸 GPT-5 nano 71.2% 85.2% - - $0.05 $0.40 400k May 2024 - Closed
Mistral 🇫🇷 Ministral 3 (14B Reasoning 2512) 71.2% 85.0% - - $0.20 $0.20 262.1k - 14 Open
Mistral 🇫🇷 Mistral Small 4 71.2% 83.8% $0.15 $0.60 256k - 119 Open
Mistral 🇫🇷 Magistral Medium 70.8% 64.9% - - - - - Jun. 2025 24 Open
Qwen 🇨🇳 Qwen3 VL 30B A3B Instruct 70.4% 69.3% - - $0.20 $0.70 262.1k - 31 Open
OpenAI 🇺🇸 GPT-4o 70.1% - 33.2% - $2.50 $10.00 128k - - Closed
MiniMax 🇨🇳 MiniMax M1 80K x 70.0% 76.9% 56.0% 8.4% $0.55 $2.20 1M - 456 Open
Qwen 🇨🇳 Qwen3 VL 8B Thinking 69.9% 80.3% - - $0.18 $2.09 262.1k - 9 Open
Meta 🇺🇸 Llama 4 Maverick 69.8% - - - $0.17 $0.60 1M - 400 Open
OpenAI 🇺🇸 GPT-4.5 69.5% - 38.0% - $75.00 $150.00 128k - - Closed
MiniMax 🇨🇳 MiniMax M1 40K x 69.2% 74.6% 55.6% 7.2% - - - - 456 Open
Microsoft 🇺🇸 Phi 4 Reasoning Plus x 68.9% 78.0% - - - - - Mar. 2025 14 Open
Qwen 🇨🇳 Qwen3 VL 32B Instruct 68.9% 66.2% - - - - - - 33 Open
DeepSeek 🇨🇳 DeepSeek-V3 0324 x 68.4% - - - $0.28 $1.14 163.8k - 671 Open
Mistral 🇫🇷 Magistral Small 2506 x 68.2% 62.8% - - - - - Jun. 2025 24 Open
Anthropic 🇺🇸 Claude 3.5 Sonnet 67.2% - 49.0% - $3.00 $15.00 200k - - Closed
Mistral 🇫🇷 Ministral 3 (8B Reasoning 2512) 66.8% 78.7% - - $0.15 $0.15 262.1k - 8 Open
Meituan 🇨🇳 LongCat-Flash-Lite x 66.8% 63.2% 54.4% $0.10 $0.40 256k - 68.5 Open
Nvidia 🇺🇸 Llama-3.3 Nemotron Super 49B v1 x 66.7% 58.4% - - - - - Dec. 2023 49.9 Open
Sarvam AI 🇮🇳 Sarvam-30B x 66.5% 96.7% 34.0% - - - - 30 Open
OpenAI 🇺🇸 GPT-4.1 66.3% 46.4% 54.6% - $2.00 $8.00 1.0M Jun. 2024 - Closed
Nous Research 🇺🇸 Hermes 3 70B x 66.1% - - - $0.35 $1.40 131.1k - 70 Open
Microsoft 🇺🇸 Phi 4 Reasoning x 65.8% 62.9% - - - - - Mar. 2025 14 Open
Qwen 🇨🇳 Qwen3 30B A3B x 65.8% 70.9% - - $0.10 $0.30 128k - 30.5 Open
DeepSeek 🇨🇳 DeepSeek R1 Distill Llama 70B x 65.2% - - - $0.10 $0.40 128k - 70.6 Open
Qwen 🇨🇳 QwQ-32B x 65.2% - - - - - - Nov. 2024 32.5 Open
Qwen 🇨🇳 QwQ-32B-Preview x 65.2% - - - $0.15 $0.60 32.8k Nov. 2024 32.5 Open
OpenAI 🇺🇸 GPT-4.1 mini 65.0% 40.2% 23.6% - $0.40 $1.60 1.0M May 2024 - Closed
Google 🇺🇸 Gemini 2.5 Flash-Lite 64.6% 49.8% 31.6% - $0.10 $0.40 1.0M Jan. 2025 - Open
Qwen 🇨🇳 Qwen3 VL 4B Thinking 64.1% 74.5% - - $0.10 $1.00 262.1k - 4 Open
Nvidia 🇺🇸 Nemotron Nano 9B v2 x 64.0% 72.1% - - - - - Sep. 2024 8.9 Open
DeepSeek 🇨🇳 DeepSeek R1 Distill Qwen 32B x 62.1% - - - $0.12 $0.18 128k - 32.8 Open
Google 🇺🇸 Gemini 2.0 Flash 62.1% - - - $0.10 $0.40 1.0M Aug. 2024 - Closed
Qwen 🇨🇳 Qwen3 Max x 62.0% 81.6% 69.6% - $0.50 $5.00 256k - 1000 Closed
OpenAI 🇺🇸 o1-mini x 60.0% - - - $3.00 $12.00 128k - - Closed
Anthropic 🇺🇸 Claude 3.5 Sonnet 59.4% - - - $3.00 $15.00 200k - - Closed
DeepSeek 🇨🇳 DeepSeek R1 Distill Qwen 14B x 59.1% - - - - - - - 14.8 Open
DeepSeek 🇨🇳 DeepSeek-V3 x 59.1% - 42.0% - $0.27 $1.10 131.1k - 671 Open
Google 🇺🇸 Gemini 1.5 Pro 59.1% - - - $2.50 $10.00 2.1M Nov. 2023 - Closed
Google 🇺🇸 Gemma 4 E4B 58.6% - - - Jan. 2025 8 Open
Meta 🇺🇸 Llama 4 Scout 57.2% - - - $0.08 $0.30 10M - 109 Open
Microsoft 🇺🇸 Phi 4 x 56.1% - - - $0.07 $0.14 16k Jun. 2024 14.7 Open
xAI 🇺🇸 Grok-2 56.0% - - - $2.00 $10.00 128k - - Closed
Nvidia 🇺🇸 Llama 3.1 Nemotron Nano 8B V1 x 54.1% 47.1% - - - - - Dec. 2023 8 Open
OpenAI 🇺🇸 GPT-4o 53.6% - - - $2.50 $10.00 128k - - Closed
Mistral 🇫🇷 Min istral 3 (3B Reasoning 2512) 53.4% 72.1% - - $0.10 $0.10 131.1k - 3 Open
Microsoft 🇺🇸 Phi 4 Mini Reasoning x 52.0% - - - - - - Feb. 2025 3.8 Open
Qwen 🇨🇳 Qwen3.5-2B 51.6% - - - - 2 Open
Google 🇺🇸 Gemini 2.0 Flash-Lite 51.5% - - - $0.07 $0.30 1.0M Jun. 2024 - Closed
Google 🇺🇸 Gemini 1.5 Flash 51.0% - - - $0.15 $0.60 1.0M Nov. 2023 - Closed
xAI 🇺🇸 Grok-2 mini 51.0% - - - - - - - - Closed
Meta 🇺🇸 Llama 3.1 405B Instruct x 50.7% - - - $0.89 $0.89 128k - 405 Open
Meta 🇺🇸 Llama 3.3 70B Instruct x 50.5% - - - $0.20 $0.20 128k - 70 Open
Anthropic 🇺🇸 Claude 3 Opus 50.4% - - - $15.00 $75.00 200k - - Closed
OpenAI 🇺🇸 GPT-4.1 nano 50.3% - - - $0.10 $0.40 1.0M May 2024 - Closed
Qwen 🇨🇳 Qwen2.5 32B Instruct x 49.5% - - - - - - - 32.5 Open
DeepSeek 🇨🇳 DeepSeek R1 Distill Qwen 7B x 49.1% - - - - - - - 7.6 Open
DeepSeek 🇨🇳 DeepSeek R1 Distill Llama 8B x 49.0% - - - - - - - 8.0 Open
Qwen 🇨🇳 Qwen2.5 72B Instruct x 49.0% - - - $0.35 $0.40 131.1k - 72.7 Open
MoonshotAI 🇨🇳 Kimi K2 Base x 48.1% - - - - - - - 1000 Open
OpenAI 🇺🇸 GPT-4 Turbo x 48.0% - - - $10.00 $30.00 128k Dec. 2023 - Closed
Qwen 🇨🇳 Qwen3 235B A22B x 47.5% 81.5% - - $0.10 $0.10 128k - 235 Open
Amazon - Nova Pro 46.9% - - - $0.80 $3.20 300k - - Closed
Meta 🇺🇸 Llama 3.2 90B Instruct 46.7% - - - $0.35 $0.40 128k - 90 Open
Mistral 🇫🇷 Mistral Small 3.2 24B Instruct 46.1% - - - - - - Oct. 2023 23.6 Open
Mistral 🇫🇷 Mistral Small 3.1 24B Instruct 46.0% - - - - - - - 24 Open
Qwen 🇨🇳 Qwen2.5 VL 32B Instruct 46.0% - - - - - - - 33.5 Open
Qwen 🇨🇳 Qwen2.5 14B Instruct x 45.5% - - - - - - - 14.7 Open
Mistral 🇫🇷 Mistral Small 3 24B Instruct x 45.3% - - - $0.07 $0.14 32k Oct. 2023 24 Open
Mistral 🇫🇷 Mistral Large 3 (675B Instruct 2512) 43.9% - - - $0.50 $1.50 262.1k - 675 Open
Mistral 🇫🇷 Mistral Large 3 (675B Base) 43.9% - - - - - - - 675 Open
Mistral 🇫🇷 Mistral Large 3 (675B Instruct 2512 Eagle) 43.9% - - - - - - - 675 Open
Mistral 🇫🇷 Mistral Large 3 (675B Instruct 2512 NVFP4) 43.9% - - - - - - - 675 Open
Google 🇺🇸 Gemma 4 E2B 43.4% - - - Jan. 2025 5.1 Open
Google 🇺🇸 Gemma 3 27B 42.4% - - - $0.10 $0.20 131.1k - 27 Open
Qwen 🇨🇳 Qwen2 72B Instruct x 42.4% - - - - - - - 72 Open
Amazon - Nova Lite 42.0% - - - $0.06 $0.24 300k - - Closed
Meta 🇺🇸 Llama 3.1 70B Instruct x 41.7% - - - $0.20 $0.20 128k - 70 Open
Anthropic 🇺🇸 Claude 3.5 Haiku x 41.6% - 40.6% - $0.80 $4.00 200k - - Closed
Google 🇺🇸 Gemma 3 12B 40.9% - - - $0.05 $0.10 131.1k - 12 Open
Anthropic 🇺🇸 Claude 3 Sonnet 40.4% - - - $3.00 $15.00 200k - - Closed
Google 🇺🇸 Gemini Diffusion x 40.4% 23.3% 22.9% - - - - - - Closed
OpenAI 🇺🇸 GPT-4o mini 40.2% - 8.7% - $0.15 $0.60 128k Oct. 2023 - Closed
Amazon - Nova Micro x 40.0% - - - $0.03 $0.14 128k - - Closed
Google 🇺🇸 Gemini 1.5 Flash 8B 38.4% - - - $0.07 $0.30 1.0M Oct. 2024 8 Closed
Mistral 🇫🇷 Mistral Small 3.1 24B Base 37.5% - - - $0.10 $0.30 128k - 24 Open
AI21 Labs - Jamba 1.5 Large x 36.9% - - - $2.00 $8.00 256k Mar. 2024 398 Open
Microsoft 🇺🇸 Phi-3.5-MoE-instruct x 36.8% - - - - - - - 60 Open
Qwen 🇨🇳 Qwen2.5 7B Instruct x 36.4% - - - $0.30 $0.30 131.1k - 7.6 Open
xAI 🇺🇸 Grok-1.5 x 35.9% - - - - - - - - Closed
OpenAI 🇺🇸 GPT-4 35.7% - - - $30.00 $60.00 32.8k Dec. 2022 - Closed
Mistral 🇫🇷 Mistral Small 3 24B Base 34.4% - - - - - - Oct. 2023 23.6 Open
DeepSeek 🇨🇳 DeepSeek R1 Distill Qwen 1.5B x 33.8% - - - - - - - 1.8 Open
Anthropic 🇺🇸 Claude 3 Haiku 33.3% - - - $0.25 $1.25 200k - - Closed
Meta 🇺🇸 Llama 3.2 11B Instruct 32.8% - - - $0.05 $0.05 128k Dec. 2023 10.6 Open
Meta 🇺🇸 Llama 3.2 3B Instruct x 32.8% - - - $0.01 $0.02 128k - 3.2 Open
AI21 Labs - Jamba 1.5 Mini x 32.3% - - - $0.20 $0.40 256.1k Mar. 2024 52 Open
Google 🇺🇸 Gemma 3 4B 30.8% - - - $0.02 $0.04 131.1k Aug. 2024 4 Open
OpenAI 🇺🇸 GPT-3.5 Turbo x 30.8% - - - $0.50 $1.50 16.4k Sep. 2021 - Closed
Qwen 🇨🇳 Qwen2.5-Omni-7B 30.8% - - - - - - - 7 Open
Meta 🇺🇸 Llama 3.1 8B Instruct x 30.4% - - - $0.03 $0.03 131.1k Dec. 2023 8 Open
Microsoft 🇺🇸 Phi-3.5-mini-instruct x 30.4% - - - $0.10 $0.10 128k - 3.8 Open
Google 🇺🇸 Gemini 1.0 Pro x 27.9% - - - $0.50 $1.50 32.8k Feb. 2024 - Closed
Qwen 🇨🇳 Qwen2 7B Instruct x 25.3% - - - - - - - 7.6 Open
Microsoft 🇺🇸 Phi 4 Mini x 25.2% - - - - - - Jun. 2024 3.8 Open
Google 🇺🇸 Gemma 3n E2B Instructed 24.8% 6.7% - - - - - Jun. 2024 8 Closed
Google 🇺🇸 Gemma 3n E2B Instructed LiteRT (Preview) 24.8% 6.7% - - - - - Jun. 2024 1.9 Open
Google 🇺🇸 Gemma 3n E4B Instructed 23.7% 11.6% - - $20.00 $40.00 32k Jun. 2024 8 Closed
Google 🇺🇸 Gemma 3n E4B Instructed LiteRT Preview 23.7% 11.6% - - - - - Jun. 2024 1.9 Open
Google 🇺🇸 Gemma 3 1B x 19.2% - - - - - - - 1 Open
Qwen 🇨🇳 Qwen3.5-0.8B 11.9% - - - - 0.8 Open
OpenAI 🇺🇸 GPT-5.2 Codex - - - - $1.75 $14.00 400k - - Closed
DeepSeek 🇨🇳 DeepSeek-V3.2-Speciale x - 96.0% 73.1% 30.6% $0.28 $0.42 131.1k - 685 Open
DeepSeek 🇨🇳 DeepSeek-V3.2 (Non-thinking) x - - - - $0.28 $0.42 131.1k - 685 Open
OpenAI 🇺🇸 GPT-5.1 Medium - 98.4% - - $1.25 $10.00 400k - - Closed
OpenAI 🇺🇸 GPT-5.1 Codex - - 73.7% - $1.25 $10.00 400k Sep. 2024 - Closed
StepFun 🇨🇳 Step-3.5-Flash NEW - 97.3% 74.4% - $0.10 $0.40 65.5k - 196 Open
Qwen 🇨🇳 Qwen3-Coder x - - - - $0.18 $0.18 256k - 480 Open
xAI 🇺🇸 Grok-4 Fast Reasoning - - - - $0.20 $0.50 2M - - Closed
xAI 🇺🇸 Grok-4.1 Fast Non-Reasoning - - - - $0.20 $0.50 2M - - Closed
OpenAI 🇺🇸 GPT-5.1 Codex High - 96.7% - - $1.25 $10.00 400k - - Closed
xAI 🇺🇸 Grok-4.1 Fast Reasoning - - - - $0.20 $0.50 2M - - Closed
xAI 🇺🇸 Grok Code Fast 1 x - - 70.8% - $0.20 $1.50 256k - - Closed
OpenAI 🇺🇸 GPT-5.1 Codex Mini - 42.1% - - $0.25 $2.00 400k - - Closed
Qwen 🇨🇳 Qwen3-Coder 480B A35B Instruct x - - 69.6% - - - - - 480 Open
Qwen 🇨🇳 Qwen3 32B x - 72.9% - - $0.10 $0.44 128k - 32.8 Open
Mistral 🇫🇷 Codestral-22B x - - - - - - - - 22.2 Open
Cohere 🇨🇦 Command R+ x - - - - $0.25 $1.00 128k - 104 Open
DeepSeek 🇨🇳 DeepSeek-R1 x - - - - $0.55 $2.19 131.1k - 671 Open
DeepSeek 🇨🇳 DeepSeek-V2.5 x - - 16.8% - $0.14 $0.28 8.2k - 236 Open
DeepSeek 🇨🇳 DeepSeek VL2 - - - - $9.50 $4800.00 129.3k - 27 Open
DeepSeek 🇨🇳 DeepSeek VL2 Small - - - - - - - - 16 Open
DeepSeek 🇨🇳 DeepSeek VL2 Tiny - - - - - - - - 3 Open
Mistral 🇫🇷 Devstral Medium x - - 61.6% - $0.40 $2.00 128k - - Closed
Mistral 🇫🇷 Devstral Small 1.1 x - - 53.6% - $0.10 $0.30 128k - 24 Open
Google 🇺🇸 Gemma 2 27B x - - - - - - - - 27.2 Open
Google 🇺🇸 Gemma 2 9B x - - - - - - - - 9.2 Open
Google 🇺🇸 Gemma 3n E2B - - - - - - - Jun. 2024 8 Closed
Google 🇺🇸 Gemma 3n E4B - - - - - - - Jun. 2024 8 Closed
OpenAI 🇺🇸 GPT-5.3 Codex NEW - - - - $1.75 $14.00 400k - - Closed
OpenAI 🇺🇸 GPT-5 Codex x - - 74.5% - - - - Sep. 2024 - Closed
IBM - Granite 3.3 8B Base - - - - - - - Apr. 2024 8.2 Open
IBM - IBM Granite 4.0 Tiny Preview x - - - - - - - - 7 Open
xAI 🇺🇸 Grok-1.5V - - - - - - - - - Closed
xAI 🇺🇸 Grok-2 Image 1212 x - - - - - - 131.1k - - Closed
xAI 🇺🇸 Grok-4.1 - - - - $3.00 $15.00 256k - - Closed
xAI 🇺🇸 Grok-4.1 Thinking - - - - $3.00 $15.00 256k - - Closed
MoonshotAI 🇨🇳 Kimi-k1.5 - - - - - - - - - Closed
Nvidia 🇺🇸 Llama 3.1 Nemotron 70B Instruct x - - - - - - - Dec. 2023 70 Open
Google 🇺🇸 MedGemma 4B IT - - - - - - - - 4.3 Open
Mistral 🇫🇷 Ministral 3 (14B Base 2512) - - - - - - - - 14 Open
Mistral 🇫🇷 MiniStral 3 (14B Instruct 2512) - - - - - - - - 14 Open
Mistral 🇫🇷 Ministral 3 (3B Base 2512) - - - - - - - - 3 Open
Mistral 🇫🇷 Ministral 3 (3B Instruct 2512) - - - - - - - - 3 Open
Mistral 🇫🇷 Ministral 3 (8B Base 2512) - - - - - - - - 8 Open
Mistral 🇫🇷 Ministral 3 (8B Instruct 2512) - - - - - - - - 8 Open
Mistral 🇫🇷 Ministral 8B Instruct x - - - - $0.10 $0.10 128k - 8.0 Open
Mistral 🇫🇷 Mistral Large 2 x - - - - $2.00 $6.00 128k - 123 Open
Mistral 🇫🇷 Mistral Large 3 - - - - $2.00 $5.00 128k - 675 Open
Mistral 🇫🇷 Mistral NeMo Instruct x - - - - $0.15 $0.15 128k - 12 Open
Mistral 🇫🇷 Mistral Small x - - - - $0.20 $0.60 32.8k - 22 Open
OpenAI 🇺🇸 o3-pro - - - - $20.00 $80.00 200k May 2024 - Closed
Microsoft 🇺🇸 Phi-3.5-vision-instruct - - - - - - - - 4.2 Open
Microsoft 🇺🇸 Phi-4-multimodal-instruct - - - - $0.05 $0.10 128k Jun. 2024 5.6 Open
Mistral 🇫🇷 Pixtral-12B - - - - $0.15 $0.15 128k - 12.4 Open
Mistral 🇫🇷 Pixtral Large - - - - $2.00 $6.00 128k - 124 Open
Qwen 🇨🇳 QvQ-72B-Preview - - - - - - - - 73.4 Open
Qwen 🇨🇳 Qwen2.5-Coder 32B Instruct x - - - - $0.09 $0.09 128k - 32 Open
Qwen 🇨🇳 Qwen2.5-Coder 7B Instruct x - - - - - - - - 7 Open
Qwen 🇨🇳 Qwen2.5 VL 72B Instruct - - - - - - - - 72 Open
Qwen 🇨🇳 Qwen2.5 VL 7B Instruct - - - - - - - - 8.3 Open
Qwen 🇨🇳 Qwen2-VL-72B-Instruct - - - - - - - Jun. 2023 73.4 Open
Qwen 🇨🇳 Qwen3-Next-80B-A3B-Base x - - - - - - - - 80 Open
Qwen 🇨🇳 Qwen3 VL 235B A22B Instruct - 74.7% - - $0.30 $1.49 262.1k - 236 Open
Qwen 🇨🇳 Qwen3 VL 235B A22B Thinking - 89.7% - 13.6% $0.45 $3.49 262.1k - 236 Open
Qwen 🇨🇳 Qwen3 VL 4B Instruct - 46.6% - - $0.10 $0.60 262.1k - 4 Open
Qwen 🇨🇳 Qwen3 VL 8B Instruct - 45.9% - - $0.08 $0.50 262.1k - 9 Open
StepFun 🇨🇳 Step3-VL-10B - 87.7% - - - - - - 10 Open
xAI 🇺🇸 Grok-4 Fast Non-Reasoning - - - - $0.20 $0.50 2M - - Closed
ZAI 🇨🇳 GLM-4.5V - - - - $0.55 $2.19 131.1k - 108 Open
IBM - Granite 3.3 8B Instruct - - - - $0.50 $0.50 128k Apr. 2024 8 Open
LG AI Research 🇰🇷 K-EXAONE-236B-A23B x - 92.8% - - $0.60 $1.00 32.8k Oct. 2025 236 Closed
ZAI 🇨🇳 GLM-5 x 77.8% $1.00 $3.20 200k - 744 Open
xAI 🇺🇸 Grok-4.20 Beta Non-Reasoning $2.00 $6.00 2M - - Closed
MiniMax 🇨🇳 MiniMax M2.7 x $0.30 $1.20 204.8k - - Open
MiniMax 🇨🇳 MiniMax M2.5 x 80.2% $0.30 $1.20 1M - 230 Open
xAI 🇺🇸 Grok-4.20 Multi-Agent Beta $2.00 $6.00 2M - - Closed
xAI 🇺🇸 Grok-4.20 Beta Reasoning $2.00 $6.00 2M - - Closed
OpenAI 🇺🇸 GPT-5.3 Chat $1.75 $14.00 128k Aug. 2025 - Closed
Xiaomi 🇨🇳 MiMo-V2-Omni 74.8% $0.40 $2.00 262k - - Closed
Xiaomi 🇨🇳 MiMo-V2-Pro x 78.0% $1.00 $3.00 1M - 1000 Closed
ZAI 🇨🇳 GLM-5V-Turbo - - - - - Closed
xAI 🇺🇸 Grok Imagine Image Pro x - - 131.1k - - Closed
OpenBMB 🇨🇳 MiniCPM-SALA x 78.3% - - - - 9.5 Open