Compare Leading AI & LLM Models
Compare performance metrics, benchmarks, and pricing across the leading AI models. Sort by any metric and find the best model for your use case.
Model Leaderboard
Showing 240 of 240 models
β’ 0 selected
| Org | π | Model | Multimodal | GPQA | AIME 2025 | SWE-bench | HLE | Input $/M | Output $/M | Context | Cutoff | Params (B) | License | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OpenAI | πΊπΈ | GPT-5.2 Pro | β | 93.2% | 100.0% | - | 36.6% | $21.00 | $168.00 | 400k | - | - | Closed | |
| OpenAI | πΊπΈ | GPT-5.2 | β | 92.4% | 100.0% | 80.0% | 34.5% | $1.75 | $14.00 | 400k | Aug. 2025 | - | Closed | |
| πΊπΈ | Gemini 3 Pro | β | 91.9% | 100.0% | 76.2% | - | $2.00 | $12.00 | 1.0M | Jan. 2025 | - | Closed | ||
| πΊπΈ | Gemini 3 Flash | β | 90.4% | 99.7% | 78.0% | - | $0.50 | $3.00 | 1M | Jan. 2025 | - | Closed | ||
| xAI | πΊπΈ | Grok-4 Heavy UNRELEASED | β | 88.4% | 100.0% | - | - | - | - | - | Dec. 2024 | - | Closed | |
| OpenAI | πΊπΈ | GPT-5 Medium | β | 88.1% | 88.9% | - | - | $1.25 | $10.00 | 400k | Sep. 2024 | - | Closed | |
| OpenAI | πΊπΈ | GPT-5.1 | β | 88.1% | 94.0% | 76.3% | - | $1.25 | $10.00 | 400k | Sep. 2024 | - | Closed | |
| OpenAI | πΊπΈ | GPT-5.1 High | β | 88.1% | 99.6% | - | - | $1.25 | $10.00 | 400k | - | - | Closed | |
| OpenAI | πΊπΈ | GPT-5.1 Thinking | β | 88.1% | 94.0% | 76.3% | - | $1.25 | $10.00 | 400k | - | - | Closed | |
| OpenAI | πΊπΈ | GPT-5.1 Instant | β | 88.1% | 94.0% | 76.3% | - | $1.25 | $10.00 | 400k | - | - | Closed | |
| MoonshotAI | π¨π³ | Kimi K2.5 NEW | β | 87.6% | 96.1% | 76.8% | 50.2% | $0.60 | $2.50 | 262.1k | - | 1000 | Open | |
| xAI | πΊπΈ | Grok-4 | β | 87.5% | 91.7% | - | - | $3.00 | $15.00 | 256k | Dec. 2024 | - | Closed | |
| OpenAI | πΊπΈ | GPT-5 High | β | 87.3% | 94.6% | - | - | $1.25 | $10.00 | 400k | Sep. 2024 | - | Closed | |
| Anthropic | πΊπΈ | Claude Opus 4.5 | β | 87.0% | - | 80.9% | - | $5.00 | $25.00 | 200k | Mar. 2025 | - | Closed | |
| πΊπΈ | Gemini 2.5 Pro Preview 06-05 | β | 86.4% | 88.0% | 67.2% | - | $1.25 | $10.00 | 1.0M | Jan. 2025 | - | Closed | ||
| ZAI | π¨π³ | GLM-4.7 | β | 85.7% | 95.7% | 73.8% | 42.8% | $0.60 | $2.20 | 204.8k | - | 358 | Open | |
| OpenAI | πΊπΈ | GPT-5 | β | 85.7% | 94.6% | 74.9% | - | $1.25 | $10.00 | 400k | Sep. 2024 | - | Closed | |
| xAI | πΊπΈ | Grok 4 Fast | β | 85.7% | 92.0% | - | 20.0% | $0.20 | $0.50 | 2M | - | - | Closed | |
| Baidu | π¨π³ | ERNIE 5.0 | β | 85.0% | 87.0% | - | 39.0% | - | - | - | - | - | Closed | |
| Anthropic | πΊπΈ | Claude 3.7 Sonnet | β | 84.8% | 54.8% | 70.3% | - | $3.00 | $15.00 | 200k | - | - | Closed | |
| xAI | πΊπΈ | Grok-3 | β | 84.6% | 93.3% | - | - | $3.00 | $15.00 | 128k | Nov. 2024 | - | Closed | |
| MoonshotAI | π¨π³ | Kimi K2-Thinking-0905 | x | 84.5% | 100.0% | 71.3% | 51.0% | $0.47 | $2.00 | 262.1k | - | 1000 | Open | |
| OpenAI | πΊπΈ | ChatGPT-4o Latest | β | 84.0% | - | - | - | $2.50 | $10.00 | 128k | - | - | Closed | |
| xAI | πΊπΈ | Grok-3 Mini | β | 84.0% | 90.8% | - | - | $0.30 | $0.50 | 128k | Nov. 2024 | - | Closed | |
| Xiaomi | π¨π³ | MiMo-V2-Flash | x | 83.7% | 94.1% | 73.4% | 22.1% | $0.10 | $0.30 | 256k | - | 309 | Open | |
| Anthropic | πΊπΈ | Claude Sonnet 4.5 | β | 83.4% | 87.0% | - | - | $3.00 | $15.00 | 200k | Jan. 2025 | - | Closed | |
| OpenAI | πΊπΈ | o3 | β | 83.3% | 86.4% | 69.1% | - | $2.00 | $8.00 | 200k | May 2024 | - | Closed | |
| πΊπΈ | Gemini 2.5 Pro | β | 83.0% | 83.0% | 63.2% | - | $1.25 | $10.00 | 1.0M | Jan. 2025 | - | Closed | ||
| πΊπΈ | Gemini 2.5 Flash | β | 82.8% | 72.0% | 60.4% | - | $0.30 | $2.50 | 1.0M | Jan. 2025 | - | Closed | ||
| DeepSeek | π¨π³ | DeepSeek-V3.2 (Thinking) | x | 82.4% | 93.1% | 73.1% | 25.1% | $0.28 | $0.42 | 131.1k | - | 685 | Open | |
| OpenAI | πΊπΈ | GPT-5 mini | β | 82.3% | 91.1% | - | - | $0.25 | $2.00 | 400k | May 2024 | - | Closed | |
| OpenAI | πΊπΈ | o4-mini | β | 81.4% | 92.7% | 68.1% | - | $1.10 | $4.40 | 200k | May 2024 | - | Closed | |
| Qwen | π¨π³ | Qwen3-235B-A22B-Thinking-2507 | x | 81.1% | 92.3% | - | - | $0.30 | $3.00 | 262.1k | - | 235 | Open | |
| ZAI | π¨π³ | GLM-4.6 | β | 81.0% | 93.9% | 68.0% | 17.2% | $0.55 | $2.19 | 131.1k | - | 357 | Open | |
| MiniMax | π¨π³ | MiniMax M2.1 | x | 81.0% | 81.0% | 67.0% | 22.0% | $0.30 | $1.20 | 1M | - | - | Open | |
| DeepSeek | π¨π³ | DeepSeek-R1-0528 | x | 81.0% | 87.5% | 44.6% | - | $0.50 | $2.15 | 131.1k | - | 671 | Open | |
| Anthropic | πΊπΈ | Claude Opus 4.1 | β | 80.9% | 78.0% | 74.5% | - | $15.00 | $75.00 | 200k | - | - | Closed | |
| OpenAI | πΊπΈ | GPT OSS 120B High | x | 80.9% | 92.5% | - | - | $0.10 | $0.50 | 131.1k | - | 116.8 | Open | |
| OpenAI | πΊπΈ | GPT OSS 120B | x | 80.1% | - | - | - | $0.09 | $0.45 | 131.1k | - | 116.8 | Open | |
| DeepSeek | π¨π³ | DeepSeek-V3.2-Exp | x | 79.9% | 89.3% | 67.8% | - | $0.27 | $0.41 | 163.8k | - | 685 | Open | |
| Anthropic | πΊπΈ | Claude Opus 4 | β | 79.6% | 75.5% | 72.5% | - | $15.00 | $75.00 | 200k | - | - | Closed | |
| ZAI | π¨π³ | GLM-4.5 | x | 79.1% | - | 64.2% | 14.4% | $0.40 | $1.60 | 131.1k | - | 355 | Open | |
| OpenAI | πΊπΈ | o1-pro | β | 79.0% | - | - | - | - | - | - | Sep. 2023 | - | Closed | |
| MiniMax | π¨π³ | MiniMax M2 | x | 78.0% | 78.0% | 69.4% | 12.5% | $0.30 | $1.20 | 1M | - | 230 | Open | |
| OpenAI | πΊπΈ | o1 | x | 78.0% | - | 41.0% | - | $15.00 | $60.00 | 200k | - | - | Closed | |
| Qwen | π¨π³ | Qwen3-235B-A22B-Instruct-2507 | x | 77.5% | 70.3% | - | - | $0.15 | $0.80 | 262.1k | - | 235 | Open | |
| OpenAI | πΊπΈ | o3-mini | x | 77.2% | - | 49.3% | - | $1.10 | $4.40 | 200k | Sep. 2023 | - | Closed | |
| Qwen | π¨π³ | Qwen3-Next-80B-A3B-Thinking | x | 77.2% | 87.8% | - | - | $0.15 | $1.50 | 65.5k | - | 80 | Open | |
| Nvidia | πΊπΈ | Llama 3.1 Nemotron Ultra 253B v1 | x | 76.0% | 72.5% | - | - | - | - | - | Dec. 2023 | 253 | Open | |
| MoonshotAI | π¨π³ | Kimi K2 0905 | x | 75.8% | - | - | - | $0.60 | $2.50 | 262.1k | - | 1000 | Closed | |
| Anthropic | πΊπΈ | Claude Sonnet 4 | β | 75.4% | 70.5% | 72.7% | - | $3.00 | $15.00 | 200k | - | - | Closed | |
| ZAI | π¨π³ | GLM-4.7-Flash | x | 75.2% | 91.6% | 59.2% | 14.4% | $0.07 | $0.40 | 128k | - | - | Open | |
| MoonshotAI | π¨π³ | Kimi K2-Instruct-0905 | x | 75.1% | 49.5% | 65.8% | 4.7% | - | - | - | - | 1000 | Open | |
| MoonshotAI | π¨π³ | Kimi K2 Instruct | x | 75.1% | 49.5% | - | - | $0.50 | $0.50 | 200k | - | 1000 | Open | |
| Nvidia | πΊπΈ | Nemotron 3 Nano (30B A3B) | x | 75.0% | 99.2% | 38.8% | 15.5% | $0.06 | $0.24 | 262.1k | Nov. 2025 | 32 | Open | |
| ZAI | π¨π³ | GLM-4.5-Air | x | 75.0% | - | 57.6% | 10.6% | - | - | - | - | 106 | Open | |
| DeepSeek | π¨π³ | DeepSeek-V3.1 | x | 74.9% | 49.8% | 66.0% | - | $0.27 | $1.00 | 163.8k | - | 671 | Open | |
| Qwen | π¨π³ | Qwen3 VL 30B A3B Thinking | β | 74.4% | 83.1% | - | - | $0.20 | $1.00 | 262.1k | - | 31 | Open | |
| OpenAI | πΊπΈ | GPT OSS 20B High | x | 74.2% | 98.7% | - | - | $0.10 | $0.50 | 131.1k | - | 20.9 | Open | |
| πΊπΈ | Gemini 2.0 Flash Thinking | β | 74.2% | - | - | - | - | - | - | Aug. 2024 | - | Closed | ||
| Baidu | π¨π³ | ERNIE 4.5 | x | 74.0% | - | - | - | $0.40 | $4.00 | 128k | - | 21 | Closed | |
| DeepSeek | π¨π³ | DeepSeek R1 Zero | x | 73.3% | - | - | - | - | - | - | - | 671 | Open | |
| OpenAI | πΊπΈ | o1-preview | x | 73.3% | - | 41.3% | - | $15.00 | $60.00 | 128k | - | - | Closed | |
| Qwen | π¨π³ | Qwen3 VL 32B Thinking | β | 73.1% | 83.7% | - | - | - | - | - | - | 33 | Open | |
| Anthropic | πΊπΈ | Claude Haiku 4.5 | β | 73.0% | 80.7% | 73.3% | - | $1.00 | $5.00 | 200k | Feb. 2025 | - | Closed | |
| Qwen | π¨π³ | Qwen3-Next-80B-A3B-Instruct | x | 72.9% | 69.5% | - | - | $0.15 | $1.50 | 65.5k | - | 80 | Open | |
| OpenAI | πΊπΈ | GPT OSS 20B | x | 71.5% | - | - | - | $0.05 | $0.20 | 131.1k | - | 20.9 | Open | |
| OpenAI | πΊπΈ | GPT-5 nano | β | 71.2% | 85.2% | - | - | $0.05 | $0.40 | 400k | May 2024 | - | Closed | |
| Mistral | π«π· | Ministral 3 (14B Reasoning 2512) | β | 71.2% | 85.0% | - | - | $0.20 | $0.20 | 262.1k | - | 14 | Open | |
| Mistral | π«π· | Magistral Medium | β | 70.8% | 64.9% | - | - | - | - | - | Jun. 2025 | 24 | Open | |
| Qwen | π¨π³ | Qwen3 VL 30B A3B Instruct | β | 70.4% | 69.3% | - | - | $0.20 | $0.70 | 262.1k | - | 31 | Open | |
| OpenAI | πΊπΈ | GPT-4o | β | 70.1% | - | 33.2% | - | $2.50 | $10.00 | 128k | - | - | Closed | |
| MiniMax | π¨π³ | MiniMax M1 80K | x | 70.0% | 76.9% | 56.0% | 8.4% | $0.55 | $2.20 | 1M | - | 456 | Open | |
| Qwen | π¨π³ | Qwen3 VL 8B Thinking | β | 69.9% | 80.3% | - | - | $0.18 | $2.09 | 262.1k | - | 9 | Open | |
| Meta | πΊπΈ | Llama 4 Maverick | β | 69.8% | - | - | - | $0.17 | $0.60 | 1M | - | 400 | Open | |
| OpenAI | πΊπΈ | GPT-4.5 | β | 69.5% | - | 38.0% | - | $75.00 | $150.00 | 128k | - | - | Closed | |
| MiniMax | π¨π³ | MiniMax M1 40K | x | 69.2% | 74.6% | 55.6% | 7.2% | - | - | - | - | 456 | Open | |
| Microsoft | πΊπΈ | Phi 4 Reasoning Plus | x | 68.9% | 78.0% | - | - | - | - | - | Mar. 2025 | 14 | Open | |
| Qwen | π¨π³ | Qwen3 VL 32B Instruct | β | 68.9% | 66.2% | - | - | - | - | - | - | 33 | Open | |
| DeepSeek | π¨π³ | DeepSeek-V3 0324 | x | 68.4% | - | - | - | $0.28 | $1.14 | 163.8k | - | 671 | Open | |
| Mistral | π«π· | Magistral Small 2506 | x | 68.2% | 62.8% | - | - | - | - | - | Jun. 2025 | 24 | Open | |
| Anthropic | πΊπΈ | Claude 3.5 Sonnet | β | 67.2% | - | 49.0% | - | $3.00 | $15.00 | 200k | - | - | Closed | |
| Mistral | π«π· | Ministral 3 (8B Reasoning 2512) | β | 66.8% | 78.7% | - | - | $0.15 | $0.15 | 262.1k | - | 8 | Open | |
| Nvidia | πΊπΈ | Llama-3.3 Nemotron Super 49B v1 | x | 66.7% | 58.4% | - | - | - | - | - | Dec. 2023 | 49.9 | Open | |
| OpenAI | πΊπΈ | GPT-4.1 | β | 66.3% | 46.4% | 54.6% | - | $2.00 | $8.00 | 1.0M | Jun. 2024 | - | Closed | |
| Nous Research | πΊπΈ | Hermes 3 70B | x | 66.1% | - | - | - | $0.35 | $1.40 | 131.1k | - | 70 | Open | |
| Microsoft | πΊπΈ | Phi 4 Reasoning | x | 65.8% | 62.9% | - | - | - | - | - | Mar. 2025 | 14 | Open | |
| Qwen | π¨π³ | Qwen3 30B A3B | x | 65.8% | 70.9% | - | - | $0.10 | $0.30 | 128k | - | 30.5 | Open | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Llama 70B | x | 65.2% | - | - | - | $0.10 | $0.40 | 128k | - | 70.6 | Open | |
| Qwen | π¨π³ | QwQ-32B | x | 65.2% | - | - | - | - | - | - | Nov. 2024 | 32.5 | Open | |
| Qwen | π¨π³ | QwQ-32B-Preview | x | 65.2% | - | - | - | $0.15 | $0.60 | 32.8k | Nov. 2024 | 32.5 | Open | |
| OpenAI | πΊπΈ | GPT-4.1 mini | β | 65.0% | 40.2% | 23.6% | - | $0.40 | $1.60 | 1.0M | May 2024 | - | Closed | |
| πΊπΈ | Gemini 2.5 Flash-Lite | β | 64.6% | 49.8% | 31.6% | - | $0.10 | $0.40 | 1.0M | Jan. 2025 | - | Open | ||
| Qwen | π¨π³ | Qwen3 VL 4B Thinking | β | 64.1% | 74.5% | - | - | $0.10 | $1.00 | 262.1k | - | 4 | Open | |
| Nvidia | πΊπΈ | Nemotron Nano 9B v2 | x | 64.0% | 72.1% | - | - | - | - | - | Sep. 2024 | 8.9 | Open | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Qwen 32B | x | 62.1% | - | - | - | $0.12 | $0.18 | 128k | - | 32.8 | Open | |
| πΊπΈ | Gemini 2.0 Flash | β | 62.1% | - | - | - | $0.10 | $0.40 | 1.0M | Aug. 2024 | - | Closed | ||
| Qwen | π¨π³ | Qwen3 Max | x | 62.0% | 81.6% | 69.6% | - | $0.50 | $5.00 | 256k | - | - | Open | |
| OpenAI | πΊπΈ | o1-mini | x | 60.0% | - | - | - | $3.00 | $12.00 | 128k | - | - | Closed | |
| Anthropic | πΊπΈ | Claude 3.5 Sonnet | β | 59.4% | - | - | - | $3.00 | $15.00 | 200k | - | - | Closed | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Qwen 14B | x | 59.1% | - | - | - | - | - | - | - | 14.8 | Open | |
| DeepSeek | π¨π³ | DeepSeek-V3 | x | 59.1% | - | 42.0% | - | $0.27 | $1.10 | 131.1k | - | 671 | Open | |
| πΊπΈ | Gemini 1.5 Pro | β | 59.1% | - | - | - | $2.50 | $10.00 | 2.1M | Nov. 2023 | - | Closed | ||
| Meta | πΊπΈ | Llama 4 Scout | β | 57.2% | - | - | - | $0.08 | $0.30 | 10M | - | 109 | Open | |
| Microsoft | πΊπΈ | Phi 4 | x | 56.1% | - | - | - | $0.07 | $0.14 | 16k | Jun. 2024 | 14.7 | Open | |
| xAI | πΊπΈ | Grok-2 | β | 56.0% | - | - | - | $2.00 | $10.00 | 128k | - | - | Closed | |
| Nvidia | πΊπΈ | Llama 3.1 Nemotron Nano 8B V1 | x | 54.1% | 47.1% | - | - | - | - | - | Dec. 2023 | 8 | Open | |
| OpenAI | πΊπΈ | GPT-4o | β | 53.6% | - | - | - | $2.50 | $10.00 | 128k | - | - | Closed | |
| Mistral | π«π· | Min istral 3 (3B Reasoning 2512) | β | 53.4% | 72.1% | - | - | $0.10 | $0.10 | 131.1k | - | 3 | Open | |
| Microsoft | πΊπΈ | Phi 4 Mini Reasoning | x | 52.0% | - | - | - | - | - | - | Feb. 2025 | 3.8 | Open | |
| πΊπΈ | Gemini 2.0 Flash-Lite | β | 51.5% | - | - | - | $0.07 | $0.30 | 1.0M | Jun. 2024 | - | Closed | ||
| πΊπΈ | Gemini 1.5 Flash | β | 51.0% | - | - | - | $0.15 | $0.60 | 1.0M | Nov. 2023 | - | Closed | ||
| xAI | πΊπΈ | Grok-2 mini | β | 51.0% | - | - | - | - | - | - | - | - | Closed | |
| Meta | πΊπΈ | Llama 3.1 405B Instruct | x | 50.7% | - | - | - | $0.89 | $0.89 | 128k | - | 405 | Open | |
| Meta | πΊπΈ | Llama 3.3 70B Instruct | x | 50.5% | - | - | - | $0.20 | $0.20 | 128k | - | 70 | Open | |
| Anthropic | πΊπΈ | Claude 3 Opus | β | 50.4% | - | - | - | $15.00 | $75.00 | 200k | - | - | Closed | |
| OpenAI | πΊπΈ | GPT-4.1 nano | β | 50.3% | - | - | - | $0.10 | $0.40 | 1.0M | May 2024 | - | Closed | |
| Qwen | π¨π³ | Qwen2.5 32B Instruct | x | 49.5% | - | - | - | - | - | - | - | 32.5 | Open | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Qwen 7B | x | 49.1% | - | - | - | - | - | - | - | 7.6 | Open | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Llama 8B | x | 49.0% | - | - | - | - | - | - | - | 8.0 | Open | |
| Qwen | π¨π³ | Qwen2.5 72B Instruct | x | 49.0% | - | - | - | $0.35 | $0.40 | 131.1k | - | 72.7 | Open | |
| MoonshotAI | π¨π³ | Kimi K2 Base | x | 48.1% | - | - | - | - | - | - | - | 1000 | Open | |
| OpenAI | πΊπΈ | GPT-4 Turbo | x | 48.0% | - | - | - | $10.00 | $30.00 | 128k | Dec. 2023 | - | Closed | |
| Qwen | π¨π³ | Qwen3 235B A22B | x | 47.5% | 81.5% | - | - | $0.10 | $0.10 | 128k | - | 235 | Open | |
| Amazon | - | Nova Pro | β | 46.9% | - | - | - | $0.80 | $3.20 | 300k | - | - | Closed | |
| Meta | πΊπΈ | Llama 3.2 90B Instruct | β | 46.7% | - | - | - | $0.35 | $0.40 | 128k | - | 90 | Open | |
| Mistral | π«π· | Mistral Small 3.2 24B Instruct | β | 46.1% | - | - | - | - | - | - | Oct. 2023 | 23.6 | Open | |
| Mistral | π«π· | Mistral Small 3.1 24B Instruct | β | 46.0% | - | - | - | - | - | - | - | 24 | Open | |
| Qwen | π¨π³ | Qwen2.5 VL 32B Instruct | β | 46.0% | - | - | - | - | - | - | - | 33.5 | Open | |
| Qwen | π¨π³ | Qwen2.5 14B Instruct | x | 45.5% | - | - | - | - | - | - | - | 14.7 | Open | |
| Mistral | π«π· | Mistral Small 3 24B Instruct | x | 45.3% | - | - | - | $0.07 | $0.14 | 32k | Oct. 2023 | 24 | Open | |
| Mistral | π«π· | Mistral Large 3 (675B Instruct 2512) | β | 43.9% | - | - | - | $0.50 | $1.50 | 262.1k | - | 675 | Open | |
| Mistral | π«π· | Mistral Large 3 (675B Base) | β | 43.9% | - | - | - | - | - | - | - | 675 | Open | |
| Mistral | π«π· | Mistral Large 3 (675B Instruct 2512 Eagle) | β | 43.9% | - | - | - | - | - | - | - | 675 | Open | |
| Mistral | π«π· | Mistral Large 3 (675B Instruct 2512 NVFP4) | β | 43.9% | - | - | - | - | - | - | - | 675 | Open | |
| πΊπΈ | Gemma 3 27B | β | 42.4% | - | - | - | $0.10 | $0.20 | 131.1k | - | 27 | Open | ||
| Qwen | π¨π³ | Qwen2 72B Instruct | x | 42.4% | - | - | - | - | - | - | - | 72 | Open | |
| Amazon | - | Nova Lite | β | 42.0% | - | - | - | $0.06 | $0.24 | 300k | - | - | Closed | |
| Meta | πΊπΈ | Llama 3.1 70B Instruct | x | 41.7% | - | - | - | $0.20 | $0.20 | 128k | - | 70 | Open | |
| Anthropic | πΊπΈ | Claude 3.5 Haiku | x | 41.6% | - | 40.6% | - | $0.80 | $4.00 | 200k | - | - | Closed | |
| πΊπΈ | Gemma 3 12B | β | 40.9% | - | - | - | $0.05 | $0.10 | 131.1k | - | 12 | Open | ||
| Anthropic | πΊπΈ | Claude 3 Sonnet | β | 40.4% | - | - | - | $3.00 | $15.00 | 200k | - | - | Closed | |
| πΊπΈ | Gemini Diffusion | x | 40.4% | 23.3% | 22.9% | - | - | - | - | - | - | Closed | ||
| OpenAI | πΊπΈ | GPT-4o mini | β | 40.2% | - | 8.7% | - | $0.15 | $0.60 | 128k | Oct. 2023 | - | Closed | |
| Amazon | - | Nova Micro | x | 40.0% | - | - | - | $0.03 | $0.14 | 128k | - | - | Closed | |
| πΊπΈ | Gemini 1.5 Flash 8B | β | 38.4% | - | - | - | $0.07 | $0.30 | 1.0M | Oct. 2024 | 8 | Closed | ||
| Mistral | π«π· | Mistral Small 3.1 24B Base | β | 37.5% | - | - | - | $0.10 | $0.30 | 128k | - | 24 | Open | |
| AI21 Labs | - | Jamba 1.5 Large | x | 36.9% | - | - | - | $2.00 | $8.00 | 256k | Mar. 2024 | 398 | Open | |
| Microsoft | πΊπΈ | Phi-3.5-MoE-instruct | x | 36.8% | - | - | - | - | - | - | - | 60 | Open | |
| Qwen | π¨π³ | Qwen2.5 7B Instruct | x | 36.4% | - | - | - | $0.30 | $0.30 | 131.1k | - | 7.6 | Open | |
| xAI | πΊπΈ | Grok-1.5 | x | 35.9% | - | - | - | - | - | - | - | - | Closed | |
| OpenAI | πΊπΈ | GPT-4 | β | 35.7% | - | - | - | $30.00 | $60.00 | 32.8k | Dec. 2022 | - | Closed | |
| Mistral | π«π· | Mistral Small 3 24B Base | β | 34.4% | - | - | - | - | - | - | Oct. 2023 | 23.6 | Open | |
| DeepSeek | π¨π³ | DeepSeek R1 Distill Qwen 1.5B | x | 33.8% | - | - | - | - | - | - | - | 1.8 | Open | |
| Anthropic | πΊπΈ | Claude 3 Haiku | β | 33.3% | - | - | - | $0.25 | $1.25 | 200k | - | - | Closed | |
| Meta | πΊπΈ | Llama 3.2 11B Instruct | β | 32.8% | - | - | - | $0.05 | $0.05 | 128k | Dec. 2023 | 10.6 | Open | |
| Meta | πΊπΈ | Llama 3.2 3B Instruct | x | 32.8% | - | - | - | $0.01 | $0.02 | 128k | - | 3.2 | Open | |
| AI21 Labs | - | Jamba 1.5 Mini | x | 32.3% | - | - | - | $0.20 | $0.40 | 256.1k | Mar. 2024 | 52 | Open | |
| πΊπΈ | Gemma 3 4B | β | 30.8% | - | - | - | $0.02 | $0.04 | 131.1k | Aug. 2024 | 4 | Open | ||
| OpenAI | πΊπΈ | GPT-3.5 Turbo | x | 30.8% | - | - | - | $0.50 | $1.50 | 16.4k | Sep. 2021 | - | Closed | |
| Qwen | π¨π³ | Qwen2.5-Omni-7B | β | 30.8% | - | - | - | - | - | - | - | 7 | Open | |
| Meta | πΊπΈ | Llama 3.1 8B Instruct | x | 30.4% | - | - | - | $0.03 | $0.03 | 131.1k | Dec. 2023 | 8 | Open | |
| Microsoft | πΊπΈ | Phi-3.5-mini-instruct | x | 30.4% | - | - | - | $0.10 | $0.10 | 128k | - | 3.8 | Open | |
| πΊπΈ | Gemini 1.0 Pro | x | 27.9% | - | - | - | $0.50 | $1.50 | 32.8k | Feb. 2024 | - | Closed | ||
| Qwen | π¨π³ | Qwen2 7B Instruct | x | 25.3% | - | - | - | - | - | - | - | 7.6 | Open | |
| Microsoft | πΊπΈ | Phi 4 Mini | x | 25.2% | - | - | - | - | - | - | Jun. 2024 | 3.8 | Open | |
| πΊπΈ | Gemma 3n E2B Instructed | β | 24.8% | 6.7% | - | - | - | - | - | Jun. 2024 | 8 | Closed | ||
| πΊπΈ | Gemma 3n E2B Instructed LiteRT (Preview) | β | 24.8% | 6.7% | - | - | - | - | - | Jun. 2024 | 1.9 | Open | ||
| πΊπΈ | Gemma 3n E4B Instructed | β | 23.7% | 11.6% | - | - | $20.00 | $40.00 | 32k | Jun. 2024 | 8 | Closed | ||
| πΊπΈ | Gemma 3n E4B Instructed LiteRT Preview | β | 23.7% | 11.6% | - | - | - | - | - | Jun. 2024 | 1.9 | Open | ||
| πΊπΈ | Gemma 3 1B | x | 19.2% | - | - | - | - | - | - | - | 1 | Open | ||
| OpenAI | πΊπΈ | GPT-5.2 Codex | β | - | - | - | - | $1.75 | $14.00 | 400k | - | - | Closed | |
| DeepSeek | π¨π³ | DeepSeek-V3.2-Speciale | x | - | 96.0% | 73.1% | 30.6% | $0.28 | $0.42 | 131.1k | - | 685 | Open | |
| OpenAI | πΊπΈ | GPT-5.1 Medium | β | - | 98.4% | - | - | $1.25 | $10.00 | 400k | - | - | Closed | |
| DeepSeek | π¨π³ | DeepSeek-V3.2 (Non-thinking) | x | - | - | - | - | $0.28 | $0.42 | 131.1k | - | 685 | Open | |
| OpenAI | πΊπΈ | GPT-5.1 Codex | β | - | - | 73.7% | - | $1.25 | $10.00 | 400k | Sep. 2024 | - | Closed | |
| Qwen | π¨π³ | Qwen3-Coder | x | - | - | - | - | $0.18 | $0.18 | 256k | - | 480 | Open | |
| OpenAI | πΊπΈ | GPT-5.1 Codex High | β | - | 96.7% | - | - | $1.25 | $10.00 | 400k | - | - | Closed | |
| xAI | πΊπΈ | Grok-4 Fast Reasoning | β | - | - | - | - | $0.20 | $0.50 | 2M | - | - | Closed | |
| xAI | πΊπΈ | Grok Code Fast 1 | x | - | - | 70.8% | - | $0.20 | $1.50 | 256k | - | - | Closed | |
| xAI | πΊπΈ | Grok-4.1 Fast Non-Reasoning | β | - | - | - | - | $0.20 | $0.50 | 2M | - | - | Closed | |
| xAI | πΊπΈ | Grok-4.1 Fast Reasoning | β | - | - | - | - | $0.20 | $0.50 | 2M | - | - | Closed | |
| Qwen | π¨π³ | Qwen3-Coder 480B A35B Instruct | x | - | - | 69.6% | - | - | - | - | - | 480 | Open | |
| Mistral | π«π· | Codestral-22B | x | - | - | - | - | - | - | - | - | 22.2 | Open | |
| Cohere | π¨π¦ | Command R+ | x | - | - | - | - | $0.25 | $1.00 | 128k | - | 104 | Open | |
| DeepSeek | π¨π³ | DeepSeek-R1 | x | - | - | - | - | $0.55 | $2.19 | 131.1k | - | 671 | Open | |
| DeepSeek | π¨π³ | DeepSeek-V2.5 | x | - | - | 16.8% | - | $0.14 | $0.28 | 8.2k | - | 236 | Open | |
| DeepSeek | π¨π³ | DeepSeek VL2 | β | - | - | - | - | $9.50 | $4800.00 | 129.3k | - | 27 | Open | |
| DeepSeek | π¨π³ | DeepSeek VL2 Small | β | - | - | - | - | - | - | - | - | 16 | Open | |
| DeepSeek | π¨π³ | DeepSeek VL2 Tiny | β | - | - | - | - | - | - | - | - | 3 | Open | |
| Mistral | π«π· | Devstral Medium | x | - | - | 61.6% | - | $0.40 | $2.00 | 128k | - | - | Closed | |
| Mistral | π«π· | Devstral Small 1.1 | x | - | - | 53.6% | - | $0.10 | $0.30 | 128k | - | 24 | Open | |
| πΊπΈ | Gemma 2 27B | x | - | - | - | - | - | - | - | - | 27.2 | Open | ||
| πΊπΈ | Gemma 2 9B | x | - | - | - | - | - | - | - | - | 9.2 | Open | ||
| πΊπΈ | Gemma 3n E2B | β | - | - | - | - | - | - | - | Jun. 2024 | 8 | Closed | ||
| πΊπΈ | Gemma 3n E4B | β | - | - | - | - | - | - | - | Jun. 2024 | 8 | Closed | ||
| OpenAI | πΊπΈ | GPT-5 Codex | x | - | - | 74.5% | - | - | - | - | Sep. 2024 | - | Closed | |
| IBM | - | Granite 3.3 8B Base | β | - | - | - | - | - | - | - | Apr. 2024 | 8.2 | Open | |
| IBM | - | IBM Granite 4.0 Tiny Preview | x | - | - | - | - | - | - | - | - | 7 | Open | |
| xAI | πΊπΈ | Grok-1.5V | β | - | - | - | - | - | - | - | - | - | Closed | |
| xAI | πΊπΈ | Grok-2 Image 1212 | x | - | - | - | - | - | - | 131.1k | - | - | Closed | |
| xAI | πΊπΈ | Grok-4.1 | β | - | - | - | - | $3.00 | $15.00 | 256k | - | - | Closed | |
| xAI | πΊπΈ | Grok-4.1 Thinking | β | - | - | - | - | $3.00 | $15.00 | 256k | - | - | Closed | |
| MoonshotAI | π¨π³ | Kimi-k1.5 | β | - | - | - | - | - | - | - | - | - | Closed | |
| Nvidia | πΊπΈ | Llama 3.1 Nemotron 70B Instruct | x | - | - | - | - | - | - | - | Dec. 2023 | 70 | Open | |
| πΊπΈ | MedGemma 4B IT | β | - | - | - | - | - | - | - | - | 4.3 | Open | ||
| Mistral | π«π· | Ministral 3 (14B Base 2512) | β | - | - | - | - | - | - | - | - | 14 | Open | |
| Mistral | π«π· | MiniStral 3 (14B Instruct 2512) | β | - | - | - | - | - | - | - | - | 14 | Open | |
| Mistral | π«π· | Ministral 3 (3B Base 2512) | β | - | - | - | - | - | - | - | - | 3 | Open | |
| Mistral | π«π· | Ministral 3 (3B Instruct 2512) | β | - | - | - | - | - | - | - | - | 3 | Open | |
| Mistral | π«π· | Ministral 3 (8B Base 2512) | β | - | - | - | - | - | - | - | - | 8 | Open | |
| Mistral | π«π· | Ministral 3 (8B Instruct 2512) | β | - | - | - | - | - | - | - | - | 8 | Open | |
| Mistral | π«π· | Ministral 8B Instruct | x | - | - | - | - | $0.10 | $0.10 | 128k | - | 8.0 | Open | |
| Mistral | π«π· | Mistral Large 2 | x | - | - | - | - | $2.00 | $6.00 | 128k | - | 123 | Open | |
| Mistral | π«π· | Mistral Large 3 | β | - | - | - | - | $2.00 | $5.00 | 128k | - | 675 | Open | |
| Mistral | π«π· | Mistral NeMo Instruct | x | - | - | - | - | $0.15 | $0.15 | 128k | - | 12 | Open | |
| Mistral | π«π· | Mistral Small | x | - | - | - | - | $0.20 | $0.60 | 32.8k | - | 22 | Open | |
| OpenAI | πΊπΈ | o3-pro | β | - | - | - | - | $20.00 | $80.00 | 200k | May 2024 | - | Closed | |
| Microsoft | πΊπΈ | Phi-3.5-vision-instruct | β | - | - | - | - | - | - | - | - | 4.2 | Open | |
| Microsoft | πΊπΈ | Phi-4-multimodal-instruct | β | - | - | - | - | $0.05 | $0.10 | 128k | Jun. 2024 | 5.6 | Open | |
| Mistral | π«π· | Pixtral-12B | β | - | - | - | - | $0.15 | $0.15 | 128k | - | 12.4 | Open | |
| Mistral | π«π· | Pixtral Large | β | - | - | - | - | $2.00 | $6.00 | 128k | - | 124 | Open | |
| Qwen | π¨π³ | QvQ-72B-Preview | β | - | - | - | - | - | - | - | - | 73.4 | Open | |
| Qwen | π¨π³ | Qwen2.5-Coder 32B Instruct | x | - | - | - | - | $0.09 | $0.09 | 128k | - | 32 | Open | |
| Qwen | π¨π³ | Qwen2.5-Coder 7B Instruct | x | - | - | - | - | - | - | - | - | 7 | Open | |
| Qwen | π¨π³ | Qwen2.5 VL 72B Instruct | β | - | - | - | - | - | - | - | - | 72 | Open | |
| Qwen | π¨π³ | Qwen2.5 VL 7B Instruct | β | - | - | - | - | - | - | - | - | 8.3 | Open | |
| Qwen | π¨π³ | Qwen2-VL-72B-Instruct | β | - | - | - | - | - | - | - | Jun. 2023 | 73.4 | Open | |
| Qwen | π¨π³ | Qwen3-Next-80B-A3B-Base | x | - | - | - | - | - | - | - | - | 80 | Open | |
| Qwen | π¨π³ | Qwen3 VL 235B A22B Instruct | β | - | 74.7% | - | - | $0.30 | $1.49 | 262.1k | - | 236 | Open | |
| Qwen | π¨π³ | Qwen3 VL 235B A22B Thinking | β | - | 89.7% | - | 13.6% | $0.45 | $3.49 | 262.1k | - | 236 | Open | |
| Qwen | π¨π³ | Qwen3 VL 4B Instruct | β | - | 46.6% | - | - | $0.10 | $0.60 | 262.1k | - | 4 | Open | |
| Qwen | π¨π³ | Qwen3 VL 8B Instruct | β | - | 45.9% | - | - | $0.08 | $0.50 | 262.1k | - | 9 | Open | |
| StepFun | π¨π³ | Step3-VL-10B | β | - | 87.7% | - | - | - | - | - | - | 10 | Open | |
| xAI | πΊπΈ | Grok-4 Fast Non-Reasoning | β | - | - | - | - | $0.20 | $0.50 | 2M | - | - | Closed | |
| Qwen | π¨π³ | Qwen3 32B | x | - | 72.9% | - | - | $0.10 | $0.30 | 128k | - | 32.8 | Open | |
| ZAI | π¨π³ | GLM-4.5V | β | - | - | - | - | $0.55 | $2.19 | 131.1k | - | 108 | Open | |
| OpenAI | πΊπΈ | GPT-5.1 Codex Mini | β | - | 42.1% | - | - | $0.25 | $2.00 | 400k | - | - | Closed | |
| LG AI Research | π°π· | K-EXAONE-236B-A23B | x | - | 92.8% | - | - | $0.60 | $1.00 | 32.8k | Oct. 2025 | 236 | Closed | |
| IBM | - | Granite 3.3 8B Instruct | β | - | - | - | - | $0.50 | $0.50 | 128k | Apr. 2024 | 8 | Open |
No models found matching your criteria.