Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 12/15/2025, 1:02:07 PM

ModelProvider
GPT-4o-mini
openai
Azure
#14
$0.151.32s520.8T/s99.99%
Gemini 2.0 Flash
google
Google AI Studio
#31
$0.100.51s171.1T/s99.79%
Gemini 2.0 Flash Lite
google
Google AI Studio
-
$0.080.45s284.4T/s99.95%
Gemini 2.5 Flash
google
Google Vertex
#7
$0.320.93s97.3T/s99.92%
Gemini 2.5 Flash Lite
google
Google AI Studio
-
$0.100.52s86.3T/s99.93%
gpt-oss-120b
openai
Cerebras
-
$0.040.25s2129.7T/s99.43%
Mistral Nemo
mistralai
Mistral
-
$0.020.33s196.8T/s99.99%
Llama 3.1 70B Instruct
meta-llama
Hyperbolic
-
$0.400.67s73.2T/s99.99%
Llama 3.1 8B Instruct
meta-llama
Cerebras
-
$0.020.21s2800.0T/s99.98%
Llama 3.3 70B Instruct
meta-llama
Cerebras
#66
$0.100.37s2533.3T/s99.97%
Gemma 3 4B
google
DeepInfra
-
$0.020.48s125.0T/s99.98%
DeepSeek V3 0324
deepseek
SambaNova
#14
$0.210.55s288.5T/s99.87%
Llama 4 Scout
meta-llama
Groq
-
$0.080.13s1000.0T/s99.99%
GPT-4.1 Nano
openai
Azure
-
$0.100.60s220.4T/s99.99%
GPT-4.1 Mini
openai
Azure
-
$0.410.49s89.5T/s99.99%
Qwen3 32B
qwen
Cerebras
-
$0.081.26s618.7T/s99.98%
Gemini 2.5 Pro
google
Google Vertex (Global)
#1
$1.332.62s92.7T/s99.85%
DeepSeek R1T2 Chimera (free)
tngtech
Chutes
-
$0.002.42s28.2T/s99.98%
Qwen3 235B A22B Instruct 2507
qwen
Cerebras
-
$0.070.40s1117.6T/s99.64%
gpt-oss-20b
openai
Groq
-
$0.030.24s1478.5T/s99.82%
GPT-5 Mini
openai
OpenAI
-
$0.276.30s54.6T/s99.89%
DeepSeek V3.1
deepseek
SambaNova
-
$0.165.16s174.1T/s99.96%
Grok Code Fast 1
x-ai
xAI
-
$0.210.74s90.8T/s72.00%
Kimi K2 0905
moonshotai
Groq
-
$0.410.32s320.0T/s99.98%
Grok 4 Fast
x-ai
xAI
-
$0.203.63s103.0T/s99.80%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google AI Studio
-
$0.100.37s123.6T/s99.79%
Gemini 2.5 Flash Preview 09-2025
google
Google AI Studio
-
$0.320.77s140.5T/s99.65%
Claude Sonnet 4.5
anthropic
Amazon Bedrock
-
$3.122.96s74.8T/s99.94%
GLM 4.6
z-ai
Cerebras
-
$0.410.58s285.2T/s99.78%
Nemotron Nano 12B 2 VL (free)
nvidia
NVIDIA
-
$0.003.32s65.1T/s-
KAT-Coder-Pro V1 (free)
kwaipilot
StreamLake
-
$0.000.80s54.4T/s99.91%
Grok 4.1 Fast
x-ai
xAI
-
$0.204.25s69.2T/s99.16%
DeepSeek V3.2
deepseek
SiliconFlow
-
$0.244.48s58.4T/s98.60%
Devstral 2 2512 (free)
mistralai
Mistral
-
$0.009.72s66.9T/s88.50%
GPT-4o
openai
Azure
-
$2.581.36s258.2T/s99.99%
Ministral 3B
mistralai
Mistral
-
$0.040.33s313.9T/s-
DeepSeek V3
deepseek-ai
Chutes
-
$0.311.31s60.7T/s99.99%
Mistral Small 3
mistralai
Together
-
$0.030.12s111.5T/s99.75%
Gemma 3 27B
google
Nebius Token Factory
-
$0.040.16s68.7T/s99.15%
Gemma 3 12B
google
Crusoe
-
$0.030.45s138.9T/s99.43%
Llama 4 Maverick
meta-llama
SambaNova
-
$0.151.62s661.8T/s99.86%
GPT-4.1
openai
OpenAI
-
$2.060.56s57.5T/s99.98%
Gemma 3n 4B
google
Together
-
$0.020.17s33.6T/s99.89%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.121.27s81.8T/s99.97%
Grok 3 Mini
x-ai
xAI Fast
-
$0.300.97s97.1T/s99.87%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.060.26s119.2T/s99.81%
GLM 4 32B
z-ai
Z.AI
-
$0.101.09s3000.0T/s16.00%
GPT-5 Nano
openai
Azure
-
$0.053.86s106.2T/s99.93%
GPT-5
openai
Azure
-
$1.335.97s64.8T/s99.89%
Qwen3 Next 80B A3B Instruct
qwen
Google Vertex
-
$0.100.29s277.9T/s99.98%
Claude Haiku 4.5
anthropic
Google Vertex
-
$1.040.83s138.1T/s99.93%
MiniMax M2
minimax
Google Vertex
-
$0.210.42s154.9T/s99.90%
GPT-5.1
openai
OpenAI
-
$1.332.99s44.8T/s99.90%
Gemini 3 Pro Preview
google
Google Vertex
-
$2.104.47s77.7T/s99.33%
Claude Opus 4.5
anthropic
Amazon Bedrock
-
$5.202.27s77.0T/s99.51%
GPT-5.2
openai
OpenAI
-
$1.863.41s40.3T/s99.97%