Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 2/10/2026, 12:02:08 PM

ModelProvider
Gemini 2.5 Flash
google
Google Vertex
-
$0.321.90s122.0T/s99.66%
GPT-4o-mini
openai
OpenAI
-
$0.150.46s21.0T/s99.99%
Llama 3.1 8B Instruct
meta-llama
Groq
-
$0.020.14s323.0T/s99.98%
Gemini 2.0 Flash
google
Google AI Studio
-
$0.100.47s77.0T/s96.82%
Gemini 2.5 Flash Lite
google
Google Vertex
-
$0.100.46s90.0T/s99.56%
gpt-oss-120b
openai
Cerebras
-
$0.040.40s728.0T/s99.65%
Grok 4.1 Fast
x-ai
xAI
-
$0.203.56s87.0T/s100.00%
DeepSeek V3.2
deepseek
Google Vertex
-
$0.250.80s32.5T/s99.80%
Gemini 3 Flash Preview
google
Google AI Studio
-
$0.521.05s75.0T/s98.70%
Mistral Nemo
mistralai
Mistral
-
$0.020.34s75.0T/s100.00%
Llama 3.3 70B Instruct
meta-llama
Groq
-
$0.100.26s220.0T/s99.92%
DeepSeek V3
deepseek-ai
Chutes
-
$0.310.76s41.0T/s99.92%
Gemini 2.0 Flash Lite
google
Google AI Studio
-
$0.080.39s95.0T/s99.35%
Gemma 3 27B
google
Phala
-
$0.040.76s33.0T/s91.03%
Gemma 3 12B
google
Cloudflare
-
$0.030.22s27.0T/s99.55%
Gemma 3 4B
google
Chutes
-
$0.020.45s37.0T/s99.98%
DeepSeek V3 0324
deepseek
Nebius AI Studio (Fast)
-
$0.201.06s100.0T/s99.96%
Llama 4 Maverick
meta-llama
Groq
-
$0.150.27s138.0T/s99.91%
GPT-4.1 Mini
openai
Azure
-
$0.410.70s48.0T/s99.99%
GPT-4.1
openai
Azure
-
$2.060.75s47.0T/s99.98%
Qwen3 32B
qwen
Cerebras
-
$0.080.40s608.0T/s99.98%
Gemini 2.5 Pro
google
Google AI Studio
-
$1.333.12s105.0T/s99.17%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.060.30s89.0T/s99.85%
Qwen3 235B A22B Instruct 2507
qwen
Cerebras
-
$0.070.33s135.0T/s99.93%
gpt-oss-20b
openai
Groq
-
$0.030.11s494.0T/s99.59%
GPT-5 Nano
openai
OpenAI
-
$0.053.12s90.0T/s99.89%
GPT-5 Mini
openai
Azure
-
$0.279.08s81.0T/s99.86%
DeepSeek V3.1
deepseek
Google Vertex
-
$0.160.98s92.0T/s99.89%
Grok Code Fast 1
x-ai
xAI
-
$0.213.76s113.0T/s92.00%
Qwen3 Next 80B A3B Thinking
qwen
Hyperbolic
-
$0.160.81s227.0T/s98.21%
Grok 4 Fast
x-ai
xAI
-
$0.202.29s110.0T/s100.00%
Qwen3 VL 235B A22B Instruct
qwen
Chutes
-
$0.212.11s76.0T/s99.96%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google AI Studio
-
$0.101.16s319.0T/s99.90%
Claude Sonnet 4.5
anthropic
Google Vertex (Global)
-
$3.121.55s39.0T/s99.95%
Claude Haiku 4.5
anthropic
Anthropic
-
$1.040.76s64.0T/s99.99%
Gemini 3 Pro Preview
google
Google AI Studio
-
$2.103.46s78.0T/s99.50%
GPT-5.2
openai
OpenAI
-
$1.862.36s35.0T/s99.86%
GLM 4.7
z-ai
Cerebras
-
$0.410.64s154.0T/s99.81%
MiniMax M2.1
minimax
Fireworks
-
$0.280.78s104.0T/s99.94%
Kimi K2.5
moonshotai
Baseten
-
$0.470.74s96.0T/s99.81%
Trinity Large Preview (free)
arcee-ai
Arcee AI
-
$0.000.48s20.0T/s88.00%
Claude Opus 4.6
anthropic
Google Vertex
-
$5.201.49s42.0T/s99.46%
Pony Alpha
openrouter
Stealth
-
$0.009.73s19.0T/s-
GPT-4o
openai
Azure
-
$2.580.88s43.0T/s99.99%
Gemma 2 9B
google
Nebius AI Studio (Fast)
-
$0.030.25s95.0T/s-
Llama 3.1 70B Instruct
meta-llama
DeepInfra (Turbo)
-
$0.400.16s21.0T/s99.99%
Ministral 3B
mistralai
Mistral
-
$0.040.23s73.0T/s-
Llama 3.1 Nemotron Ultra 253B v1
nvidia
Nebius Token Factory
-
$0.610.14s20.0T/s-
GPT-4.1 Nano
openai
Azure
-
$0.100.59s79.5T/s99.99%
Gemma 3n 4B
google
Together
-
$0.020.19s26.0T/s100.00%
Claude Sonnet 4
anthropic
Anthropic
-
$3.121.31s47.0T/s99.78%
R1 0528
deepseek
Nebius AI Studio (Fast)
-
$0.410.91s130.5T/s99.90%
Grok 3 Mini
x-ai
xAI Fast
-
$0.305.44s73.0T/s99.96%
DeepSeek R1T2 Chimera (free)
tngtech
Chutes
-
$0.001.97s11.0T/s99.98%
GLM 4 32B
z-ai
Z.ai
-
$0.101.11s3.0T/s-
Kimi K2 0905
moonshotai
Groq
-
$0.410.25s136.0T/s99.97%
DeepSeek V3.1 Terminus
deepseek
SiliconFlow
-
$0.222.65s55.0T/s99.89%
DeepSeek V3.2 Exp
deepseek
SiliconFlow
-
$0.272.24s36.0T/s99.95%
Claude Opus 4.5
anthropic
Amazon Bedrock
-
$5.202.31s41.0T/s99.92%
MiMo-V2-Flash
xiaomi
AtlasCloud
-
$0.090.88s82.0T/s99.98%
Step 3.5 Flash (free)
stepfun
StepFun
-
$0.001.54s133.0T/s-