Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 10/25/2025, 5:01:58 PM

ModelProvider
Gemini 2.0 Flash
google
Google AI Studio
#31
$0.100.48s168.9T/s99.99%
Gemma 3 12B
google
Crusoe
-
$0.030.42s171.0T/s99.96%
Gemini 2.5 Flash
google
Google Vertex
#7
$0.320.84s103.1T/s99.98%
Gemini 2.5 Flash Lite
google
Google Vertex
-
$0.100.37s100.0T/s99.91%
Grok Code Fast 1
x-ai
xAI
-
$0.211.06s99.7T/s100.00%
GPT-4o-mini
openai
Azure
#14
$0.150.69s269.0T/s99.96%
Mistral Nemo
mistralai
Mistral
-
$0.020.33s169.5T/s99.99%
Llama 3.3 70B Instruct
meta-llama
Cerebras
#66
$0.130.22s2600.0T/s99.89%
Gemini 2.0 Flash Lite
google
Google AI Studio
-
$0.080.41s363.2T/s99.99%
Gemma 3 27B
google
Chutes
-
$0.091.35s67.8T/s99.60%
Gemma 3 4B
google
Chutes
-
$0.021.19s60.4T/s99.84%
DeepSeek V3 0324
deepseek
SambaNova
#14
$0.250.48s325.8T/s99.96%
Llama 4 Maverick
meta-llama
Groq
-
$0.150.24s687.4T/s99.97%
GPT-4.1 Mini
openai
Azure
-
$0.410.64s70.6T/s99.97%
Gemini 2.5 Pro
google
Google Vertex (Global)
#1
$1.332.54s88.5T/s99.78%
Gemini 2.5 Flash Lite Preview 06-17
google
Google AI Studio
-
$0.101.04s14.4T/s100.00%
Mistral Small 3.2 24B
mistralai
Chutes
-
$0.060.61s217.9T/s99.41%
Qwen3 235B A22B Instruct 2507
qwen
Cerebras
-
$0.082.18s2116.5T/s99.77%
gpt-oss-20b
openai
Groq
-
$0.030.12s4692.3T/s99.60%
gpt-oss-120b
openai
Cerebras
-
$0.040.33s2898.5T/s98.80%
DeepSeek V3.1
deepseek
SambaNova
-
$0.282.79s199.2T/s99.56%
Grok 4 Fast
x-ai
xAI
-
$0.202.13s135.7T/s100.00%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google AI Studio
-
$0.100.57s167.6T/s99.73%
Gemini 2.5 Flash Preview 09-2025
google
Google AI Studio
-
$0.323.58s174.5T/s99.58%
DeepSeek V3.2 Exp
deepseek
DeepSeek
-
$0.271.55s27.1T/s99.71%
Claude Sonnet 4.5
anthropic
Anthropic
-
$3.122.60s61.8T/s99.92%
GLM 4.6
z-ai
Z.AI
-
$0.510.67s91.5T/s99.84%
Mistral Tiny
mistralai
Mistral
-
$0.250.22s300.0T/s-
Llama 3.1 8B Instruct
meta-llama
Cerebras
-
$0.020.32s2269.2T/s99.97%
DeepSeek V3
deepseek
Chutes
-
$0.311.35s81.6T/s99.96%
Mistral Small 3
mistralai
Mistral
-
$0.050.24s203.0T/s99.98%
Claude 3.7 Sonnet
anthropic
Google Vertex (Global)
-
$3.120.38s57.3T/s99.89%
Llama 4 Scout
meta-llama
Cerebras
-
$0.080.30s1802.4T/s99.97%
GPT-4.1 Nano
openai
Azure
-
$0.100.76s136.5T/s99.99%
GPT-4.1
openai
Azure
-
$2.060.71s137.9T/s99.97%
Qwen3 32B
qwen
Groq
-
$0.050.26s640.6T/s99.94%
Gemma 3n 4B
google
Together
-
$0.020.34s45.7T/s99.99%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.121.78s102.7T/s99.94%
Grok 3 Mini
x-ai
xAI Fast
-
$0.300.42s92.6T/s99.96%
DeepSeek R1T2 Chimera (free)
tngtech
Chutes
-
$0.003.13s17.7T/s99.88%
Qwen3 Coder 480B A35B
qwen
Cerebras
-
$0.230.35s2614.1T/s99.12%
GPT-5 Mini
openai
Azure
-
$0.277.10s79.6T/s99.55%
Qwen3 Next 80B A3B Instruct
qwen
Hyperbolic
-
$0.110.49s433.0T/s99.53%
Qwen3 VL 235B A22B Instruct
qwen
Chutes
-
$0.311.95s72.4T/s95.05%
Claude Haiku 4.5
anthropic
Anthropic
-
$1.041.37s164.6T/s99.93%