Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 9/2/2025, 9:01:59 PM

ModelProvider
Gemini 2.0 Flash
google
Google AI Studio
#31
$0.100.52s152.3T/s99.98%
Gemma 3 12B
google
Cloudflare
-
$0.050.32s78.8T/s91.48%
DeepSeek V3 0324
deepseek
GMICloud
#14
$0.213.65s810.5T/s99.91%
Gemini 2.5 Flash
google
Google AI Studio
#7
$0.320.43s111.1T/s99.99%
GPT-4o-mini
openai
Azure
#14
$0.151.05s214.5T/s99.94%
Mistral Nemo
mistralai
Nineteen
-
$0.010.50s271.5T/s100.00%
Qwen2.5 7B Instruct
qwen
NovitaAI
-
$0.040.61s171.5T/s99.97%
Llama 3.3 70B Instruct
meta-llama
Cerebras
#66
$0.040.41s3426.9T/s99.99%
Gemini 2.0 Flash Lite
google
Google Vertex
-
$0.080.38s153.3T/s100.00%
Gemma 3 4B
google
DeepInfra
-
$0.020.88s246.8T/s-
Llama 4 Maverick
meta-llama
Cerebras
-
$0.150.39s1155.4T/s99.98%
GPT-4.1 Nano
openai
OpenAI
-
$0.100.43s102.5T/s99.93%
GPT-4.1 Mini
openai
OpenAI
-
$0.410.51s86.2T/s99.91%
GPT-4.1
openai
OpenAI
-
$2.060.71s80.2T/s99.88%
Qwen3 32B
qwen
Groq
-
$0.020.33s629.0T/s99.89%
Qwen3 30B A3B
qwen
Chutes
-
$0.021.69s75.8T/s97.75%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.122.76s69.8T/s99.98%
Grok 3 Mini
x-ai
xAI Fast
-
$0.300.67s175.9T/s99.58%
Gemini 2.5 Pro
google
Google AI Studio
#1
$1.332.78s98.5T/s99.83%
Gemini 2.5 Flash Lite Preview 06-17
google
Google Vertex
-
$0.100.34s91.2T/s100.00%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.050.36s150.3T/s99.43%
Gemini 2.5 Flash Lite
google
Google Vertex
-
$0.100.34s202.4T/s99.99%
DeepSeek V3.1
deepseek
SambaNova
-
$0.212.56s185.1T/s99.89%
Grok Code Fast 1
x-ai
xAI
-
$0.211.33s88.2T/s92.00%
Mistral Tiny
mistralai
Mistral
-
$0.250.27s94.3T/s4.00%
Gemini 1.5 Flash
google
Google AI Studio
-
$0.080.40s161.2T/s100.00%
Llama 3.1 8B Instruct
meta-llama
Cerebras
-
$0.020.22s1500.0T/s100.00%
Gemini 1.5 Flash 8B
google
Google AI Studio
-
$0.040.46s192.3T/s99.98%
Nova Lite 1.0
amazon
Amazon Bedrock
-
$0.060.47s351.6T/s-
Claude 3.7 Sonnet
anthropic
Google Vertex (Europe)
-
$3.122.59s61.6T/s99.66%
Gemma 3 27B
google
Nebius AI Studio
-
$0.070.47s66.0T/s98.63%
Llama 4 Scout
meta-llama
Cerebras
-
$0.080.50s1455.7T/s99.89%
R1 0528
deepseek
Nebius (Fast)
-
$0.211.20s217.0T/s99.68%
Kimi K2
moonshotai
Groq
-
$0.161.48s336.5T/s99.75%
Qwen3 235B A22B Instruct 2507
qwen
Cerebras
-
$0.080.57s1187.8T/s99.92%
gpt-oss-120b
openai
Cerebras
-
$0.070.35s2568.4T/s99.36%
GPT-5 Nano
openai
OpenAI
-
$0.053.03s60.7T/s99.65%
GPT-5 Mini
openai
OpenAI
-
$0.274.54s52.7T/s99.88%