Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 1/30/2026, 6:02:02 AM

ModelProvider
Gemini 2.0 Flash
google
Google Vertex
-
$0.100.32s83.0T/s99.12%
Gemini 2.5 Flash
google
Google Vertex
-
$0.323.60s126.0T/s99.74%
Gemini 2.5 Flash Lite
google
Google AI Studio
-
$0.100.75s98.0T/s99.51%
Grok 4 Fast
x-ai
xAI
-
$0.203.10s117.0T/s100.00%
Claude Sonnet 4.5
anthropic
Amazon Bedrock
-
$3.121.75s102.0T/s99.90%
Grok 4.1 Fast
x-ai
xAI
-
$0.200.77s105.0T/s100.00%
DeepSeek V3.2
deepseek
Google Vertex
-
$0.251.85s34.0T/s99.85%
Gemini 3 Flash Preview
google
Google AI Studio
-
$0.521.11s94.0T/s99.04%
GPT-4o-mini
openai
OpenAI
-
$0.150.48s32.0T/s99.99%
Mistral Nemo
mistralai
Mistral
-
$0.020.22s136.0T/s99.99%
Llama 3.1 70B Instruct
meta-llama
Together
-
$0.400.41s20.0T/s99.82%
Llama 3.1 8B Instruct
meta-llama
Friendli
-
$0.020.10s138.0T/s99.99%
Ministral 3B
mistralai
Mistral
-
$0.040.28s91.0T/s-
Llama 3.3 70B Instruct
meta-llama
Cerebras
-
$0.100.31s355.5T/s99.87%
Gemini 2.0 Flash Lite
google
Google Vertex
-
$0.080.49s42.0T/s99.65%
Gemma 3 27B
google
Nebius Token Factory
-
$0.040.37s49.0T/s98.55%
Gemma 3 4B
google
Chutes
-
$0.020.94s34.0T/s99.99%
DeepSeek V3 0324
deepseek
Baseten
-
$0.200.36s114.0T/s99.96%
Llama 4 Maverick
meta-llama
Groq
-
$0.150.33s231.0T/s99.98%
GPT-4.1 Mini
openai
Azure
-
$0.410.65s45.0T/s99.98%
Qwen3 32B
qwen
Cerebras
-
$0.080.40s664.5T/s99.74%
Gemini 2.5 Pro
google
Google Vertex (Global)
-
$1.332.65s92.0T/s98.50%
DeepSeek R1T2 Chimera (free)
tngtech
Chutes
-
$0.002.13s29.0T/s99.99%
Qwen3 235B A22B Instruct 2507
qwen
Cerebras
-
$0.070.23s89.0T/s99.95%
Qwen3 Coder 480B A35B
qwen
DeepInfra (Turbo)
-
$0.230.31s110.0T/s99.97%
GLM 4 32B
z-ai
Z.ai
-
$0.100.64s5.0T/s-
gpt-oss-20b
openai
Groq
-
$0.020.11s476.0T/s99.72%
GPT-5 Nano
openai
OpenAI
-
$0.050.78s113.0T/s99.97%
GPT-5 Mini
openai
Azure
-
$0.279.53s88.0T/s99.84%
DeepSeek V3.1
deepseek
Google Vertex
-
$0.160.79s94.0T/s99.97%
Grok Code Fast 1
x-ai
xAI
-
$0.210.80s136.0T/s100.00%
Kimi K2 0905
moonshotai
Fireworks
-
$0.410.33s89.0T/s99.52%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google AI Studio
-
$0.100.58s191.0T/s99.76%
Claude Haiku 4.5
anthropic
Google Vertex
-
$1.040.55s83.0T/s99.98%
LFM2-8B-A1B
liquid
Liquid
-
$0.010.37s28.0T/s-
GPT-5.1 Chat
openai
Azure
-
$1.332.56s99.0T/s99.74%
Claude Opus 4.5
anthropic
Google Vertex
-
$5.201.60s43.0T/s99.90%
GPT-5.2
openai
Azure
-
$1.862.92s38.0T/s99.46%
Kimi K2.5
moonshotai
GMICloud
-
$0.520.80s80.0T/s94.42%
Trinity Large Preview (free)
arcee-ai
Arcee AI
-
$0.000.59s39.0T/s92.00%
Nova Micro 1.0
amazon
Amazon Bedrock
-
$0.040.36s195.0T/s28.00%
Gemma 3 12B
google
DeepInfra
-
$0.030.57s39.0T/s99.09%
Llama 3.1 Nemotron Ultra 253B v1
nvidia
Nebius Token Factory
-
$0.610.13s21.0T/s-
GPT-4.1
openai
Azure
-
$2.060.76s50.0T/s99.98%
Qwen3 8B
qwen
Fireworks
-
$0.050.48s84.0T/s99.62%
Qwen3 30B A3B
qwen
Friendli
-
$0.060.10s125.0T/s97.37%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.121.73s55.0T/s99.95%
Grok 3 Mini
x-ai
xAI Fast
-
$0.300.61s75.0T/s99.96%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.060.23s103.0T/s99.89%
gpt-oss-120b (exacto)
openai
Groq
-
$0.040.34s418.0T/s98.99%
GPT-5
openai
Azure
-
$1.337.45s72.0T/s97.93%
Qwen3 VL 8B Instruct
qwen
Together
-
$0.080.27s94.0T/s99.90%
Gemini 3 Pro Preview
google
Google AI Studio
-
$2.103.31s81.0T/s98.01%
Mistral Large 3 2512
mistralai
Mistral
-
$0.510.72s37.0T/s100.00%
GLM 4.7
z-ai
Cerebras
-
$0.410.95s170.0T/s99.78%
MiniMax M2.1
minimax
Fireworks
-
$0.280.98s99.5T/s99.52%