Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 5/13/2026, 5:02:35 PM

ModelProvider
Gemini 2.5 Flash Lite
google
Google AI Studio
-
$0.100.85s95.0T/s99.98%
gpt-oss-120b
openai
Cerebras
-
$0.040.26s567.0T/s99.50%
GPT-4o-mini
openai
OpenAI
-
$0.150.56s30.0T/s100.00%
Mistral Nemo
mistralai
DeepInfra
-
$0.020.34s44.0T/s99.91%
Llama 3.1 8B Instruct
meta-llama
Cerebras
-
$0.020.14s217.0T/s99.96%
GPT-4.1 Mini
openai
Azure
-
$0.410.57s46.0T/s99.97%
Gemini 2.5 Flash
google
Google Vertex (Global)
-
$0.320.94s59.0T/s99.56%
Qwen3 235B A22B Instruct 2507
qwen
Google Vertex
-
$0.070.41s45.0T/s97.65%
GLM 4.5 Air
z-ai
NovitaAI
-
$0.140.90s25.0T/s99.08%
Grok 4.1 Fast
x-ai
xAI
-
$0.200.80s61.0T/s100.00%
DeepSeek V3.2
deepseek
Alibaba Cloud Int.
-
$0.261.00s32.0T/s99.85%
Gemini 3 Flash Preview
google
Google AI Studio
-
$0.521.20s75.0T/s99.89%
Claude Sonnet 4.6
anthropic
Anthropic
-
$3.121.23s56.0T/s99.90%
Gemini 3.1 Flash Lite Preview
google
Google Vertex
-
$0.261.11s98.0T/s99.73%
Gemma 4 31B
google
Venice
-
$0.121.06s42.0T/s63.37%
Gemma 4 26B A4B
google
Cloudflare
-
$0.060.37s49.0T/s99.87%
Claude Opus 4.7
anthropic
Amazon Bedrock (US)
-
$5.201.68s64.0T/s98.61%
Claude Opus Latest
anthropic
Amazon Bedrock (US)
-
$5.201.17s73.0T/s-
Hy3 preview
tencent
SiliconFlow
-
$0.074.71s33.0T/s-
DeepSeek V4 Flash
deepseek
Alibaba Cloud Int.
-
$0.130.94s104.0T/s99.47%
DeepSeek V4 Pro
deepseek
Alibaba Cloud Int.
-
$0.440.73s48.0T/s99.53%
Anthropic Claude Sonnet Latest
anthropic
Anthropic
-
$3.121.23s56.0T/s-
Google Gemini Flash Latest
google
Google AI Studio
-
$0.521.20s75.0T/s-
Google Gemini Pro Latest
google
Google Vertex
-
$2.102.97s69.0T/s-
GPT-4o
openai
OpenAI
-
$2.580.64s25.0T/s99.60%
Llama 3.3 70B Instruct
meta-llama
Groq
-
$0.100.34s125.0T/s100.00%
DeepSeek V3
deepseek-ai
NovitaAI
-
$0.331.18s17.0T/s99.71%
Gemini 2.0 Flash
google
Google AI Studio
-
$0.100.70s55.0T/s70.89%
Gemini 2.0 Flash Lite
google
Google Vertex
-
$0.080.48s65.0T/s99.96%
Gemma 3 27B
google
Phala
-
$0.080.69s41.0T/s99.60%
Gemma 3 12B
google
Cloudflare
-
$0.040.29s39.0T/s100.00%
DeepSeek V3 0324
deepseek
ModelRun
-
$0.211.47s28.0T/s99.89%
Llama 4 Scout
meta-llama
Groq
-
$0.080.36s182.0T/s99.86%
Llama 4 Maverick
meta-llama
Parasail
-
$0.150.69s85.0T/s99.69%
GPT-4.1 Nano
openai
Azure
-
$0.101.47s45.0T/s99.98%
GPT-4.1
openai
Azure
-
$2.060.63s36.0T/s99.98%
Qwen3 32B
qwen
Groq
-
$0.080.26s389.5T/s99.93%
Qwen3 8B
qwen
Alibaba Cloud Int.
-
$0.050.37s83.0T/s99.95%
Gemini 2.5 Pro
google
Google Vertex (EU)
-
$1.332.53s102.0T/s99.55%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.080.39s54.0T/s99.64%
GLM 4 32B
z-ai
Z.ai
-
$0.101.68s2.0T/s-
gpt-oss-20b
openai
Groq
-
$0.030.25s241.0T/s99.91%
GPT-5 Nano
openai
Azure
-
$0.053.86s84.0T/s99.94%
GPT-5 Mini
openai
OpenAI
-
$0.274.14s58.0T/s99.96%
GPT-5 Chat
openai
OpenAI
-
$1.330.75s91.0T/s96.00%
DeepSeek V3.1
deepseek
SambaNova
-
$0.221.30s64.0T/s99.94%
Qwen3 Next 80B A3B Instruct
qwen
Google Vertex
-
$0.100.50s143.0T/s99.24%
Grok 4 Fast
x-ai
xAI
-
$0.205.08s91.0T/s96.00%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google Vertex
-
$0.102.77s75.0T/s76.00%
Claude Sonnet 4.5
anthropic
Google Vertex
-
$3.120.98s38.0T/s99.74%
Qwen3 VL 8B Instruct
qwen
Alibaba Cloud Int.
-
$0.080.43s51.0T/s99.60%
Claude Haiku 4.5
anthropic
Google Vertex (Europe)
-
$1.040.54s96.0T/s100.00%
Qwen3 VL 32B Instruct
qwen
Alibaba Cloud Int.
-
$0.111.70s28.0T/s-
GPT-5.1 Chat
openai
OpenAI
-
$1.331.71s51.0T/s99.97%
MiMo-V2-Flash
xiaomi
NovitaAI
-
$0.101.51s32.0T/s98.70%
GLM 4.7 Flash
z-ai
DeepInfra
-
$0.060.35s63.0T/s98.07%
Kimi K2.5
moonshotai
Venice
-
$0.420.98s57.0T/s99.74%
Step 3.5 Flash
stepfun
StepFun
-
$0.101.44s78.0T/s99.93%
Qwen3 Coder Next
qwen
Ionstream
-
$0.120.34s45.0T/s99.63%
Claude Opus 4.6
anthropic
Google Vertex
-
$5.201.56s49.0T/s99.98%
MiniMax M2.5
minimax
MARA
-
$0.160.73s290.0T/s99.92%
Gemini 3.1 Pro Preview
google
Google Vertex
-
$2.103.25s69.0T/s98.64%
Qwen3.5-Flash
qwen
Alibaba Cloud Int.
-
$0.070.52s104.0T/s72.00%
GPT-5.4
openai
Azure
-
$2.621.67s44.0T/s99.91%
Nemotron 3 Super (free)
nvidia
NVIDIA
-
$0.0017.14s18.0T/s11.11%
GPT-5.4 Mini
openai
OpenAI
-
$0.790.76s47.0T/s99.93%
GPT-5.4 Nano
openai
Azure
-
$0.211.58s45.0T/s100.00%
MiniMax M2.7
minimax
Fireworks
-
$0.292.20s62.0T/s99.83%
Qwen3.6 Plus
qwen
Alibaba Cloud Int.
-
$0.342.19s37.0T/s100.00%
GLM 5.1
z-ai
Friendli
-
$1.080.55s93.0T/s99.91%
Kimi K2.6
moonshotai
Baseten
-
$0.771.14s232.0T/s99.85%
GPT-5.5
openai
Azure
-
$5.244.96s49.0T/s99.86%
OpenAI GPT Latest
openai
Azure
-
$5.244.96s49.0T/s-
MoonshotAI Kimi Latest
moonshotai
Baseten
-
$0.771.14s232.0T/s-
OpenAI GPT Mini Latest
openai
OpenAI
-
$0.790.76s47.0T/s-
Anthropic Claude Haiku Latest
anthropic
Google Vertex (Europe)
-
$1.040.54s96.0T/s-
Owl Alpha
openrouter
Stealth
-
$0.0013.41s7.0T/s-
Gemini 3.1 Flash Lite
google
Google Vertex
-
$0.260.98s105.0T/s99.56%
Ring-2.6-1T (free)
inclusionai
NovitaAI
-
$0.003.71s45.0T/s95.99%
Llama 3.1 70B Instruct
meta-llama
Weights & Biases
-
$0.400.23s22.0T/s100.00%
Qwen2.5 7B Instruct
qwen
Phala
-
$0.040.41s52.0T/s99.99%
Claude 3.5 Haiku
anthropic
Amazon Bedrock
-
$0.830.87s46.0T/s99.91%
Mistral Small 3
mistralai
DeepInfra
-
$0.050.23s44.0T/s-
Gemma 3 4B
google
DeepInfra
-
$0.040.38s30.0T/s72.00%
Llama Guard 4 12B
meta-llama
Together
-
$0.180.13s18.0T/s77.94%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.121.03s53.0T/s100.00%
GPT-5
openai
OpenAI
-
$1.336.82s52.0T/s99.93%
DeepSeek V3.1 Terminus
deepseek
NovitaAI
-
$0.281.66s26.0T/s99.98%
Qwen3 VL 235B A22B Instruct
qwen
Alibaba Cloud Int.
-
$0.210.84s41.0T/s99.40%
DeepSeek V3.2 Exp
deepseek
AtlasCloud
-
$0.271.59s22.0T/s99.93%
Qwen3 VL 30B A3B Instruct
qwen
Alibaba Cloud Int.
-
$0.130.64s50.0T/s99.82%
Nano Banana (Gemini 2.5 Flash Image)
google
Google AI Studio
-
$0.320.84s194.0T/s99.97%
GPT-5.1
openai
Azure
-
$1.331.22s47.0T/s99.96%
Ministral 3 3B 2512
mistralai
Mistral
-
$0.100.16s54.0T/s100.00%
GPT-5.2
openai
Azure
-
$1.862.53s30.0T/s99.89%
Nemotron 3 Nano 30B A3B
nvidia
DeepInfra
-
$0.051.14s67.0T/s-
GLM 5
z-ai
Baseten
-
$0.620.48s79.0T/s99.89%
Qwen3.5-27B
qwen
Phala
-
$0.210.81s99.0T/s99.97%
Qwen3.5-35B-A3B
qwen
Alibaba Cloud Int.
-
$0.150.86s114.0T/s99.89%
Qwen3.5-9B
qwen
Venice
-
$0.040.60s112.0T/s99.27%
Mistral Small 4
mistralai
Venice
-
$0.150.67s125.0T/s99.39%
Laguna M.1 (free)
poolside
Poolside
-
$0.001.87s19.0T/s-