Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 6/8/2026, 10:00:58 PM

ModelProvider
Gemini 2.5 Flash
google
Google AI Studio
-
$0.320.83s72.0T/s99.53%
Gemini 2.5 Flash Lite
google
Google AI Studio
-
$0.100.35s119.0T/s99.68%
gpt-oss-120b
openai
Cerebras
-
$0.040.21s425.0T/s99.51%
Gemini 3 Flash Preview
google
Google Vertex
-
$0.521.35s63.0T/s99.96%
DeepSeek V4 Flash
deepseek
SiliconFlow
-
$0.101.68s77.0T/s99.30%
GPT-4o-mini
openai
OpenAI
-
$0.150.51s31.0T/s100.00%
Mistral Nemo
mistralai
Mistral
-
$0.020.32s78.0T/s99.24%
Llama 3.1 8B Instruct
meta-llama
Groq
-
$0.020.23s182.0T/s98.40%
Llama 4 Maverick
meta-llama
Google Vertex
-
$0.150.39s66.0T/s99.99%
GPT-4.1 Nano
openai
Azure
-
$0.101.67s42.0T/s100.00%
Qwen3 235B A22B Instruct 2507
qwen
Weights & Biases
-
$0.090.31s89.0T/s99.69%
GLM 4.5 Air
z-ai
NovitaAI
-
$0.130.86s39.0T/s99.82%
Claude Haiku 4.5
anthropic
Amazon Bedrock
-
$1.040.84s120.5T/s99.99%
DeepSeek V3.2
deepseek
Friendli
-
$0.230.29s49.0T/s99.67%
Claude Sonnet 4.6
anthropic
Google Vertex (Europe)
-
$3.122.89s76.0T/s99.98%
Gemini 3.1 Flash Lite Preview
google
Google AI Studio
-
$0.260.61s113.0T/s99.98%
GPT-5.4 Mini
openai
OpenAI
-
$0.790.65s69.0T/s99.82%
Gemma 4 31B
google
Weights & Biases
-
$0.120.34s40.0T/s98.80%
Gemma 4 26B A4B
google
Cloudflare
-
$0.060.39s77.5T/s99.89%
Ling-2.6-flash
inclusionai
NovitaAI
-
$0.010.83s44.0T/s18.30%
MiMo-V2.5
xiaomi
Xiaomi
-
$0.142.11s61.0T/s86.40%
Hy3 preview
tencent
SiliconFlow
-
$0.063.09s54.0T/s99.76%
DeepSeek V4 Pro
deepseek
DeepSeek
-
$0.441.35s62.0T/s99.61%
Anthropic Claude Sonnet Latest
anthropic
Google Vertex (Europe)
-
$3.122.89s76.0T/s-
OpenAI GPT Mini Latest
openai
OpenAI
-
$0.790.65s69.0T/s100.00%
Anthropic Claude Haiku Latest
anthropic
Amazon Bedrock
-
$1.040.84s120.5T/s100.00%
Owl Alpha
openrouter
Stealth
-
$0.005.27s14.0T/s-
Gemini 3.1 Flash Lite
google
Google AI Studio
-
$0.260.73s108.0T/s99.97%
MiniMax M3
minimax
MiniMax
-
$0.313.22s31.0T/s93.43%
Qwen3.7 Plus
qwen
Alibaba Cloud Int.
-
$0.410.76s10.0T/s52.00%
Nemotron 3 Ultra (free)
nvidia
NVIDIA
-
$0.006.28s9.0T/s-
Llama 3.1 70B Instruct
meta-llama
Weights & Biases
-
$0.400.32s36.0T/s100.00%
Llama 3.3 70B Instruct
meta-llama
SambaNova Turbo
-
$0.100.38s119.0T/s100.00%
Gemma 3 27B
google
Phala
-
$0.080.84s27.0T/s99.71%
DeepSeek V3 0324
deepseek
NovitaAI
-
$0.211.22s25.0T/s99.99%
Llama 4 Scout
meta-llama
Groq
-
$0.100.43s75.0T/s99.92%
GPT-4.1 Mini
openai
OpenAI
-
$0.410.75s34.0T/s99.99%
GPT-4.1
openai
Azure
-
$2.060.85s45.0T/s100.00%
Qwen3 32B
qwen
Groq
-
$0.080.31s481.0T/s99.29%
Mistral Small 3.2 24B
mistralai
Mistral
-
$0.080.33s98.0T/s99.89%
GLM 4 32B
z-ai
Z.ai
-
$0.101.47s2.0T/s100.00%
Qwen3 30B A3B Instruct 2507
qwen
AtlasCloud
-
$0.051.78s79.0T/s99.98%
gpt-oss-20b
openai
Weights & Biases
-
$0.030.27s280.0T/s98.63%
GPT-5 Nano
openai
OpenAI
-
$0.052.79s87.0T/s100.00%
GPT-5 Mini
openai
OpenAI
-
$0.273.31s67.0T/s99.16%
DeepSeek V3.1
deepseek
SambaNova
-
$0.221.29s57.0T/s99.31%
Kimi K2 0905
moonshotai
Groq
-
$0.620.18s180.0T/s100.00%
DeepSeek V3.1 Terminus
deepseek
DeepInfra
-
$0.280.71s30.0T/s100.00%
Claude Sonnet 4.5
anthropic
Google Vertex (Global)
-
$3.121.55s43.0T/s99.99%
Qwen3 VL 30B A3B Instruct
qwen
Alibaba Cloud Int.
-
$0.130.49s62.0T/s99.87%
Qwen3 VL 8B Instruct
qwen
Alibaba Cloud Int.
-
$0.080.68s66.0T/s99.39%
GPT-5.1
openai
OpenAI
-
$1.333.65s54.0T/s99.92%
Ministral 3 3B 2512
mistralai
Mistral
-
$0.100.21s31.5T/s100.00%
Ministral 3 8B 2512
mistralai
Mistral
-
$0.150.23s42.0T/s99.99%
GPT-5.2
openai
OpenAI
-
$1.864.12s45.0T/s99.95%
MiMo-V2-Flash
xiaomi
Xiaomi
-
$0.100.59s57.0T/s100.00%
Kimi K2.5
moonshotai
Venice
-
$0.421.11s86.0T/s99.96%
Claude Opus 4.6
anthropic
Google Vertex
-
$5.201.06s49.0T/s99.99%
MiniMax M2.5
minimax
MARA
-
$0.161.38s174.0T/s99.97%
Gemini 3.1 Pro Preview
google
Google AI Studio
-
$2.102.72s102.0T/s98.64%
Qwen3.5-Flash
qwen
Alibaba Cloud Int.
-
$0.070.63s82.0T/s96.00%
GPT-5.4
openai
Azure
-
$2.623.90s49.0T/s99.97%
Nemotron 3 Super (free)
nvidia
NVIDIA
-
$0.005.32s7.0T/s12.00%
GPT-5.4 Nano
openai
Azure
-
$0.211.04s54.0T/s99.96%
MiniMax M2.7
minimax
MARA
-
$0.290.85s235.0T/s98.74%
GLM 5.1
z-ai
Friendli
-
$1.000.57s93.0T/s99.92%
Claude Opus 4.7
anthropic
Google Vertex
-
$5.201.51s58.0T/s99.99%
Kimi K2.6
moonshotai
Weights & Biases
-
$0.710.64s153.0T/s99.62%
Claude Opus Latest
anthropic
Google Vertex
-
$5.202.89s79.0T/s-
MiMo-V2.5-Pro
xiaomi
DeepInfra
-
$0.441.08s79.0T/s99.90%
GPT-5.5
openai
Azure
-
$5.244.68s45.0T/s98.68%
OpenAI GPT Latest
openai
Azure
-
$5.244.68s45.0T/s-
Google Gemini Flash Latest
google
Google Vertex
-
$1.573.00s110.0T/s-
MoonshotAI Kimi Latest
moonshotai
Weights & Biases
-
$0.710.64s153.0T/s-
Google Gemini Pro Latest
google
Google AI Studio
-
$2.102.72s102.0T/s-
Laguna M.1 (free)
poolside
Poolside
-
$0.003.43s10.0T/s100.00%
Grok 4.3
x-ai
xAI
-
$1.270.75s147.0T/s100.00%
Gemini 3.5 Flash
google
Google Vertex
-
$1.573.00s110.0T/s99.76%
Claude Opus 4.8
anthropic
Google Vertex
-
$5.202.89s79.0T/s99.99%
Step 3.7 Flash
stepfun
StepFun
-
$0.211.91s50.0T/s20.00%
GPT-4o
openai
OpenAI
-
$2.580.54s47.0T/s99.97%
Qwen2.5 7B Instruct
qwen
Together
-
$0.040.40s83.0T/s99.74%
Claude 3.5 Haiku
anthropic
Amazon Bedrock (US-WEST)
-
$0.830.82s34.0T/s99.94%
DeepSeek V3
deepseek-ai
StreamLake
-
$0.210.86s25.0T/s99.97%
Gemma 3 12B
google
DeepInfra
-
$0.050.63s33.0T/s96.05%
Qwen3 235B A22B
qwen
Alibaba Cloud Int.
-
$0.470.49s60.0T/s-
Llama Guard 4 12B
meta-llama
Together
-
$0.180.11s19.0T/s100.00%
Claude Sonnet 4
anthropic
Google Vertex (Europe)
-
$3.120.50s46.0T/s100.00%
Gemini 2.5 Pro
google
Google AI Studio
-
$1.332.50s94.0T/s96.96%
GPT-5
openai
OpenAI
-
$1.332.32s45.0T/s99.90%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google Vertex
-
$0.100.38s201.0T/s12.00%
Nano Banana (Gemini 2.5 Flash Image)
google
Google AI Studio
-
$0.325.71s169.0T/s99.99%
Qwen3 VL 32B Instruct
qwen
Alibaba Cloud Int.
-
$0.110.64s19.0T/s98.63%
gpt-oss-safeguard-20b
openai
Groq
-
$0.080.27s428.0T/s100.00%
Mistral Large 3 2512
mistralai
Mistral
-
$0.510.57s3.0T/s88.00%
Ministral 3 14B 2512
mistralai
Mistral
-
$0.200.26s52.0T/s99.38%
Nemotron 3 Nano 30B A3B
nvidia
DeepInfra
-
$0.051.33s105.0T/s4.00%
GLM 4.7
z-ai
Cerebras
-
$0.410.51s448.0T/s99.63%
GLM 5
z-ai
Friendli
-
$0.620.44s92.0T/s99.84%
Qwen3.5 397B A17B
qwen
Together
-
$0.410.48s96.0T/s99.94%
LFM2-24B-A2B
liquid
Together
-
$0.030.20s59.0T/s-
Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google
Google AI Studio
-
$0.5211.56s131.0T/s99.98%
Qwen3.5-9B
qwen
Venice
-
$0.100.72s66.0T/s99.48%
Qwen3.6 Plus
qwen
Alibaba Cloud Int.
-
$0.341.15s38.0T/s64.00%
Qwen3.6 27B
qwen
SiliconFlow
-
$0.312.85s32.0T/s99.71%
Qwen3.7 Max
qwen
Alibaba Cloud Int.
-
$1.281.41s51.0T/s92.00%