Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 3/16/2026, 2:02:39 PM

ModelProvider
MiniMax M2.5
minimax
SambaNova
-
$0.261.11s202.0T/s100.00%
Gemini 3.1 Flash Lite Preview
google
Google AI Studio
-
$0.260.93s71.0T/s100.00%
Healer Alpha
openrouter
Stealth
-
$0.001.52s47.0T/s100.00%
Hunter Alpha
openrouter
Stealth
-
$0.002.18s38.0T/s100.00%
Grok 4.1 Fast
x-ai
xAI
-
$0.203.55s116.0T/s100.00%
DeepSeek V3.2
deepseek
NovitaAI
-
$0.261.58s24.0T/s100.00%
Gemini 3 Flash Preview
google
Google AI Studio
-
$0.521.21s83.0T/s100.00%
GPT-4o-mini
openai
Azure
-
$0.150.78s56.0T/s100.00%
Llama 3.1 8B Instruct
meta-llama
Groq
-
$0.020.15s250.0T/s100.00%
Gemini 2.0 Flash
google
Google AI Studio
-
$0.100.61s85.0T/s100.00%
Gemma 3 12B
google
Cloudflare
-
$0.040.28s21.0T/s100.00%
GPT-4.1 Mini
openai
Azure
-
$0.413.48s68.0T/s100.00%
Gemini 2.5 Flash
google
Google Vertex (Global)
-
$0.320.68s67.0T/s100.00%
Gemini 2.5 Flash Lite
google
Google AI Studio
-
$0.100.52s133.0T/s100.00%
gpt-oss-120b
openai
SambaNova
-
$0.040.78s316.0T/s100.00%
Kimi K2.5
moonshotai
Inceptron
-
$0.470.59s54.0T/s100.00%
Trinity Large Preview (free)
arcee-ai
Arcee (Prime Intellect)
-
$0.000.32s37.0T/s100.00%
Step 3.5 Flash (free)
stepfun
StepFun
-
$0.003.18s50.0T/s100.00%
Claude Opus 4.6
anthropic
Google Vertex
-
$5.201.59s42.0T/s100.00%
GLM 5
z-ai
Fireworks
-
$0.740.91s93.0T/s100.00%
Claude Sonnet 4.6
anthropic
Google Vertex (Global)
-
$3.121.06s40.0T/s100.00%
Gemini 3.1 Pro Preview
google
Google AI Studio
-
$2.104.18s69.0T/s100.00%
GPT-5.4
openai
OpenAI
-
$2.623.35s48.0T/s100.00%
Nemotron 3 Super (free)
nvidia
NVIDIA
-
$0.005.37s17.0T/s100.00%
GLM 5 Turbo
z-ai
Z.ai
-
$0.993.48s27.0T/s100.00%
Grok 4 Fast
x-ai
xAI
-
$0.202.86s149.0T/s100.00%
Gemini 2.5 Flash Lite Preview 09-2025
google
Google AI Studio
-
$0.100.45s152.0T/s100.00%
Claude Sonnet 4.5
anthropic
Amazon Bedrock
-
$3.122.39s43.0T/s100.00%
Claude Haiku 4.5
anthropic
Google Vertex
-
$1.040.64s87.0T/s100.00%
Ministral 3 3B 2512
mistralai
Mistral
-
$0.100.23s60.0T/s100.00%
GPT-5.2
openai
OpenAI
-
$1.863.18s43.0T/s100.00%
MiMo-V2-Flash
xiaomi
Xiaomi
-
$0.092.13s44.0T/s100.00%
Llama 3 8B Instruct
meta-llama
DeepInfra
-
$0.030.20s45.0T/s100.00%
GPT-4o
openai
OpenAI
-
$2.580.54s47.0T/s100.00%
Mistral Nemo
mistralai
Mistral
-
$0.020.23s122.0T/s100.00%
Llama 3.1 70B Instruct
meta-llama
DeepInfra
-
$0.400.26s20.0T/s100.00%
Llama 3.3 70B Instruct
meta-llama
Groq
-
$0.100.25s189.0T/s100.00%
Gemini 2.0 Flash Lite
google
Google Vertex
-
$0.080.55s67.0T/s100.00%
Gemma 3 27B
google
DeepInfra
-
$0.030.47s46.0T/s100.00%
DeepSeek V3 0324
deepseek
SambaNova
-
$0.210.59s77.5T/s100.00%
Llama 4 Scout
meta-llama
Google Vertex
-
$0.080.46s42.5T/s100.00%
Llama 4 Maverick
meta-llama
Parasail
-
$0.150.38s94.0T/s100.00%
GPT-4.1 Nano
openai
Azure
-
$0.100.61s85.0T/s100.00%
GPT-4.1
openai
Azure
-
$2.060.80s52.0T/s100.00%
Qwen3 32B
qwen
Groq
-
$0.080.30s410.0T/s100.00%
Gemini 2.5 Pro
google
Google AI Studio
-
$1.333.05s107.0T/s100.00%
Qwen3 235B A22B Instruct 2507
qwen
Weights & Biases
-
$0.070.36s67.0T/s100.00%
gpt-oss-20b
openai
Groq
-
$0.030.15s720.0T/s100.00%
GPT-5 Nano
openai
Azure
-
$0.0511.37s95.0T/s100.00%
GPT-5 Mini
openai
Azure
-
$0.2720.12s79.0T/s100.00%
GPT-5 Chat
openai
OpenAI
-
$1.330.70s95.0T/s100.00%
DeepSeek V3.1
deepseek
Google Vertex
-
$0.160.78s115.0T/s100.00%
Grok 4.20 Beta
x-ai
xAI
-
$2.050.41s108.0T/s100.00%
GLM 4.7 Flash
z-ai
Venice
-
$0.060.74s52.0T/s100.00%
Qwen3.5 397B A17B
qwen
Together
-
$0.410.47s72.0T/s100.00%
Qwen3.5-Flash
qwen
Alibaba Cloud Int.
-
$0.100.50s67.0T/s100.00%
Qwen3 VL 8B Instruct
qwen
Alibaba Cloud Int.
-
$0.080.66s73.0T/s100.00%
gpt-oss-safeguard-20b
openai
Groq
-
$0.080.18s789.0T/s100.00%
GPT-5.1 Chat
openai
OpenAI
-
$1.331.50s54.0T/s100.00%
GPT-5.1
openai
Azure
-
$1.332.14s63.0T/s100.00%
Nemotron 3 Nano 30B A3B
nvidia
DeepInfra
-
$0.050.60s93.0T/s100.00%
Mistral Small Creative
mistralai
Mistral
-
$0.100.16s41.0T/s100.00%
GLM 4.7
z-ai
Google Vertex
-
$0.400.82s126.0T/s100.00%
GPT-4o-mini (2024-07-18)
openai
OpenAI
-
$0.150.57s34.0T/s100.00%
Claude 3.5 Haiku
anthropic
Amazon Bedrock (US-WEST)
-
$0.830.84s40.0T/s100.00%
DeepSeek V3
deepseek-ai
DeepInfra
-
$0.330.41s25.0T/s100.00%
Qwen-Turbo
qwen
Alibaba Cloud Int.
-
$0.030.44s30.0T/s100.00%
Gemma 3n 4B
google
Together
-
$0.020.27s29.0T/s100.00%
Claude Sonnet 4
anthropic
Amazon Bedrock
-
$3.121.51s68.0T/s100.00%
Mistral Small 3.2 24B
mistralai
DeepInfra
-
$0.060.49s72.0T/s100.00%
GLM 4 32B
z-ai
Z.ai
-
$0.101.11s3.0T/s100.00%
GLM 4.5 Air
z-ai
Nebius Token Factory
-
$0.140.33s51.0T/s100.00%
Codestral 2508
mistralai
Mistral
-
$0.310.28s55.0T/s100.00%
GPT-5
openai
Azure
-
$1.3315.19s68.0T/s100.00%
Kimi K2 0905
moonshotai
Groq
-
$0.420.28s161.0T/s100.00%
Nano Banana (Gemini 2.5 Flash Image)
google
Google AI Studio
-
$0.326.96s154.0T/s100.00%
GPT-5.3-Codex
openai
OpenAI
-
$1.864.09s44.0T/s100.00%