Model Leaderboard

Compare key performance metrics for LLM APIs.

Updated at: 6/22/2025, 5:03:33 AM

ModelProvider
Gemini 1.5 Flash
google
Google AI Studio
#83
$0.080.94s157.2T/s99.97%
GPT-4o-mini
openai
Azure
#12
$0.151.08s176.7T/s99.90%
Mistral Nemo
mistralai
Parasail
-
$0.010.39s145.0T/s100.00%
Gemini 1.5 Flash 8B
google
Google AI Studio
#102
$0.040.21s185.6T/s94.66%
Llama 3.3 70B Instruct
meta-llama
Cerebras
#57
$0.050.18s3783.0T/s99.91%
Gemini 2.0 Flash
google
Google AI Studio
#25
$0.100.51s176.1T/s99.99%
Gemini 2.0 Flash Lite
google
Google Vertex
#15
$0.080.42s172.0T/s93.95%
Gemma 3 27B
google
kluster.ai
#26
$0.101.25s64.6T/s99.01%
DeepSeek V3 0324
deepseek
SambaNova
#12
$0.311.90s182.3T/s99.41%
Llama 4 Maverick
meta-llama
Groq
#43
$0.150.23s1259.3T/s99.82%
Claude Sonnet 4
anthropic
Google Vertex (Europe)
#12
$3.122.00s67.6T/s99.69%
Gemini 2.5 Flash Lite Preview 06-17
google
Google AI Studio
#12
$0.100.33s289.7T/s99.62%
MythoMax 13B
gryphe
NovitaAI
-
$0.071.17s85.1T/s99.99%
Llama 3.1 70B Instruct
meta-llama
Together
-
$0.100.93s140.1T/s99.82%
Qwen2.5 72B Instruct
qwen
Fireworks
#72
$0.123.82s39.6T/s80.57%
Llama 3.2 3B Instruct
meta-llama
SambaNova
#144
$0.010.34s3117.6T/s99.99%
Qwen2.5 7B Instruct
qwen
Together
-
$0.040.32s156.9T/s99.99%
GPT-4.1 Mini
openai
OpenAI
#21
$0.410.48s69.3T/s99.83%
GPT-4.1
openai
OpenAI
#6
$2.060.49s75.3T/s99.79%
Gemini 2.5 Pro Preview 05-06
google
Google Vertex
#2
$1.334.50s82.8T/s99.55%
Mixtral 8x7B Instruct
mistralai
DeepInfra
#135
$0.080.36s119.3T/s99.81%
Mistral Tiny
mistralai
Mistral
-
$0.250.29s164.4T/s100.00%
WizardLM-2 8x22B
microsoft
Parasail
-
$0.481.18s56.2T/s99.99%
GPT-4o
openai
Azure
-
$2.581.95s143.2T/s99.87%
Hermes 2 Pro - Llama-3 8B
nousresearch
Lambda
-
$0.030.29s152.8T/s99.97%
GPT-4o-mini (2024-07-18)
openai
OpenAI
#57
$0.150.36s81.5T/s99.98%
Llama 3.1 8B Instruct
meta-llama
Cerebras
#128
$0.020.14s4852.9T/s99.99%
Llama 3 8B Lunaris
sao10k
NovitaAI
-
$0.021.11s68.6T/s99.99%
Hermes 3 405B Instruct
nousresearch
Lambda
-
$0.711.16s34.5T/s99.78%
Hermes 3 70B Instruct
nousresearch
Lambda
-
$0.120.43s49.2T/s99.99%
Rocinante 12B
thedrummer
Infermatic
-
$0.250.42s64.9T/s99.90%
Ministral 8B
mistralai
Mistral
#111
$0.100.25s149.3T/s99.87%
Claude 3.5 Sonnet
anthropic
Google Vertex
-
$3.121.27s58.8T/s99.78%
Claude 3.5 Haiku
anthropic
Anthropic
-
$0.831.33s70.1T/s98.48%
UnslopNemo 12B
thedrummer
Infermatic
-
$0.450.54s93.9T/s99.89%
GPT-4o (2024-11-20)
openai
OpenAI
-
$2.580.41s92.4T/s99.85%
DeepSeek V3
deepseek
Fireworks
#30
$0.391.03s76.8T/s99.95%
MiniMax-01
minimax
Minimax
-
$0.211.63s27.3T/s98.37%
R1
deepseek
DeepInfra Turbo
#12
$0.470.41s147.4T/s99.93%
R1 Distill Llama 70B
deepseek
Cerebras
-
$0.100.20s2829.8T/s99.99%
LFM 3B
liquid
Liquid
-
$0.021.04s18.2T/s99.84%
LFM 7B
liquid
Lambda
-
$0.010.44s117.0T/s99.99%
Mistral Small 3
mistralai
Mistral
#92
$0.050.31s82.6T/s99.96%
Gemma 3 4B
google
DeepInfra
#66
$0.020.27s106.0T/s99.87%
Mistral Small 3.1 24B
mistralai
Mistral
#69
$0.050.23s101.3T/s99.98%
Llama 4 Scout
meta-llama
Cerebras
#51
$0.080.54s2000.0T/s99.98%
Grok 3 Beta
x-ai
xAI
#8
$3.120.54s63.3T/s99.69%
Grok 3 Mini Beta
x-ai
xAI Fast
#26
$0.300.32s189.1T/s99.93%
GPT-4.1 Nano
openai
OpenAI
#50
$0.100.25s244.0T/s99.68%
Qwen3 235B A22B
qwen
Fireworks
#22
$0.130.62s77.7T/s97.98%
Qwen3 32B
qwen
Cerebras
#24
$0.100.64s721.1T/s99.88%
Gemini 2.5 Flash Preview 05-20 (thinking)
google
AI Studio Thinking
#6
$0.181.68s131.9T/s99.52%
R1 0528
deepseek
Baseten
#12
$0.520.36s132.3T/s99.51%
Gemini 2.5 Pro Preview 06-05
google
Google Vertex
#1
$1.332.64s93.0T/s99.67%
Gemini 2.5 Pro
google
Google Vertex
#1
$1.332.20s85.7T/s99.78%
Gemini 2.5 Flash
google
Google AI Studio
#6
$0.320.52s114.5T/s99.89%
GPT-3.5 Turbo
openai
OpenAI
#125
$0.510.34s172.7T/s99.58%
ReMM SLERP 13B
undi95
Mancer (private)
-
$0.810.73s43.6T/s99.98%
Claude 3 Haiku
anthropic
Google Vertex
#103
$0.261.22s168.7T/s99.53%
Gemini 1.5 Pro
google
Google Vertex
#54
$1.291.31s66.0T/s99.93%
Llama 3 70B Instruct
meta-llama
Groq
#79
$0.300.20s427.6T/s99.99%
Llama 3 8B Instruct
meta-llama
Groq
#122
$0.030.39s5269.7T/s99.77%
Mistral 7B Instruct
mistralai
Together
#161
$0.030.53s219.1T/s99.95%
Gemma 2 9B
google
Groq
#101
$0.200.39s1018.9T/s98.14%
Llama 3.1 405B Instruct
meta-llama
SambaNova
-
$0.811.79s93.1T/s99.76%
ChatGPT-4o
openai
OpenAI
#2
$5.120.44s99.9T/s99.54%
Llama 3.1 Euryale 70B v2.2
sao10k
DeepInfra
-
$0.710.32s39.7T/s99.98%
Lumimaid v0.2 8B
neversleep
Mancer (private)
-
$0.210.77s66.0T/s99.98%
Ministral 3B
mistralai
Mistral
-
$0.040.19s243.1T/s99.91%
Qwen2.5 Coder 32B Instruct
qwen
Together
#89
$0.060.23s342.0T/s99.86%
Gemini 2.0 Flash Experimental (free)
google
Google Vertex
#15
$0.001.03s173.2T/s45.09%
Grok 2 Vision 1212
x-ai
xAI
-
$2.080.97s76.6T/s99.91%
Llama 3.3 Euryale 70B
sao10k
Infermatic
-
$0.710.52s45.9T/s99.93%
Phi 4
microsoft
Nebius AI Studio
#103
$0.070.10s125.5T/s98.20%
Codestral 2501
mistralai
Mistral
-
$0.310.30s293.4T/s99.89%
R1 Distill Qwen 32B
deepseek
DeepInfra
-
$0.120.50s46.9T/s99.04%
o3 Mini
openai
OpenAI
#32
$1.147.95s361.0T/s99.23%
Qwen2.5 VL 72B Instruct
qwen
Hyperbolic
-
$0.261.60s36.5T/s98.18%
Qwen-Turbo
qwen
Alibaba
-
$0.050.65s110.1T/s99.71%
Claude 3.7 Sonnet (thinking)
anthropic
Anthropic
#21
$3.121.65s57.4T/s98.92%
GPT-4.5 (Preview)
openai
OpenAI
#4
$76.201.48s11.5T/s95.43%
QwQ 32B
qwen
Groq
#145
$0.151.05s524.0T/s99.92%
Skyfall 36B V2
thedrummer
Parasail
-
$0.510.65s53.5T/s99.88%
Gemma 3 12B
google
Cloudflare
#33
$0.050.55s71.7T/s87.92%
o4 Mini
openai
OpenAI
#12
$1.145.18s99.7T/s63.03%
MAI DS R1 (free)
microsoft
Chutes
-
$0.001.12s67.3T/s97.72%
DeepSeek R1T Chimera (free)
tngtech
Chutes
-
$0.002.46s32.0T/s70.51%
Qwen3 30B A3B
qwen
Fireworks
#49
$0.080.84s140.9T/s99.72%
Claude Opus 4
anthropic
Anthropic
-
$15.602.70s32.5T/s96.95%
Valkyrie 49B V1
thedrummer
Parasail
-
$0.510.67s44.9T/s99.89%
Deepseek R1 0528 Qwen3 8B
deepseek
Parasail
-
$0.050.62s99.9T/s99.18%
Grok 3 Mini
x-ai
xAI Fast
#26
$0.300.40s193.9T/s96.64%
GPT-3.5 Turbo 16k
openai
OpenAI
#125
$0.510.43s173.4T/s99.80%
GPT-4 Turbo (older v1106)
openai
OpenAI
#57
$10.241.40s9.7T/s95.25%
Mistral Large
mistralai
Mistral
#60
$2.050.29s90.0T/s99.30%
GPT-4 Turbo
openai
OpenAI
-
$10.240.81s47.7T/s98.67%
Llama 3.1 Sonar 70B Online
perplexity
Perplexity
-
$1.011.66s87.0T/s93.18%
GPT-4o (2024-08-06)
openai
Azure
#44
$2.580.82s152.3T/s99.17%
Command R (08-2024)
cohere
Cohere
#103
$0.150.83s82.2T/s99.96%
Pixtral 12B
mistralai
Hyperbolic
-
$0.101.55s85.9T/s99.91%
Claude 3.5 Haiku (2024-10-22)
anthropic
Google Vertex
#57
$0.832.28s78.6T/s95.97%
Mistral Large 2411
mistralai
Mistral
#72
$2.050.54s38.2T/s99.80%
Sonar
perplexity
Perplexity
-
$1.011.88s113.1T/s99.86%
Anubis Pro 105B V1
thedrummer
Parasail
-
$0.811.05s26.9T/s98.21%
DeepSeek V3 Base (free)
deepseek
Chutes
-
$0.001.28s81.9T/s99.71%
Gemini 2.5 Flash Preview 04-17 (thinking)
google
AI Studio Thinking
#8
$0.181.39s169.4T/s97.04%
Qwen3 14B
qwen
Nebius AI Studio
-
$0.060.63s90.2T/s98.59%
Mistral Medium 3
mistralai
Mistral
#18
$0.420.53s59.0T/s95.82%
Devstral Small (free)
mistralai
Chutes
-
$0.001.50s82.6T/s94.71%