YPerf

Track & Compare LLM API Performance Metrics

Last updated: 4/18/2025, 6:04:26 AM

ModelProvider
Gemini 1.5 Flash 8B
google
Google AI Studio
#83
$0.040.38s345.3T/s99.06%
Qwen2.5 7B Instruct
qwen
Together
-
$0.050.18s177.1T/s99.99%
Gemini 2.0 Flash
google
Google AI Studio
#14
$0.100.52s183.0T/s99.87%
Gemini 1.5 Flash
google
Google AI Studio
#65
$0.081.01s176.2T/s99.29%
GPT-4o-mini
openai
Azure
#6
$0.151.27s176.7T/s98.83%
Mistral Nemo
mistralai
Parasail
-
$0.040.70s138.3T/s99.97%
Llama 3.1 8B Instruct
meta-llama
Groq
#109
$0.020.81s1631.6T/s99.74%
Llama 3.3 70B Instruct
meta-llama
Groq
#42
$0.100.42s372.1T/s99.98%
Claude 3.7 Sonnet (thinking)
anthropic
Google Vertex
#11
$3.121.70s56.2T/s99.28%
Claude 3.7 Sonnet
anthropic
Google Vertex
#11
$3.121.70s56.2T/s98.51%
Gemini 2.0 Flash Lite
google
Google AI Studio
#15
$0.080.87s152.0T/s99.76%
DeepSeek V3 0324
deepseek
SambaNova
#4
$0.281.49s186.8T/s99.57%
MythoMax 13B
gryphe
Together
-
$0.070.44s137.1T/s99.99%
OpenChat 3.5 7B
openchat
Lepton
-
$0.070.31s96.3T/s99.99%
Claude 3 Haiku
anthropic
Google Vertex
#85
$0.260.77s156.1T/s99.96%
WizardLM-2 8x22B
microsoft
Together
-
$0.500.44s79.3T/s99.98%
GPT-4o
openai
Azure
-
$2.581.61s180.6T/s98.67%
Hermes 2 Pro - Llama-3 8B
nousresearch
NovitaAI
-
$0.030.95s145.9T/s99.97%
Mistral 7B Instruct
mistralai
NovitaAI
#142
$0.031.19s79.3T/s99.98%
Llama 3.1 70B Instruct
meta-llama
Fireworks
-
$0.120.68s113.8T/s99.94%
Hermes 3 405B Instruct
nousresearch
Nebius AI Studio
-
$0.810.99s29.1T/s99.82%
Qwen2.5 72B Instruct
qwen
Together
#57
$0.120.51s90.6T/s99.96%
Llama 3.2 3B Instruct
meta-llama
SambaNova
#124
$0.020.40s3590.9T/s100.00%
Claude 3.5 Sonnet
anthropic
Google Vertex
-
$3.121.13s59.1T/s99.82%
Claude 3.5 Sonnet (self-moderated)
anthropic
Anthropic
-
$3.120.90s56.3T/s99.61%
Nova Lite 1.0
amazon
Amazon Bedrock
#80
$0.060.43s189.6T/s98.60%
DeepSeek V3
deepseek
Fireworks
#17
$0.391.30s44.3T/s99.90%
DeepSeek V3 (free)
deepseek
Chutes
#17
$0.001.49s43.0T/s94.04%
R1
deepseek
SambaNova
#6
$0.524.82s404.7T/s99.61%
R1 (free)
deepseek
Chutes
#6
$0.009.81s67.1T/s98.40%
LFM 3B
liquid
Liquid
-
$0.020.76s29.7T/s99.76%
Mistral Small 3
mistralai
Mistral
#74
$0.070.31s131.2T/s99.96%
Gemma 3 27B
google
Parasail
#14
$0.101.53s53.6T/s98.98%
Gemma 3 4B
google
DeepInfra
-
$0.020.34s79.0T/s95.43%
DeepSeek V3 0324 (free)
deepseek
Chutes
#4
$0.001.68s39.0T/s88.36%
Llama 4 Maverick
meta-llama
SambaNova
#24
$0.181.13s641.0T/s99.47%
GPT-4.1 Mini
openai
OpenAI
-
$0.410.46s89.6T/s99.76%
GPT-4.1
openai
OpenAI
-
$2.060.54s59.8T/s99.56%
Mixtral 8x7B Instruct
mistralai
Fireworks
#116
$0.240.23s210.0T/s100.00%
Mistral Tiny
mistralai
Mistral
-
$0.250.27s148.3T/s100.00%
Claude 3 Sonnet
anthropic
Anthropic
#71
$3.120.53s71.3T/s96.88%
Claude 3 Sonnet (self-moderated)
anthropic
Anthropic
#71
$3.120.53s71.3T/s98.02%
Claude 3 Haiku (self-moderated)
anthropic
Anthropic
#85
$0.260.69s148.2T/s99.82%
Gemini 1.5 Pro
google
Google AI Studio
#40
$1.290.69s71.2T/s95.18%
Llama 3 70B Instruct
meta-llama
Groq
#63
$0.300.26s394.0T/s100.00%
Llama 3 8B Instruct
meta-llama
Groq
#101
$0.030.36s2869.6T/s99.39%
Llama 3 Lumimaid 8B
neversleep
Mancer (private)
-
$0.100.58s64.2T/s99.98%
Llama 3 Lumimaid 8B (extended)
neversleep
Mancer (private)
-
$0.100.58s64.2T/s99.98%
GPT-4o-mini (2024-07-18)
openai
OpenAI
#43
$0.150.39s81.8T/s99.98%
Llama 3 8B Lunaris
sao10k
NovitaAI
-
$0.020.61s77.9T/s100.00%
ChatGPT-4o
openai
OpenAI
#2
$5.120.55s104.2T/s99.40%
Hermes 3 70B Instruct
nousresearch
Lambda
-
$0.121.07s38.2T/s99.97%
Llama 3.1 Euryale 70B v2.2
sao10k
DeepInfra
-
$0.710.33s36.7T/s99.98%
Command R (08-2024)
cohere
Cohere
#84
$0.150.41s70.9T/s99.95%
Llama 3.2 11B Vision Instruct
meta-llama
SambaNova
-
$0.051.75s573.1T/s99.42%
Rocinante 12B
thedrummer
Infermatic
-
$0.250.39s84.3T/s99.92%
Qwen2.5 7B Instruct (free)
qwen
Nineteen
-
$0.000.67s283.8T/s99.94%
Ministral 3B
mistralai
Mistral
-
$0.040.17s237.6T/s99.99%
Claude 3.5 Haiku (2024-10-22)
anthropic
Google Vertex
#43
$0.831.29s51.8T/s99.86%
Claude 3.5 Haiku (2024-10-22) (self-moderated)
anthropic
Anthropic
#43
$0.836.43s51.2T/s95.13%
Claude 3.5 Haiku
anthropic
Anthropic
-
$0.836.82s61.7T/s99.82%
Unslopnemo 12B
thedrummer
Infermatic
-
$0.500.58s79.5T/s99.99%
Qwen2.5 Coder 32B Instruct
qwen
Together
#71
$0.070.61s72.2T/s99.77%
Mistral Large 2411
mistralai
Mistral
#57
$2.050.39s50.9T/s99.97%
GPT-4o (2024-11-20)
openai
OpenAI
-
$2.580.50s100.7T/s99.64%
Gemini 2.0 Flash Experimental (free)
google
Google Vertex
#8
$0.000.92s200.6T/s84.08%
Grok 2 Vision 1212
x-ai
xAI
-
$2.080.82s62.0T/s99.52%
Llama 3.3 Euryale 70B
sao10k
Infermatic
-
$0.710.51s48.2T/s99.98%
Phi 4
microsoft
Nebius AI Studio
#84
$0.070.34s124.8T/s99.34%
MiniMax-01
minimax
Minimax
-
$0.211.82s28.4T/s99.75%
R1 Distill Llama 70B
deepseek
SambaNova
-
$0.103.17s1302.0T/s99.90%
LFM 7B
liquid
Liquid
-
$0.010.69s63.1T/s100.00%
R1 Distill Qwen 32B
deepseek
NovitaAI
-
$0.1227.03s65.4T/s99.88%
o3 Mini
openai
OpenAI
#20
$1.146.15s148.4T/s99.69%
Qwen-Max
qwen
Alibaba
-
$1.651.01s38.7T/s96.23%
Claude 3.7 Sonnet (self-moderated)
anthropic
Anthropic
#11
$3.121.72s49.0T/s98.11%
QwQ 32B (free)
qwen
Nineteen
#126
$0.005.33s438.0T/s98.37%
Gemma 3 12B
google
DeepInfra
-
$0.051.00s34.5T/s98.93%
Mistral Small 3.1 24B
mistralai
Mistral
-
$0.100.27s143.2T/s99.79%
Llama 4 Scout
meta-llama
Groq
-
$0.080.39s756.5T/s99.97%
Llama 4 Maverick (free)
meta-llama
Chutes
-
$0.001.05s73.6T/s99.69%
Grok 3 Beta
x-ai
xAI Fast
#4
$3.120.74s65.3T/s99.45%
Grok 3 Mini Beta
x-ai
xAI Fast
-
$0.304.24s187.2T/s74.85%
GPT-4.1 Nano
openai
OpenAI
-
$0.100.40s245.2T/s99.92%
o4 Mini
openai
OpenAI
-
$1.144.45s81.9T/s82.78%
Gemini 2.5 Flash Preview
google
AI Studio Non-Thinking
-
$0.150.86s168.4T/s41.64%
Gemini 2.5 Pro Preview
google
Google Vertex
#1
$1.339.79s438.1T/s97.69%
GPT-3.5 Turbo
openai
OpenAI
#106
$0.510.38s146.4T/s99.65%
ReMM SLERP 13B
undi95
Mancer
-
$0.571.68s43.6T/s99.99%
Mistral Large
mistralai
Mistral
#43
$2.050.42s45.7T/s99.97%
Dolphin 2.9.2 Mixtral 8x22B 🐬
cognitivecomputations
NovitaAI
-
$0.912.24s12.1T/s99.95%
Gemma 2 9B
google
Groq
#82
$0.070.25s1046.2T/s100.00%
Llama 3.1 405B Instruct
meta-llama
Fireworks
-
$0.810.61s75.1T/s99.93%
GPT-4o (2024-08-06)
openai
Azure
#29
$2.581.46s124.3T/s99.56%
Llama 3.2 1B Instruct
meta-llama
SambaNova
#152
$0.010.14s2666.7T/s99.96%
Claude 3.5 Haiku (self-moderated)
anthropic
Anthropic
#43
$0.836.82s61.7T/s89.96%
Mistral Large 2407
mistralai
Mistral
#43
$2.050.88s45.0T/s95.83%
Codestral 2501
mistralai
Mistral
-
$0.310.27s186.3T/s99.96%
Gemini 2.0 Flash Thinking Experimental 01-21 (free)
google
Google AI Studio
#9
$0.003.87s190.8T/s80.41%
Qwen2.5 VL 72B Instruct
qwen
Together
-
$0.711.16s37.9T/s99.51%
Qwen-Turbo
qwen
Alibaba
-
$0.051.09s105.5T/s99.87%
R1 Distill Llama 8B
deepseek
NovitaAI
-
$0.0416.71s170.0T/s97.58%
QwQ 32B
qwen
Groq
#126
$0.151.21s3558.6T/s99.71%
Sonar Reasoning Pro
perplexity
Perplexity
-
$2.0610.75s112.5T/s95.32%
Phi 4 Multimodal Instruct
microsoft
Parasail
-
$0.051.91s99.0T/s88.18%
Skyfall 36B V2
thedrummer
Parasail
-
$0.513.86s20.4T/s98.92%
Gemma 3 27B (free)
google
Chutes
-
$0.001.34s70.7T/s85.64%
Qwen2.5 VL 32B Instruct
qwen
Fireworks
-
$0.910.34s68.2T/s94.07%
Gemini 2.5 Pro Experimental (free)
google
Google Vertex
#1
$0.0013.51s417.8T/s57.08%
o4 Mini High
openai
OpenAI
-
$1.148.52s91.3T/s69.37%
Created by JCdata from OpenRouter and Chatbot Arena