YPerf

Track & Compare LLM API Performance Metrics

Last updated: 4/2/2025, 7:00:52 AM

ModelProvider
Llama 3.3 70B Instruct
meta-llama
Groq
#41
$0.120.44s304.6T/s99.88%
Gemini Flash 1.5 8B
google
Google AI Studio
#78
$0.040.43s336.5T/s99.87%
Gemini Flash 2.0
google
Google Vertex
#13
$0.100.46s176.7T/s99.97%
GPT-4o-mini
openai
Azure
#5
$0.151.06s125.5T/s99.86%
Gemini 2.0 Flash Lite
google
Google AI Studio
#15
$0.080.46s148.2T/s99.61%
Gemini Flash 1.5
google
Google AI Studio
#55
$0.080.83s175.1T/s99.98%
DeepSeek V3 0324 (free)
deepseek
Targon
-
$0.007.08s220.6T/s94.68%
Claude 3.7 Sonnet (thinking)
anthropic
Amazon Bedrock
#12
$3.120.82s87.6T/s97.45%
R1 (free)
deepseek
Chutes
#7
$0.009.29s68.4T/s99.14%
Claude 3.7 Sonnet
anthropic
Amazon Bedrock
#11
$3.120.82s87.6T/s96.75%
R1 Distill Llama 70B
deepseek
SambaNova
-
$0.242.25s605.8T/s98.90%
Claude 3.5 Sonnet
anthropic
Google Vertex
-
$3.121.72s60.2T/s99.85%
Claude 3.5 Haiku
anthropic
Google Vertex
-
$0.830.75s109.1T/s99.24%
MythoMax 13B
gryphe
Together
-
$0.070.45s120.1T/s99.99%
DeepSeek V3 0324
deepseek
SambaNova
-
$0.285.94s222.2T/s97.73%
Mistral Small 3
mistralai
Mistral
#74
$0.070.30s119.2T/s100.00%
Claude 3.5 Haiku (self-moderated)
anthropic
Anthropic
#44
$0.831.30s108.8T/s98.92%
Mistral 7B Instruct
mistralai
Together
#142
$0.030.38s163.1T/s100.00%
GPT-4o-mini (2024-07-18)
openai
OpenAI
#42
$0.150.33s76.9T/s99.98%
Command R (08-2024)
cohere
Cohere
#85
$0.150.37s54.7T/s99.67%
R1
deepseek
SambaNova
#7
$0.728.28s337.6T/s93.03%
Gemma 3 27B
google
Parasail
#15
$0.100.74s46.5T/s99.50%
Qwen2.5 72B Instruct
qwen
SambaNova
#56
$0.130.39s360.2T/s99.68%
Llama 3.1 8B Instruct
meta-llama
SambaNova
#108
$0.020.30s816.6T/s99.99%
DeepSeek V3
deepseek
Fireworks
#17
$0.411.40s52.9T/s99.96%
DeepSeek V3 (free)
deepseek
Chutes
#16
$0.001.55s38.7T/s99.51%
Qwen2.5 Coder 32B Instruct
qwen
SambaNova
#68
$0.070.52s526.7T/s99.90%
LFM 3B
liquid
Liquid
-
$0.020.40s36.1T/s100.00%
Claude 3.5 Sonnet (self-moderated)
anthropic
Anthropic
-
$3.121.51s54.0T/s93.79%
WizardLM-2 8x22B
microsoft
Together
-
$0.500.52s76.0T/s99.99%
GPT-4o
openai
OpenAI
-
$2.580.46s73.1T/s98.68%
Hermes 2 Pro - Llama-3 8B
nousresearch
Lambda
-
$0.030.50s144.5T/s100.00%
Mistral Nemo
mistralai
Parasail
-
$0.040.74s135.8T/s99.99%
Llama 3.1 70B Instruct
meta-llama
SambaNova
-
$0.120.57s281.3T/s99.98%
Llama 3.1 405B Instruct
meta-llama
SambaNova
-
$0.812.55s90.2T/s99.86%
Hermes 3 405B Instruct
nousresearch
Lambda
-
$0.811.16s27.9T/s99.90%
Gemini Pro 2.5 Experimental (free)
google
Google Vertex
-
$0.0012.82s120.2T/s48.84%
Llama 3 8B Instruct
meta-llama
Groq
#100
$0.030.28s941.0T/s99.95%
Claude 3 Haiku
anthropic
Anthropic
#84
$0.260.66s155.4T/s99.77%
Phi 4
microsoft
Nebius AI Studio
#85
$0.070.14s115.3T/s99.81%
Gemma 2 9B (free)
google
Chutes
#79
$0.001.52s114.3T/s99.93%
Gemma 2 9B
google
Groq
#79
$0.030.29s550.3T/s100.00%
QwQ 32B
qwen
SambaNova
#126
$0.124.62s428.5T/s99.50%
GPT-4.5 (Preview)
openai
OpenAI
#2
$76.201.22s9.4T/s99.55%
o3 Mini
openai
OpenAI
#18
$1.146.51s117.5T/s99.93%
Mixtral 8x7B Instruct
mistralai
Groq
#115
$0.240.36s648.5T/s100.00%
Claude 3.7 Sonnet (self-moderated)
anthropic
Anthropic
#9
$3.121.34s56.9T/s92.06%
Claude 3 Haiku (self-moderated)
anthropic
Anthropic
#83
$0.260.66s155.4T/s98.39%
Llama 3.2 3B Instruct
meta-llama
SambaNova
#121
$0.020.33s1401.8T/s100.00%
Claude 3.5 Haiku (2024-10-22)
anthropic
Google Vertex
#40
$0.831.35s56.5T/s85.69%
Claude 3.5 Haiku (2024-10-22) (self-moderated)
anthropic
Anthropic
#40
$0.831.40s52.6T/s99.01%
Gemini Pro 2.0 Experimental (free)
google
Google Vertex
#4
$0.001.59s47.3T/s92.93%
Gemini Flash 2.0 Experimental (free)
google
Google Vertex
#8
$0.000.79s206.0T/s93.84%
Grok 2 1212
x-ai
xAI
-
$2.080.16s50.9T/s99.92%
Grok 2 Vision 1212
x-ai
xAI
-
$2.080.37s26.1T/s99.38%
Llama 3.3 Euryale 70B
sao10k
DeepInfra
-
$0.710.62s36.4T/s99.91%
Codestral 2501
mistralai
Mistral
-
$0.310.25s179.1T/s99.97%
MiniMax-01
minimax
Minimax
-
$0.211.23s25.4T/s99.95%
LFM 7B
liquid
Liquid
-
$0.010.69s58.9T/s99.99%
R1 Distill Qwen 32B
deepseek
Groq
-
$0.120.67s137.3T/s99.81%
Qwen-Max
qwen
Alibaba
-
$1.650.90s36.6T/s99.66%
Qwen-Turbo
qwen
Alibaba
-
$0.050.87s104.1T/s99.63%
Gemini Flash Lite 2.0 Preview (free)
google
Google Vertex
#13
$0.000.76s144.3T/s98.55%
Mistral Small 3.1 24B
mistralai
Mistral
-
$0.100.47s109.2T/s76.80%
GPT-4o (2024-11-20)
openai
OpenAI
-
$2.580.71s37.5T/s99.81%
Rocinante 12B
thedrummer
Infermatic
-
$0.250.65s24.8T/s99.92%
Qwen2.5 7B Instruct
qwen
Together
-
$0.030.36s107.1T/s100.00%
Unslopnemo 12B
thedrummer
Infermatic
-
$0.500.55s68.9T/s99.68%
Mistral Tiny
mistralai
Mistral
-
$0.250.31s127.3T/s100.00%
Gemini Pro 1.5
google
Google Vertex
-
$1.291.13s76.4T/s99.96%
Llama 3 Lumimaid 8B
neversleep
Mancer (private)
-
$0.200.57s62.1T/s99.98%
Llama 3 Lumimaid 8B (extended)
neversleep
Mancer (private)
-
$0.200.57s62.1T/s100.00%
GPT-4o (2024-08-06)
openai
OpenAI
#19
$2.580.45s85.9T/s99.93%
Hermes 3 70B Instruct
nousresearch
Hyperbolic
-
$0.121.04s31.8T/s99.91%
Llama 3.1 Euryale 70B v2.2
sao10k
DeepInfra
-
$0.710.45s35.6T/s99.95%
Llama 3 70B Instruct
meta-llama
Groq
#63
$0.230.18s316.7T/s100.00%
GPT-3.5 Turbo
openai
OpenAI
#105
$0.510.33s117.8T/s99.52%
QwQ 32B (free)
qwen
Nineteen
#126
$0.003.23s128.5T/s98.03%
o1-mini
openai
OpenAI
#25
$1.140.65s162.2T/s99.19%
Llama 3.2 1B Instruct
meta-llama
SambaNova
#150
$0.010.49s1931.0T/s100.00%
Mistral Small 3.1 24B (free)
mistralai
Chutes
-
$0.001.33s62.0T/s98.32%
Command R+ (08-2024)
cohere
Cohere
#65
$2.450.51s41.7T/s99.59%
o3 Mini High
openai
OpenAI
#13
$1.149.32s119.2T/s98.71%
Llama 3.3 70B Instruct (free)
meta-llama
Together
#39
$0.001.18s89.5T/s99.60%
Gemma 3 27B (free)
google
Chutes
#13
$0.001.18s41.7T/s96.76%
Nova Lite 1.0
amazon
Amazon Bedrock
#77
$0.060.37s128.7T/s98.94%
R1 Distill Llama 70B (free)
deepseek
Together
-
$0.001.10s49.4T/s98.41%
Sonar
perplexity
Perplexity
-
$1.011.91s37.6T/s99.56%
Qwen2.5 VL 72B Instruct
qwen
Parasail
-
$0.710.71s33.5T/s99.60%
R1 Distill Llama 8B
deepseek
NovitaAI
-
$0.040.84s55.1T/s97.94%
Ministral 3B
mistralai
Mistral
-
$0.040.19s197.8T/s99.95%
Magnum v4 72B
anthracite-org
Infermatic
-
$1.890.53s29.9T/s99.98%
Llama 3.2 11B Vision Instruct
meta-llama
SambaNova
-
$0.060.70s776.1T/s99.18%
ReMM SLERP 13B
undi95
Mancer
-
$0.810.77s45.7T/s100.00%
OpenChat 3.5 7B
openchat
Lepton
-
$0.060.39s108.3T/s99.94%
Dolphin 2.9.2 Mixtral 8x22B 🐬
cognitivecomputations
NovitaAI
-
$0.914.06s8.9T/s98.49%
ChatGPT-4o
openai
OpenAI
#2
$5.120.62s95.4T/s99.75%
Created by JCdata from OpenRouter and Chatbot Arena