leaderboards - a MoritzLaurer Collection

MoritzLaurer 's Collections

prompt-templates

Zeroshot Classifiers

other-interesting

code generation

leaderboards

updated Apr 2, 2025

Running

4.71k

LMArena Leaderboard

🏆

4.71k

View LMArena model leaderboard
Running on CPU Upgrade

13.9k

Open LLM Leaderboard

🏆

13.9k

Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade

7.03k

MTEB Leaderboard

🥇

7.03k

Embedding Leaderboard
Running on CPU Upgrade

Featured

1.22k

Open ASR Leaderboard

🏆

1.22k

Explore and compare speech‑recognition model benchmarks
Running

Featured

583

LLM-Perf Leaderboard

🏆

583

Explore LLM performance across hardware configurations
Running

1.49k

Big Code Models Leaderboard

📈

1.49k

Explore and submit code model evaluations on a leaderboard
Runtime error

78

Human & GPT-4 Evaluation of LLMs Leaderboard

👩

78
Running

450

Can Ai Code Results

🏆

450

Can AI Code? An LLM leaderboard inclquantized models.
Runtime error

145

Hallucinations Leaderboard

🔥

145

View and submit LLM evaluations
Build error

105

Enterprise Scenarios Leaderboard

🥇

105
Running on CPU Upgrade

93

LLM Safety Leaderboard

🥇

93

Explore and submit LLM benchmarks
Running

Featured

560

Vision Arena (Testing VLMs side-by-side)

🖼

560

Analyze images with multiple vision models for labels and boxes
Running

71

CyberSecEvalTest

📈

71

Evaluate LLMs' cybersecurity risks and capabilities
Running

Featured

438

LLM Performance Leaderboard

🐨

438

View AI model performance leaderboard
Running on CPU Upgrade

75

AIR-Bench Leaderboard

🥇

75

Explore and compare QA and long doc benchmarks
Running on CPU Upgrade

990

Open VLM Leaderboard

🌎

990

VLMEvalKit Evaluation Results Collection
Running

420

Reward Bench Leaderboard

📐

420

Explore RewardBench model rankings and scores
Running

230

BigCodeBench Leaderboard

🥇

230

Explore code-generation model leaderboards and task details
Runtime error

10

MJ Bench Leaderboard

🥇

10

Display and filter multimodal model leaderboard results
Running

116

MTEB Arena

⚔

116

Display MTEB Arena interface
Runtime error

Featured

151

Open LLM Progress Tracker

🔬

151

Visualize Open vs. Proprietary LLM Progress
Running

109

Judge Arena

💻

109

View and compare open‑source AI model rankings with ELO scores
Running

463

TTS Spaces Arena

🤗

463

Blind vote on HF TTS models!
Running

Featured

141

smolagents LLM leaderboard

🏆

141

A leaderboard for LLMs powering smolagents