LMArena Leaderboard
View LMArena model leaderboard
View LMArena model leaderboard
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Explore and compare speechβrecognition model benchmarks
Explore LLM performance across hardware configurations
Explore and submit code model evaluations on a leaderboard
Can AI Code? An LLM leaderboard inclquantized models.
View and submit LLM evaluations
Explore and submit LLM benchmarks
Analyze images with multiple vision models for labels and boxes
Evaluate LLMs' cybersecurity risks and capabilities
View AI model performance leaderboard
Explore and compare QA and long doc benchmarks
VLMEvalKit Evaluation Results Collection
Explore RewardBench model rankings and scores
Explore code-generation model leaderboards and task details
Display and filter multimodal model leaderboard results
Display MTEB Arena interface
Visualize Open vs. Proprietary LLM Progress
View and compare openβsource AI model rankings with ELO scores
Blind vote on HF TTS models!
A leaderboard for LLMs powering smolagents