Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Alan Blanchet's picture

Alan Blanchet

Alanox

laclouis5's profile picture

21world's profile picture

·

https://alan-blanchet.fr/

AlanBlanchet

AI & ML interests

None yet

Organizations

Alanox 's collections 1

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

240

MMLU-Pro Leaderboard

🥇

240

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

574

GAIA Leaderboard

🦾

574

Submit and evaluate models on GAIA leaderboard

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

240

MMLU-Pro Leaderboard

🥇

240

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

574

GAIA Leaderboard

🦾

574

Submit and evaluate models on GAIA leaderboard

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs