-
233
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
56
Stick To Your Role! Leaderboard
๐ญBenchmarking LLMs on the stability of simulated populations
-
53
ZeroEval Leaderboard
๐Embed ZeroEval for evaluation
-
26
Decentralized Arena Leaderboard
๐ฅView and compare LLM evaluations across various domains
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
liked
a model
11 days ago
lightx2v/Autoencoders
liked
a model
19 days ago
JunhaoZhuang/FlashVSR
liked
a model
about 1 month ago
Phr00t/Qwen-Image-Edit-Rapid-AIO
Organizations
None yet