Running on CPU Upgrade 18 BigCodeBench Evaluator 🥇 18 Evaluate code samples using specified parameters
Running on CPU Upgrade 13.8k Open LLM Leaderboard 🏆 13.8k Track, rank and evaluate open LLMs and chatbots