Running on CPU Upgrade 590 GAIA Leaderboard 🦾 590 Submit your model answers to GAIA benchmark and view leaderboard
Running 230 BigCodeBench Leaderboard 🥇 230 Explore code-generation model leaderboards and task details
Running 124 Berkeley Function Calling Leaderboard 🏃 124 View the Berkeley Function-Calling Leaderboard
Running 1.5k Big Code Models Leaderboard 📈 1.5k Explore and submit code model evaluations on a leaderboard
Running Featured 584 LLM-Perf Leaderboard 🏆 584 Explore LLM performance across hardware configurations