Running Agents 68 UncheatableEval 🏆 68 Explore model scaling metrics with interactive tables and plots
Configuration error Agents Featured 360 GOT Online 💬 360 Extract text from images using various OCR modes
Running Featured 1.33k FineWeb: decanting the web for the finest text data at scale 🍷 1.33k Explore and download the FineWeb web‑text dataset
Running Agents 232 AI2 WildBench Leaderboard (V2) 🦁 232 Display and explore a leaderboard of language models