Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published 12 days ago • 78
Running on CPU Upgrade 503 Visualize Dataset (v2.0+ latest dataset format) 💻 503 Explore and visualize LeRobot datasets easily