Running 62 Stick To Your Role! Leaderboard 🎠62 Benchmarking LLMs on the stability of simulated populations
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • Updated May 22, 2025 • 367k • • 1.27k
Running 596 Scaling test-time compute 📈 596 Run advanced search strategies to boost LLM problem solving
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 1 day ago • 38
meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 492k • • 2.73k
Running Agents 110 Judge Arena 💻 110 View and compare open‑source AI model rankings with ELO scores
meta-llama/Meta-Llama-3-8B-Instruct Text Generation • 8B • Updated Jun 18, 2025 • 1.3M • • 4.48k