Running 192 The ultimate guide to RL environments: building and scaling them in the LLM era ๐ 192 Building and scaling RL environments for LLM training
Running Featured 678 The Tokenizer Playground ๐ 678 Experiment with and compare different tokenizers
Running 114 Unlocking On-Policy Distillation for Any Model Family ๐ 114 Explore on-policy distillation visualization for any model
Running Featured 88 Distilling 100B+ Models 40x Faster with TRL ๐ 88 TRL distillation for 100B+ teachers, 40x faster
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard ๐ 1.02k VLMEvalKit Evaluation Results Collection
Running on CPU Upgrade 261 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 261 Visualize syntheticโdata experiments as an interactive bookshelf
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems ๐ 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Running on CPU Upgrade 14k Open LLM Leaderboard ๐ 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade Featured 3.21k The Smol Training Playbook ๐ 3.21k The secrets to building world-class LLMs
Running 230 FineVision: Open Data is All You Need ๐ 230 A new open-source dataset for training VLMs
Running Featured 1.37k FineWeb: decanting the web for the finest text data at scale ๐ท 1.37k Explore and download the FineWeb webโscale text dataset