Running 105 Unlocking On-Policy Distillation for Any Model Family 📝 105 Visualize on-policy distillation for any model family
Running Featured 84 Distilling 100B+ Models 40x Faster with TRL 📝 84 TRL distillation for 100B+ teachers, 40x faster
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
Running 3.85k The Ultra-Scale Playbook 🌌 3.85k The ultimate guide to training LLM on large GPU Clusters
Running Agents 18 Chapter 1 Quiz - Transformers Fundementals 🔥 18 Test your knowledge of the Transformers Fundementals