view article Article 🚀 DTS: A Candidate for the Best Parallel Reasoning in LLMs guan-wang • Feb 11 • 14
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 159
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family lightonai • Jan 19 • 93
Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs Paper • 2601.05851 • Published Jan 9 • 3
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published Dec 17, 2025 • 17
view article Article Why You Should Care About Partial Differential Equations (PDEs) hugging-science • Dec 12, 2025 • 45
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 380
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 134
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR lightonai • Oct 23, 2025 • 73
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde • Aug 8, 2025 • 98
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published Jul 17, 2025 • 126
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One rishiraj • Jun 26, 2025 • 50
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89
view article Article Learn the Hugging Face Kernel Hub in 5 Minutes +5 drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb • Jun 12, 2025 • 164