deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 673k • • 1.49k
Running 3.79k The Ultra-Scale Playbook 🌌 3.79k The ultimate guide to training LLM on large GPU Clusters
oliverguhr/fullstop-punctuation-multilang-large Token Classification • Updated Nov 16, 2023 • 416k • • 174
Running Featured 1.33k FineWeb: decanting the web for the finest text data at scale 🍷 1.33k Explore and download the FineWeb web‑text dataset
Running on L4 Agents 1.19k ControlNet V1.1 📉 1.19k Generate edited images using edge, pose, and other guides