-
open-thoughts/OpenThoughts-Agent-SFT-ColdStartForRL-10K
Viewer • Updated • 9.44k • 106 • 1 -
open-thoughts/OpenThoughts-Agent-RL-5K
Viewer • Updated • 5k • 101 • 1 -
open-thoughts/OpenThoughts-Agent-SFT-100K
Viewer • Updated • 94.3k • 251 • 9 -
open-thoughts/OpenThinkerAgent-8B-ColdStartSFTForRL
Text Generation • 308k • Updated • 98 • 1
Collections
Discover the best community collections!
Collections trending this week
-
JetBrains/Mellum2-12B-A2.5B-Thinking
Text Generation • 12B • Updated • 27.2k • 307 -
JetBrains/Mellum2-12B-A2.5B-Instruct
Text Generation • 12B • Updated • 7.73k • 77 -
JetBrains/Mellum2-12B-A2.5B-Thinking-SFT
Text Generation • 12B • Updated • 850 • 24 -
JetBrains/Mellum2-12B-A2.5B-Instruct-SFT
Text Generation • 12B • Updated • 390 • 14
-
DFlash: Block Diffusion for Flash Speculative Decoding
Paper • 2602.06036 • Published • 87 -
z-lab/Qwen3.5-397B-A17B-DFlash
Text Generation • 1B • Updated • 4.08k • 7 -
z-lab/gemma-4-31B-it-DFlash
Text Generation • 2B • Updated • 6.99k • 99 -
z-lab/gemma-4-26B-A4B-it-DFlash
Text Generation • 0.4B • Updated • 15.7k • 53
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 43.2k • 86 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 60.8k • • 407 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 246k • 148 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 109k • • 784
-
google/gemma-4-E2B-it-qat-q4_0-unquantized
Any-to-Any • 5B • Updated • 11.5k • 25 -
google/gemma-4-E4B-it-qat-q4_0-unquantized
Any-to-Any • 8B • Updated • 11.1k • 20 -
google/gemma-4-12B-it-qat-q4_0-unquantized
Any-to-Any • 12B • Updated • 76.2k • 61 -
google/gemma-4-26B-A4B-it-qat-q4_0-unquantized
Image-Text-to-Text • 27B • Updated • 30.3k • 32
-
LFM2.5 1.2B Thinking WebGPU
💧120Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
-
Voxtral Realtime WebGPU
💬137Real-time speech transcription, entirely in your browser.
-
Nemotron 3 Nano WebGPU
⚛78A compact reasoning-capable model running in your browser.
-
Qwen3.5 WebGPU
😻87Run Qwen3.5 (0.8B, 2B, 4B) in-browser with Transformers.js
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 1.06k • 64 -
nvidia/Nemotron-Pretraining-Legal-v1
Viewer • Updated • 9.62M • 930 • 12 -
nvidia/Nemotron-Pretraining-Specialized-v1.2
Viewer • Updated • 600M • 2.53k • 9 -
nvidia/Nemotron-Pretraining-Code-v3
Viewer • Updated • 146M • 2.73k • 54
-
open-thoughts/OpenThoughts-Agent-SFT-ColdStartForRL-10K
Viewer • Updated • 9.44k • 106 • 1 -
open-thoughts/OpenThoughts-Agent-RL-5K
Viewer • Updated • 5k • 101 • 1 -
open-thoughts/OpenThoughts-Agent-SFT-100K
Viewer • Updated • 94.3k • 251 • 9 -
open-thoughts/OpenThinkerAgent-8B-ColdStartSFTForRL
Text Generation • 308k • Updated • 98 • 1
-
google/gemma-4-E2B-it-qat-q4_0-unquantized
Any-to-Any • 5B • Updated • 11.5k • 25 -
google/gemma-4-E4B-it-qat-q4_0-unquantized
Any-to-Any • 8B • Updated • 11.1k • 20 -
google/gemma-4-12B-it-qat-q4_0-unquantized
Any-to-Any • 12B • Updated • 76.2k • 61 -
google/gemma-4-26B-A4B-it-qat-q4_0-unquantized
Image-Text-to-Text • 27B • Updated • 30.3k • 32
-
JetBrains/Mellum2-12B-A2.5B-Thinking
Text Generation • 12B • Updated • 27.2k • 307 -
JetBrains/Mellum2-12B-A2.5B-Instruct
Text Generation • 12B • Updated • 7.73k • 77 -
JetBrains/Mellum2-12B-A2.5B-Thinking-SFT
Text Generation • 12B • Updated • 850 • 24 -
JetBrains/Mellum2-12B-A2.5B-Instruct-SFT
Text Generation • 12B • Updated • 390 • 14
-
LFM2.5 1.2B Thinking WebGPU
💧120Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
-
Voxtral Realtime WebGPU
💬137Real-time speech transcription, entirely in your browser.
-
Nemotron 3 Nano WebGPU
⚛78A compact reasoning-capable model running in your browser.
-
Qwen3.5 WebGPU
😻87Run Qwen3.5 (0.8B, 2B, 4B) in-browser with Transformers.js
-
DFlash: Block Diffusion for Flash Speculative Decoding
Paper • 2602.06036 • Published • 87 -
z-lab/Qwen3.5-397B-A17B-DFlash
Text Generation • 1B • Updated • 4.08k • 7 -
z-lab/gemma-4-31B-it-DFlash
Text Generation • 2B • Updated • 6.99k • 99 -
z-lab/gemma-4-26B-A4B-it-DFlash
Text Generation • 0.4B • Updated • 15.7k • 53
-
nvidia/Nemotron-Pretraining-Dataset-sample
Viewer • Updated • 27.7k • 1.06k • 64 -
nvidia/Nemotron-Pretraining-Legal-v1
Viewer • Updated • 9.62M • 930 • 12 -
nvidia/Nemotron-Pretraining-Specialized-v1.2
Viewer • Updated • 600M • 2.53k • 9 -
nvidia/Nemotron-Pretraining-Code-v3
Viewer • Updated • 146M • 2.73k • 54
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 43.2k • 86 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 60.8k • • 407 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 246k • 148 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 109k • • 784