Running 10 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 10 Extend LLM context to 100K tokens on consumer GPUs
Running 10 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 10 Extend LLM context to 100K tokens on consumer GPUs
Running 10 TurboQuant on Consumer GPUs β 100K Context on RTX 3090, 64K on RTX 4070 π 10 Extend LLM context to 100K tokens on consumer GPUs
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF Image-Text-to-Text β’ 27B β’ Updated Apr 6 β’ 70.2k β’ 606
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation β’ 67B β’ Updated May 1 β’ 1.56M β’ 328
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation β’ 124B β’ Updated Apr 29 β’ 475k β’ 251
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text β’ 28B β’ Updated Apr 6 β’ 163k β’ β’ 2.87k
Running on CPU Upgrade 245 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 245 Explore synthetic data experiments on an interactive bookshelf