NVIDIA Nemotron TwoTower 30B-A3B diffusion LM in MLX for Apple Silicon: AR tower + two-tower diffusion, 4/6/8-bit + bf16.
-
pipenetwork/Nemotron-3-Nano-30B-A3B-context-mlx-4bit
Text Generation • 32B • Updated -
pipenetwork/Nemotron-3-Nano-30B-A3B-context-mlx-6bit
Text Generation • 32B • Updated • 1 -
pipenetwork/Nemotron-3-Nano-30B-A3B-context-mlx-8bit
Text Generation • 32B • Updated -
pipenetwork/Nemotron-3-Nano-30B-A3B-context-mlx-bf16
Text Generation • 32B • Updated