inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 3 days ago • 171
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 4 days ago • 153
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 4 days ago • 175
RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8 Text Generation • 235B • Updated 4 days ago • 95
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 4 days ago • 169