How accurate is the model?

by dw1932 - opened about 21 hours ago

Discussion

dw1932

about 21 hours ago

I would like to know how the accuracy of the model is after being quantified. Has anyone conducted a test?

spectator2026

about 3 hours ago

Yes — on held-out WikiText, full-vocab KL-divergence vs the bf16 master ≈ 0.151. For reference, a solid int4 W4A16 (my own quant https://huggingface.co/spectator2026/MiniMax-M3-AWQ-int4) of the same model is ~0.16 on that metric, so this NVFP4 is a touch better. (Measured weight-only on A100).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment