This is a prebuilt wheel (.whl) for Windows, Python 3.12.9, and PyTorch 2.10.0+cu130. Builds are available for Sage-Attention2, Flash-Attention2, and Block-Sparse-Attention.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support