Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 417
• 44
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 75.9k
• 350
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 373k
• 37
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 100k
• 11
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 16.1k
• 6
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 44.1k
• 57
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 69.1k
• 14
Image-Text-to-Text
• 10B • Updated • 345k
• 12
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 3.56k
• 81
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 158k
• 16
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 57.1k
• 25
Text Generation
• 586B • Updated • 5.16k
• 5
Image-Text-to-Text
• 5B • Updated • 39.5k
• 7
QuantTrio/Qwopus3.5-27B-v3-AWQ-6Bit
Image-Text-to-Text
• 27B • Updated • 1.45k
• 2
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 7.38k
• 6
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 35.2k
• 3
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 83
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 16
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 91
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 82
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 5
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 114
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 229
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 87
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 8
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 194
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 9
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 1.95k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 272
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 4
• 1