Drop VLLM_USE_DEEP_GEMM=0 from vllm serve recipe (DeepGEMM is supported on Hopper and datacenter Blackwell) b17bd8d verified joerowell commited on 4 days ago
Enable thinking by default in non-Hopper FP8-KV serve command e8b30a7 verified joerowell commited on 25 days ago
Update non-Hopper FP8-KV serve command and link to vLLM recipes page 2ad0232 verified joerowell commited on 25 days ago