Commit History

Drop VLLM_USE_DEEP_GEMM=0 from vllm serve recipe (DeepGEMM is supported on Hopper and datacenter Blackwell)
b17bd8d
verified

joerowell commited on

Enable thinking by default in non-Hopper FP8-KV serve command
e8b30a7
verified

joerowell commited on

Update non-Hopper FP8-KV serve command and link to vLLM recipes page
2ad0232
verified

joerowell commited on

Laguna XS.2 upload
fba1514

joerowell commited on