[NVIDIA] Add Cutlass MLA backend (#17625)
This commit is contained in:
@@ -1395,6 +1395,7 @@ class EngineArgs:
|
||||
"PALLAS_VLLM_V1",
|
||||
"TRITON_ATTN_VLLM_V1",
|
||||
"TRITON_MLA",
|
||||
"CUTLASS_MLA_VLLM_V1",
|
||||
"FLASHMLA",
|
||||
"FLASHINFER",
|
||||
"FLASHINFER_VLLM_V1",
|
||||
|
||||
Reference in New Issue
Block a user