[ROCm] Split AITER unified attention into its own backend (#25507)
Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
This commit is contained in:
committed by
GitHub
parent
2161efe978
commit
f231e5bc21
@@ -1623,6 +1623,7 @@ class EngineArgs:
|
||||
"TREE_ATTN",
|
||||
"XFORMERS",
|
||||
"ROCM_ATTN",
|
||||
"ROCM_AITER_UNIFIED_ATTN",
|
||||
]
|
||||
if (
|
||||
envs.is_set("VLLM_ATTENTION_BACKEND")
|
||||
|
||||
Reference in New Issue
Block a user