[Model] Add support for OLMo Hybrid (#32550)

This commit is contained in:
Yanhong Li
2026-03-05 11:51:06 -08:00
committed by GitHub
parent 5395471d29
commit a911f4dd20
10 changed files with 1520 additions and 53 deletions

View File

@@ -666,6 +666,7 @@ class CompilationConfig:
"vllm::linear_attention",
"vllm::plamo2_mamba_mixer",
"vllm::gdn_attention_core",
"vllm::olmo_hybrid_gdn_full_forward",
"vllm::kda_attention",
"vllm::sparse_attn_indexer",
"vllm::rocm_aiter_sparse_attn_indexer",