vllm/vllm/model_executor/layers/mamba at 5042815ab6eb96596730e2853beef6cdfe0a3996 - vllm

Files

Harry Huang 5206e5e28c [V1][Hybrid] Mamba Prefix Caching with align mode (#30877 )

Signed-off-by: huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com>
Signed-off-by: Chen Zhang <zhangch99@outlook.com>
Co-authored-by: Chen Zhang <zhangch99@outlook.com>

2026-01-23 09:56:48 -08:00

ops

[Chore] Migrate V0 attention utils (#31891 )

2026-01-07 13:44:36 +00:00

__init__.py

[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (#7651 )

2024-08-28 15:06:52 -07:00

abstract.py

[V1][Hybrid] Mamba Prefix Caching with align mode (#30877 )

2026-01-23 09:56:48 -08:00

linear_attn.py

[1/N][Attention] Restructure attention: move files (#31916 )

2026-01-09 13:10:24 -08:00

mamba_mixer2.py

[V1][Hybrid] Mamba Prefix Caching with align mode (#30877 )

2026-01-23 09:56:48 -08:00

mamba_mixer.py

[V1][Hybrid] Mamba Prefix Caching with align mode (#30877 )

2026-01-23 09:56:48 -08:00

mamba_utils.py

[V1][Hybrid] Mamba Prefix Caching with align mode (#30877 )

2026-01-23 09:56:48 -08:00

short_conv.py

[1/N][Attention] Restructure attention: move files (#31916 )

2026-01-09 13:10:24 -08:00