vllm/vllm/model_executor/layers/mamba at 613abb50d5715ba693ee9d5b727e8385b98e7185 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Shanshan Shen d44e9df7d4 [Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

Signed-off-by: shen-shanshan <467638484@qq.com>

2025-11-19 16:24:55 +00:00

..

[Hybrid] [Kernel] Fix chunk scan kernel when BLOCK_SIZE_DSTATE > 128 (#28295 )

2025-11-14 22:55:42 +00:00

__init__.py

[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (#7651 )

2024-08-28 15:06:52 -07:00

abstract.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

linear_attn.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

mamba_mixer2.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

mamba_mixer.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

mamba_utils.py

[Model] Introduce Kimi Linear to vLLM (#27809 )

2025-10-30 21:02:27 +08:00

short_conv.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00