vllm/vllm/model_executor/layers/mamba at 0ff70821c9b0b991197fa7f3264bf9dd78b8d4b3 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung aab0102a26 [V0 deprecation] Remove more V0 references (#29088 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-11-21 11:56:59 +00:00

..

[Hybrid] [Kernel] Fix chunk scan kernel when BLOCK_SIZE_DSTATE > 128 (#28295 )

2025-11-14 22:55:42 +00:00

__init__.py

[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (#7651 )

2024-08-28 15:06:52 -07:00

abstract.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

linear_attn.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

mamba_mixer2.py

[V0 deprecation] Remove more V0 references (#29088 )

2025-11-21 11:56:59 +00:00

mamba_mixer.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

mamba_utils.py

[Model] Introduce Kimi Linear to vLLM (#27809 )

2025-10-30 21:02:27 +08:00

short_conv.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00