vllm/vllm/model_executor/layers/mamba at c373b5c00d1a6f0830099ce5c4b5276e70bc6388 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Benjamin Chislett 8a680463fa [Bugfix] Fix NemotronH MTP + Chunked Prefill (#35447 )

2026-03-17 07:07:33 +01:00

..

[Bugfix] Fix NemotronH MTP + Chunked Prefill (#35447 )

2026-03-17 07:07:33 +01:00

__init__.py

[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (#7651 )

2024-08-28 15:06:52 -07:00

abstract.py

[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )

2026-02-24 09:49:56 -08:00

linear_attn.py

[Model] Ring 2.5 (#35102 )

2026-02-26 02:17:11 -08:00

mamba_mixer2.py

[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )

2026-02-24 09:49:56 -08:00

mamba_mixer.py

[Mamba1] - Kernel Level Chunk Alignment for Prefix Caching (#34798 )

2026-03-01 20:40:23 +08:00

mamba_utils.py

[Deprecation] Deprecate code in 0.17 as scheduled (#35441 )

2026-02-28 17:32:37 +00:00

short_conv.py

[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )

2026-02-24 09:49:56 -08:00