vllm/vllm/model_executor/layers/mamba at a8c6ee9b787d273916206a29b77feebadb80c368 - vllm

Files

Artem Perevedentsev cb10b7e80b [GDN] Eliminate GPU->CPU sync in prepare_chunk_indices during prefill (#38361 )

Signed-off-by: Artem Perevedentsev <aperevedents@nvidia.com>
Signed-off-by: Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>

2026-04-03 13:38:02 +00:00

2026-04-03 01:50:09 +02:00

__init__.py

2024-08-28 15:06:52 -07:00

abstract.py

2026-02-24 09:49:56 -08:00

gdn_linear_attn.py

2026-04-03 13:38:02 +00:00

linear_attn.py

2026-03-23 20:10:11 -07:00

mamba_mixer2.py

2026-04-03 01:50:09 +02:00

mamba_mixer.py

2026-04-03 01:50:09 +02:00

mamba_utils.py

2026-04-03 01:50:09 +02:00

short_conv.py

2026-04-03 01:50:09 +02:00