vllm/vllm/model_executor at a0e50a4260d20d021d6caa137a078ae2d16a8f93 - vllm

Files

Benjamin Chislett f5972a872f [Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )

Signed-off-by: Shahar Mor <smor@nvidia.com>
Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Co-authored-by: Shahar Mor <smor@nvidia.com>
Co-authored-by: Roi Koren <roik@nvidia.com>
Co-authored-by: Lucas Wilkinson <lwilkins@redhat.com>

2026-02-24 09:49:56 -08:00

kernels

[Refactor] [1/N] Reorganize kernel abstraction directory (#34055 )

2026-02-24 06:47:22 +00:00

layers

[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )

2026-02-24 09:49:56 -08:00

model_loader

Use Xet high performance mode for Transformers v5 (#35098 )

2026-02-23 08:19:30 -08:00

models

[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726 )