This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4f6eed3bd4a92c6bd513460ee85b917d6df88a17
vllm
/
vllm
/
model_executor
/
layers
/
mamba
History
Xu Jinyang
b779eb3363
[Model] Sync upstream BT=chunk_size fix for GDN chunk_fwd_kernel_o, simplify warmup to single pass (
#38343
)
...
Signed-off-by: AuYang <
459461160@qq.com
> Co-authored-by: Jiangyun Zhu <
riverclouds.zhu@qq.com
>
2026-03-31 23:03:24 +04:00
..
ops
[Bugfix] clamp dA_cumsum differences to prevent Inf in Mamba2 SSD kernels (
#37501
)
2026-03-31 17:35:51 +02:00
__init__.py
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00
abstract.py
[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (
#33726
)
2026-02-24 09:49:56 -08:00
gdn_linear_attn.py
[Model] Sync upstream BT=chunk_size fix for GDN chunk_fwd_kernel_o, simplify warmup to single pass (
#38343
)
2026-03-31 23:03:24 +04:00
linear_attn.py
[V0 Deprecation] Refactor kv cache from list to element (
#37487
)
2026-03-23 20:10:11 -07:00
mamba_mixer2.py
[Mamba] Add stochastic rounding support (
#35753
)
2026-03-30 12:33:49 -04:00
mamba_mixer.py
[Mamba] Add stochastic rounding support (
#35753
)
2026-03-30 12:33:49 -04:00
mamba_utils.py
[Deprecation] Deprecate code in 0.17 as scheduled (
#35441
)
2026-02-28 17:32:37 +00:00
short_conv.py
[V0 Deprecation] Refactor kv cache from list to element (
#37487
)
2026-03-23 20:10:11 -07:00