This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
c8bde93367fb252eca1e9a6ae78650caa4a9a951
vllm
/
csrc
/
attention
/
mla
/
cutlass_sm100_mla
History
Alexander Matveev
1210e4d95b
[Bugfix] [B200] cutlass_mla - ensure kv_split == 1 for batch size > 1 (
#25509
)
...
Signed-off-by: Alexander Matveev <
amatveev@redhat.com
>
2025-09-23 16:57:55 -07:00
..
device
[Bugfix] [B200] cutlass_mla - ensure kv_split == 1 for batch size > 1 (
#25509
)
2025-09-23 16:57:55 -07:00
kernel
SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP (
#20769
)
2025-07-15 01:06:38 +00:00