This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4e33a7ea85cb702090c07fb7a8ebdbf44c472f5c
vllm
/
csrc
/
attention
/
mla
/
cutlass_sm100_mla
History
Alexander Matveev
1210e4d95b
[Bugfix] [B200] cutlass_mla - ensure kv_split == 1 for batch size > 1 (
#25509
)
...
Signed-off-by: Alexander Matveev <
amatveev@redhat.com
>
2025-09-23 16:57:55 -07:00
..
device
[Bugfix] [B200] cutlass_mla - ensure kv_split == 1 for batch size > 1 (
#25509
)
2025-09-23 16:57:55 -07:00
kernel
SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP (
#20769
)
2025-07-15 01:06:38 +00:00