This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4e4d017b6f70c729e7c78f74e4328a4ebca7b8ec
vllm
/
csrc
/
attention
/
mla
/
cutlass_sm100_mla
/
device
History
Alexander Matveev
8cdc371217
SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP (
#20769
)
...
Signed-off-by: Alexander Matveev <
amatveev@redhat.com
>
2025-07-15 01:06:38 +00:00
..
sm100_mla.hpp
SM100 Cutlass MLA decode with unrestricted num_heads (< 128) for DeepSeek TP (
#20769
)
2025-07-15 01:06:38 +00:00