This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
fa6a6be51978bd4b49ba0da17039e60f96dc5b13
vllm
/
benchmarks
/
kernels
/
benchmark_mla_k_concat.py
Ming Yang
fba8906930
[perf] Use direct copy (broadcast) instead of cat for k_nope/k_pe in MLA prefill (
#29710
)
...
Signed-off-by: Ming Yang <
minos.future@gmail.com
>
2025-12-11 08:20:45 +00:00
4.5 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink