This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
12,987
Commits
2
Branches
140
Tags
d084e9fca7d5d40cbb62eb5fe8ab5cbc6c769cf0
Commit Graph
2 Commits
Author
SHA1
Message
Date
R3hankhan
8e27663b6a
[CPU] Add head sizes 80 and 112 with vec16 fallback (
#31968
)
...
Signed-off-by: Rehan Khan <
Rehan.Khan7@ibm.com
>
2026-01-09 22:14:46 +08:00
Fadi Arafeh
730bd35378
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
...
Signed-off-by: Fadi Arafeh <
fadi.arafeh@arm.com
>
2025-11-22 09:04:36 -08:00