This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
13,562
Commits
2
Branches
140
Tags
f0d525171557e3fe74e8e6df52257f9d66831d3f
Commit Graph
2 Commits
Author
SHA1
Message
Date
R3hankhan
8e27663b6a
[CPU] Add head sizes 80 and 112 with vec16 fallback (
#31968
)
...
Signed-off-by: Rehan Khan <
Rehan.Khan7@ibm.com
>
2026-01-09 22:14:46 +08:00
Fadi Arafeh
730bd35378
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
...
Signed-off-by: Fadi Arafeh <
fadi.arafeh@arm.com
>
2025-11-22 09:04:36 -08:00