This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
13,149
Commits
2
Branches
140
Tags
808d6fd7b97f71f64a14ad4eecb9afd7b4d9dcf8
Commit Graph
2 Commits
Author
SHA1
Message
Date
R3hankhan
8e27663b6a
[CPU] Add head sizes 80 and 112 with vec16 fallback (
#31968
)
...
Signed-off-by: Rehan Khan <
Rehan.Khan7@ibm.com
>
2026-01-09 22:14:46 +08:00
Fadi Arafeh
730bd35378
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
...
Signed-off-by: Fadi Arafeh <
fadi.arafeh@arm.com
>
2025-11-22 09:04:36 -08:00