This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
11,864
Commits
2
Branches
140
Tags
1528e079e2b2cf8a807e4dce86ef05540e16a430
Commit Graph
2 Commits
Author
SHA1
Message
Date
Fadi Arafeh
730bd35378
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
...
Signed-off-by: Fadi Arafeh <
fadi.arafeh@arm.com
>
2025-11-22 09:04:36 -08:00
Li, Jiang
7f829be7d3
[CPU] Refactor CPU attention backend (
#27954
)
...
Signed-off-by: jiang1.li <
jiang1.li@intel.com
>
2025-11-12 09:43:06 +08:00