This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
7f829be7d3d734020606fcca520f3c500581beb8
vllm
/
tests
/
models
/
language
History
Li, Jiang
7f829be7d3
[CPU] Refactor CPU attention backend (
#27954
)
...
Signed-off-by: jiang1.li <
jiang1.li@intel.com
>
2025-11-12 09:43:06 +08:00
..
generation
[CPU] Refactor CPU attention backend (
#27954
)
2025-11-12 09:43:06 +08:00
generation_ppl_test
[Model][0/N] Improve all pooling task | clean up (
#25817
)
2025-10-13 16:44:50 +08:00
pooling
[CPU] Refactor CPU attention backend (
#27954
)
2025-11-12 09:43:06 +08:00
pooling_mteb_test
[Bugfix] Fix out of bound index issue for Jina-embedding-v3 RoPE with cuda graph (
#26687
)
2025-10-13 03:21:48 -07:00
__init__.py
[CI/Build] Reorganize models tests (
#17459
)
2025-04-30 23:03:08 -07:00