This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
67a6882da474a45dde0d35b3789e096e7bd0fd4e
vllm
/
vllm
/
lora
/
ops
History
Jee Jee Li
9b0e3ec970
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
..
__init__.py
[Kernel][RFC] Refactor the punica kernel based on Triton (
#5036
)
2024-07-31 17:12:24 -07:00
bgmv_expand_slice.py
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
bgmv_expand.py
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
bgmv_shrink.py
[Bugfix] Make torch registration of punica ops optional (
#7970
)
2024-08-28 16:11:49 -06:00
sgmv_expand_slice.py
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
sgmv_expand.py
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
sgmv_shrink.py
[Kernel][LoRA] Add assertion for punica sgmv kernels (
#7585
)
2024-09-23 18:57:42 +00:00
utils.py
[Kernel][RFC] Refactor the punica kernel based on Triton (
#5036
)
2024-07-31 17:12:24 -07:00