[Kernel][RFC] Refactor the punica kernel based on Triton (#5036)
This commit is contained in:
@@ -66,7 +66,6 @@ You can also build and install vLLM from source:
|
||||
|
||||
$ git clone https://github.com/vllm-project/vllm.git
|
||||
$ cd vllm
|
||||
$ # export VLLM_INSTALL_PUNICA_KERNELS=1 # optionally build for multi-LoRA capability
|
||||
$ pip install -e . # This may take 5-10 minutes.
|
||||
|
||||
.. tip::
|
||||
|
||||
Reference in New Issue
Block a user