vllm/vllm/lora at 42c194964341bea9fc59e0d35db04dfafc3c473d - vllm

Files

Xin Yang a491b0911b [LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 (#29708 )

Signed-off-by: Xin Yang <xyangx@amazon.com>
Signed-off-by: Xin Yang <105740670+xyang16@users.noreply.github.com>
Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>

2025-11-30 10:37:25 +08:00

layers

[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 (#29708 )

2025-11-30 10:37:25 +08:00

ops

add support for --fully-sharded-loras in fused_moe (#28761 )

2025-11-19 16:32:00 +08:00

punica_wrapper

[Doc]: fixing typos in diverse files (#29492 )

2025-11-27 07:15:50 -08:00

__init__.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

lora_weights.py

[LoRA] Continue optimizing MoE LoRA weight loading (#29322 )

2025-11-27 05:56:28 -08:00

models.py

[LoRA] Cleanup LoRA unused code (#29611 )

2025-11-28 22:52:58 -08:00

peft_helper.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

request.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

resolver.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

utils.py

[LoRA] Continue optimizing MoE LoRA weight loading (#29322 )

2025-11-27 05:56:28 -08:00

worker_manager.py

[LoRA] Cleanup LoRA unused code (#29611 )

2025-11-28 22:52:58 -08:00