vllm/tests/lora at 3b7178cfa4a317922d4aef9dd3b2647b8d950e7d - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Liangfu Chen 3b7178cfa4 [Neuron] Support inference with transformers-neuronx (#2569 )

2024-02-28 09:34:34 -08:00

..

__init__.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

conftest.py

[Neuron] Support inference with transformers-neuronx (#2569 )

2024-02-28 09:34:34 -08:00

test_layers.py

chore(vllm): codespell for spell checking (#2820 )

2024-02-21 18:56:01 -08:00

test_llama.py

chore(vllm): codespell for spell checking (#2820 )

2024-02-21 18:56:01 -08:00

test_lora_manager.py

Add LoRA support for Mixtral (#2831 )

2024-02-14 00:55:45 +01:00

test_lora.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

test_mixtral.py

Add LoRA support for Mixtral (#2831 )

2024-02-14 00:55:45 +01:00

test_punica.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

test_tokenizer.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

test_utils.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

test_worker.py

Remove hardcoded device="cuda" to support more devices (#2503 )

2024-02-01 15:46:39 -08:00

utils.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00