vllm/vllm/model_executor at 90fbf12540da089fcc7dc825ce2ceb7ea3a3df33 - vllm

Files

felixzhu555 703e42ee4b Add guided decoding for OpenAI API server (#2819 )

Co-authored-by: br3no <breno@veltefaria.de>
Co-authored-by: simon-mo <simon.mo@hey.com>

2024-02-29 22:13:08 +00:00

2024-02-28 21:52:23 -08:00

2024-02-29 00:51:48 -08:00

2024-02-21 18:56:01 -08:00

__init__.py

2024-02-28 09:34:34 -08:00

guided_decoding.py

2024-02-29 22:13:08 +00:00

guided_logits_processors.py

2024-02-29 22:13:08 +00:00

input_metadata.py

2024-01-28 16:43:54 -08:00

model_loader.py

2024-02-28 09:34:34 -08:00

neuron_model_loader.py

2024-02-28 09:34:34 -08:00

sampling_metadata.py

2024-02-28 09:34:34 -08:00

utils.py

2024-02-28 09:34:34 -08:00

weight_utils.py

2024-02-01 15:41:58 -08:00