vllm/vllm/entrypoints/openai at 249b88228d1d371a5830c3394be010d51c7e5cbf - vllm

Files

Pooya Davoodi 249b88228d [Frontend] Support embeddings in the run_batch API (#7132 )

Co-authored-by: Simon Mo <simon.mo@hey.com>

2024-08-09 09:48:21 -07:00

2024-08-08 09:47:48 -07:00

__init__.py

2023-06-17 03:07:40 -07:00

api_server.py

2024-08-08 09:47:48 -07:00

cli_args.py

2024-08-02 18:27:28 -07:00

logits_processors.py

2024-08-09 10:39:41 +08:00

protocol.py

2024-08-09 09:48:21 -07:00

run_batch.py

2024-08-09 09:48:21 -07:00

serving_chat.py

2024-08-07 09:12:05 +00:00

serving_completion.py

2024-08-07 13:21:41 +08:00

serving_embedding.py

2024-08-09 09:48:21 -07:00

serving_engine.py

2024-08-09 10:39:41 +08:00

serving_tokenization.py

2024-08-07 09:12:05 +00:00