vllm/examples/offline_inference_chat.py at b4522474a32b6e0bf5573a9b6a6830cb787dfb63

Files

Andy 2529d09b5a [Frontend] Batch inference for llm.chat() API (#8648 )

Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
Co-authored-by: Roger Wang <ywang@roblox.com>
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>

2024-09-24 09:44:11 -07:00

2.0 KiB

Raw Blame History

View Raw

2.0 KiB Raw Blame History

2.0 KiB

Raw Blame History