vllm/vllm/v1 at 8f10d5e3930f05c2057a831cd80ba24c52b8ceef - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung 8f10d5e393 [Misc] Split up pooling tasks (#10820 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2024-12-11 01:28:00 -08:00

..

[torch.compile] add a flag to track batchsize statistics (#11059 )

2024-12-10 12:40:52 -08:00

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

[Misc] Split up pooling tasks (#10820 )

2024-12-11 01:28:00 -08:00

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

[Model] PP support for Mamba-like models (#10992 )

2024-12-10 21:53:37 -05:00

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

outputs.py

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

request.py

[V1] VLM - Run the mm_mapper preprocessor in the frontend process (#10640 )

2024-12-03 10:33:10 +00:00

serial_utils.py

[V1] Use pickle for serializing EngineCoreRequest & Add multimodal inputs to EngineCoreRequest (#10245 )

2024-11-12 08:57:14 -08:00

utils.py

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00