vllm/vllm/executor at 5eda2ea02a01b2457f4d6ac2a217f2fa8a2e5d5f - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Nick Hill eb6d3c264d [Core] Eliminate parallel worker per-step task scheduling overhead (#4894 )

2024-05-23 06:17:27 +09:00

..

__init__.py

Add distributed model executor abstraction (#3191 )

2024-03-11 11:03:45 -07:00

cpu_executor.py

[Misc][Refactor] Introduce ExecuteModelData (#4540 )

2024-05-03 17:47:07 -07:00

distributed_gpu_executor.py

[Core] Eliminate parallel worker per-step task scheduling overhead (#4894 )

2024-05-23 06:17:27 +09:00

executor_base.py

[Core] Eliminate parallel worker per-step task scheduling overhead (#4894 )

2024-05-23 06:17:27 +09:00

gpu_executor.py

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

multiproc_gpu_executor.py

[Core] Eliminate parallel worker per-step task scheduling overhead (#4894 )

2024-05-23 06:17:27 +09:00

multiproc_worker_utils.py

[Misc] centralize all usage of environment variables (#4548 )

2024-05-02 11:13:25 -07:00

neuron_executor.py

[Misc][Refactor] Introduce ExecuteModelData (#4540 )

2024-05-03 17:47:07 -07:00

ray_gpu_executor.py

[Core] Eliminate parallel worker per-step task scheduling overhead (#4894 )

2024-05-23 06:17:27 +09:00

ray_utils.py

[Core] Add MultiprocessingGPUExecutor (#4539 )

2024-05-14 10:38:59 -07:00