vllm/vllm/executor at bc0c0192d13ca6ea4aeea4725f752a89483895bc - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Isotr0py 0ce0539d47 [Bugfix] Fix Llava inference with Tensor Parallelism. (#3883 )

2024-04-07 22:54:13 +08:00

..

__init__.py

Add distributed model executor abstraction (#3191 )

2024-03-11 11:03:45 -07:00

cpu_executor.py

[Hardware][Intel] Add CPU inference backend (#3634 )

2024-04-01 22:07:30 -07:00

executor_base.py

[Speculative decoding] Adding configuration object for speculative decoding (#3706 )

2024-04-03 00:40:57 +00:00

gpu_executor.py

[Speculative decoding] Adding configuration object for speculative decoding (#3706 )

2024-04-03 00:40:57 +00:00

neuron_executor.py

[Speculative decoding] Adding configuration object for speculative decoding (#3706 )

2024-04-03 00:40:57 +00:00

ray_gpu_executor.py

[Bugfix] Fix Llava inference with Tensor Parallelism. (#3883 )

2024-04-07 22:54:13 +08:00

utils.py

Add distributed model executor abstraction (#3191 )

2024-03-11 11:03:45 -07:00