This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bc0c0192d13ca6ea4aeea4725f752a89483895bc
vllm
/
vllm
/
executor
History
Isotr0py
0ce0539d47
[Bugfix] Fix Llava inference with Tensor Parallelism. (
#3883
)
2024-04-07 22:54:13 +08:00
..
__init__.py
Add distributed model executor abstraction (
#3191
)
2024-03-11 11:03:45 -07:00
cpu_executor.py
[Hardware][Intel] Add CPU inference backend (
#3634
)
2024-04-01 22:07:30 -07:00
executor_base.py
[Speculative decoding] Adding configuration object for speculative decoding (
#3706
)
2024-04-03 00:40:57 +00:00
gpu_executor.py
[Speculative decoding] Adding configuration object for speculative decoding (
#3706
)
2024-04-03 00:40:57 +00:00
neuron_executor.py
[Speculative decoding] Adding configuration object for speculative decoding (
#3706
)
2024-04-03 00:40:57 +00:00
ray_gpu_executor.py
[Bugfix] Fix Llava inference with Tensor Parallelism. (
#3883
)
2024-04-07 22:54:13 +08:00
utils.py
Add distributed model executor abstraction (
#3191
)
2024-03-11 11:03:45 -07:00