vllm/vllm/engine at 3963a5335bb4106f2ecd1139527e3568d2151933 - vllm

Files

Swapnil Parekh 4d6ada947c [CORE] Adding support for insertion of soft-tuned prompts (#4645 )

Co-authored-by: Swapnil Parekh <swapnilp@ibm.com>
Co-authored-by: Joe G <joseph.granados@h2o.ai>
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>

2024-07-09 13:26:36 -07:00

output_processor

[Core] Pipeline Parallel Support (#4412 )

2024-07-02 10:58:08 -07:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[CORE] Adding support for insertion of soft-tuned prompts (#4645 )

2024-07-09 13:26:36 -07:00

async_llm_engine.py

[CORE] Adding support for insertion of soft-tuned prompts (#4645 )

2024-07-09 13:26:36 -07:00

async_timeout.py

[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

llm_engine.py

[CORE] Adding support for insertion of soft-tuned prompts (#4645 )

2024-07-09 13:26:36 -07:00

metrics.py

[Speculative Decoding 2/2 ] Integrate typical acceptance sampler into Spec Decode Worker (#5348 )

2024-07-01 00:33:05 -07:00