vllm/vllm/model_executor at 55211b01e87a02bdd0045b455715dfe508580738 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Szymon Ożóg 55211b01e8 [Bugfix] Fix chunked prefill for GGUF (#14666 )

Signed-off-by: SzymonOzog <szymon.ozog@aleph-alpha.com>

2025-03-13 07:19:03 +00:00

..

guided_decoding

[Bugfix][Structured Output] Support outlines engine with reasoning outputs for DeepSeek R1 (#14114 )

2025-03-06 03:49:20 +00:00

[Bugfix] Fix chunked prefill for GGUF (#14666 )

2025-03-13 07:19:03 +00:00

[BugFix][TritonMLA] Process weights after model loading for GGUF (#14555 )

2025-03-12 20:14:36 -07:00

[Quant] Bamba SupportsQuant (#14698 )

2025-03-13 04:57:05 +00:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00