vllm/vllm/model_executor at 32ef4983cd029d613172dbcf1edf91e62920bbc8 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Lucas Wilkinson d47807ba08 [Attention] Remove slow setattr in MLA (#14769 )

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

2025-03-13 21:31:14 +00:00

..

guided_decoding

[Bugfix][Structured Output] Support outlines engine with reasoning outputs for DeepSeek R1 (#14114 )

2025-03-06 03:49:20 +00:00

[Attention] Remove slow setattr in MLA (#14769 )

2025-03-13 21:31:14 +00:00

[BugFix][TritonMLA] Process weights after model loading for GGUF (#14555 )

2025-03-12 20:14:36 -07:00

[Bugfix] Fix prompt format of GLM4V (#14539 )

2025-03-13 11:37:17 +00:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00