Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
296f927f2493908984707354e3cc5d7b2e41650b
vllm/vllm/model_executor
History
Chih-Chieh Yang 296f927f24 [Model] RE: Mamba2 Prefill Performance Tweaks: Fixing Flurry of Unnecessary Memory Copies (#14857)
Signed-off-by: Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
2025-03-20 19:21:08 -07:00
..
guided_decoding
[Core][V0] Add guidance backend for structured output (#14589)
2025-03-19 21:33:51 -07:00
layers
[Model] RE: Mamba2 Prefill Performance Tweaks: Fixing Flurry of Unnecessary Memory Copies (#14857)
2025-03-20 19:21:08 -07:00
model_loader
[Bugfix] Fix bnb quantization for models with both HF-format and Mistral-format weights (#14950)
2025-03-17 23:27:26 +00:00
models
[Bugfix] Fix incorrect qwen2.5-vl attention mask pre-computation (#15200)
2025-03-20 19:18:04 -07:00
__init__.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
custom_op.py
[Neuron] Add custom_ops for neuron backend (#13246)
2025-02-25 11:47:49 -08:00
parameter.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
pooling_metadata.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
sampling_metadata.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
utils.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
Powered by Gitea Version: 1.25.2 Page: 74ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API