Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
537d5ee0251bcfcedbb8ca934d273366e05f80fa
vllm/vllm/model_executor
History
rasmith 68af5f6c5c [AMD][FP8][BugFix] Remove V1 check in arg_utils.py for FP8 since it is not necessary (#17215)
Signed-off-by: Randall Smith <Randall.Smith@amd.com>
2025-04-25 19:55:05 -07:00
..
guided_decoding
[Bugfix] remove fallback in guided_json (int range, patterns) (#16725)
2025-04-25 06:54:43 +00:00
layers
[AMD][FP8][BugFix] Remove V1 check in arg_utils.py for FP8 since it is not necessary (#17215)
2025-04-25 19:55:05 -07:00
model_loader
More informative error when using Transformers backend (#16988)
2025-04-23 19:54:03 -07:00
models
[Bugfix] gemma[2,3] interleaved attention when sliding window is disabled (#17180)
2025-04-25 19:53:51 -07:00
__init__.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
custom_op.py
[Neuron] Add custom_ops for neuron backend (#13246)
2025-02-25 11:47:49 -08:00
parameter.py
[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036)
2025-04-22 09:01:36 +01:00
pooling_metadata.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
sampling_metadata.py
[Bugfix] Fix extra comma (#15851)
2025-03-31 22:57:28 -07:00
utils.py
[Misc] Add SPDX-License-Identifier headers to python source files (#12628)
2025-02-02 11:58:18 -08:00
Powered by Gitea Version: 1.25.2 Page: 161ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API