Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
dc6b57846686206d6d77fe788f71ab7fe8e568ab
vllm/vllm/model_executor
History
Xin Yang dc6b578466 [Kernel] Add fused_sigmoid_gating_delta_rule_update kernel for Qwen3 Next (#35777)
Signed-off-by: Xin Yang <xyangx@amazon.com>
2026-03-08 23:41:01 -07:00
..
kernels
[Bugfix][CI] fix typos (#34934)
2026-03-05 17:05:46 +00:00
layers
[Kernel] Add fused_sigmoid_gating_delta_rule_update kernel for Qwen3 Next (#35777)
2026-03-08 23:41:01 -07:00
model_loader
fix: Use iterator as not to store all the file loads in memory at once (#36149)
2026-03-08 20:25:21 -07:00
models
[Kernel] Add fused_sigmoid_gating_delta_rule_update kernel for Qwen3 Next (#35777)
2026-03-08 23:41:01 -07:00
offloader
[UX] Remove NoOpOffloader log (#35678)
2026-03-04 12:13:40 -08:00
warmup
[MoE Refactor] Create MK for TRTLLM Kernels (#32564)
2026-03-03 10:39:50 -08:00
__init__.py
[Platform] Deprecate seed_everything (#31659)
2026-01-04 18:34:04 -08:00
custom_op.py
[Model Bash][DSR1] Add selective dynamic shape marking for CustomOp (#34900)
2026-02-21 19:28:01 -05:00
parameter.py
[QeRL] Layerwise Reloading (#32133)
2026-01-30 08:50:05 -07:00
utils.py
[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262)
2026-01-29 16:52:11 +08:00
Powered by Gitea Version: 1.25.2 Page: 177ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API