This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
a307ac073432d3f224c58a54d363fa93f6a659f4
vllm
/
vllm
/
model_executor
History
xuebwang-amd
629584bfc9
[Kernel][MoE] fix computation order of MoE weight multiplication and improve flow (
#31962
)
...
Signed-off-by: xuebwang-amd <
xuebwang@amd.com
>
2026-01-12 17:17:30 -05:00
..
layers
[Kernel][MoE] fix computation order of MoE weight multiplication and improve flow (
#31962
)
2026-01-12 17:17:30 -05:00
model_loader
[FixBug] Improve exception string in
tensorizer.py
(
#31680
)
2026-01-11 05:01:53 -08:00
models
[BUGFIX] Add missed remaping of the names of fp8 kv-scale (
#32199
)
2026-01-12 20:42:06 +00:00
warmup
[UX] Reduce DeepGEMM warmup log output to single progress bar (
#30903
)
2025-12-17 20:21:51 -08:00
__init__.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00
custom_op.py
[Doc] Add developer guide for CustomOp (
#30886
)
2026-01-09 16:21:11 +00:00
parameter.py
[Docs] Replace
rst
style double-backtick with
md
single-backtick (
#27091
)
2025-10-17 02:47:34 -07:00
utils.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00