This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bde57ab2edb67533158a762823fba31e32f71a85
vllm
/
vllm
/
model_executor
History
Matt
bde57ab2ed
[Hardware][AMD][CI][Bugfix] Fix AMD Quantization test group (
#31713
)
...
Signed-off-by: Matthew Wong <
Matthew.Wong2@amd.com
>
2026-01-10 23:19:46 -08:00
..
layers
[Hardware][AMD][CI][Bugfix] Fix AMD Quantization test group (
#31713
)
2026-01-10 23:19:46 -08:00
model_loader
[Core] Use weights_only=True with torch.load (
#32045
)
2026-01-10 00:28:57 +00:00
models
[MTP][GLM][Bugfix] Fixed .weight_scale loading logic that dropped MTP prediction accuracy with fp8+mtp (
#32101
)
2026-01-10 23:14:54 -08:00
warmup
[UX] Reduce DeepGEMM warmup log output to single progress bar (
#30903
)
2025-12-17 20:21:51 -08:00
__init__.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00
custom_op.py
[Doc] Add developer guide for CustomOp (
#30886
)
2026-01-09 16:21:11 +00:00
parameter.py
[Docs] Replace
rst
style double-backtick with
md
single-backtick (
#27091
)
2025-10-17 02:47:34 -07:00
utils.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00