This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
0ada960a20766fa1de9063576979f9d807583785
vllm
/
vllm
/
model_executor
History
Xin Yang
0ada960a20
[Kernel] Support bias type in grouped_topk kernel (
#31781
)
...
Signed-off-by: Xin Yang <
xyangx@amazon.com
> Co-authored-by: Michael Goin <
mgoin64@gmail.com
>
2026-01-07 12:16:32 -08:00
..
layers
[Kernel] Support bias type in grouped_topk kernel (
#31781
)
2026-01-07 12:16:32 -08:00
model_loader
Enable quantized attention in NemotronH models (
#31898
)
2026-01-07 17:37:19 +00:00
models
Enable quantized attention in NemotronH models (
#31898
)
2026-01-07 17:37:19 +00:00
warmup
[UX] Reduce DeepGEMM warmup log output to single progress bar (
#30903
)
2025-12-17 20:21:51 -08:00
__init__.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00
custom_op.py
[Bugfix][CPU] Fix RotaryEmbedding fallback causing gibberish with --enforce-eager (
#31643
)
2026-01-06 01:25:38 +08:00
parameter.py
[Docs] Replace
rst
style double-backtick with
md
single-backtick (
#27091
)
2025-10-17 02:47:34 -07:00
utils.py
[Platform] Deprecate seed_everything (
#31659
)
2026-01-04 18:34:04 -08:00