This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
faedbb4d4fe4a56e111d23c9d657a1ef47cd7981
vllm
/
vllm
/
model_executor
History
Paul Zhang
faedbb4d4f
[Feature] Extend batch invariant torch.compile to B200 (
#27856
)
...
Signed-off-by: PaulZhang12 <
paulzhan@fb.com
>
2025-11-05 10:04:49 -08:00
..
layers
[Feature] Extend batch invariant torch.compile to B200 (
#27856
)
2025-11-05 10:04:49 -08:00
model_loader
[V0 deprecation] Remove VLLM_USE_V1 usage in most modules (
#27955
)
2025-11-04 20:51:16 -08:00
models
[FlashInfer] Avoid FlashInfer block_size 16 + head_size 256 on blackwell (
#27994
)
2025-11-05 09:25:32 -08:00
warmup
[Kernels] Isolate modular kernel code from FusedMoEMethodBase subclasses. (
#27123
)
2025-11-04 21:59:45 +08:00
__init__.py
Convert formatting to use
ruff
instead of
yapf
+
isort
(
#26247
)
2025-10-05 07:06:22 -07:00
custom_op.py
[FrontEnd] UNREVERT CompilationConfig overhaul (
#20283
): deprecate use_inductor in favor of backend, simplify custom_ops (
#26502
)
2025-10-13 22:47:16 +00:00
parameter.py
[Docs] Replace
rst
style double-backtick with
md
single-backtick (
#27091
)
2025-10-17 02:47:34 -07:00
utils.py
[Chore] Clean up pytorch helper functions in
vllm.utils
(
#26908
)
2025-10-18 09:48:22 -07:00