vllm/tests/compile/piecewise at 56dcf4e7e965e34043acf20ca4e4aceda21d41ec - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Yong Hoon Shin dfd2382039 [torch.compile] Support conditional torch.compile per module (#22269 )

Signed-off-by: Yong Hoon Shin <yhshin@meta.com>

2025-08-20 16:52:59 +00:00

..

__init__.py

[torch.compile] rework compile control with piecewise cudagraph (#9715 )

2024-10-29 23:03:49 -07:00

test_full_cudagraph.py

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059 )

2025-08-15 10:01:39 -04:00

test_multiple_graphs.py

[torch.compile] Support conditional torch.compile per module (#22269 )

2025-08-20 16:52:59 +00:00

test_simple.py

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059 )

2025-08-15 10:01:39 -04:00

test_toy_llama.py

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059 )

2025-08-15 10:01:39 -04:00