This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
ea37530b474fa738a99a53a8975af4e389b968c7
vllm
/
tests
/
v1
/
cudagraph
History
Lucas Wilkinson
c7914d30f9
Reapply [Attention][FA3] Update FA3 to include new swizzle optimization (
#34043
)
...
Signed-off-by: Lucas Wilkinson <
lwilkins@redhat.com
>
2026-02-11 07:07:56 -08:00
..
__init__.py
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (
#20059
)
2025-08-15 10:01:39 -04:00
test_cudagraph_dispatch.py
Reapply [Attention][FA3] Update FA3 to include new swizzle optimization (
#34043
)
2026-02-11 07:07:56 -08:00
test_cudagraph_mode.py
[Attention] Update tests to remove deprecated env vars (
#30563
)
2025-12-17 09:49:59 -08:00