vllm/vllm/attention at 4abfd8796f37adc8fccc9481f37f20de1bce62e4 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Michael Goin f81c1bb055 [Bugfix] Check NVIDIA artifactory is accessible before using flashinfer cubin kernels (#21893 )

2025-08-01 08:28:45 -04:00

..

[Bugfix] Check NVIDIA artifactory is accessible before using flashinfer cubin kernels (#21893 )

2025-08-01 08:28:45 -04:00

[V0 Deprecation] Deprecate BlockSparse Attention & Phi3-Small (#21217 )

2025-07-19 13:53:17 -07:00

[MISC] Add init files for python package (#20908 )

2025-07-15 12:16:33 +00:00

__init__.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

layer.py

[V1] Fix local chunked attention always disabled (#21419 )

2025-07-23 15:59:30 -07:00

selector.py

[V0 Deprecation] Deprecate BlockSparse Attention & Phi3-Small (#21217 )

2025-07-19 13:53:17 -07:00