This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
a17cef70eacba81577e1eaa91f2b5dd18624e5d5
vllm
/
vllm
/
attention
/
ops
/
blocksparse_attention
History
Mengqing Cao
f9bc5a0693
[Bugfix] Fix triton import with local TritonPlaceholder (
#17446
)
...
Signed-off-by: Mengqing Cao <
cmq0113@163.com
>
2025-05-06 17:53:09 +08:00
..
__init__.py
[Kernel][Backend][Model] Blocksparse flash attention kernel and Phi-3-Small model (
#4799
)
2024-05-24 22:00:52 -07:00
blocksparse_attention_kernel.py
[Bugfix] Fix triton import with local TritonPlaceholder (
#17446
)
2025-05-06 17:53:09 +08:00
interface.py
[Misc] Add SPDX-License-Identifier headers to python source files (
#12628
)
2025-02-02 11:58:18 -08:00
utils.py
[Bugfix] Fix triton import with local TritonPlaceholder (
#17446
)
2025-05-06 17:53:09 +08:00