This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
4,791
Commits
2
Branches
140
Tags
2f42a4888cacc979bc542b77b43b8c1a7ea4e76e
Commit Graph
3 Commits
Author
SHA1
Message
Date
leoneo
839b27c6cc
[Kernel]Add streamK for block-quantized CUTLASS kernels (
#12978
)
2025-02-20 22:14:24 -08:00
Tyler Michael Smith
c1e37bf71b
[Kernel][Bugfix] Refactor and Fix CUTLASS 2:4 Sparse Kernels (
#13198
)
...
Signed-off-by: Tyler Michael Smith <
tyler@neuralmagic.com
>
2025-02-14 00:01:14 +00:00
Lucas Wilkinson
9798b2fb00
[Kernel] Update
cutlass_scaled_mm
to support 2d group (blockwise) scaling (
#11868
)
2025-01-30 18:33:00 -08:00