This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
4,867
Commits
2
Branches
140
Tags
b526ca672630e4dfd63173161dcc3eed5821e2b2
Commit Graph
3 Commits
Author
SHA1
Message
Date
leoneo
839b27c6cc
[Kernel]Add streamK for block-quantized CUTLASS kernels (
#12978
)
2025-02-20 22:14:24 -08:00
Tyler Michael Smith
c1e37bf71b
[Kernel][Bugfix] Refactor and Fix CUTLASS 2:4 Sparse Kernels (
#13198
)
...
Signed-off-by: Tyler Michael Smith <
tyler@neuralmagic.com
>
2025-02-14 00:01:14 +00:00
Lucas Wilkinson
9798b2fb00
[Kernel] Update
cutlass_scaled_mm
to support 2d group (blockwise) scaling (
#11868
)
2025-01-30 18:33:00 -08:00