This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
f62cad6431e2bce91c033c74e20835c8e0c9b288
vllm
/
csrc
/
quantization
/
cutlass_w8a8
History
TherLF
c12df53b60
[Bugfix] Fix cutlass dispatch for fp8/int8 to properly invoke M<=16 c… (
#16751
)
...
Signed-off-by: Ther-LF <
2639852836@qq.com
>
2025-04-27 19:38:42 -07:00
..
c3x
…
moe
[Kernel] Add expert_map support to Cutlass FP8 MOE (
#16861
)
2025-04-21 20:44:32 -07:00
Epilogues.md
…
scaled_mm_c2x_sm75_dispatch.cuh
…
scaled_mm_c2x_sm80_dispatch.cuh
…
scaled_mm_c2x_sm89_fp8_dispatch.cuh
[Bugfix] Fix cutlass dispatch for fp8/int8 to properly invoke M<=16 c… (
#16751
)
2025-04-27 19:38:42 -07:00
scaled_mm_c2x_sm89_int8_dispatch.cuh
[Bugfix] Fix cutlass dispatch for fp8/int8 to properly invoke M<=16 c… (
#16751
)
2025-04-27 19:38:42 -07:00
scaled_mm_c2x.cu
…
scaled_mm_c2x.cuh
…
scaled_mm_c3x_sm90.cu
[Build/BugFix] Fix hopper 12.8 build (
#14354
)
2025-03-08 08:11:56 +00:00
scaled_mm_c3x_sm100.cu
[Build/BugFix] Fix hopper 12.8 build (
#14354
)
2025-03-08 08:11:56 +00:00
scaled_mm_entry.cu
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00