[Kernel] some optimizations for dense marlin and moe marlin (#16850)
Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>
This commit is contained in:
1
csrc/quantization/gptq_marlin/.gitignore
vendored
Normal file
1
csrc/quantization/gptq_marlin/.gitignore
vendored
Normal file
@@ -0,0 +1 @@
|
||||
kernel_*.cu
|
||||
Reference in New Issue
Block a user