vllm/vllm/model_executor/layers/quantization/kernels at f6227c22ab8976a24913122874c24624102da1b4 - vllm

Files

czhu-cohere f6227c22ab [Kernel]Support W4A8 Grouped GEMM on Hopper (#29691 )

Signed-off-by: czhu-cohere <conway.zhu@cohere.com>

2025-12-08 19:29:06 -08:00

2025-12-08 19:29:06 -08:00

2025-11-12 21:46:57 +00:00

__init__.py

2025-01-08 19:33:29 +00:00