vllm/csrc/quantization/fp8 at 9da4aad44b7878032ef2bb32eb1b4e1ab86f8351 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Michael Goin 47f0954af0 [Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975 )

2024-07-03 17:38:00 +00:00

..

[CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722 )

2024-05-22 07:18:41 +00:00

[CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722 )

2024-05-22 07:18:41 +00:00

common.cu

[Kernel] Vectorized FP8 quantize kernel (#5396 )

2024-06-12 14:07:26 -07:00

fp8_marlin.cu

[Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975 )

2024-07-03 17:38:00 +00:00