Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
b6c16cf8ff8d558ec943f1f17342c2c081f3f5af
vllm/csrc/quantization/fp8
History
Michael Goin 47f0954af0 [Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975)
2024-07-03 17:38:00 +00:00
..
amd
[CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722)
2024-05-22 07:18:41 +00:00
nvidia
[CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722)
2024-05-22 07:18:41 +00:00
common.cu
[Kernel] Vectorized FP8 quantize kernel (#5396)
2024-06-12 14:07:26 -07:00
fp8_marlin.cu
[Kernel] Expand FP8 support to Ampere GPUs using FP8 Marlin (#5975)
2024-07-03 17:38:00 +00:00
Powered by Gitea Version: 1.25.2 Page: 76ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API