This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
10e94c84f6ce4e5f1dff8a2fefc242e9ee687d0c
vllm
/
csrc
/
quantization
/
gptq_marlin
History
Robert Shaw
85f55c943c
[Quantization][Deprecation] Deprecate HQQ (
#32681
)
...
Signed-off-by: Robert Shaw <
robshaw@redhat.com
> Co-authored-by: Robert Shaw <
robshaw@redhat.com
>
2026-01-21 09:32:40 -05:00
..
.gitignore
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00
awq_marlin_repack.cu
[Kernel][Quantization] add w4a8 support for marlin kernel (
#24722
)
2025-11-29 07:19:33 -08:00
dequant.h
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00
generate_kernels.py
[Quantization][Deprecation] Deprecate HQQ (
#32681
)
2026-01-21 09:32:40 -05:00
gptq_marlin_repack.cu
[Kernel][Quantization] add w4a8 support for marlin kernel (
#24722
)
2025-11-29 07:19:33 -08:00
gptq_marlin.cu
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00
kernel.h
[Kernel][Quantization] add w4a8 support for marlin kernel (
#24722
)
2025-11-29 07:19:33 -08:00
marlin_dtypes.cuh
[Kernel][Quantization] add w4a8 support for marlin kernel (
#24722
)
2025-11-29 07:19:33 -08:00
marlin_int4_fp8_preprocess.cu
[Kernel][Quantization] add w4a8 support for marlin kernel (
#24722
)
2025-11-29 07:19:33 -08:00
marlin_mma.h
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00
marlin_template.h
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00
marlin.cuh
[Kernel][Quantization][MoE] add marlin kernel support for turing (sm75) (
#29901
)
2025-12-16 14:35:28 -08:00