vllm/tests/models/quantization at 73f48ce559e230fd0d738c52cb2e99bd0dd08754 - vllm

Files

Michael Goin db5d0719e1 [Kernel] Add MXFP8 to Marlin GEMM/MoE and refactor Mxfp8LinearOp (#34664 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2026-04-01 09:41:42 -07:00

__init__.py

2025-04-30 23:03:08 -07:00

test_awq.py

2026-02-07 05:24:40 -08:00

test_bitsandbytes.py

2026-03-12 18:03:25 +00:00

test_fp8.py

2026-01-09 13:10:24 -08:00

test_gguf.py

2025-12-03 10:33:46 +00:00

test_gpt_oss.py

2026-03-31 22:32:54 +08:00

test_gptq_marlin.py

2025-10-05 07:06:22 -07:00

test_modelopt.py

2025-10-05 07:06:22 -07:00

test_mxfp4.py

2025-10-05 07:06:22 -07:00

test_mxfp8.py

2026-04-01 09:41:42 -07:00

test_nvfp4.py

2026-01-13 15:22:53 -08:00