vllm/tests/models/quantization at d53cb9cb8e9eac9d38a3b7bf027df2a6ef15def4 - vllm

Files

EdalatiAli e5b807607c [Quant][Feature] Support online MXFP8 quantization for MoE and dense models (#35448 )

Signed-off-by: EdalatiAli <aliedalati@cohere.com>

2026-03-16 18:07:39 -04:00

__init__.py

2025-04-30 23:03:08 -07:00

test_awq.py

2026-02-07 05:24:40 -08:00

test_bitsandbytes.py

2026-03-12 18:03:25 +00:00

test_fp8.py

2026-01-09 13:10:24 -08:00

test_gguf.py

2025-12-03 10:33:46 +00:00

test_gpt_oss.py

2026-03-07 13:50:17 -08:00

test_gptq_marlin.py

2025-10-05 07:06:22 -07:00

test_modelopt.py

2025-10-05 07:06:22 -07:00

test_mxfp4.py

2025-10-05 07:06:22 -07:00

test_mxfp8.py

2026-03-16 18:07:39 -04:00

test_nvfp4.py

2026-01-13 15:22:53 -08:00