vllm/tests/models/quantization at e812bf70bd668b4d28e7135ae1577d252c08ee5c - vllm

Files

EdalatiAli e5b807607c [Quant][Feature] Support online MXFP8 quantization for MoE and dense models (#35448 )

Signed-off-by: EdalatiAli <aliedalati@cohere.com>

2026-03-16 18:07:39 -04:00

__init__.py

2025-04-30 23:03:08 -07:00

test_awq.py

2026-02-07 05:24:40 -08:00

test_bitsandbytes.py

2026-03-12 18:03:25 +00:00

test_fp8.py

2026-01-09 13:10:24 -08:00

test_gguf.py

2025-12-03 10:33:46 +00:00

test_gpt_oss.py

2026-03-07 13:50:17 -08:00

test_gptq_marlin.py

2025-10-05 07:06:22 -07:00

test_modelopt.py

2025-10-05 07:06:22 -07:00

test_mxfp4.py

2025-10-05 07:06:22 -07:00

test_mxfp8.py

2026-03-16 18:07:39 -04:00

test_nvfp4.py

2026-01-13 15:22:53 -08:00