vllm/tests/quantization at 1b67b0465647490c6abf2afe330a0cbb2eb2949b - vllm

Files

Michael Goin f708bd4904 [CI] Add E2E Blackwell Quantized MoE Test (#25723 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-09-26 12:23:00 -07:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

reference_mxfp4.py

[Feature][Quantization] MXFP4 support for MOE models (#17888 )

2025-07-09 13:19:02 -07:00

test_auto_round.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_blackwell_moe.py

[CI] Add E2E Blackwell Quantized MoE Test (#25723 )

2025-09-26 12:23:00 -07:00

test_compressed_tensors.py

Revert "[Performance] Move apply_w8a8_block_fp8_linear to an op class… (#25607 )

2025-09-25 08:05:21 +00:00

test_configs.py

[Kernel/Quant] Remove the original marlin format and qqq (#23204 )

2025-08-20 15:13:36 -04:00

test_cpu_offload.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_experts_int8.py

Update transformers to v4.55 (#21931 )

2025-08-05 22:56:14 -07:00

test_fp8.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_gptq_dynamic.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_ipex_quant.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_lm_head.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_modelopt.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_ptpc_fp8.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_quark.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_register_quantization_config.py

[V1] Support LLM.apply_model (#18465 )

2025-09-20 07:14:35 +00:00

test_rtn.py

[Feature] Add support for MoE models in the calibration-free RTN-based quantization (#20766 )

2025-07-25 18:09:34 -07:00

test_torchao.py

[torchao] Support quantization configs using module swap (#21982 )

2025-09-10 23:53:24 -07:00

utils.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00