Robert Shaw
42135d6898
[MoE Refactor] Oracle Select FP8+NVFP4 Kernels In Priority ( #32414 )
2026-01-21 08:22:33 -05:00
Micah Williamson
22375f8d13
[ROCm][CI] Remove DS async eplb accuracy test from AMD CI ( #32717 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2026-01-20 13:40:48 -08:00
vllmellm
148117ea2e
[Refactor] Make FP8 Linear Ops use kernel abstraction ( #27814 )
...
Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com >
2026-01-20 14:48:20 +08:00
Matthew Bonanni
1a1fc3bbc0
[Attention][MLA] Make FLASHINFER_MLA the default MLA backend on Blackwell, and TRTLLM the default prefill ( #32615 )
...
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com >
Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com >
2026-01-19 18:41:34 -05:00
Yanan Cao
9d1e611f0e
[CI] Add Helion as an optional dependency ( #32482 )
...
Signed-off-by: Yanan Cao <gmagogsfm@gmail.com >
2026-01-19 19:09:56 +00:00
qli88
a0490be8f1
[CI][amd] Revert NIXL connector change to avoid crash ( #32570 )
...
Signed-off-by: Qiang Li <qiang.li2@amd.com >
Signed-off-by: Matthew Wong <Matthew.Wong2@amd.com >
2026-01-19 18:39:16 +00:00
Robert Shaw
afc3622602
[CI] Move Distributed Tests from H200 -> H100 ( #32555 )
2026-01-18 10:25:23 -08:00
Shengqi Chen
965765aef9
[build] fix cu130 related release pipeline steps and publish as nightly image ( #32522 )
...
Signed-off-by: Shengqi Chen <harry-chen@outlook.com >
2026-01-17 18:36:11 -08:00
Shengqi Chen
8e61425ee6
[CI] Implement uploading to PyPI and GitHub in the release pipeline, enable release image building for CUDA 13.0 ( #31032 )
2026-01-17 04:52:33 +00:00
Lucas Wilkinson
ca21288080
[CI] Fix OOM in Hopper Fusion E2E Tests (H100) ( #32489 )
...
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com >
2026-01-16 21:27:16 +00:00
Lucas Wilkinson
14ce524249
[CI] Breakup h200 tests ( #30499 )
...
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com >
2026-01-16 06:23:22 +00:00
TJian
41c544f78a
[ROCm] [CI] [Release] Rocm wheel pipeline with sccache ( #32264 )
...
Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com >
2026-01-16 02:56:18 +08:00
Douglas Lehr
c5891b5430
[ROCM] Add ROCm image build to release pipeline ( #31995 )
...
Signed-off-by: Doug Lehr <douglehr@amd.com >
Co-authored-by: Doug Lehr <douglehr@amd.com >
2026-01-15 19:01:40 +08:00
Micah Williamson
6fa6e7ef0c
[ROCm][CI] Disable Async Scheduling For Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test ( #32275 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2026-01-14 13:29:42 +08:00
Roberto L. Castro
8ef50d9a6b
[Kernel][Performance] Enable smaller Scaling Factor tiling for NVFP4 small-batch decoding ( #30885 )
...
Signed-off-by: LopezCastroRoberto <roberto.lopez.castro@udc.es >
Signed-off-by: Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com >
Signed-off-by: LopezCastroRoberto <rocastro@redhat.com >
2026-01-13 15:22:53 -08:00
Simon Mo
0db574b185
[Build] Add scripts for cherry-picking and trigger build ( #32282 )
...
Co-authored-by: Cursor Agent <cursoragent@cursor.com >
2026-01-13 13:21:05 -08:00
Divakar Verma
a28d9f4470
[ROCm][CI] Handle pytest status code 5 when a shard isn't allocated any tests ( #32040 )
...
Signed-off-by: Divakar Verma <divakar.verma@amd.com >
2026-01-12 17:35:49 -05:00
Cyrus Leung
a374532111
[CI/Build] Separate out flaky responses API tests ( #32110 )
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk >
2026-01-11 05:01:12 -08:00
Matt
bde57ab2ed
[Hardware][AMD][CI][Bugfix] Fix AMD Quantization test group ( #31713 )
...
Signed-off-by: Matthew Wong <Matthew.Wong2@amd.com >
2026-01-10 23:19:46 -08:00
Micah Williamson
ac0675ff6b
[CI] Allow Deprecated Quantization For LM Eval Tests ( #32065 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2026-01-09 19:10:47 -07:00
Matthew Bonanni
2612ba9285
[1/N][Attention] Restructure attention: move files ( #31916 )
...
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com >
2026-01-09 13:10:24 -08:00
Nicolò Lucchesi
83e1c76dbe
[CI][ROCm] Fix NIXL tests on ROCm ( #31728 )
...
Signed-off-by: NickLucche <nlucches@redhat.com >
2026-01-09 01:34:43 +08:00
TJian
72c068b8e0
[CI] [Bugfix] Fix unbounded variable in run-multi-node-test.sh ( #31967 )
...
Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com >
2026-01-08 05:42:01 -08:00
rasmith
f1b1bea5c3
[CI][BugFix][AMD] Actually skip tests marked @pytest.mark.skip_v1 ( #31873 )
...
Signed-off-by: Randall Smith <ransmith@amd.com >
Co-authored-by: Randall Smith <ransmith@amd.com >
2026-01-08 13:06:09 +08:00
Charlie Fu
cddbc2b4b2
[ROCm][CI] Add rocm support for run-multi-node-test.sh ( #31922 )
...
Signed-off-by: charlifu <charlifu@amd.com >
Signed-off-by: Charlie Fu <Charlie.Fu@amd.com >
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-01-08 04:36:39 +00:00
Robert Shaw
5dcd7ef1f2
[MoE Refactor][15/N] Apply Refactor to Fp8 ( #31415 )
2026-01-07 19:42:33 -05:00
Andreas Karatzas
364a8bc6dc
[ROCm][CI] Fix plugin tests (2 GPUs) failures on ROCm and removing VLLM_FLOAT32_MATMUL_PRECISION from all ROCm tests ( #31829 )
...
Signed-off-by: Andreas Karatzas <akaratza@amd.com >
2026-01-07 01:12:23 +00:00
Robert Shaw
d3e477c013
[MoE Refactor] Add Temporary Integration Tests - H100/B200 ( #31759 )
...
Signed-off-by: Robert Shaw <robshaw@redhat.com >
Co-authored-by: Robert Shaw <robshaw@redhat.com >
2026-01-06 10:34:17 -05:00
Michael Goin
ccb309a964
Revert "[CI Failure] Disable B200 tests while runner is broken" ( #31750 )
...
Signed-off-by: Michael Goin <mgoin64@gmail.com >
2026-01-05 17:26:33 -08:00
Matthew Bonanni
276e03b92c
[CI][DeepSeek] Add nightly DeepSeek R1 lm_eval tests on H200 ( #30356 )
...
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com >
Co-authored-by: Michael Goin <mgoin64@gmail.com >
2026-01-05 17:17:59 -05:00
Michael Goin
eefa713a66
[CI Failure] Disable B200 tests while runner is broken ( #31732 )
...
Signed-off-by: mgoin <mgoin64@gmail.com >
2026-01-05 08:50:51 -08:00
Andreas Karatzas
89f1f25310
[CI] Skip Phi-MoE test due to old API util ( #31632 )
...
Signed-off-by: Andreas Karatzas <akaratza@amd.com >
2026-01-05 08:52:07 +08:00
Andreas Karatzas
5cc4876630
[ROCm][CI] Fix failure in Language Models Tests (Extra Standard) by reducing agent pool size ( #31553 )
...
Signed-off-by: Andreas Karatzas <akaratza@amd.com >
2026-01-01 19:29:42 -08:00
TJian
578c8f51f6
[CI] [Critical] [CUDA] Fix duplicated test name ( #31562 )
...
Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com >
2025-12-30 21:01:09 -08:00
Nicolò Lucchesi
ab1af6aa3e
[CI][NIXL] Split DPEP tests ( #31491 )
...
Signed-off-by: NickLucche <nlucches@redhat.com >
2025-12-30 07:26:12 -05:00
qli88
0f35429a0c
[CI]Test Group 'NixlConnector PD accuracy tests' is fixed ( #31460 )
...
Signed-off-by: qli88 <qiang.li2@amd.com >
2025-12-29 23:48:56 +00:00
Li, Jiang
17347daaa2
[CI/Build][CPU] Update CPU CI test cases ( #31466 )
...
Signed-off-by: jiang1.li <jiang1.li@intel.com >
2025-12-29 14:17:52 +08:00
Andreas Karatzas
f70368867e
[ROCm][CI] Add TorchCodec source build for transcription tests ( #31323 )
...
Signed-off-by: Andreas Karatzas <akaratza@amd.com >
2025-12-28 16:06:05 +08:00
Louie Tsai
f15185fbdb
[Benchmark Suite] improve cpu Benchmark Suite tests and comparison report for 0.12.0 ( #30994 )
...
Signed-off-by: Tsai, Louie <louie.tsai@intel.com >
2025-12-25 08:51:45 +00:00
Cyrus Leung
d201807339
[Chore] Bump lm-eval version ( #31264 )
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk >
2025-12-24 05:39:13 -08:00
Micah Williamson
6559d96796
[ROCm][CI] Set TORCH_NCCL_BLOCKING_WAIT Distributed Tests On ROCm ( #31259 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2025-12-24 07:19:07 +00:00
kliuae
1c74150bca
[ROCm][CI] Fix "Distributed Tests (H200)" Test ( #31227 )
...
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com >
2025-12-24 06:56:30 +00:00
Micah Williamson
3ce791ac77
[ROCm][CI] Set VLLM_FLOAT32_MATMUL_PRECISION="tf32" For terratorch Tests In AMD CI ( #31242 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2025-12-24 03:21:50 +00:00
Micah Williamson
a5bc77c253
[AMD][CI] Add "V1 Test e2e + engine" to mi325_8 Agent Pool ( #31040 )
...
Signed-off-by: Micah Williamson <micah.williamson@amd.com >
2025-12-22 10:41:56 -05:00
Shengqi Chen
2cf91c2ea4
[CI] add polling for precompiled wheel in python_only_compile.sh, fix index generation for releases ( #30781 )
...
Signed-off-by: Shengqi Chen <harry-chen@outlook.com >
2025-12-22 13:24:21 +00:00
Kevin McKay
8c084de59d
[Misc] Fix spelling typos in comments ( #31114 )
...
Signed-off-by: c0de128 <kevin.mckay@outlook.com >
2025-12-21 21:13:14 -08:00
Lucas Wilkinson
7e065eba59
[CI] Fix "2 Node Tests (4 GPUs in total)" ( #31090 )
...
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com >
2025-12-22 10:32:40 +08:00
Ameen Patel
93cabc417c
ci: add nvidia-smi warmup before Prime-RL integration test ( #31093 )
...
Signed-off-by: AmeenP <ameenp360@gmail.com >
2025-12-21 15:43:01 +00:00
Lucas Wilkinson
ae0770fa6b
[CI] Fix H200 Distributed test ( #31054 )
...
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com >
2025-12-20 16:48:49 -05:00
Li, Jiang
420ba2dbb6
Enable aarch64 CPU performance benchmarks ( #26494 )
...
Signed-off-by: Ioana Ghiban <ioana.ghiban@arm.com >
Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com >
Co-authored-by: Ioana Ghiban <ioana.ghiban@arm.com >
Co-authored-by: Fadi Arafeh <fadi.arafeh@arm.com >
2025-12-19 12:16:18 +00:00