Andreas Karatzas
|
7b43db210c
|
[ROCm][CI][Bugfix] Multi-Modal Model Support Fixes and Attention Backend Improvements (#30270)
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
|
2025-12-19 02:17:27 +00:00 |
|
Nick Hill
|
45c0526ac9
|
[BugFix] Handle errors when preprocessing added requests (#30895)
Signed-off-by: Nick Hill <nhill@redhat.com>
|
2025-12-19 01:29:11 +00:00 |
|
Elizabeth Thomas
|
41b6f9200f
|
Remove all2all backend envvar (#30363)
Signed-off-by: Elizabeth Thomas <email2eliza@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-12-18 19:46:28 +00:00 |
|
Li, Jiang
|
e3ab93c896
|
[CPU] Refactor CPU fused MOE (#30531)
Signed-off-by: jiang1.li <jiang1.li@intel.com>
|
2025-12-18 14:36:49 +08:00 |
|
Divakar Verma
|
82dc338ad6
|
[AMD][CI] fix lm eval ci arg (#30911)
Signed-off-by: Divakar Verma <divakar.verma@amd.com>
|
2025-12-18 13:18:26 +08:00 |
|
Andrey Talman
|
e06d0bf0aa
|
2.9.1 PyTorch release update (#28495)
|
2025-12-17 12:20:22 -08:00 |
|
Matthew Bonanni
|
7eb6cb6c18
|
[Attention] Update tests to remove deprecated env vars (#30563)
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
|
2025-12-17 09:49:59 -08:00 |
|
Kevin H. Luu
|
4bf6c23668
|
[ci] Sync test areas yaml file with test-pipeline (#30862)
Signed-off-by: Kevin H. Luu <khluu000@gmail.com>
|
2025-12-17 02:30:56 -08:00 |
|
Chauncey
|
9ad5b21710
|
[Refactor] [4/N] Move VLLM_SERVER_DEV endpoints into the serve directory (#30749)
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
|
2025-12-17 02:27:30 -08:00 |
|
Michael Goin
|
10ee1c64cf
|
[CI] Generalize gsm8k test args and add Qwen3-Next MTP B200 test (#30723)
Signed-off-by: mgoin <mgoin64@gmail.com>
|
2025-12-16 14:28:34 -05:00 |
|
Lucas Wilkinson
|
00a8d7628c
|
[BugFix] Fix memory spike in workspace allocation (#30744)
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
|
2025-12-16 06:46:22 -08:00 |
|
Cyrus Leung
|
ed586e7724
|
[Refactor] [3/N] Move tool parser tests and run on CPU (#30693)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-12-15 13:45:36 +00:00 |
|
Micah Williamson
|
97f2f160fd
|
[ROCm][CI] Add "Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test" Back Into AMD CI (#30590)
Signed-off-by: David Chen <530634352@qq.com>
Signed-off-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>
Signed-off-by: Micah Williamson <micah.williamson@amd.com>
Co-authored-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
|
2025-12-14 06:56:26 +00:00 |
|
Michael Goin
|
2f32a68d75
|
[CI] Update several models in registry that are available online now (#30514)
Signed-off-by: mgoin <mgoin64@gmail.com>
Signed-off-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
|
2025-12-12 18:28:13 -08:00 |
|
Kevin H. Luu
|
b4039c08b5
|
[ci] Mark PrimeRL integration test as soft fail (#30578)
Signed-off-by: Kevin H. Luu <khluu000@gmail.com>
|
2025-12-12 14:13:09 -08:00 |
|
Li, Jiang
|
9693dd0fe3
|
[CI/Build] Add x86 CPU wheel release pipeline (#28848)
Signed-off-by: jiang1.li <jiang1.li@intel.com>
|
2025-12-12 19:21:35 +00:00 |
|
shivampr
|
cd7740ac5c
|
[ROCm] Enable Triton ScaledMM fallback + kernel selection fix (#26668)
Signed-off-by: Shivam <shivampr.dev@gmail.com>
Signed-off-by: Shivam <shivamprasad91@gmail.com>
|
2025-12-12 13:28:20 -05:00 |
|
Andreas Karatzas
|
b5945d49c0
|
[ROCm][CI] Use mi325_4 agent pool for V1 e2e tests (#30526)
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
|
2025-12-12 01:37:24 +00:00 |
|
Concurrensee
|
2cc5affc38
|
[ROCM][CI] Fix AMD Examples Test Group (#30276)
Signed-off-by: Yida Wu <yida.wu@amd.com>
Signed-off-by: Yida <yida.wu@amd.com>
|
2025-12-11 18:03:54 -05:00 |
|
Shengqi Chen
|
305b168a9f
|
[CI] refine more logic when generating and using nightly wheels & indices, add cuda130 build for aarch64, specify correct manylinux version (#30341)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-12 00:42:30 +08:00 |
|
Sage Moore
|
b4054c8ab4
|
Revert "[CI] Add Async Eplb nightly CI tests (#29385)" (#30431)
|
2025-12-11 00:48:35 +00:00 |
|
Aditya Tewari
|
cebda2a4af
|
[CPU] Support for Whisper (#30062)
Signed-off-by: Aditya Tewari <aditya.tewari@arm.com>
|
2025-12-10 04:58:42 -08:00 |
|
Ilya Markov
|
0b6a8a304c
|
[BugFix] Fix non detected failing tests (#30277)
Signed-off-by: ilmarkov <markovilya197@gmail.com>
|
2025-12-09 17:57:55 +00:00 |
|
Alexei-V-Ivanov-AMD
|
804e3468c0
|
Update AMD test definitions (2025-12-08) (#30298)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
|
2025-12-09 17:31:30 +00:00 |
|
Fadi Arafeh
|
e130845984
|
[CPU][CI] Enable fused MoE tests in Arm CI (#30132)
Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>
|
2025-12-09 04:55:39 +00:00 |
|
Or Ozeri
|
4c6fd25880
|
kv_transfer: Rename the shared storage connectors (#30201)
Signed-off-by: Or Ozeri <oro@il.ibm.com>
|
2025-12-08 20:46:09 -08:00 |
|
Kevin H. Luu
|
db14f61f2d
|
[ci] Refactor CI file structure (#29343)
|
2025-12-08 17:25:43 -09:00 |
|
Micah Williamson
|
78c7503364
|
[ROCm][CI] Skip NVIDIA-Only Prime-RL Test in AMD CI (#29420)
Signed-off-by: Micah Williamson <micah.williamson@amd.com>
|
2025-12-09 02:14:02 +00:00 |
|
Yi Liu
|
0d8a7d8a26
|
[Compressed Tensors] Add XPU wNa16 support (#29484)
Signed-off-by: yiliu30 <yi4.liu@intel.com>
|
2025-12-05 22:02:09 +08:00 |
|
Shengqi Chen
|
aaddc9c82a
|
[CI] fix silent error in nightly wheel index generation script, add generation time to HTML index (#30060)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-05 00:48:59 +00:00 |
|
Zhewen Li
|
263c38d74d
|
[CI/Build] Update batch invariant test trigger (#30080)
Signed-off-by: zhewenli <zhewenli@meta.com>
|
2025-12-05 00:42:37 +00:00 |
|
Zhewen Li
|
bcf43ab1f3
|
[CI/Build][AMD] Add Llama4 Maverick FP8 to AMD CI (#28695)
Signed-off-by: zhewenli <zhewenli@meta.com>
|
2025-12-04 16:07:20 -08:00 |
|
Kevin H. Luu
|
1b7c7f5159
|
[release] install regex (#30008)
Signed-off-by: Kevin H. Luu <khluu000@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
|
2025-12-04 03:18:29 -08:00 |
|
Andreas Karatzas
|
e96a6a6dca
|
[ROCm][CI][Bugfix] Fixing the Multi-Modal Models Test (Extended) 1 group (#30013)
Signed-off-by: Andreas Karatzas <akaratza@amd.com>
|
2025-12-04 11:00:16 +00:00 |
|
Li Wang
|
5f91cdda75
|
[Misc] Add docker build env for Ascend NPU (#30015)
Signed-off-by: wangli <wangli858794774@gmail.com>
|
2025-12-03 19:53:00 -08:00 |
|
Zhewen Li
|
c493b9d092
|
[CI/Build] Add MM code path to Examples Test (#29986)
Signed-off-by: zhewenli <zhewenli@meta.com>
|
2025-12-03 19:21:45 -08:00 |
|
Shengqi Chen
|
1109f98288
|
[CI] fix docker image build by specifying merge-base commit id when downloading pre-compiled wheels (#29930)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-03 14:08:19 -08:00 |
|
Wentao Ye
|
ac1886588f
|
[CI] Fix re import error (#29973)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
|
2025-12-03 15:16:54 -05:00 |
|
WeiQing Chen
|
7fe9c1a223
|
[CI] Add Async Eplb nightly CI tests (#29385)
Signed-off-by: David Chen <530634352@qq.com>
Signed-off-by: WeiQing Chen <40507679+david6666666@users.noreply.github.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
|
2025-12-03 09:51:08 +00:00 |
|
Alexei-V-Ivanov-AMD
|
5f67361fd1
|
Reverting re-direction to amd_mi355_X. (#29914)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
|
2025-12-03 00:40:02 +00:00 |
|
Alexei-V-Ivanov-AMD
|
c77b9929a0
|
Update AMD-CI testing mirror (as of 2025-12-02) (#29898)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
|
2025-12-02 08:52:54 -09:00 |
|
wang.yuqi
|
2eb4fe9129
|
[examples] Resettle pooling examples. (#29365)
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-12-02 15:54:28 +00:00 |
|
Louie Tsai
|
8bbcf8b6e7
|
[vLLM Benchmark Suite] Add default parameters section and update CPU benchmark cases (#29381)
Signed-off-by: Tsai, Louie <louie.tsai@intel.com>
Signed-off-by: Louie Tsai <louie.tsai@intel.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Li, Jiang <bigpyj64@gmail.com>
|
2025-12-02 09:00:23 +00:00 |
|
Shengqi Chen
|
4b612664fd
|
[CI] Renovation of nightly wheel build & generation (take 2) (#29838)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-01 22:17:10 -08:00 |
|
Alexei-V-Ivanov-AMD
|
342c4f1472
|
Updated CI mirror 2025-11-25 (#29434)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
Signed-off-by: Alexei-V-Ivanov-AMD <156011006+Alexei-V-Ivanov-AMD@users.noreply.github.com>
Co-authored-by: Kevin H. Luu <khluu000@gmail.com>
|
2025-12-01 23:44:33 +00:00 |
|
Kevin H. Luu
|
1336a1ea24
|
Revert #29787 and #29690 (#29815)
|
2025-12-01 13:42:03 -08:00 |
|
Kevin H. Luu
|
ec7035c9d4
|
[ci] Make distributed 8 gpus test optional (#29801)
Signed-off-by: Kevin H. Luu <khluu000@gmail.com>
|
2025-12-01 10:22:05 -08:00 |
|
Shengqi Chen
|
37593deb02
|
[CI] fix url-encoding behavior in nightly metadata generation (#29787)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-01 23:17:20 +08:00 |
|
Shengqi Chen
|
36db0a35e4
|
[CI] Renovation of nightly wheel build & generation (#29690)
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
|
2025-12-01 21:25:39 +08:00 |
|
Cyrus Leung
|
2afcec4dec
|
[Misc] Update TokenizerLike interface and move get_cached_tokenizer (#29730)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-11-30 14:59:47 +08:00 |
|