biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Harry Mellor	ecde7af9c4	Fix import that was moved in Transformers 5.2.0 (#36120 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-03-05 13:59:44 +00:00
Andreas Karatzas	edba15045a	[Bugfix] Guard mm_token_type_ids kwarg in get_mrope_input_positions (#35711 ) Signed-off-by: Andreas Karatzas <akaratza@amd.com>	2026-03-04 04:12:51 +00:00
SteadfastAsArt	2decec9856	[Transformers backend] Ignore MTP weights when num_nextn_predict_layers=0 (#34888 ) Signed-off-by: SteadfastAsArt <695488173@qq.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-02-27 19:39:23 +00:00
Raushan Turganbay	fd6de37fca	[BugFix] Fix 3D rope in transformers backend (#35097 ) Signed-off-by: raushan <raushan@huggingface.co> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-02-27 18:34:49 +00:00
Cyrus Leung	392645454b	[Refactor] Decouple TimingContext from InputProcessingContext (#35083 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-23 14:15:50 +00:00
Harry Mellor	103e614b14	Fix pipeline parallel with embed scaling in the Transformers modelling backend (#35094 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-02-23 05:04:47 -08:00
Cyrus Leung	987506bca6	[Refactor] Simplify dummy data generation (#35025 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-22 20:55:27 -08:00
Cyrus Leung	a0d8d944e2	[Renderer] Move MM Hash parsing into Renderer (#34711 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-17 19:18:55 -08:00
Cyrus Leung	ec17bdd894	[Renderer] Move InputPreprocessor into Renderer (1.5/2) (#34598 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-02-15 23:46:33 -08:00
Harry Mellor	679ca5d8d3	Fix MoE for the Transformers modelling backend (#34436 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-02-12 09:29:42 -08:00
Isotr0py	0ab06100f4	[Multimodal] Expose `mm_processor_kwargs` for `DummyInputsBuilder` (#34330 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2026-02-11 09:37:40 -08:00
Cyrus Leung	88c3e114d8	[Refactor] Move MM data parsing outside processor (#33408 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-31 16:46:14 +00:00
Matthew Bonanni	a608b4c6c2	[5/N][Attention] Finish eliminating `vllm/attention` folder (#32064 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2026-01-27 10:02:51 -05:00
Harry Mellor	14385c80fc	Fix weight mapping test for Transfomers v5 (#33162 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-01-27 12:30:14 +00:00
Cyrus Leung	c25dbee40d	[Model] Bump transformers version for test registry (#33100 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-26 18:53:22 +00:00
Andreas Karatzas	22aeb43007	[Bugfix][VLM] Fix transformers backend embed_multimodal for Qwen2.5-VL profiling (#32969 ) Signed-off-by: Andreas Karatzas <akaratza@amd.com>	2026-01-26 08:34:05 +08:00
Raushan Turganbay	d95d650762	[Bugfix] Fix getting vision features in Transformer Multimodal backend (#32933 ) Signed-off-by: raushan <raushan@huggingface.co>	2026-01-23 13:34:48 +00:00
Cyrus Leung	9ea07b41da	[1/N] Reorganize multimodal processing code (#32327 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-14 15:25:31 +00:00
Cyrus Leung	9101dc756c	[Model] Avoid hardcoding pooling type (#32119 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-11 21:28:12 -08:00
Matthew Bonanni	2612ba9285	[1/N][Attention] Restructure attention: move files (#31916 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2026-01-09 13:10:24 -08:00
Shanshan Shen	08d954f036	[Doc] Add developer guide for CustomOp (#30886 ) Signed-off-by: shen-shanshan <467638484@qq.com>	2026-01-09 16:21:11 +00:00
Cyrus Leung	c8ed39b9dd	[Model] Reorganize pooling layers (#31973 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2026-01-09 11:02:14 +00:00
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟	482914849c	[BugFix] LoRA: Support loading base_layer of experts (#31104 ) Signed-off-by: Hollow Man <hollowman@opensuse.org>	2026-01-07 14:49:39 +08:00
Harry Mellor	e37e7349e6	Replace `nn.ConvNd` with vLLM's `ConvNdLayer` for Transformers modeling backend (#31498 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-12-29 16:20:01 +00:00
Harry Mellor	b10d47e0e0	Add util function for checking nesting of rope parameters (#31146 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-12-23 11:41:49 +00:00
Andreas Karatzas	7b43db210c	[ROCm][CI][Bugfix] Multi-Modal Model Support Fixes and Attention Backend Improvements (#30270 ) Signed-off-by: Andreas Karatzas <akaratza@amd.com>	2025-12-19 02:17:27 +00:00
Harry Mellor	8781cd6b88	Add Eagle and Eagle3 support to Transformers modeling backend (#30340 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-12-11 17:02:10 +00:00
Cyrus Leung	c46b932df2	[Chore] Deprecate `SupportsMultiModal.merge_by_field_config` (#30170 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-06 07:57:28 +00:00
Jee Jee Li	39e63dec7c	[LoRA] Cleanup LoRA unused code (#29611 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-11-28 22:52:58 -08:00
Matthew Bonanni	430dd4d9eb	[Attention] Remove imports from `vllm/attention/__init__.py` (#29342 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2025-11-26 10:53:15 -07:00
Harry Mellor	a8b70304d6	Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 (#28542 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-19 09:06:36 -08:00
Harry Mellor	4f5299f717	Relax Transformers modeling backend MoE experts check (#28952 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-19 21:50:30 +08:00
Harry Mellor	5f3cd7f7f2	[Docs] Update the name of `Transformers backend` -> `Transformers modeling backend` (#28725 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-14 16:34:14 +00:00
Harry Mellor	97d1c99302	Rename clashing method names for vLLM model protocol (#27583 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-12 19:14:33 -08:00
Jee Jee Li	9d1c474704	[LoRA][1/N]Remove LoRA extra vocab (#28382 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>	2025-11-11 11:06:21 -08:00
Cyrus Leung	afffd3cc8a	[Model] Pass `mm_features` directly into `get_mrope_input_positions` (#28399 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-11 21:14:48 +08:00
Cyrus Leung	d0e186c16f	[V0 Deprecation] Remove unused `context_len` and `seq_len` from M-RoPE (#28395 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-11 00:30:06 +08:00
Harry Mellor	c0a4b95d64	Fix issues from #28242 (#28257 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-07 04:23:17 +00:00
Lucas Kabela	4bf56c79cc	[Multimodal][torch.compile] Add compilation config field for turning off ViT/MM compile (#28242 ) Signed-off-by: Lucas Kabela <lucaskabela@meta.com>	2025-11-07 00:16:03 +00:00
Ilya Markov	e50c454672	[BugFix] Support EP/DP + EPLB with MTP (#25311 ) Signed-off-by: ilmarkov <markovilya197@gmail.com> Signed-off-by: Sage Moore <sage@neuralmagic.com> Co-authored-by: Sage Moore <sage@neuralmagic.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>	2025-11-05 15:22:17 +00:00
Isotr0py	0ff05e3770	[Bugfix] Fix encoder-only model support for transformers backend (#28021 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-04 22:24:41 -08:00
Harry Mellor	1f9460c4c1	Fix pooling adapters for Transformers backend (#27338 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-23 20:23:55 -07:00
Isotr0py	6ac5e06f7c	[Chore] Clean up pytorch helper functions in `vllm.utils` (#26908 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: isotr0py <2037008807@qq.com>	2025-10-18 09:48:22 -07:00
Harry Mellor	fb5e10d3fb	Refactor Transformers backend to use mixins (#26906 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 21:50:39 +00:00

44 Commits