biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
wang.yuqi	f9e2a38386	[Docs] Reorganize pooling docs. (#35592 ) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-03-19 11:25:47 +00:00
Harry Mellor	a0f44bb616	Allow `markdownlint` to run locally (#36398 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-03-08 20:05:24 -07:00
Kyle Sayers	64ac1395e8	[Docs] Clean up speculators docs (#34065 ) Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>	2026-02-18 13:48:11 -08:00
Aidan Reilly	133765760b	[Docs] Adding links and intro to Speculators and LLM Compressor (#32849 ) Signed-off-by: Aidan Reilly <aireilly@redhat.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2026-01-29 14:12:35 -08:00
Yan Ma	f1c2c20136	[XPU] decrease IGC_ForceOCLSIMDWidth for speculative decoding triton-xpu kernel compilation (#30538 ) Signed-off-by: Yan Ma <yan.ma@intel.com>	2025-12-23 05:22:15 +00:00
Fanli Lin	c2e1987a6e	[Doc] update Intel GPU MM status in Feature x Hardware matrix (#30294 ) Signed-off-by: Lin, Fanli <fanli.lin@intel.com>	2025-12-09 05:16:44 +00:00
wang.yuqi	74c4d80c6c	[Model][6/N] Improve all pooling task \| Support chunked prefill with ALL pooling (#27145 ) Signed-off-by: wang.yuqi <noooop@126.com> Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-12-04 13:44:15 +00:00
Rob Mulla	dd39f91edb	[Doc] cleanup TPU documentation and remove outdated examples (#29048 ) Signed-off-by: Rob Mulla <rob.mulla@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-21 00:05:59 +00:00
Andrew Sansom	ff93cc8c84	[CORE] Support Prefix Caching with Prompt Embeds (#27219 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai>	2025-10-22 22:18:07 -07:00
Chendi.Xue	12e21701e7	[DOC][FEATURES][CPU]update cpu feature for v1 (#27135 ) Signed-off-by: Chendi Xue <chendi.xue@intel.com>	2025-10-18 01:10:45 -07:00
Harry Mellor	483ea64611	[Docs] Replace all explicit anchors with real links (#27087 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-17 02:22:06 -07:00
Harry Mellor	4ffd6e8942	[Docs] Reduce custom syntax used in docs (#27009 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 20:05:34 -07:00
Chendi.Xue	509cdc0370	[DOC][XPU]update feature parity with Intel GPU (#26954 ) Signed-off-by: Chendi Xue <Chendi.Xue@intel.com> Signed-off-by: Chendi Xue <chendi.xue@intel.com>	2025-10-15 20:07:10 -07:00
Andrew Sansom	78a47f87ce	Test Prompt Embeds/LoRA compatibility and Enable LoRA Support for OPT Models (#25717 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai>	2025-09-30 08:10:58 +08:00
Andrew Sansom	b8a287a0a8	[docs] Prompt Embedding feature support (#25288 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai>	2025-09-19 17:46:23 -07:00
Harry Mellor	abc7989adc	[Docs] Remove Neuron install doc as backend no longer exists (#24396 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-13 00:15:03 -07:00
Harry Mellor	717fc00e98	[Docs] Move feature compatibility tables to README (#24431 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-08 06:45:14 -07:00

17 Commits