Files
vllm/vllm/model_executor/models
Lain 9a3835aaa9 Fix trtllm-gen attention env and add attention sink (#22378)
Signed-off-by: Siyuan Fu <siyuanf@nvidia.com>
Signed-off-by: Lain <fusiyuan2000@hotmail.com>
Signed-off-by: Yongye Zhu <zyy1102000@gmail.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Yongye Zhu <zyy1102000@gmail.com>
2025-08-06 18:07:41 -07:00
..
2025-08-02 01:46:57 -07:00
2025-07-19 22:40:31 +00:00
2025-07-26 19:14:04 +08:00
2025-07-16 19:03:37 +00:00
2025-07-30 09:10:41 -07:00
2025-07-31 23:19:06 +08:00
2025-07-31 23:19:06 +08:00
2025-08-05 22:56:14 -07:00