[Bugfix] Fix ModernBert cuda graph capturing in v1 (#21901)

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: Isotr0py <2037008807@qq.com>
This commit is contained in:
Isotr0py
2025-08-09 13:17:22 +08:00
committed by GitHub
parent 35afe1b30b
commit 429e4e2d42
5 changed files with 39 additions and 42 deletions

View File

@@ -466,7 +466,7 @@ class BertEmbeddingModel(nn.Module, SupportsQuant):
def forward(
self,
input_ids: Optional[torch.Tensor],
input_ids: torch.Tensor,
positions: torch.Tensor,
token_type_ids: Optional[torch.Tensor] = None,
intermediate_tensors: Optional[IntermediateTensors] = None,