vllm/benchmarks/kernels/benchmark_trtllm_decode_attention.py at b9a1c4c8a2430555f1959938b87b10640c751e57

Files

elvischenv bba1042c6f [Flashinfer] Support Flashinfer TRTLLM FP8-qkv BF16/FP16-out Attention Kernel (#23647 )

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

2025-09-08 20:53:07 -07:00

View Raw