Signed-off-by: Zhu, Zufang <zufang.zhu@intel.com> Signed-off-by: zofia <110436990+zufangzhu@users.noreply.github.com>
vllm/attention
use_data_parallel
CustomOp.forward_native
SiluAndMul
QuantFP8