Signed-off-by: Daniel Serebrenik <daserebrenik@nvidia.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>
vllm/attention
CustomOp.forward_native
SiluAndMul
QuantFP8
rst
md