Files
vllm/benchmarks/fused_kernels/layernorm_rms_benchmarks.py
Luka Govedič 30870b4f66 [torch.compile] Dynamic fp8 + rms_norm fusion (#10906)
Signed-off-by: luka <luka@neuralmagic.com>
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-12-13 03:19:23 +00:00

5.0 KiB