vllm/vllm/model_executor at cc7ae5e7cab77765369630c1401410ca54184065 - vllm

Files

rasmith cc7ae5e7ca [BugFix][AMD][Quantization] Fix torch.compile issue where wvSplitKQ not being called when it should when using quantized FP8 model (#22281 )

Signed-off-by: Randall Smith <Randall.Smith@amd.com>

2025-08-22 21:47:57 +00:00

2025-08-22 21:47:57 +00:00

2025-08-22 13:04:22 -06:00

2025-08-22 17:50:52 +00:00

2025-08-14 16:03:55 -04:00

__init__.py

2025-06-03 11:20:17 -07:00

custom_op.py

2025-08-04 21:43:24 -07:00

parameter.py

2025-06-03 11:20:17 -07:00

pooling_metadata.py

2025-08-21 13:26:09 +00:00

sampling_metadata.py

2025-08-01 05:24:46 -07:00

utils.py

2025-08-01 11:09:54 +00:00