vllm/vllm/model_executor at 537d5ee0251bcfcedbb8ca934d273366e05f80fa - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

rasmith 68af5f6c5c [AMD][FP8][BugFix] Remove V1 check in arg_utils.py for FP8 since it is not necessary (#17215 )

Signed-off-by: Randall Smith <Randall.Smith@amd.com>

2025-04-25 19:55:05 -07:00

..

guided_decoding

[Bugfix] remove fallback in guided_json (int range, patterns) (#16725 )

2025-04-25 06:54:43 +00:00

[AMD][FP8][BugFix] Remove V1 check in arg_utils.py for FP8 since it is not necessary (#17215 )

2025-04-25 19:55:05 -07:00

More informative error when using Transformers backend (#16988 )

2025-04-23 19:54:03 -07:00

[Bugfix] gemma[2,3] interleaved attention when sliding window is disabled (#17180 )

2025-04-25 19:53:51 -07:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 )

2025-04-22 09:01:36 +01:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Bugfix] Fix extra comma (#15851 )

2025-03-31 22:57:28 -07:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00