vllm/vllm/model_executor/model_loader at 630dd9e0aea166085a4c897e21a98ec752954265 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 23993a7997 [Bugfix][TPU] Do not use torch.Generator for TPUs (#6981 )

2024-07-31 18:50:28 -07:00

..

__init__.py

[vlm] Remove vision language config. (#6089 )

2024-07-03 22:14:16 +00:00

loader.py

[Bugfix] Support cpu offloading with fp8 quantization (#6960 )

2024-07-31 12:47:46 -07:00

neuron.py

[Typing] Mypy typing part 2 (#4043 )

2024-04-17 17:28:43 -07:00

openvino.py

[Hardware][Intel] OpenVINO vLLM backend (#5379 )

2024-06-28 13:50:16 +00:00

tensorizer.py

[Frontend] Add FlexibleArgumentParser to support both underscore and dash in names (#5718 )

2024-06-20 17:00:13 -06:00

utils.py

[Kernel] FP8 support for MoE kernel / Mixtral (#4244 )

2024-04-24 01:18:23 +00:00

weight_utils.py

[Bugfix][TPU] Do not use torch.Generator for TPUs (#6981 )

2024-07-31 18:50:28 -07:00