vllm/vllm/model_executor at 9117f892f0e4d3b0f07bf0b9b409321bc743dabc - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Saurabh Dash 9117f892f0 [Model] Cohere CommandR+ (#3829 )

2024-04-04 13:31:49 -07:00

..

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Model] Cohere CommandR+ (#3829 )

2024-04-04 13:31:49 -07:00

[Core] manage nccl via a pypi package & upgrade to pt 2.2.1 (#3805 )

2024-04-04 10:26:19 -07:00

__init__.py

[Core] Refactor Attention Take 2 (#3462 )

2024-03-25 04:39:33 +00:00

guided_decoding.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

guided_logits_processors.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

model_loader.py

Usage Stats Collection (#2852 )

2024-03-28 22:16:12 -07:00

neuron_model_loader.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

sampling_metadata.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

utils.py

[Hardware][Neuron] Refactor neuron support (#3471 )

2024-03-22 01:22:17 +00:00

weight_utils.py

[Core] Enable hf_transfer by default if available (#3817 )

2024-04-04 04:02:43 +00:00