vllm/vllm/model_executor at 0ce0539d4750f9ebcd9b19d7085ca3b934b9ec67 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

youkaichao 95baec828f [Core] enable out-of-tree model register (#3871 )

2024-04-06 17:11:41 -07:00

..

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

[Core] enable out-of-tree model register (#3871 )

2024-04-06 17:11:41 -07:00

[Core] improve robustness of pynccl (#3860 )

2024-04-04 16:52:12 -07:00

__init__.py

[Core] Refactor Attention Take 2 (#3462 )

2024-03-25 04:39:33 +00:00

guided_decoding.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

guided_logits_processors.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

model_loader.py

Usage Stats Collection (#2852 )

2024-03-28 22:16:12 -07:00

neuron_model_loader.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

sampling_metadata.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

utils.py

[Hardware][Neuron] Refactor neuron support (#3471 )

2024-03-22 01:22:17 +00:00

weight_utils.py

[Core] Enable hf_transfer by default if available (#3817 )

2024-04-04 04:02:43 +00:00