vllm/vllm/config at 25bb9e8c65424e3bf24d2eab259743f9a97b7a3c - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

wang.yuqi 25bb9e8c65 [CI Failure] fix models/language/pooling/test_auto_prefix_cache_support.py (#24636 )

Signed-off-by: wang.yuqi <noooop@126.com>

2025-09-11 03:31:23 -07:00

..

__init__.py

[CI Failure] fix models/language/pooling/test_auto_prefix_cache_support.py (#24636 )

2025-09-11 03:31:23 -07:00

cache.py

[Core] Use sha256 bytes instead of BlockHash to reduce GC overhead (#23673 )

2025-09-08 21:34:37 -07:00

compilation.py

Add the support for the qwen3 next model (a hybrid attention model). (#24526 )

2025-09-11 15:32:09 +08:00

kv_events.py

Move KVEventsConfig from config/__init__.py to config/kv_events.py (#24433 )

2025-09-08 06:41:27 -07:00

kv_transfer.py

Move KVTransferConfig from config/__init__.py to config/kv_transfer.py (#24434 )

2025-09-08 20:30:32 -07:00

load.py

[Core] feat: Add --safetensors-load-strategy flag for faster safetensors loading from Lustre (#24469 )

2025-09-10 23:10:01 -07:00

parallel.py

[Bugfix] Improve EPLB config validation error message (#24524 )

2025-09-10 00:32:36 +00:00

scheduler.py

[V0 Deprecation] Remove args for multi-step scheduling (#22779 )

2025-08-12 20:38:18 -07:00

utils.py

Extract CompilationConfig from config.py (#22524 )

2025-08-08 16:34:25 -07:00