vllm/vllm/config at 9f1c6422549d37eee22bfa4dbadaaa91d95e98ba - vllm

Files

afeldman-nm bf7f470b22 [V1] Logits processors extensibility (#19912 )

Signed-off-by: Andrew Feldman <afeldman@redhat.com>
Signed-off-by: Andrew Feldman <afeld2012@gmail.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Co-authored-by: Nick Hill <nhill@redhat.com>
Co-authored-by: Andrew Feldman <afeld2012@gmail.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-08-16 12:59:17 -07:00

__init__.py

[V1] Logits processors extensibility (#19912 )

2025-08-16 12:59:17 -07:00

cache.py

[V1] [Hybrid] Support using float32 for state in Hybrid Models (Mamba2, Mamba1, Minimax) (#22928 )

2025-08-15 12:57:06 +00:00

compilation.py

[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059 )

2025-08-15 10:01:39 -04:00

parallel.py

Move ParallelConfig from config/__init__.py to config/parallel.py (#22565 )

2025-08-09 08:33:46 -07:00

scheduler.py

[V0 Deprecation] Remove args for multi-step scheduling (#22779 )

2025-08-12 20:38:18 -07:00

utils.py

Extract CompilationConfig from config.py (#22524 )

2025-08-08 16:34:25 -07:00