vllm/tests/v1/worker at c7ea0b56cd9a5e607f30cc637b4c800ce47bafca - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Isotr0py 5f1ac1e1d1 Revert "[v1] Add fp32 support to v1 engine through flex attn" (#19404 )

2025-06-10 01:30:20 -07:00

..

__init__.py

[V1] Adding min tokens/repetition/presence/frequence penalties to V1 sampler (#10681 )

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

[Core] Use tuple for kv cache group block ids (#19175 )

2025-06-10 07:01:17 +02:00

test_gpu_model_runner.py

Revert "[v1] Add fp32 support to v1 engine through flex attn" (#19404 )

2025-06-10 01:30:20 -07:00