vllm/tests/kernels at 04a9e064db4dcf57519f1333796ba7face46248b - vllm

Files

linhaifeng 7901109ea5 [Bugfix] Fix Off-by-one error in _num_tokens_to_min_blocks calculation (#32603 )

Signed-off-by: linhaifeng <1371675203@qq.com>

2026-01-20 11:13:39 -05:00

attention

Revert "[Kernels][FI] Skip trtllm attention when num_kv_heads=1 (#308… (#31617 )

2026-01-10 12:39:59 -08:00

core

[CI][Hardware][AMD] Fix test_rotary_embedding_mla_cache_fused (#32408 )

2026-01-19 08:25:47 +00:00

helion

[CI] Add Helion as an optional dependency (#32482 )

2026-01-19 19:09:56 +00:00

mamba

[Chore] Migrate V0 attention utils (#31891 )

2026-01-07 13:44:36 +00:00

moe

[MoE Refactor] Separate Router into OO Classes (#30623 )

2026-01-18 11:40:49 -05:00

quantization

[Refactor] Make FP8 Linear Ops use kernel abstraction (#27814 )

2026-01-20 14:48:20 +08:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

allclose_default.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

quant_utils.py

[Bugfix][Hardware][AMD] Consolidate FP8 min/max values helper function (#31106 )

2026-01-07 06:55:03 +00:00

test_apply_repetition_penalties.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

test_cache_kernels.py

[Bugfix][cache_kernels]: Fix OOB in cache_kernels.cu (#28760 )

2025-11-20 02:52:02 -08:00

test_fla_layernorm_guard.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

test_flex_attention.py

[Fix][FlexAttention] return max logical block index to handle reused blocks (#30915 )

2025-12-18 06:42:21 +00:00

test_fused_quant_activation.py

[Misc] Fix Current vLLM config is not set. warnings, assert to avoid issues in the future (#31747 )

2026-01-08 15:20:49 -08:00

test_onednn.py

[CPU] Refactor CPU attention backend (#27954 )

2025-11-12 09:43:06 +08:00

test_shuffle_rows.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_top_k_per_row.py

[DeepSeek v3.2] Make top-k work for any logit values. (#27568 )

2025-12-08 06:55:58 -08:00

utils.py

[Bugfix] Fix Off-by-one error in _num_tokens_to_min_blocks calculation (#32603 )

2026-01-20 11:13:39 -05:00