vllm/tests/models/decoder_only/language at ccb1aabccaa7aaf07b08fd8be30380e828efba0f - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Wallas Henrique 8b79f9e107 [Bugfix] Fix guided decoding with tokenizer mode mistral (#11046 )

2024-12-17 22:34:08 -08:00

..

__init__.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_aqlm.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_fp8.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_gguf.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_gptq_marlin_24.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_gptq_marlin.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_granite.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_jamba.py

[core] clean up cudagraph batchsize padding logic (#10996 )

2024-12-13 06:57:50 +00:00

test_mamba.py

[core] clean up cudagraph batchsize padding logic (#10996 )

2024-12-13 06:57:50 +00:00

test_mistral.py

[Bugfix] Fix guided decoding with tokenizer mode mistral (#11046 )

2024-12-17 22:34:08 -08:00

test_modelopt.py

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

test_models.py

[Model] Support Qwen2 embeddings and use tags to select model tests (#10184 )

2024-11-14 20:23:09 -08:00

test_phimoe.py

[Hardware][CPU] using current_platform.is_cpu (#9536 )

2024-10-22 00:50:43 -07:00