vllm/tests at 9c1352eb5736d9e71d37959db44b6a641e898772 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Jason Zhu 7a0b011dd5 Add a 1-line docstring to explain why calling context_attention_fwd twice in test_prefix_prefill.py (#2553 )

2024-01-22 14:47:25 -08:00

..

OpenAI Server refactoring (#2360 )

2024-01-16 21:33:14 -08:00

Simplify broadcast logic for control messages (#2501 )

2024-01-19 11:23:30 -08:00

Migrate linter from pylint to ruff (#1665 )

2023-11-20 11:58:01 -08:00

refactor complemention api for readability (#2499 )

2024-01-18 16:45:14 -08:00

Add a 1-line docstring to explain why calling context_attention_fwd twice in test_prefix_prefill.py (#2553 )

2024-01-22 14:47:25 -08:00

Add StableLM3B model (#2372 )

2024-01-16 20:32:40 -08:00

[Experimental] Prefix Caching Support (#1669 )

2024-01-17 16:32:10 -08:00

[BugFix] Fix input positions for long context with sliding window (#2088 )

2023-12-13 12:28:13 -08:00

[Experimental] Prefix Caching Support (#1669 )

2024-01-17 16:32:10 -08:00

[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )

2024-01-21 16:31:47 -08:00

__init__.py

[Small] Formatter only checks lints in changed files (#1528 )

2023-10-31 15:39:38 -07:00

conftest.py

[BUGFIX] Fix the path of test prompts (#2273 )

2023-12-26 10:37:21 -08:00

test_regression.py

[Minor] Fix duplication of ignored seq group in engine step (#1666 )

2023-11-16 13:11:41 -08:00