vllm/tests at aebfcb262a2c6a66f96d8a82efc4ac4c35092222 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Yanming W 8efe23f150 Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546 )

2023-11-08 14:19:12 -08:00

..

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

Fix integer overflows in attention & cache ops (#1514 )

2023-10-31 15:19:30 -07:00

Add Mistral 7B to test_models (#1366 )

2023-10-16 17:49:54 -07:00

Added logits processor API to sampling params (#1469 )

2023-11-03 14:12:15 -07:00

Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546 )

2023-11-08 14:19:12 -08:00

__init__.py

[Small] Formatter only checks lints in changed files (#1528 )

2023-10-31 15:39:38 -07:00

conftest.py

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00