vllm/cacheflow/worker at 0f40557af6141ced118b81f2a04e651a0c6c9dbd - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 0f40557af6 Implement block copy kernel to optimize beam search (#32 )

2023-04-07 17:45:07 -07:00

..

cache_engine.py

Implement block copy kernel to optimize beam search (#32 )

2023-04-07 17:45:07 -07:00

controller.py

Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00

worker.py

Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00