Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
c4bca740e8498987184466d2f85ed43f1e1feb80
vllm/tests/distributed
History
Lily Liu 7041de4384 [Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
2024-06-28 15:28:49 -07:00
..
__init__.py
[CI/Build] Move test_utils.py to tests/utils.py (#4425)
2024-05-13 23:50:09 +09:00
test_basic_distributed_correctness.py
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
2024-06-28 15:28:49 -07:00
test_chunked_prefill_distributed.py
[CI/Test] improve robustness of test (vllm_runner) (#5357)
2024-06-08 08:59:20 +00:00
test_comm_ops.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_custom_all_reduce.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_parallel_state.py
[Distributed] Make it clear that % should not be in tensor dict keys. (#5927)
2024-06-28 15:20:22 +00:00
test_pynccl.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_same_node.py
[Core][Distributed] add same-node detection (#5369)
2024-06-11 10:53:59 -07:00
test_shm_broadcast.py
[bugfix][distributed] fix shm broadcast when the queue size is full (#5801)
2024-06-25 21:56:02 -07:00
test_utils.py
[Hardware][AMD][CI/Build][Doc] Upgrade to ROCm 6.1, Dockerfile improvements, test fixes (#5422)
2024-06-25 15:56:15 -07:00
Powered by Gitea Version: 1.25.2 Page: 111ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API