Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
3cb5769883fa104e42248f2b3f41a310947f357c
vllm/tests/models
History
Cyrus Leung eeec9e3390 [Frontend] Separate pooling APIs in offline inference (#11129)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-12-13 10:40:07 +00:00
..
decoder_only
[core] clean up cudagraph batchsize padding logic (#10996)
2024-12-13 06:57:50 +00:00
embedding
[Frontend] Separate pooling APIs in offline inference (#11129)
2024-12-13 10:40:07 +00:00
encoder_decoder
[Model] Support Qwen2 embeddings and use tags to select model tests (#10184)
2024-11-14 20:23:09 -08:00
fixtures
[CI/Build] Update pixtral tests to use JSON (#8436)
2024-09-13 03:47:52 +00:00
__init__.py
[CI/Build] Move test_utils.py to tests/utils.py (#4425)
2024-05-13 23:50:09 +09:00
registry.py
[Model] Add support for embedding model GritLM (#10816)
2024-12-12 06:39:16 +00:00
test_initialization.py
[CI/Build] Bump test transformers version (#10106)
2024-12-05 16:05:52 +00:00
test_oot_registration.py
[Frontend] Separate pooling APIs in offline inference (#11129)
2024-12-13 10:40:07 +00:00
test_registry.py
[Misc] Rename embedding classes to pooling (#10801)
2024-12-01 14:36:51 +08:00
utils.py
[CI/Build] Update CPU tests to include all "standard" tests (#5481)
2024-11-08 23:30:04 +08:00
Powered by Gitea Version: 1.25.2 Page: 326ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API