Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
05bdf4eaf3bd8c577d09a6556acc3688094d0f6b
vllm/vllm
History
mezuzza 6774bd50b0 Fix typing in AsyncLLMEngine & add toml to requirements-dev (#2100)
2023-12-14 00:19:41 -08:00
..
core
[FIX] Fix formatting error
2023-11-29 00:40:19 +00:00
engine
Fix typing in AsyncLLMEngine & add toml to requirements-dev (#2100)
2023-12-14 00:19:41 -08:00
entrypoints
Fix completion API echo and logprob combo (#1992)
2023-12-10 13:20:30 -08:00
model_executor
Optimize Mixtral with expert parallelism (#2090)
2023-12-13 23:55:07 -08:00
transformers_utils
Fix Baichuan tokenizer error (#1874)
2023-11-30 18:35:50 -08:00
worker
[BugFix] Fix input positions for long context with sliding window (#2088)
2023-12-13 12:28:13 -08:00
__init__.py
Bump up to v0.2.5 (#2095)
2023-12-13 23:56:15 -08:00
block.py
[Quality] Add code formatter and linter (#326)
2023-07-03 11:31:55 -07:00
config.py
Optimize Mixtral with expert parallelism (#2090)
2023-12-13 23:55:07 -08:00
logger.py
[Fix] Fix duplicated logging messages (#1524)
2023-10-31 09:04:47 -07:00
outputs.py
docs: add description (#1553)
2023-11-03 09:14:52 -07:00
py.typed
Add py.typed so consumers of vLLM can get type checking (#1509)
2023-10-30 14:50:47 -07:00
sampling_params.py
add custom server params (#1868)
2023-12-03 12:59:18 -08:00
sequence.py
[FIX] Fix class naming (#1803)
2023-11-28 14:08:01 -08:00
utils.py
Fix peak memory profiling (#2031)
2023-12-12 22:01:53 -08:00
Powered by Gitea Version: 1.25.2 Page: 73ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API