This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bd7a8eef25cd85be7eb9f2a94fd752d27ee7dce3
vllm
/
vllm
/
model_executor
History
Isotr0py
fbf152d976
[Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (
#4324
)
...
Co-authored-by: Woosuk Kwon <
woosuk.kwon@berkeley.edu
>
2024-04-25 09:35:56 -07:00
..
guided_decoding
[Bugfix] Add fix for JSON whitespace (
#4189
)
2024-04-19 20:49:22 -07:00
layers
[Model] Adds Phi-3 support (
#4298
)
2024-04-25 03:06:57 +00:00
model_loader
[Kernel] FP8 support for MoE kernel / Mixtral (
#4244
)
2024-04-24 01:18:23 +00:00
models
[Bugfix][Model] Refactor OLMo model to support new HF format in transformers 4.40.0 (
#4324
)
2024-04-25 09:35:56 -07:00
__init__.py
[Core] Refactor Attention Take 2 (
#3462
)
2024-03-25 04:39:33 +00:00
sampling_metadata.py
[Typing] Mypy typing part 2 (
#4043
)
2024-04-17 17:28:43 -07:00
utils.py
[Hardware][Neuron] Refactor neuron support (
#3471
)
2024-03-22 01:22:17 +00:00