This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
258a2c58d08fc7a242556120877a89404861fbce
vllm
/
vllm
/
model_executor
History
Cody Yu
a62aaf1df5
[Misc][Refactor] Generalize linear_method to be quant_method (
#4373
)
2024-04-26 16:41:14 -04:00
..
guided_decoding
[Bugfix] Add fix for JSON whitespace (
#4189
)
2024-04-19 20:49:22 -07:00
layers
[Misc][Refactor] Generalize linear_method to be quant_method (
#4373
)
2024-04-26 16:41:14 -04:00
model_loader
[Misc][Refactor] Generalize linear_method to be quant_method (
#4373
)
2024-04-26 16:41:14 -04:00
models
[Misc][Refactor] Generalize linear_method to be quant_method (
#4373
)
2024-04-26 16:41:14 -04:00
__init__.py
[Core] Refactor Attention Take 2 (
#3462
)
2024-03-25 04:39:33 +00:00
sampling_metadata.py
[Core] Refactoring sampler and support prompt logprob for chunked prefill (
#4309
)
2024-04-26 13:02:02 +00:00
utils.py
[Hardware][Neuron] Refactor neuron support (
#3471
)
2024-03-22 01:22:17 +00:00