biondizzle/vllm - vllm - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Jee Jee Li	15859f2357	[[Misc]Upgrade bitsandbytes to the latest version 0.45.0 (#11201 )	2024-12-15 03:03:06 +00:00
yansh97	cfb3bf25fb	[bugfix] fix the default value of llm_int8_threshold in BitsAndBytesConfig (#10657 )	2024-11-27 13:55:23 +08:00
Michael Goin	7576cd38df	[Bugfix] Check bnb_4bit_quant_storage for bitsandbytes (#10642 )	2024-11-26 12:29:00 -08:00
Michael Goin	399c798608	Remove ScaledActivation for AWQ (#10057 ) Signed-off-by: mgoin <michael@neuralmagic.com>	2024-11-06 14:27:06 +00:00
Jee Jee Li	b9c64c0ca7	[Misc] Modify BNB parameter name (#9997 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>	2024-11-05 14:40:08 -05:00
Michael Goin	37a4947dcd	[Bugfix] Fix layer skip logic with bitsandbytes (#9887 ) Signed-off-by: mgoin <michael@neuralmagic.com>	2024-11-01 13:12:44 +08:00
Isotr0py	09500f7dde	[Model] Add BNB quantization support for Mllama (#9720 )	2024-10-29 08:20:02 -04:00
chenqianfzh	2f4117c38e	support bitsandbytes quantization with more models (#9148 )	2024-10-08 19:52:19 -06:00
Jee Jee Li	13f9f7a3d0	[[Misc]Upgrade bitsandbytes to the latest version 0.44.0 (#8768 )	2024-09-24 17:08:55 -07:00
chenqianfzh	4664ceaad6	support bitsandbytes 8-bit and FP4 quantized models (#7445 )	2024-08-29 19:09:08 -04:00
dongmao zhang	87525fab92	[bitsandbytes]: support read bnb pre-quantized model (#5753 ) Co-authored-by: Michael Goin <michael@neuralmagic.com>	2024-07-23 23:45:09 +00:00
Robert Shaw	683e3cb9c4	[ Misc ] `fbgemm` checkpoints (#6559 )	2024-07-20 09:36:57 -07:00
Dipika Sikka	7836fdcc11	[Misc] Fix `get_min_capability` (#5971 )	2024-06-30 20:15:16 +00:00
chenqianfzh	b9c0605a8e	[Feature][Kernel] Support bitsandbytes quantization and QLoRA (#4776 )	2024-06-01 14:51:10 -06:00

14 Commits