Zhuohan Li
|
002800f081
|
Align vLLM's beam search implementation with HF generate (#857)
|
2023-09-04 17:29:42 -07:00 |
|
JFDuan
|
0d93f15694
|
Accelerate LLaMA model loading (#234)
|
2023-08-30 01:00:13 -07:00 |
|
Wen Sun
|
eedac9dba0
|
fix: revert code to avoid no attribute problem (#827)
|
2023-08-22 11:55:16 -07:00 |
|
zhaoyang-star
|
4f8584756d
|
Fix mqa is false case in gpt_bigcode (#806)
|
2023-08-21 22:22:06 -07:00 |
|
Zhuohan Li
|
7d5a155e4a
|
[Fix] Fix GPTBigcoder for distributed execution (#503)
|
2023-07-24 18:36:33 -07:00 |
|
Zhuohan Li
|
96853af5a8
|
Optimize MQA Kernel (#452)
|
2023-07-14 20:06:40 -04:00 |
|
Zhuohan Li
|
d6fa1be3a8
|
[Quality] Add code formatter and linter (#326)
|
2023-07-03 11:31:55 -07:00 |
|
Zhuohan Li
|
598dc4b79a
|
[Fix] Weight loading for GPTBigCode (#313)
|
2023-06-29 22:14:17 -07:00 |
|
Michael Feil
|
298695b766
|
GPTBigCode (StarCoder, SantaCoder Support) (#209)
|
2023-06-23 01:49:27 +08:00 |
|