Zhean Xu
|
0f5f266202
|
Multiple updates and refactorings (#280)
|
2026-01-16 17:06:52 +08:00 |
|
Ray Wang
|
38f8ef73a4
|
Multiple updates and refactorings (#231)
|
2025-11-21 17:49:47 +08:00 |
|
Chenggang Zhao
|
8da33d6bd9
|
Clean up
|
2025-11-19 11:00:55 +08:00 |
|
Guoteng
|
f63d7f24d6
|
fix: prevent int32 overflow in k-grouped GEMM size calculations (#226)
|
2025-11-19 10:52:08 +08:00 |
|
Simon Mo
|
59f2c07cf2
|
Add SM100 kernels (#201)
Signed-off-by: simon-mo <simon.mo@hey.com>
|
2025-09-29 17:07:28 +08:00 |
|
Chenggang Zhao
|
80ceeb2c76
|
Add SM90 kernels (#200)
|
2025-09-29 17:00:23 +08:00 |
|
Ray Wang
|
3f71de7aa9
|
Make various updates and fixes (#198)
|
2025-09-25 16:19:07 +08:00 |
|
Ray Wang
|
f85ec649d7
|
Make various updates and fixes: (#164)
- Add BF16 support for SM90 and SM100
- Refactor Python APIs
- Other fixes and code refactoring
|
2025-08-15 18:32:35 +08:00 |
|