Commit Graph

13 Commits

Author SHA1 Message Date
Chenggang Zhao
ea9c5d9270 Use driver API 2025-08-28 09:40:49 +08:00
Rain Jiang
51d1e9cdd3 Support compilation with CUDA 13.0 (#174) 2025-08-27 09:30:08 +08:00
Ray Wang
f85ec649d7 Make various updates and fixes: (#164)
- Add BF16 support for SM90 and SM100
- Refactor Python APIs
- Other fixes and code refactoring
2025-08-15 18:32:35 +08:00
Ray Wang
d9c363f86f Make various updates and fixes:
- Add support for legacy CUDA versions; now compatible with CUDA 12.3 and newer
- Add support for NVRTC compilation
- Other fixes and code refactoring
2025-08-02 19:52:22 -07:00
LJC00118
a581263f06 Fix indent 2025-07-29 16:14:46 +08:00
dan_the_3rd
fb7c687548 Merge pull request #135 from danthe3rd/patch-3
Fix import-time error: "undefined symbol: _ZN3c104cuda20getCurrentCUDAStreamEa"
2025-07-29 16:14:15 +08:00
Yuxian Qiu
dd6ed14acb Add torch as build dependency. (#139)
* Add torch as dependency.

Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>

* Add pyproject.toml

Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>

* Fix setup.py.

Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>

* Add build dependency.

Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>

---------

Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
2025-07-28 17:01:39 +08:00
dan_the_3rd
8987798502 Update setup.py (#134) 2025-07-28 16:58:23 +08:00
Ray Wang
9da4a23561 Add more GPU architectures support (#112)
* Add more GPU architectures support

* Update layout.py

* Optimize performance, Add SM90 support, Add 1D2D SM100 support

* Add fmtlib submodule at commit 553ec11

---------

Co-authored-by: fzyzcjy <5236035+fzyzcjy@users.noreply.github.com>
2025-07-18 11:32:22 +08:00
yz-tang
6c59e0f40d fix setup build error when setuptools version is lower 2025-03-04 16:53:00 +08:00
Chenggang Zhao
b05ed2f017 Code format 2025-02-27 10:50:20 +08:00
Dmytro Dzhulgakov
fc7c3f8299 setup.py: fix wheel building 2025-02-26 17:48:57 +00:00
Chenggang Zhao
a6d97a1c1b Initial commit 2025-02-25 22:52:41 +08:00