Logo
Explore Help
Register Sign In
biondizzle/DeepGEMM
1
0
Fork 0
You've already forked DeepGEMM
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
1f13b2435464b97845a6dfc1fd3abb2c49da053a
DeepGEMM/csrc/apis
History
biondizzle 80df24a641 fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
- runtime_utils.hpp: added kInt8 -> CU_TENSOR_MAP_DATA_TYPE_UINT8 mapping
- mega_nvfp4.hpp: changed activation tensor dtypes from kUInt8 to kInt8
  (same byte layout, but kInt8 is recognized by the TMA dtype switch)
2026-05-11 22:54:47 +00:00
..
attention.hpp
Add various optimizations and Mega MoE benchmarks (#316)
2026-04-24 18:41:37 +08:00
einsum.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
gemm.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
hyperconnection.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
layout.hpp
fix: gran_k=16 in transform_sf + sm_100a arch for NVFP4 mega_moe
2026-05-11 16:11:11 +00:00
mega_nvfp4.hpp
fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
2026-05-11 22:54:47 +00:00
mega.hpp
Add various optimizations and Mega MoE benchmarks (#316)
2026-04-24 18:41:37 +08:00
runtime.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
Powered by Gitea Version: 1.25.2 Page: 92ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API