Logo
Explore Help
Register Sign In
biondizzle/DeepGEMM
1
0
Fork 0
You've already forked DeepGEMM
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
d6551617c0c68a5f71ba15e79fe91d14f6a8cdfe
DeepGEMM/csrc/apis
History
biondizzle 80df24a641 fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
- runtime_utils.hpp: added kInt8 -> CU_TENSOR_MAP_DATA_TYPE_UINT8 mapping
- mega_nvfp4.hpp: changed activation tensor dtypes from kUInt8 to kInt8
  (same byte layout, but kInt8 is recognized by the TMA dtype switch)
2026-05-11 22:54:47 +00:00
..
attention.hpp
Add various optimizations and Mega MoE benchmarks (#316)
2026-04-24 18:41:37 +08:00
einsum.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
gemm.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
hyperconnection.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
layout.hpp
fix: gran_k=16 in transform_sf + sm_100a arch for NVFP4 mega_moe
2026-05-11 16:11:11 +00:00
mega_nvfp4.hpp
fix: add kInt8 dtype support to TMA descriptor + change activation tensors to kInt8
2026-05-11 22:54:47 +00:00
mega.hpp
Add various optimizations and Mega MoE benchmarks (#316)
2026-04-24 18:41:37 +08:00
runtime.hpp
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
Powered by Gitea Version: 1.25.2 Page: 55ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API