Logo
Explore Help
Register Sign In
biondizzle/DeepGEMM
1
0
Fork 0
You've already forked DeepGEMM
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
54a7de03a08726d90e0ec1afe03c3d6d39daa764
DeepGEMM/deep_gemm
History
biondizzle 54a7de03a0 fix: add UTCCP SMEM warp transpose for NVFP4 packed UE4M3 scales
2026-05-12 16:48:06 +00:00
..
include/deep_gemm
fix: add UTCCP SMEM warp transpose for NVFP4 packed UE4M3 scales
2026-05-12 16:48:06 +00:00
legacy
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
mega
fix: no GPU tensor ops in crash handler (CUDA is broken after 715)
2026-05-12 16:20:11 +00:00
testing
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
utils
[Public release 26/04] Introducing Mega MoE, FP4 Indexer and other features/fixes (#304)
2026-04-17 09:45:14 +08:00
__init__.py
Add various optimizations and Mega MoE benchmarks (#316)
2026-04-24 18:41:37 +08:00
Powered by Gitea Version: 1.25.2 Page: 81ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API