From 904b7217318be16bc86d0ee4c4a54484b6284375 Mon Sep 17 00:00:00 2001 From: Chenggang Zhao Date: Thu, 25 Sep 2025 16:27:57 +0800 Subject: [PATCH] Update README --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 814099d..a80558d 100644 --- a/README.md +++ b/README.md @@ -33,7 +33,7 @@ Despite its lightweight design, DeepGEMM's performance matches or exceeds expert - [ ] Larger TMA multicast size for some shapes - [x] MMA template refactor with CUTLASS - [x] Remove shape limitations on N and K -- [ ] BF16 kernels +- [x] BF16 kernels - [ ] Split/stream-k optimizations - [ ] Ampere kernels - [ ] Polish docs