Logo
Explore Help
Register Sign In
biondizzle/nvfp4-megamoe-kernel
1
0
Fork 0
You've already forked nvfp4-megamoe-kernel
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
e0f60b9f052d2aa1fdd5ff2f7df922a00f26e76b
nvfp4-megamoe-kernel/dsv4/cache/block_table.py

3 lines
54 B
Python
Raw Normal View History

Restructure: cutedsl/ -> dsv4/ with proper layering - Split bridge.py -> ops/quantize.py, ops/layouts.py, ops/gemm_runner.py - Renamed classes: CuTeDSLNvfp4Linear -> Nvfp4Linear, etc. - Moved kernel code to dsv4/kernels/ (gemm, attention, compressor, decode, cuda) - Moved PyTorch bridges to dsv4/ops/ - Moved nn.Module layers to dsv4layers/ - Moved reference implementations to dsv4/reference/ - Moved vendored CUTLASS code to vendored/ - Archived ~190 debug tests to tests/archive/ - Kept ~15 canonical tests in tests/unit/ - Updated all import paths - Added stubs for future components (model/, cache/, loader/) - Updated pyproject.toml: dsv4-inference package name
2026-05-21 17:30:44 +00:00
"""Block table for paged KV cache."""
# TODO: Phase 3
Reference in New Issue Copy Permalink
Powered by Gitea Version: 1.25.2 Page: 113ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API