Skip to content

[cuda backend] store scale/zero in int4_plain_mm in [N, n_groups] layout#20038

Merged
Gasoonjia merged 1 commit into
mainfrom
g4-opt-coalesced-scale
Jun 9, 2026
Merged

[cuda backend] store scale/zero in int4_plain_mm in [N, n_groups] layout#20038
Gasoonjia merged 1 commit into
mainfrom
g4-opt-coalesced-scale

Commits

Commits on Jun 9, 2026