Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix hopper grouped gemm example elect_sync api use
#3262 opened May 22, 2026 by LongshengDu Contributor Loading…
[CLI] Update FMHA & improve perf
#3251 opened May 20, 2026 by keithzzzzz Contributor Loading…
Filter SM120 mixed 8-bit tiles for FP6 ElementD
#3247 opened May 19, 2026 by zhils Loading…
fix an intermittent accuracy isse
#3233 opened May 15, 2026 by dishengbin Loading…
W4a8 speedup v2
#3226 opened May 11, 2026 by mak-corp Loading…
Avoid unordered_map for runtime datatype mapping
#3223 opened May 11, 2026 by LwhJesse Loading…
FMHA examples: use cute::min in device functions
#3222 opened May 11, 2026 by LwhJesse Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221 opened May 11, 2026 by shubaoyu2 Contributor Loading…
Add Hopper FP8 grouped blockwise GEMM (sparse-groups) CuTeDSL example
#3195 opened Apr 29, 2026 by Johnsonms Contributor Draft
7 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.