-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix hopper grouped gemm example
elect_sync api use
#3262
opened May 22, 2026 by
LongshengDu
Contributor
Loading…
fix: handle ComposedLayout slicing with dynamic strides (fixes #3255)
#3261
opened May 22, 2026 by
zhils
Loading…
[DOC] Drop empty duplicated Numeric Conversion code block from fundamental_types.md
#3260
opened May 22, 2026 by
adityasingh2400
Loading…
3 tasks done
fix(CuTeDSL): restore trailing Int<1> dimension in SM90 MMA atom TV L…
#3258
opened May 21, 2026 by
zhils
Loading…
[Cutlass SM90] Per-group aux TMA descriptor update for grouped GEMM + Gated-SwiGLU example)
#3256
opened May 21, 2026 by
Butterfingrz
Loading…
test: add CPU-only unit coverage for sharding helpers
#3250
opened May 19, 2026 by
Pritiks23
Loading…
fix(base_dsl): drop ArchMeta alias so Arch.sm_*.value is correct
#3248
opened May 19, 2026 by
lingolin128
Loading…
Fix FastDivmod divisor SSA transport for kernel regions (#3243)
#3246
opened May 19, 2026 by
zhils
Loading…
[cutlass-library] Alias cutlass_lib to the static target when shared is off (fixes #3179)
#3245
opened May 18, 2026 by
LeSingh1
Loading…
[CuTe] Add missing include for smem_ptr_flag_bits in print_tensor.hpp
#3244
opened May 18, 2026 by
LeSingh1
Loading…
[fast_math] Add bfloat16_t PTX specializations for fast_exp and fast_tanh
#3242
opened May 16, 2026 by
VittoriaLanzo
Loading…
5 tasks done
Fix MSVC CUDA build: is_unsigned_v not available in cutlass::platform
#3229
opened May 12, 2026 by
TxsharDev
Loading…
[example Cute C++]Add CuTe C++ tutorial for Blackwell MXFP8 block-scaled GEMM.
#3225
opened May 11, 2026 by
haowen-han
Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221
opened May 11, 2026 by
shubaoyu2
Contributor
Loading…
[CuTe][Fix]: Add missing template specialization for F8F6F4 MMA Op (#3207)
#3209
opened May 7, 2026 by
infinitron
Loading…
[CuTeDSL] Make editable installs use exact runtime companion wheels
#3204
opened May 5, 2026 by
alecco
Loading…
[CuTe][SM70] Add comment clarifying signed cast requirement for blockIdx coords
#3203
opened May 2, 2026 by
Flink-ddd
Contributor
Loading…
[CuTe] [Fix] MSVC's inability to deduce a non-type parameter pack from a dependent template alias
#3198
opened Apr 30, 2026 by
SystemPanic
Contributor
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.