Skip to content

Cache JIT-compiled CUDA kernels#3587

Merged
zcbenz merged 1 commit into
mainfrom
test/compile-cache
Jun 16, 2026
Merged

Cache JIT-compiled CUDA kernels#3587
zcbenz merged 1 commit into
mainfrom
test/compile-cache

Conversation

@zcbenz

@zcbenz zcbenz commented May 24, 2026

Copy link
Copy Markdown
Collaborator

This reduces test run time to 7m, it is a must when we start JIT-compiling CUTLASS/CuTe kernels which would spend over an hour on running tests in CI.

The cache key is the hash of all files under mlx/backend/cuda, which is not 100% bullet-proof but should be robust enough to dodge most cache invalidation problems.

@angeloskath angeloskath left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's awesome!

@zcbenz zcbenz merged commit 8edbda0 into main Jun 16, 2026
48 checks passed
@zcbenz zcbenz deleted the test/compile-cache branch June 16, 2026 23:42
tpegolotti pushed a commit that referenced this pull request Jun 22, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants