Skip to content

[ml] Fix out-of-bounds read in filtered cluster split#22440

Merged
guitargeek merged 1 commit into
root-project:masterfrom
guitargeek:ci_march_native_failues
May 31, 2026
Merged

[ml] Fix out-of-bounds read in filtered cluster split#22440
guitargeek merged 1 commit into
root-project:masterfrom
guitargeek:ci_march_native_failues

Conversation

@guitargeek
Copy link
Copy Markdown
Contributor

In RClusterLoader::LoadTrainingClusterInto the boundary entry was read from rdfEntries unconditionally before checking valCount, so when one side of the split is empty the access was past end-of-vector. This was silent in non-hardened builds but trips libstdc++ assertions (e.g. with libcxxhardeningfast), aborting test09_filtered_last_chunk.

Compute the boundary only when the corresponding rdfEntries index is in-bounds, falling back to the cluster endpoint otherwise.

In RClusterLoader::LoadTrainingClusterInto the boundary entry was read
from rdfEntries unconditionally before checking valCount, so when one
side of the split is empty the access was past end-of-vector. This was
silent in non-hardened builds but trips libstdc++ assertions (e.g. with
libcxxhardeningfast), aborting test09_filtered_last_chunk.

Compute the boundary only when the corresponding rdfEntries index is
in-bounds, falling back to the cluster endpoint otherwise.
@guitargeek guitargeek self-assigned this May 31, 2026
@guitargeek guitargeek added the in:ML Everything under ROOT/ML label May 31, 2026
@github-actions
Copy link
Copy Markdown

Test Results

    22 files      22 suites   3d 10h 14m 28s ⏱️
 3 860 tests  3 860 ✅ 0 💤 0 ❌
76 176 runs  76 176 ✅ 0 💤 0 ❌

Results for commit df5c093.

@guitargeek guitargeek added the pr:emergency🚨 This PR is fixing breakage, it's urgent and may overtake others label May 31, 2026
@guitargeek guitargeek linked an issue May 31, 2026 that may be closed by this pull request
1 task
@guitargeek guitargeek merged commit 6183bad into root-project:master May 31, 2026
41 checks passed
@guitargeek guitargeek deleted the ci_march_native_failues branch May 31, 2026 22:46
@guitargeek
Copy link
Copy Markdown
Contributor Author

\backport to 6.40

@guitargeek
Copy link
Copy Markdown
Contributor Author

/backport to 6.40

@root-project-bot
Copy link
Copy Markdown

Preparing to backport PR #22440 to branch 6.40 requested by guitargeek

@root-project-bot
Copy link
Copy Markdown

Something went wrong when assigning the PR or setting labels @guitargeek please see the logs

@root-project-bot
Copy link
Copy Markdown

This PR has been backported to branch 6.40: #22441

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

in:ML Everything under ROOT/ML pr:emergency🚨 This PR is fixing breakage, it's urgent and may overtake others

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Failing ML Dataloader test on alma10 builds with auto-registration off

3 participants