HDDS-15601. Improve ContainerHealthSchemaManager delete performance by umesh9794 · Pull Request #10532 · apache/ozone

umesh9794 · 2026-06-17T10:12:16Z

What changes were proposed in this pull request?

Further speed up TestUnhealthyContainersDerbyPerformance by using BETWEEN predicate instead of many IN clauses while deleting the contiguous container IDs

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15601

How was this patch tested?

Ran the test locally and the elapsed time of this run is significantly reduced from ~313s to ~59s:

[INFO] Running org.apache.hadoop.ozone.recon.persistence.TestUnhealthyContainersDerbyPerformance

[INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 58.63 s -- in org.apache.hadoop.ozone.recon.persistence.TestUnhealthyContainersDerbyPerformance

adoroszlai · 2026-06-17T11:29:11Z

Thanks @umesh9794 for the patch. Please enable the build-branch workflow in your fork.

adoroszlai · 2026-06-17T13:02:24Z

Further speed up TestUnhealthyContainersDerbyPerformance by using BETWEEN predicate instead of many IN clauses while deleting the contiguous container IDs

While the speed improvement in TestUnhealthyContainersDerbyPerformance is nice (173 -> 51 seconds locally), the test no longer seems to simulate real workload, where a continuous range of 200K containers is unlikely.

adoroszlai · 2026-06-17T13:05:34Z

@ArafatKhan2198 @devmadhuu ContainerHealthSchemaManager.batchDeleteSCMStatesForContainers is only called from tests, can we delete it?

umesh9794 · 2026-06-18T04:06:25Z

@adoroszlai I have enabled the workflow in my fork. Thanks.

Yes as I was not aware of the codebase very much, this method ContainerHealthSchemaManager.batchDeleteSCMStatesForContainers only gets called from the couple of test classes. Upon conformation of @ArafatKhan2198 and @devmadhuu we can clean-up this probably.

devmadhuu · 2026-06-18T08:08:23Z

@ArafatKhan2198 @devmadhuu ContainerHealthSchemaManager.batchDeleteSCMStatesForContainers is only called from tests, can we delete it?

Yes, we can remove it as long as our test uses some other real code logic hitting the actual product code and able to test the batch delete performance.

devmadhuu

Thanks @umesh9794 for the patch. Given few comments. Pls check.

devmadhuu · 2026-06-18T08:02:19Z

+    List<Long> inClauseBatch = new ArrayList<>(MAX_IN_CLAUSE_CHUNK_SIZE);
+
+    for (int i = 0; i < sortedIds.size(); ) {
+      int rangeStart = i;


This below code assumption seems incorrect that in real cluster that the unhealthy container ids all will be in continous sequence.

Real container IDs may not form one continuous sequence.

Consider this input:

1, 2, 4, 5, 7, 8, 10, 11

The PR sees four small continuous ranges and executes:

BETWEEN 1 AND 2
BETWEEN 4 AND 5
BETWEEN 7 AND 8
BETWEEN 10 AND 11

That means four separate DELETE statements.

The old implementation could delete all eight IDs using one statement:

WHERE container_id IN (1, 2, 4, 5, 7, 8, 10, 11)

With a larger realistic list containing many small pairs, the difference could become:

Old code: 50 DELETE statements
New code: 10,000 DELETE statements

Each statement must be compiled and executed by Derby. Consequently, production could become significantly slower even though this test becomes faster.

1, 2, 3, 4, ... 200,000

That is the best possible input for BETWEEN.

It does not test inputs such as:

1, 2, 10, 11, 20, 21, ...

I think we can reduce delete statement count by combining BETWEEN and IN clauses in the same query using OR logic. Delay submitting the statement if number of items for IN reaches the limit or number of BETWEEN ranges reaches another (lower) threshold.

@devmadhuu yes current code executes more delete statements and it might slowdown in the real production envs. Let me try to optimize this for production scenarios. Thanks!

@adoroszlai sure, trying to check the logic to improve the performance. Thanks!

adoroszlai · 2026-06-18T09:36:46Z

ContainerHealthSchemaManager.batchDeleteSCMStatesForContainers is only called from tests, can we delete it?

Yes, we can remove it as long as our test uses some other real code logic hitting the actual product code and able to test the batch delete performance.

Thanks @devmadhuu for checking. Yes, product code uses another method:

ozone/hadoop-ozone/recon/src/main/java/org/apache/hadoop/ozone/recon/fsck/ReconReplicationManager.java

Lines 498 to 504 in fa1b3a3

    
           private void persistUnhealthyRecords( 
        
               List<Long> containerIdsToDelete, 
        
               List<ContainerHealthSchemaManager.UnhealthyContainerRecord> recordsToInsert) { 
        
             LOG.info("Replacing unhealthy container records atomically: deleteRowsFor={} containers, insert={}", 
        
                 containerIdsToDelete.size(), recordsToInsert.size()); 
        
             healthSchemaManager.replaceUnhealthyContainerRecordsAtomically( 
        
                 containerIdsToDelete, recordsToInsert);

which is also covered by the performance test:

ozone/hadoop-ozone/recon/src/test/java/org/apache/hadoop/ozone/recon/persistence/TestUnhealthyContainersDerbyPerformance.java

Line 635 in fa1b3a3

    
           schemaManager.replaceUnhealthyContainerRecordsAtomically(idsToReplace, replacementRecords);

So I suggest splitting the task:

"Further speed up TestUnhealthyContainersDerbyPerformance": remove test case for batchDeleteSCMStatesForContainers
"Improve deleteScmStatesForContainers performance": this PR, needs further work

devmadhuu · 2026-06-18T10:18:30Z

ContainerHealthSchemaManager.batchDeleteSCMStatesForContainers is only called from tests, can we delete it?

Yes, we can remove it as long as our test uses some other real code logic hitting the actual product code and able to test the batch delete performance.

Thanks @devmadhuu for checking. Yes, product code uses another method:

ozone/hadoop-ozone/recon/src/main/java/org/apache/hadoop/ozone/recon/fsck/ReconReplicationManager.java

Lines 498 to 504 in fa1b3a3

private void persistUnhealthyRecords(

List<Long> containerIdsToDelete,

List<ContainerHealthSchemaManager.UnhealthyContainerRecord> recordsToInsert) {

LOG.info("Replacing unhealthy container records atomically: deleteRowsFor={} containers, insert={}",

containerIdsToDelete.size(), recordsToInsert.size());

healthSchemaManager.replaceUnhealthyContainerRecordsAtomically(

containerIdsToDelete, recordsToInsert);

which is also covered by the performance test:

ozone/hadoop-ozone/recon/src/test/java/org/apache/hadoop/ozone/recon/persistence/TestUnhealthyContainersDerbyPerformance.java

Line 635 in fa1b3a3

schemaManager.replaceUnhealthyContainerRecordsAtomically(idsToReplace, replacementRecords);

So I suggest splitting the task:
1. "Further speed up TestUnhealthyContainersDerbyPerformance": remove test case for `batchDeleteSCMStatesForContainers`

2. "Improve deleteScmStatesForContainers performance": this PR, needs further work

Yes. I agree.

HDDS-15582 : Further speed up TestUnhealthyContainersDerbyPerformance

4d9430f

adoroszlai added test recon and removed test labels Jun 17, 2026

adoroszlai requested review from ArafatKhan2198 and devmadhuu June 17, 2026 12:51

devmadhuu reviewed Jun 18, 2026

View reviewed changes

adoroszlai changed the title ~~HDDS-15582. Further speed up TestUnhealthyContainersDerbyPerformance~~ HDDS-15601. Improve ContainerHealthSchemaManager delete performance Jun 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-15601. Improve ContainerHealthSchemaManager delete performance#10532

HDDS-15601. Improve ContainerHealthSchemaManager delete performance#10532
umesh9794 wants to merge 1 commit into
apache:masterfrom
umesh9794:HDDS-15582

umesh9794 commented Jun 17, 2026 •

edited by adoroszlai

Loading

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

umesh9794 commented Jun 18, 2026

Uh oh!

devmadhuu commented Jun 18, 2026

Uh oh!

devmadhuu left a comment

Uh oh!

devmadhuu Jun 18, 2026

Uh oh!

adoroszlai Jun 18, 2026

Uh oh!

umesh9794 Jun 18, 2026

Uh oh!

umesh9794 Jun 18, 2026

Uh oh!

adoroszlai commented Jun 18, 2026

Uh oh!

devmadhuu commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

umesh9794 commented Jun 17, 2026 • edited by adoroszlai Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

adoroszlai commented Jun 17, 2026

Uh oh!

umesh9794 commented Jun 18, 2026

Uh oh!

devmadhuu commented Jun 18, 2026

Uh oh!

devmadhuu left a comment

Choose a reason for hiding this comment

Uh oh!

devmadhuu Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

adoroszlai Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

umesh9794 Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

umesh9794 Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

adoroszlai commented Jun 18, 2026

Uh oh!

devmadhuu commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

umesh9794 commented Jun 17, 2026 •

edited by adoroszlai

Loading