Skip to content

[AURON #2307] Add @transient to large fields and use NativePartition to reduce serialization overhead#2309

Open
Deegue wants to merge 1 commit into
apache:masterfrom
Deegue:auron#2307_perf-add-transient-reduce-serialization
Open

[AURON #2307] Add @transient to large fields and use NativePartition to reduce serialization overhead#2309
Deegue wants to merge 1 commit into
apache:masterfrom
Deegue:auron#2307_perf-add-transient-reduce-serialization

Conversation

@Deegue
Copy link
Copy Markdown
Contributor

@Deegue Deegue commented Jun 4, 2026

Which issue does this PR close?

Closes #2307

Rationale for this change

Mark large non-serializable fields with @transient annotation across NativeExec classes to reduce task serialization size and memory usage.

Introduce NativePartition[P] wrapper to carry original partition payloads through NativeRDD without re-indexing into the partitions array at execution time, which avoids serializing the full partitions array.

What changes are included in this PR?

  • Add @transient to rddPartitions in NativeRDD
  • Add NativePartition case class for type-safe partition payload access
  • Wrap input partitions in NativePartition before NativeRDD creation
  • Extract original partition via .asInstanceOf[NativePartition[P]].payload
  • Add @transient to case class parameters in shim Exec classes

Are there any user-facing changes?

No.

How was this patch tested?

UTs.

…uce serialization overhead

Introduce NativePartition[P] wrapper to carry original partition payloads
through NativeRDD without re-indexing into the partitions array at
execution time, which avoids serializing the full partitions array.

Key changes:
- Add @transient to rddPartitions in NativeRDD
- Add NativePartition case class for type-safe partition payload access
- Wrap input partitions in NativePartition before NativeRDD creation
- Extract original partition via .asInstanceOf[NativePartition[P]].payload
@Deegue Deegue force-pushed the auron#2307_perf-add-transient-reduce-serialization branch from 141cd86 to 38d309a Compare June 4, 2026 16:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add @transient to large fields and use NativePartition to reduce serialization overhead

1 participant