FastCompute implementation of GPA witness layer low-to-high HAL op by djadjka · Pull Request #813 · IrreducibleOSS/binius

djadjka · 2025-06-18T10:58:24Z

TL;DR

Implemented the pairwise_product_reduce function for the FastCpuLayer and added tests for it.

What changed?

Implemented the previously unimplemented pairwise_product_reduce function in the ComputeLayerExecutor trait for FastCpuLayer
Added input validation to ensure the input length is a power of 2 and greater than or equal to 2
Implemented the pairwise product reduction algorithm using parallel chunks processing
Removed the unused fill method from SmallOwnedChunk
Simplified the fill_constant implementation to use the new slice abstraction
Added necessary imports (Itertools and PackedMemorySlice)
Added tests for the new functionality with both single round and multi-round reductions

How to test?

Run the new tests:

cargo test -p fast_compute test_pairwise_product_reduce
cargo test -p fast_compute test_pairwise_product_reduce_single_round

Why make this change?

This change implements a previously unimplemented function that is needed for computing pairwise products and reducing them, which is a common operation in cryptographic protocols. The implementation is optimized for CPU execution using parallel processing where possible.

djadjka · 2025-06-18T10:58:41Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

How to use the Graphite Merge Queue

Add the label merge-ready to this PR to add it to the merge queue.

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

GraDKh · 2025-06-18T11:25:02Z

crates/fast_compute/src/layer.rs

+		let value = P::broadcast(value);
+
+		for element in slice.as_slice_mut() {
+			*element = value;
+		}


Nice!
Can be simplified even further:

slice.as_slice_mut().fill(P::broadcast(value));

GraDKh · 2025-06-18T11:48:23Z

crates/fast_compute/src/layer.rs

+						.for_each(|(chunk, output)| {
+							let scalar_iter = P::iter_slice(chunk)
+								.tuples()
+								.map(|(left, right)| left * right);
+							*output = P::from_scalars(scalar_iter);
+						});
+				}


Potentially it would be faster to re-use the packed multiplication by interleaving values:

let (lhs, rhs) = PackedField::interleave(chunk[0], chunk[1]); let mults = lhs*rhs; PackedField::from_scalars(mults.iter().step_by(2).copied(), mults.iter().skip(1).step_by(2).copied())

BenjaminTrapani · 2025-06-18T12:19:04Z

crates/fast_compute/src/layer.rs

+			Some(log_num_inputs) => log_num_inputs,
+		};
+		let expected_round_outputs_len = log_num_inputs;
+		if round_outputs.len() != expected_round_outputs_len as usize {


nit: maybe move the logic verifying the input and output dimensions to a separate helper function to share between the reference and fast implementations?

BenjaminTrapani mentioned this pull request Jun 18, 2025

SYS-348: add pairwise product reduction HAL op #803

Closed

djadjka requested review from BenjaminTrapani and GraDKh June 18, 2025 10:58

djadjka marked this pull request as ready for review June 18, 2025 10:59

GraDKh approved these changes Jun 18, 2025

View reviewed changes

graphite-app bot changed the base branch from btrapani/sys-348-add-gpa-partial-product-hal-op-ref-impl to graphite-base/813 June 18, 2025 12:10

graphite-app bot force-pushed the adziadziuk/cry-490-fastcompute-implementation-of-gpa-witness-layer-low-to-high branch from 344ff0b to 1de23a6 Compare June 18, 2025 12:10

graphite-app bot force-pushed the graphite-base/813 branch from e0e8aef to f1b3aa2 Compare June 18, 2025 12:10

graphite-app bot changed the base branch from graphite-base/813 to main June 18, 2025 12:11

FastCompute implementation of GPA witness layer low-to-high HAL op

07af836

graphite-app bot force-pushed the adziadziuk/cry-490-fastcompute-implementation-of-gpa-witness-layer-low-to-high branch from 1de23a6 to 07af836 Compare June 18, 2025 12:11

BenjaminTrapani approved these changes Jun 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FastCompute implementation of GPA witness layer low-to-high HAL op#813

FastCompute implementation of GPA witness layer low-to-high HAL op#813
djadjka wants to merge 1 commit intomainfrom
adziadziuk/cry-490-fastcompute-implementation-of-gpa-witness-layer-low-to-high

djadjka commented Jun 18, 2025 •

edited

Loading

Uh oh!

djadjka commented Jun 18, 2025

Uh oh!

GraDKh Jun 18, 2025

Uh oh!

GraDKh Jun 18, 2025

Uh oh!

BenjaminTrapani Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

djadjka commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TL;DR

What changed?

How to test?

Why make this change?

Uh oh!

djadjka commented Jun 18, 2025

How to use the Graphite Merge Queue

Uh oh!

GraDKh Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

GraDKh Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

BenjaminTrapani Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

djadjka commented Jun 18, 2025 •

edited

Loading