Changed reduce implementation and added tests #6877

BhoomishGupta · 2026-02-04T13:00:47Z

Proposed Changes

Created a helper function that reduces the partition value without the need of initial value..
Added a test case that was mentioned in the issue #6647.
Corrected a typo in documentation.

I only did changes in :

hpx\libs\core\algorithms\include\hpx\parallel\algorithms\reduce.hpp
hpx\libs\core\algorithms\tests\regressions\CMakeLists.txt
the rest were changes due to clang and editor.config

And created this :

hpx\libs\core\algorithms\tests\regressions\reduce_6647.cpp

Any background context you want to provide?

Previously, the implementation assumed that the *first was convertible to T. This assumption does not hold for the actual function logic required here. This PR removes that dependency, ensuring the type handling is correct.

Checklist

Not all points below apply to all pull requests.

I have added a new feature and have added tests to go along with it.
I have fixed a bug and have added a regression test.
I have added a test using random numbers; I have made sure it uses a seed, and that random numbers generated are valid inputs for the tests.

StellarBot · 2026-02-04T13:05:03Z

Can one of the admins verify this patch?

hkaiser

Could you please separate the formatting changes into a different PR? It's close to impossible to review your actual changes here.

Also, for the formatting - are you sure you're using clang-format V20?

BhoomishGupta · 2026-02-04T14:02:30Z

Yeah, I was not using V20.
So, should I close this PR and open a new one, that is only with the changes?

hkaiser · 2026-02-04T14:10:14Z

Yeah, I was not using V20. So, should I close this PR and open a new one, that is only with the changes?

Simply keep (force) pushing to your branch, this wil update the PR.

Signed-off-by: Bhoomish <[email protected]>

hkaiser · 2026-02-04T21:33:40Z

libs/core/algorithms/include/hpx/parallel/algorithms/reduce.hpp

+        {
+            if (part_size == 1)
+            {
+                return HPX_INVOKE(r, *part_begin, *part_begin);


This is not doing the correct thing as it would apply the reduction to the element of a single-element partition 'twice'. We have to avoid creating single-element partitions to begin with.

Currently, (almost) all algorithms use this functionality to determine the chunk size to use:

hpx/libs/core/algorithms/include/hpx/parallel/util/detail/chunk_size.hpp

Lines 182 to 261 in f36e02d

HPX_CXX_EXPORT template <typename ExPolicy, typename Future, typename F1,

typename IterOrR, typename Stride = std::size_t>

hpx::util::iterator_range<chunk_size_iterator<IterOrR>>

get_bulk_iteration_shape(ExPolicy& policy, std::vector<Future>& workitems,

F1&& f1, IterOrR& it_or_r, std::size_t& count, std::size_t& cores,

Stride s = Stride(1))

{

if (count == 0)

{

cores = 1;

auto it = chunk_size_iterator(it_or_r, 1);

return hpx::util::iterator_range(it, it);

}

Stride stride = parallel::detail::abs(s);

auto test_function = [&](std::size_t test_chunk_size) -> std::size_t {

if (test_chunk_size == 0)

return 0;

if (stride != 1)

{

// rounding up

test_chunk_size = (std::max) (static_cast<std::size_t>(stride),

(test_chunk_size + stride - 1) / stride * stride);

}

add_ready_future(workitems, f1, it_or_r, test_chunk_size);

test_chunk_size = (std::min) (count, test_chunk_size);

count -= test_chunk_size;

it_or_r = next_or_subrange(it_or_r, test_chunk_size, count);

return test_chunk_size;

};

// note: running the test function will modify 'count'

auto iteration_duration =

hpx::execution::experimental::measure_iteration(

policy.parameters(), policy.executor(), test_function, count);

cores = hpx::execution::experimental::processing_units_count(

policy.parameters(), policy.executor(), iteration_duration, count);

std::size_t max_chunks =

hpx::execution::experimental::maximal_number_of_chunks(

policy.parameters(), policy.executor(), cores, count);

std::size_t chunk_size =

hpx::execution::experimental::get_chunk_size(policy.parameters(),

policy.executor(), iteration_duration, cores, count);

// make sure, chunk size and max_chunks are consistent

adjust_chunk_size_and_max_chunks(cores, count, max_chunks, chunk_size);

auto last = next_or_subrange(it_or_r, count, 0);

if (stride != 1)

{

chunk_size = (std::max) (static_cast<std::size_t>(stride),

(chunk_size + stride - 1) / stride * stride);

}

// Report the calculated parameters to the corresponding parameters

// object.

hpx::execution::experimental::collect_execution_parameters(

policy.parameters(), policy.executor(), count, cores, max_chunks,

chunk_size);

// update executor with new values

policy = hpx::experimental::prefer(

hpx::execution::experimental::with_processing_units_count, policy,

cores);

auto shape_begin = chunk_size_iterator(it_or_r, chunk_size, count);

auto shape_end = chunk_size_iterator(last, chunk_size, count, count);

return hpx::util::iterator_range(shape_begin, shape_end);

}

Especially the invocation of the customization point here is responsible for computing the chunk size:

hpx/libs/core/algorithms/include/hpx/parallel/util/detail/chunk_size.hpp

Lines 231 to 233 in f36e02d

std::size_t chunk_size =

hpx::execution::experimental::get_chunk_size(policy.parameters(),

policy.executor(), iteration_duration, cores, count);

I'm inclined to think that we should introduce a new customization point replacing the final chunk size adjustments here

hpx/libs/core/algorithms/include/hpx/parallel/util/detail/chunk_size.hpp

Line 236 in f36e02d

adjust_chunk_size_and_max_chunks(cores, count, max_chunks, chunk_size);

That would allow to install a custom implementation of said customization point for the reduce algorithm without interfering with any other, possibly user-supplied customizations.

BhoomishGupta requested a review from hkaiser as a code owner February 4, 2026 13:00

hkaiser reviewed Feb 4, 2026

View reviewed changes

hkaiser added the category: algorithms label Feb 4, 2026

BhoomishGupta force-pushed the fix/reduce-6647 branch from 94b487d to 22a1f44 Compare February 4, 2026 14:47

BhoomishGupta added 2 commits February 4, 2026 14:58

Fix bug STEllAR-GROUP#6647: Correct type handling in reduce

636ec4e

Signed-off-by: Bhoomish <[email protected]>

Fixed Iterator Category Mismatch

25bccb7

Signed-off-by: Bhoomish <[email protected]>

BhoomishGupta force-pushed the fix/reduce-6647 branch from 22a1f44 to 25bccb7 Compare February 4, 2026 20:34

Added test file in CMakeLists

4c3a5d6

Signed-off-by: Bhoomish <[email protected]>

hkaiser reviewed Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Changed reduce implementation and added tests #6877

Changed reduce implementation and added tests #6877

BhoomishGupta commented Feb 4, 2026

Uh oh!

StellarBot commented Feb 4, 2026

Uh oh!

hkaiser left a comment

Uh oh!

BhoomishGupta commented Feb 4, 2026

Uh oh!

hkaiser commented Feb 4, 2026

Uh oh!

hkaiser Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	HPX_CXX_EXPORT template <typename ExPolicy, typename Future, typename F1,
	typename IterOrR, typename Stride = std::size_t>
	hpx::util::iterator_range<chunk_size_iterator<IterOrR>>
	get_bulk_iteration_shape(ExPolicy& policy, std::vector<Future>& workitems,
	F1&& f1, IterOrR& it_or_r, std::size_t& count, std::size_t& cores,
	Stride s = Stride(1))
	{
	if (count == 0)
	{
	cores = 1;
	auto it = chunk_size_iterator(it_or_r, 1);
	return hpx::util::iterator_range(it, it);
	}

	Stride stride = parallel::detail::abs(s);

	auto test_function = [&](std::size_t test_chunk_size) -> std::size_t {
	if (test_chunk_size == 0)
	return 0;

	if (stride != 1)
	{
	// rounding up
	test_chunk_size = (std::max) (static_cast<std::size_t>(stride),
	(test_chunk_size + stride - 1) / stride * stride);
	}

	add_ready_future(workitems, f1, it_or_r, test_chunk_size);

	test_chunk_size = (std::min) (count, test_chunk_size);

	count -= test_chunk_size;
	it_or_r = next_or_subrange(it_or_r, test_chunk_size, count);

	return test_chunk_size;
	};

	// note: running the test function will modify 'count'
	auto iteration_duration =
	hpx::execution::experimental::measure_iteration(
	policy.parameters(), policy.executor(), test_function, count);

	cores = hpx::execution::experimental::processing_units_count(
	policy.parameters(), policy.executor(), iteration_duration, count);

	std::size_t max_chunks =
	hpx::execution::experimental::maximal_number_of_chunks(
	policy.parameters(), policy.executor(), cores, count);

	std::size_t chunk_size =
	hpx::execution::experimental::get_chunk_size(policy.parameters(),
	policy.executor(), iteration_duration, cores, count);

	// make sure, chunk size and max_chunks are consistent
	adjust_chunk_size_and_max_chunks(cores, count, max_chunks, chunk_size);

	auto last = next_or_subrange(it_or_r, count, 0);

	if (stride != 1)
	{
	chunk_size = (std::max) (static_cast<std::size_t>(stride),
	(chunk_size + stride - 1) / stride * stride);
	}

	// Report the calculated parameters to the corresponding parameters
	// object.
	hpx::execution::experimental::collect_execution_parameters(
	policy.parameters(), policy.executor(), count, cores, max_chunks,
	chunk_size);

	// update executor with new values
	policy = hpx::experimental::prefer(
	hpx::execution::experimental::with_processing_units_count, policy,
	cores);

	auto shape_begin = chunk_size_iterator(it_or_r, chunk_size, count);
	auto shape_end = chunk_size_iterator(last, chunk_size, count, count);

	return hpx::util::iterator_range(shape_begin, shape_end);
	}

Uh oh!

Changed reduce implementation and added tests #6877

Are you sure you want to change the base?

Changed reduce implementation and added tests #6877

Conversation

BhoomishGupta commented Feb 4, 2026

Proposed Changes

Any background context you want to provide?

Checklist

Uh oh!

StellarBot commented Feb 4, 2026

Uh oh!

hkaiser left a comment

Choose a reason for hiding this comment

Uh oh!

BhoomishGupta commented Feb 4, 2026

Uh oh!

hkaiser commented Feb 4, 2026

Uh oh!

hkaiser Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hkaiser Feb 4, 2026 •

edited

Loading