[RFC] task_group_dynamic_dependencies #1469

vossmjp · 2024-08-05T13:15:37Z

Description

A proposal to extend task_group:

Extend semantics and useful lifetime of task_handle. We propose task_handle to represent tasks for the purpose of adding dependencies. The useful lifetime and semantics of task_handle will need to be extended to include tasks that have been submitted, are currently executing, or have been completed.
Add functions to set task dependencies. In the current task_group, tasks can only be waited for as a group and there is no direct way to add any before-after relationships between individual tasks. We will discuss options for spelling.
Add a function to move successors from a currently executing task to a new task. This functionality is necessary for recursively generated task graphs. This case represents a situation where it is safe to modify dependencies for an already submitted task.

Type of change

bug fix - change that fixes an issue
new feature - change that adds functionality
tests - change in tests
infrastructure - change in infrastructure and CI
documentation - documentation update

Tests

added - required for new features and some bug fixes
not needed

Documentation

updated in # - add PR number
needs to be updated
not needed

Breaks backward compatibility

Yes
No
Unknown

Notify the following users

List users with @ to send notifications

Other information

rfcs/proposed/task_group_dynamic_dependencies/README.md

pavelkumbrasev · 2024-08-26T12:59:24Z

I think this proposal is lacking the final definition of class task_handle including all the new methods.

rfcs/proposed/task_group_dynamic_dependencies/README.md

aleksei-fedotov · 2024-10-14T13:15:40Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+            void add_successor(task_handle& th);
+        };
+
+        void transfer_successors_to(task_handle& th);


Since this method is related to the currently executing task, what about including this API into tbb::this_task:: namespace? By analogy with tbb::this_task_arena:: namespace.

~~I would use tbb::task namespace, since it is already used for suspend (which applies to the currently running task) and resume functions.~~
Actually no, I think it should not be in the namespace task or this_task or just tbb, but rather it should be a static function or a member function in task_group. The reason is that, since task_group::defer is the only way to create a new non-empty task_handle, the method to transfer successors cannot be used in arbitrary tasks, only in task_group tasks.

Will add to task_group

rfcs/proposed/task_group_dynamic_dependencies/README.md

aleksei-fedotov · 2024-10-15T13:11:44Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+- Are the suggested APIs sufficient?
+- Are there additional use cases that should be considered that we missed in our analysis?
+- Are there other parts of the pre-oneTBB tasking API that developers have struggled to find a good alternative for?


These are interesting questions.

While reading this RFC, I kind of rushed proposing additional syntax sugar that besides being more user-friendly, since it seems to represent popular use cases, can save some CPU cycles. So I am posting them here for a discussion.

Would it be useful to have a method that simultaneously joins (or even fuses?) the instantiation of a task and transferring of the current task successors?
Something like:

template <typename Func> task_handle transfer_successors_to(Func&& f);

For the recursive decomposition scenario, instantiating a new task within an executing task and transferring successors to that new task seems to be the main model of writing such algorithms. Although, it seems to be not saving much (only one call to the library and assign to a couple of pointers?), there is always(?) going to be such a sequence in the code. Otherwise, how else an execution of already submitted tasks can be postponed?

Shall we also consider a question of having that API instead or in addition to the one proposed?

As for add_predecessor(s) and add_successor(s) I have a couple of thoughts.
a. It seems again that it might be useful to merge instantiation of the new task handles and adding them as successors/predecessors:
template <typename Func> void add_predecessor(Func&& f); template <typename Func> void add_successor(Func&& f);
b. Also, I think having an API that would allow adding more than one predecessor/successor at once can be useful, since usually a number of successors/predecessors are instantiated. I only think that we don't need to limit ourselves to only two parameters as it was proposed optionally, but allow passing of an arbitrary size of task handles or even user lambdas. Of course, a pattern of having a single task producer might be viewed as a limiting one, but there actually might be the cases where tasks cannot be made available to the scheduler (i.e. spawned) until all of them are gathered together from different producers, which essentially represents a barrier in the execution pipeline. Not to mention that the spawning of a bunch of tasks all together are done faster than regular one by one spawnings. Spawning of a bunch of tasks at once was implemented in the old TBB, as far as I remember.
So here I suggest to have something like:
template <typename... Func> void add_predecessors(Func&& ...f); template <typename... Func> void add_successor(Func&& ...f);
However, this also seems to ask for having the task_group::run() method to accept an arbitrary size of task handles and/or functors. So, perhaps, it is more related to another RFC/extension...

I think we should start with minimal API then extend with syntactic sugar based on use cases. I suspect this will start as an experimental feature to allow some feedback on API.

Even so, I'm also open to including these additional APIs in the initial implementation, if others think they're likely needed.

…details

Co-authored-by: Aleksei Fedotov <[email protected]>

rfcs/proposed/task_group_dynamic_dependencies/README.md

akukanov · 2024-12-05T22:32:16Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+Given two `task_handle` objects `h1` and `h2`, some possible options 
+for adding `h1` as an in-dependence / predecessor of `h2` include:
+
+- `h2.add_predecessor(h1)`
+- `h2 = defer([]() { … }, h1)`
+- `make_edge(h1, h2)`
+
+We propose including the first option. Similarly, there could be


I would prefer not to add methods to task_handle but use "external" functions, perhaps in the task_group class. This would be more consistent with the current approach (defer, run, run_and_wait) as well as with transfer_successors_to.

What would the signature of such a function look like? The nice part of it being in task_handle is that h2.add_predecessor(h1) is easy to understand -- h1 becomes a predecessor of h2. As part of task_group, the order becomes less clear: tg.add_predecessor(h1,h2) might be read as adding two predecessors to task_group tg.

What would the signature of such a function look like?

Similar to make_edge, but perhaps with a different name: connect, set_order, set_dependency, etc., with the left-to-right ordering semantics.

Currently, a task_handle is what its name says - a handle, an object that might represent a created task or be empty. It cannot even be copied, only moved.

The proposal suggests that a task_handle should represent a task at any state, so should perhaps internally track the task state somehow - that's OK. We can even think of a method to query the task state; that would also be OK.

But making the handle also tracking task dependencies, and so serving as a part of the scheduling system, rather than just an object to be scheduled, does not sound right to me. Conceptually, I see scheduling and dependency management as functions of a task group.

Last year, we discuss naming quite a bit. One of the reasons to offer make_edge as an option was to be consistent with flow graph terminology and therefore the expected left-to-right ordering. Some people that worked with tasks before thought of parent-child relationships or dependency relations as a depends on b. And so something like task_group::set_dependency(a, b) could easily be read two ways. The benefit of task_handle::add_predecessor is its readibility and a task_group::make_edge(a, b) might benefit from exposure to flow::make_edge. I don't think we should consider it now, but we could also consider make_edge between a task_handle and a flow graph continue_node.

rfcs/proposed/task_group_dynamic_dependencies/README.md

Co-authored-by: Alexandra <[email protected]>

akukanov

I support this proposal in principle, and would approve it up to but not including the proposed API changes.

The API changes seem both insufficient (see the comment about the run function semantics) and arguable (when it comes to the way to set dependencies, see the discussion in comments above). I would not approve it even for an experimental API, at least until alternative API semantics are considered and compared.

rfcs/proposed/task_group_dynamic_dependencies/README.md

akukanov · 2025-02-01T15:10:14Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+task. A created task remains created until it is submitted through
+`task_group::run` or `task_group::run_and_wait`. The current
+`task_group` specification treats accessing a `task_handle` after it is submitted
+via one of the run functions as undefined behavior. Therefore, a


Note that this is technically done by the run functions accepting a task_handle as an rvalue. The task group can then move-construct or move-assign from that handle, which will make it empty.

In order to "extend useful lifetime of a task handle`, the run functions should therefore treat the handle argument differently. This part is not covered by the proposal.

Yes, needs to be added.

akukanov · 2025-02-01T15:14:12Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+In that case, passing a `task_handle` to `task_group::run` or `task_group::run_and_wait` only makes
+it available for dependency tracking but does not make it immediately eligible for execution.


Do not forget also about task_arena::enqueue. What would be its behavior?

Yes, run and enqueue functions are not yet well covered.

rfcs/proposed/task_group_dynamic_dependencies/README.md

akukanov · 2025-02-01T16:20:19Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+- Should we add a function to adds more than one predecessor as single call, such as `add_predecessors`?
+- Should we add functions that merge creation and definition of predecessor tasks, such as
+`template <typename Func> add_predecessor(Func&& f);`.
+- Are there additional use cases that should be considered that we missed in our analysis?


I can suggest a few more potentially interesting cases:

a wavefront, where tasks have multiple predecessors and multiple successors;

a two-stage parallel scan (reduce-then-scan), with left-to-right propagation of the accumulated prefix sum;

non-trivial divide-and-conquer patterns, such as Matteo Frigo's algorithm for the N-Body problem (see e.g. https://dspace.mit.edu/bitstream/handle/1721.1/122680/6-172-fall-2010/contents/projects/MIT6_172F10_proj4_1.pdf).

These seem to be interesting examples of "dynamic task graphs that are not trees", as stated in the introduction. I do not suggest that we must support these patterns, but it is interesting to see if we could, and what would be needed for that.

Yes, more example would be good. In particular, we need to see if the very limited ways to add and modified predecessors is sufficient. And how lifetimes of task handles will be managed.

rfcs/proposed/task_group_dynamic_dependencies/README.md

Co-authored-by: Alexey Kukanov <[email protected]> Co-authored-by: Konstantin Boyarinov <[email protected]>

kboyarinov · 2025-02-06T16:16:31Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

+Where `h` is a `task_handle` to a created task, and the 
+`transfer_successors_to` function must be called from within a task. Calling
+this function from outside a task or passing anything other than a `task_handle`
+representing a task in the created state is undefined behavior.


I think we need to add information about possible limitations (or lack of limitations) for the task states represented by task_handle to which we are transferring the successors. Does it sufficient to allow only the handles representing created tasks, or other 3 states are also allowed (i.e. transferring successors to the task_handle that was already submitted to task_group::run).

kboyarinov · 2025-02-06T16:26:55Z

rfcs/proposed/task_group_dynamic_dependencies/README.md

@@ -0,0 +1,378 @@
+# Extend ``task_group`` for Dynamic Task Dependencies


I have noticed one thing that is not related to the dependencies between tasks, but relates into covering the migration from the old tasking API to another, so I decided to add it as a comment here.

Let's consider the recursive Fibonacci example rewritten using the API proposed in this RFC (the splitting stage):

long* left = new long(0); long* right = new long(0); tbb::task_handle fib_left = tg.defer([&tg, num, left] { recursive_fib(tg, num - 2, *left); }); tbb::task_handle fib_right = tg.defer([&tg, num, right] { recursive_fib(tg, num - 1, *right); }); tbb::task_handle fib_sum = tg.defer([&result, left, right] { result = *left + *right; delete left; delete right; });

|
The main difference between merge sort and this example is some data that is required for executing the task (left and right - the placeholders for partial results of Fibonacci calculations on leaft).
Since the lifetime of this data should be preserved until the sum task is executed, it cannot be placed on stack of current function and needs to be allocated dynamically.
Back to old TBB, this data was placed inside of the corresponding task that provides the required lifetime guarantees.
The question is do we need to extend the task_handle API somehow to allow putting the additional data to the task.

vossmjp requested review from akukanov and pavelkumbrasev August 5, 2024 13:15

tobiasweinzierl80 reviewed Aug 23, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

tobiasweinzierl80 reviewed Aug 24, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

tobiasweinzierl80 reviewed Aug 24, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Show resolved Hide resolved

tobiasweinzierl80 reviewed Aug 24, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

tobiasweinzierl80 reviewed Aug 24, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

tobiasweinzierl80 reviewed Aug 24, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

vossmjp added the documentation label Aug 27, 2024

vossmjp requested a review from aepanchi August 27, 2024 13:13

pavelkumbrasev mentioned this pull request Aug 28, 2024

Improvements for task_group API #1498

Draft

13 tasks

akukanov reviewed Sep 9, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

Base automatically changed from dev/vossmjp/rfcs to master September 26, 2024 14:02

aleksei-fedotov reviewed Oct 15, 2024

View reviewed changes

vossmjp and others added 9 commits December 4, 2024 17:34

Removed deprecated as cause for archive

18ad843

Added task_group_dynamic_dependencies RFC

5404632

Addressed several comments on RFC

e1b964a

Updated add_dependency figure

416d638

Updated naming in task_group_dynamic_dependencies RFC and added more …

66710d2

…details

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

1b32c34

Co-authored-by: Aleksei Fedotov <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

aceddbd

Co-authored-by: Aleksei Fedotov <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

19ac8d8

Co-authored-by: Aleksei Fedotov <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

ff7e366

Co-authored-by: Aleksei Fedotov <[email protected]>

vossmjp force-pushed the dev/vossmjp/rfc_task_group_dynamic_dependencies branch from 579bcd8 to ff7e366 Compare December 4, 2024 23:37

vossmjp marked this pull request as ready for review December 4, 2024 23:40

akukanov reviewed Dec 5, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

akukanov reviewed Dec 5, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

akukanov reviewed Dec 5, 2024

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Show resolved Hide resolved

aepanchi reviewed Dec 20, 2024

View reviewed changes

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

11b26d6

Co-authored-by: Alexandra <[email protected]>

vossmjp added the RFC label Jan 9, 2025

vossmjp and others added 14 commits January 30, 2025 13:21

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

5a27a6c

Co-authored-by: Alexandra <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

97ef660

Co-authored-by: Alexandra <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

83eec0f

Co-authored-by: Alexandra <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

cbacc32

Co-authored-by: Alexandra <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

9681b66

Co-authored-by: Alexandra <[email protected]>

Update rfcs/proposed/task_group_dynamic_dependencies/README.md

304faf1

Co-authored-by: Alexandra <[email protected]>

Response to reviews

746ef99

Response to reviews

f69b18c

Additional wording changes for dynamic tasks RFC

c373fbb

Apply suggestions from code review

d2432ab

Co-authored-by: Alexandra <[email protected]>

Apply suggestions from code review

9b433be

Co-authored-by: Alexandra <[email protected]>

Apply suggestions from code review

bc78aca

Co-authored-by: Alexandra <[email protected]>

Changes to address reviews

91deaf2

Addressed API comments for dynamic task API RFC

eeb66ba

vossmjp removed the documentation label Jan 31, 2025

Clarified what a valid task_handle can represent

3884430

vossmjp requested review from aleksei-fedotov, akukanov, tobiasweinzierl80 and kboyarinov and removed request for tobiasweinzierl80 and pavelkumbrasev January 31, 2025 21:06

akukanov reviewed Feb 1, 2025

View reviewed changes

kboyarinov reviewed Feb 3, 2025

View reviewed changes

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

rfcs/proposed/task_group_dynamic_dependencies/README.md Outdated Show resolved Hide resolved

Apply suggestions from code review

52bb751

Co-authored-by: Alexey Kukanov <[email protected]> Co-authored-by: Konstantin Boyarinov <[email protected]>

kboyarinov reviewed Feb 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] task_group_dynamic_dependencies #1469

[RFC] task_group_dynamic_dependencies #1469

vossmjp commented Aug 5, 2024

pavelkumbrasev commented Aug 26, 2024

aleksei-fedotov Oct 14, 2024

akukanov Dec 5, 2024 •

edited

Loading

vossmjp Jan 31, 2025

aleksei-fedotov Oct 15, 2024

vossmjp Dec 4, 2024

akukanov Dec 5, 2024 •

edited

Loading

vossmjp Jan 31, 2025

akukanov Feb 1, 2025

vossmjp Feb 5, 2025

akukanov left a comment •

edited

Loading

akukanov Feb 1, 2025 •

edited

Loading

vossmjp Feb 5, 2025

akukanov Feb 1, 2025

vossmjp Feb 5, 2025

akukanov Feb 1, 2025 •

edited

Loading

vossmjp Feb 5, 2025

kboyarinov Feb 6, 2025

kboyarinov Feb 6, 2025

		In that case, passing a `task_handle` to `task_group::run` or `task_group::run_and_wait` only makes
		it available for dependency tracking but does not make it immediately eligible for execution.

		@@ -0,0 +1,378 @@
		# Extend ``task_group`` for Dynamic Task Dependencies

[RFC] task_group_dynamic_dependencies #1469

Are you sure you want to change the base?

[RFC] task_group_dynamic_dependencies #1469

Conversation

vossmjp commented Aug 5, 2024

Description

Type of change

Tests

Documentation

Breaks backward compatibility

Notify the following users

Other information

pavelkumbrasev commented Aug 26, 2024

Choose a reason for hiding this comment

akukanov Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akukanov Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akukanov left a comment • edited Loading

Choose a reason for hiding this comment

akukanov Feb 1, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akukanov Feb 1, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akukanov Dec 5, 2024 •

edited

Loading

akukanov Dec 5, 2024 •

edited

Loading

akukanov left a comment •

edited

Loading

akukanov Feb 1, 2025 •

edited

Loading

akukanov Feb 1, 2025 •

edited

Loading