Use event to handle distributed row gatherer by yhmtsai · Pull Request #1882 · ginkgo-project/ginkgo

yhmtsai · 2025-07-08T14:41:56Z

With the current distributed row gatherer , we does the following:
local row gatherer (prepare data for mpi) -> synchronize (it is required by mpi) -> submit mpi ialltoallv -> submit local spmv
Ginkgo will do local spmv and mpi ialltoallv.
However, the local spmv is forced to wait for synchronize() and then submit its kernel, which lead the gap between gpu activity.

Another approach is introduced in this pr to use event
local row gatherer -> record event -> submit local spmv -> wait the event -> submit mpi ialltoallv.
the submission local spmv does not need to wait for local row gatherer to finish, but it will give additional overhead during record event.

For those does not support AsyncEvent, we just synchronize when creating it.

something from the profiler

The local spmv starts earlier (yellow box for gpu activity and gree for cpu). The MPI communication starts later now (pink box). It gives better performance because local spmv still covers the mpi communication. If mpi communication is not covered by the local spmv, this will give slower performance. I will say it is okay-ish now. This situation means it is network bandwidth bound now, which is slow compared to other bandwidth.
We can improve it by introducing thread but it will lead another discussion whether we allow thread in ReferenceExecutor.

There are two approach might also help this situation - stream-aware mpi or async execution.
stream-aware mpi needs more study like using isend/irecv implement all_to_all_v + row gatherer.
async execution I have done something for async schwarz which touching some design question on executor, so it lead larger and longer discussion. Thus, I do not touch it here.

MarcelKoch · 2025-11-13T10:59:44Z

                                           ptr_param<LinOp> x,
                                           array<char>& workspace) const;

+    std::shared_ptr<const Event> apply_prepare(ptr_param<const LinOp> b,


Note: make these protected and add the distributed matrix as a friend.

MarcelKoch

I would suggest changing the arguments to apply_prepare.

MarcelKoch · 2025-11-24T08:58:59Z

+    auto ev = apply_prepare(this->rg.get(), b, x, workspace);
+    // we modify the workspace to all 0
+    workspace.fill(static_cast<char>(0));
+    this->exec->synchronize();
+    auto req = apply_finalize(this->rg.get(), b, x, ev, workspace);
+    req.wait();


Just for my understanding, this tests that the send buffer can be overwritten after the apply_prepare, right?

yes (but require on the same stream). It checks the apply finalize indeed uses the workspace

yhmtsai requested a review from MarcelKoch July 8, 2025 14:41

yhmtsai self-assigned this Jul 8, 2025

ginkgo-bot added reg:build This is related to the build system. reg:testing This is related to testing. reg:benchmarking This is related to benchmarking. mod:all This touches all Ginkgo modules. labels Jul 8, 2025

yhmtsai force-pushed the event_row_gatherer branch from 2daf422 to a3f291a Compare July 8, 2025 16:45

yhmtsai added the 1:ST:ready-for-review This PR is ready for review label Jul 10, 2025

yhmtsai force-pushed the event_row_gatherer branch 2 times, most recently from 846d8d4 to d65428b Compare July 11, 2025 13:49

yhmtsai force-pushed the event_row_gatherer branch from d65428b to 85dfc7b Compare August 14, 2025 22:19

MarcelKoch added this to the Ginkgo 1.11 milestone Nov 13, 2025

MarcelKoch reviewed Nov 13, 2025

View reviewed changes

yhmtsai force-pushed the event_row_gatherer branch from 85dfc7b to 92f9715 Compare November 14, 2025 14:27

MarcelKoch requested changes Nov 24, 2025

View reviewed changes

yhmtsai force-pushed the event_row_gatherer branch from 92f9715 to f32c492 Compare November 25, 2025 09:50

yhmtsai requested a review from MarcelKoch November 26, 2025 08:40

MarcelKoch approved these changes Nov 26, 2025

View reviewed changes

Comment thread core/base/executor.cpp

yhmtsai force-pushed the event_row_gatherer branch 2 times, most recently from 4bd5529 to f13e720 Compare November 26, 2025 12:31

yhmtsai added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels Nov 26, 2025

yhmtsai added 8 commits November 27, 2025 06:34

add event to avoid local apply also need to wait for synchronize();

d0a4523

move record to function and implment the other backend event

e53e53e

fix the benchmark output

5ed9cde

also use event in distribtued advanced spmv

16a4ba4

use new approach only when they have event

9abc604

avoid using core function from backend

98910e4

sycl event wait_and_throw is not const func

9a282c9

add test and char instantiation on array fill

11c5c70

yhmtsai added 4 commits November 27, 2025 06:34

move the apply_prepare/finalize to protected and Event to detail

df0957e

remove unused argument

d8de521

also do it in col_scale

8a2de92

fix missing record event log in solver

4fe9d4a

yhmtsai force-pushed the event_row_gatherer branch from f13e720 to 4fe9d4a Compare November 27, 2025 05:34

yhmtsai merged commit 57d59b4 into develop Nov 27, 2025
13 of 17 checks passed

yhmtsai deleted the event_row_gatherer branch November 27, 2025 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use event to handle distributed row gatherer#1882

Use event to handle distributed row gatherer#1882
yhmtsai merged 12 commits into
developfrom
event_row_gatherer

yhmtsai commented Jul 8, 2025

Uh oh!

MarcelKoch Nov 13, 2025

Uh oh!

MarcelKoch left a comment

Uh oh!

Uh oh!

Uh oh!

MarcelKoch Nov 24, 2025

Uh oh!

yhmtsai Nov 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yhmtsai commented Jul 8, 2025

Uh oh!

MarcelKoch Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

MarcelKoch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

MarcelKoch Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

yhmtsai Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants