Skip to content

[dbsp] Use serde_json to serialize data for Broadcast operator.#6395

Merged
blp merged 1 commit into
mainfrom
broadcast-serde
Jun 4, 2026
Merged

[dbsp] Use serde_json to serialize data for Broadcast operator.#6395
blp merged 1 commit into
mainfrom
broadcast-serde

Conversation

@blp

@blp blp commented Jun 4, 2026

Copy link
Copy Markdown
Member

The Broadcast operator was using rmp_serde to serialize and deserialize data for exchange across hosts, for reasons that have been lost to the mists of time. This serialization corrupted some of our data; for example, it caused an array of integers to become an array of array of stringified integers (!).

This commit fixes it by using serde_json instead of rmp_serde. Since this was the only use of rmp_serde, it drops that dependency.

Describe Manual Test Plan

I tested that a multihost pipeline that had adaptive joins enable didn't fail immediately on startup.

The Broadcast operator was using rmp_serde to serialize and deserialize
data for exchange across hosts, for reasons that have been lost to the
mists of time.  This serialization corrupted some of our data; for example,
it caused an array of integers to become an array of array of stringified
integers (!).

This commit fixes it by using serde_json instead of rmp_serde.  Since this
was the only use of rmp_serde, it drops that dependency.

Thanks to @ryzhyk for debugging help.

Signed-off-by: Ben Pfaff <blp@feldera.com>
@blp blp requested a review from ryzhyk June 4, 2026 16:19
@blp blp self-assigned this Jun 4, 2026
@blp blp added DBSP core Related to the core DBSP library rust Pull requests that update Rust code multihost Related to multihost or distributed pipelines labels Jun 4, 2026
@blp blp enabled auto-merge June 4, 2026 16:21
@blp blp added this pull request to the merge queue Jun 4, 2026
@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 4, 2026
@blp blp added this pull request to the merge queue Jun 4, 2026
Merged via the queue into main with commit 555f34d Jun 4, 2026
1 check passed
@blp blp deleted the broadcast-serde branch June 4, 2026 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

DBSP core Related to the core DBSP library multihost Related to multihost or distributed pipelines rust Pull requests that update Rust code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants