feat(coprocessor): implement a retry mechanism for pbs computations #1245

goshawk-3 · 2025-11-04T12:13:07Z

fixes: https://github.com/zama-ai/fhevm-internal/issues/542

PBS computation can be in one of the following states:

completed - squash_noise finished and the ct128 was inserted into the DB
TransientErr - squash_noise finished but the ct128 couldn't be inserted
PermanentErr - Either squash_noise computation failed or decompress_ct failed

An infinite retry is done only for Transient Err

Additionally, both squash_noise and decompress_ct are wrapped to return an error if they would otherwise panic. This is done to avoid livelocks similar to the one noticed time ago in zkproof worker here

coprocessor/fhevm-engine/fhevm-engine-common/src/utils.rs

coprocessor/fhevm-engine/sns-worker/src/executor.rs

rudy-6-4

Can a small test be added ? Otherwise a e2e run in PR comment gives more confidence to reviewer.

goshawk-3 · 2025-11-05T11:36:32Z

Can a small test be added ? Otherwise a e2e run in PR comment gives more confidence to reviewer.

We'll probably need to mock the squash_noise call to simulate erroneous behavior and also enforce a DB insert-query failure but that's not trivial. Let me think about it.

PBS computation can be in one of the following states: - completed - squash_noise finished and the ct128 was inserted into the DB - TransientErr - squash_noise finished but the ct128 couldn't be inserted - PermanentErr - squash_noise computation failed. An infinite retry is done only for TransientErr

…void livelock

cla-bot bot added the cla-signed label Nov 4, 2025

goshawk-3 force-pushed the georgi/sns/minor-fixes branch 3 times, most recently from 4dd0220 to b33d326 Compare November 4, 2025 15:57

rudy-6-4 reviewed Nov 5, 2025

View reviewed changes

coprocessor/fhevm-engine/fhevm-engine-common/src/utils.rs Show resolved Hide resolved

rudy-6-4 reviewed Nov 5, 2025

View reviewed changes

coprocessor/fhevm-engine/sns-worker/src/executor.rs Show resolved Hide resolved

rudy-6-4 reviewed Nov 5, 2025

View reviewed changes

coprocessor/fhevm-engine/sns-worker/src/executor.rs Show resolved Hide resolved

rudy-6-4 previously approved these changes Nov 5, 2025

View reviewed changes

goshawk-3 force-pushed the georgi/sns/minor-fixes branch from b0f0036 to 5e9ea06 Compare November 5, 2025 11:46

goshawk-3 marked this pull request as ready for review November 5, 2025 15:01

goshawk-3 requested a review from a team as a code owner November 5, 2025 15:01

goshawk-3 mentioned this pull request Nov 5, 2025

chore(coprocessor): return error if update cts table fails #1265

Merged

goshawk-3 added 6 commits November 6, 2025 08:51

chore(coprocessor): update sqlx cache

dd07f2f

chore(coprocessor): update table:pbs_computation index

2876e4b

chore(coprocessor): panic-guard decopress and squash_noise calls to a…

4a1bdf1

…void livelock

chore(coprocessor): move with_panic_guard into the common crate

606bb8e

chore(coprocessor): remove set_status func

7e7c3a9

goshawk-3 force-pushed the georgi/sns/minor-fixes branch from 5e9ea06 to 7e7c3a9 Compare November 6, 2025 06:51

goshawk-3 dismissed rudy-6-4’s stale review via 7e7c3a9 November 19, 2025 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(coprocessor): implement a retry mechanism for pbs computations #1245

feat(coprocessor): implement a retry mechanism for pbs computations #1245

Uh oh!

goshawk-3 commented Nov 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rudy-6-4 left a comment

Uh oh!

goshawk-3 commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(coprocessor): implement a retry mechanism for pbs computations #1245

Are you sure you want to change the base?

feat(coprocessor): implement a retry mechanism for pbs computations #1245

Uh oh!

Conversation

goshawk-3 commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rudy-6-4 left a comment

Choose a reason for hiding this comment

Uh oh!

goshawk-3 commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

goshawk-3 commented Nov 4, 2025 •

edited

Loading