[Access] POC Collection Syncing #8154

zhangchiqing · 2025-11-14T19:50:37Z

Close #8121

This PR addresses two issues:

Problem 1: Slow startup when Access Node is far behind
When an Access Node is far behind, startup is slow because it attempts to download collections for all finalized heights with missing collections, causing extended downtime.
Solution: Switched to a job queue that focuses on the next missing height to fetch. This enables faster startup, reduces downtime, and speeds up catch-up.
Problem 2: Lock contention from dual collection syncing
Collections can be synced from either collection nodes or execution nodes. Currently, the Access Node syncs from both. Because the storage function uses a lock, concurrent syncing causes both procedures to block each other.
Solution: Prioritize syncing from execution nodes via execution data syncing, and allow only one sync procedure at a time. Syncing from collection nodes is only enabled when execution data syncing is turned off. This reduces lock contention, speeds up indexing, and reduces load on collection nodes.

Collection Indexing Refactoring POC

Finalization block processor: Moved from ingestion2 to the finalized_indexer engine. It indexes guarantees to determine finalized transactions.
Receipts ingestion: Extracted from the ingestion engine into a dedicated ingest_receipt engine. In factor, we don't need this engine, because the follower engine is already storing each receipts for verified blocks.
Execution data indexing: Removed collection indexing. It now only indexes registers and events.
Collection indexing architecture: Collection indexing is handled by two engines in engine/access/collection_sync, with only one enabled at startup. This structure supports a future hybrid mode.
Execution data-based collection indexing (when execution data indexing is enabled): The engine/access/collection_sync/execution_data_index/processor.go engine indexes collections from execution data. It receives notifications when new execution data is downloaded and indexes the collections from that data.
Collection node-based collection fetching (when execution data indexing is disabled): The engine/access/collection_sync/fetcher/engine.go engine fetches collections from collection nodes and indexes them.

…ncing

zhangchiqing · 2025-12-09T00:48:58Z

module/state_synchronization/requester/execution_data_requester.go

-	// `e.consumers`.
-	// Note: the `e.consumers` will be guaranteed to receive at least one `OnExecutionDataFetched` event
-	// for each sealed block in consecutive block height order.
-	e.notificationConsumer, err = jobqueue.NewComponentConsumer(


I simplified the requester by removing the notification consumer entirely. Instead, we are just calling e.distributor.OnExecutionDataReceived().

Note, the OnExecutionDataReceived used to be called with the execution data read from storage, but actually, no consumer actually make use the execution data, because the consumer will read execution data with their next unprocessed index to ensure data for all heights are processed.

zhangchiqing added 30 commits November 6, 2025 15:31

add missing collection queue

d14ed10

add ingestion2 collection syncer

12339f5

update ingestion2 engine

a7a0253

fix tests

6a9ccd5

update ingestion2 and indexer

4277a72

update access node builder

59684ee

simplify the job queue

3353177

add execution data processor

92fe356

update exectuion data processor

79076c4

update exectuion data processor

c4ffce4

update exectuion data processor factory

aefe0f2

refactor collection sync

7ff6772

refactor with fetcher

c723fc9

simplify the finalizer

69f2da9

add component to finalized indexer processor

c9f1ffa

rename syncer to fetcher

1c43552

add comment

66e639b

refactor last full block height

28d459a

make job consumer LastProcessedIndex and Size to be non blocking

96caf34

fix for observer

31889f8

fix backend test

eb4f9f5

fix execution script test

3b5fc1d

fix indexer tests

06b83b0

fix lint

743d56d

add metrics

91fe006

update transaction and collections storage

52149f6

fix benchmark tool

ef5635b

add logs

11f32a2

fix lint

9ee10b7

add logs

c4a867c

zhangchiqing added 29 commits December 2, 2025 17:10

add retry

65245e9

check storage before retry

0728e39

simplify RetryFetchingMissingCollections

ee346e2

Bug fix: failed Unicast calls no longer count as attempts

f6dacb4

add retry interval config

49876a9

report correct height

75a28bb

add retry interval config

5b17ae2

log indexer look up

c8f71b9

indexer remove redundant header reads

8de563a

add flag to disable bitswap bloom cache

91299c2

improve logging

199a84a

add flag to disable bitswap bloom cache

9c3c5d9

Merge branch 'master' into leo/disable-bitswap-bloom-cache

8e70d15

fix lint

15450bf

log block height

c4e8228

warn about large iteration

d71309e

requester to return the cached store

38ef51d

fix lint

d1250a3

fix mock

8e51aae

Merge branch 'master' into leo/collection-syncing

e5e3421

fix tests

484ff18

revert to use execution data cache

75d04b9

add retry

5b07e92

remove notification data

f08edf4

remove notification consumer

e2d29da

revert blob change

bbc700f

adjustable bloom cache

01324d6

disable access node builder bloom cache

27f6a59

Merge branch 'leo/disable-bitswap-bloom-cache' into leo/collection-sy…

da0c021

…ncing

zhangchiqing commented Dec 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Access] POC Collection Syncing #8154

[Access] POC Collection Syncing #8154

Uh oh!

zhangchiqing commented Nov 14, 2025 •

edited

Loading

Uh oh!

zhangchiqing Dec 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Access] POC Collection Syncing #8154

Are you sure you want to change the base?

[Access] POC Collection Syncing #8154

Uh oh!

Conversation

zhangchiqing commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Collection Indexing Refactoring POC

Uh oh!

zhangchiqing Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zhangchiqing commented Nov 14, 2025 •

edited

Loading