feat: adaptive subtreeData skip during catchup when txs exist locally by freemans13 · Pull Request #539 · bsv-blockchain/teranode

freemans13 · 2026-02-27T10:18:11Z

Problem

During catchup, subtree validation downloads subtreeData files (full serialized transactions, hundreds of MB each) from peers for every subtree in every block. In environments like dev-scale where a tx-blaster sends each transaction to both nodes, the catching-up node already has all these transactions locally in its UTXO store. Downloading subtreeData in this scenario is pure waste — bandwidth, time, and memory spent re-fetching data we already have.

Solution

For each subtree, stream the tx hashes and batch-check UTXO existence. If all txs exist locally, skip the subtreeData download entirely. If any are missing, abort the check immediately and switch to subtreeData for that subtree and all remaining ones in the block. Subtrees already validated via subtree-only are not re-downloaded.

A shared atomic flag across concurrent goroutines ensures that once a miss is found, every subsequent subtree goes straight to full fetch with zero probing overhead. This is applied across all three catchup code paths: block validation pre-fetch, CheckBlockSubtrees, and the streaming processor pipeline.

Changes

File	Change
`pkg/utxocheck/existence.go`	New shared helper `CheckAllTxsExistInUTXO()` — batch-checks tx hashes via `BatchDecorate`, skips coinbase, short-circuits on first miss
`pkg/utxocheck/existence_test.go`	9 unit tests covering all branches
`services/blockvalidation/get_blocks.go`	New `fetchBlockSubtreesAdaptive()` with per-subtree UTXO probe; wired into `blockWorker()`
`services/blockvalidation/get_blocks_test.go`	Updated error message assertion for renamed function
`services/subtreevalidation/check_block_subtrees.go`	Added `needsSubtreeData` atomic flag + UTXO check before subtreeData download
`services/subtreevalidation/streaming_processor.go`	New `loadSubtreeOnly()` helper + UTXO check in `streamAndFilterSubtrees()`

Error handling

UTXO store errors → log warning, fall back to subtreeData (optimization is advisory)
Partial misses → once any subtree has missing txs, all remaining subtrees skip the UTXO check
Race/eviction safety → ValidateSubtreeInternal has its own fallback via processMissingTransactions() if txs are somehow missing at validation time despite passing the existence check

Test plan

go build passes for all modified packages
go vet clean
make lint — 0 issues
pkg/utxocheck — 9/9 unit tests pass
services/subtreevalidation — all tests pass
services/blockvalidation — all catchup tests pass (TestBlockWorker, TestFetchBlocksConcurrently, TestFetchSubtreeDataForBlock, TestCatchup_*)
Manual test on dev-scale: during catchup, logs should show "skipping subtreeData download" for subtrees where txs exist locally

🤖 Generated with Claude Code

During catchup, subtree validation downloads massive subtreeData files from peers. When both nodes have received the same transactions (e.g. dev-scale with tx-blaster), the catching-up node already has all txs in its UTXO store. This optimization checks UTXO existence per-subtree before downloading subtreeData, skipping the download entirely when all txs exist locally. Uses a shared atomic flag so that once any subtree has missing txs, all remaining subtrees skip the probe and go straight to full subtreeData fetch, minimizing wasted bandwidth. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

github-actions · 2026-02-27T10:19:08Z

🤖 Claude Code Review

Status: Complete

✅ No new issues found

The implementation is well-designed with proper error handling and fallback mechanisms. The atomic flag pattern correctly closes the race window with the re-check at line 449 in get_blocks.go.

History:

✅ Fixed: Race condition in atomic flag pattern - The code now includes a re-check of the needsSubtreeData flag after fetching the subtree (line 449), which closes the race window where multiple goroutines could proceed to UTXO checks before any set the flag.

github-actions · 2026-02-27T10:19:58Z

services/blockvalidation/get_blocks.go

+
+		g.Go(func() error {
+			// If flag is already set, go straight to full fetch
+			if needsSubtreeData.Load() {


The atomic flag pattern here has a race condition. When multiple goroutines check needsSubtreeData.Load() at line 436 before any have called .Store(true), they all proceed to the UTXO check. If one goroutine finds a missing tx and sets the flag, other goroutines that already passed the check will still download subtreeData even though the flag is now set.

Impact: The optimization won't be as effective as intended - some goroutines will unnecessarily download subtreeData even after the flag is set.

Solution: This is an inherent limitation of the optimization pattern and may be acceptable since it still provides bandwidth savings. However, if you want stricter behavior, consider checking the flag again before calling fetchAndStoreSubtreeData at lines 458 and 464.

Update: ✅ Fixed - The code now includes a re-check of the flag at line 449 after fetching the subtree, which closes the race window. This ensures goroutines that are in-flight will check the flag again before proceeding to download subtreeData.

Add a second flag check after the subtree-hash fetch completes but before starting the expensive UTXO BatchDecorate call. This closes the window where a goroutine passes the initial check, another goroutine sets the flag while the first is fetching hashes, and the first goroutine then does an unnecessary UTXO existence check. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The UTXO check in check_block_subtrees.go and streaming_processor.go caused validation to be skipped entirely when all txs existed in the UTXO store. This broke tests using the SQL backend (which doesn't store the Creating field) and would also skip block-level validation in production. The optimization is kept only in fetchBlockSubtreesAdaptive (pre-fetch phase), which correctly skips subtreeData downloads while letting the validation pipeline run fully. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sonarqubecloud · 2026-02-27T11:10:29Z

Quality Gate failed

Failed conditions
66.3% Coverage on New Code (required ≥ 80%)
8.3% Duplication on New Code (required ≤ 3%)

See analysis details on SonarQube Cloud

github-actions bot reviewed Feb 27, 2026

View reviewed changes

freemans13 self-assigned this Feb 27, 2026

freemans13 and others added 2 commits February 27, 2026 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: adaptive subtreeData skip during catchup when txs exist locally#539

feat: adaptive subtreeData skip during catchup when txs exist locally#539
freemans13 wants to merge 3 commits intobsv-blockchain:mainfrom
freemans13:feat/adaptive-subtreedata-skip-during-catchup

freemans13 commented Feb 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

github-actions bot Feb 27, 2026 •

edited

Loading

Uh oh!

sonarqubecloud bot commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

freemans13 commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes

Error handling

Test plan

Uh oh!

github-actions bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Feb 27, 2026

Quality Gate failed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

freemans13 commented Feb 27, 2026 •

edited

Loading

github-actions bot commented Feb 27, 2026 •

edited

Loading

github-actions bot Feb 27, 2026 •

edited

Loading