cmd/geth:Implement freezer truncation as a subcommand(fixes #31135) #31351

sivaratrisrinivas · 2025-03-11T05:03:12Z

Hii, I have implemented a new subcommand geth db truncate-freezer that truncates the freezer at the merge block, keeping headers but removing bodies. This addresses issue #31135.

These are the Implementation Details

Added a new subcommand truncate-freezer to the db command
Consolidated flags into reusable variables
Implemented the truncation logic that:
- Finds the merge block using binary search
- Preserves headers and hashes before truncation
- Truncates all tables
- Re-inserts the preserved headers and hashes

This implementation follows the same pattern as the existing prune-history command and uses the same underlying truncateAncientStore method.

Fixes #31135

MariusVanDerWijden · 2025-03-11T08:55:39Z

cmd/geth/dbcmd.go

+
+	for low <= high {
+		mid := (low + high) / 2
+		header := rawdb.ReadHeader(db, rawdb.ReadCanonicalHash(db, mid), mid)
+		if header == nil {
+			return fmt.Errorf("header %d not found", mid)
+		}
+
+		if header.Difficulty.Sign() == 0 {
+			// This is a post-merge block, look earlier
+			high = mid - 1
+			mergeBlock = mid
+			found = true
+		} else {
+			// This is a pre-merge block, look later
+			low = mid + 1
+		}
+	}


You could do something like this here

Suggested change

for low <= high {

mid := (low + high) / 2

header := rawdb.ReadHeader(db, rawdb.ReadCanonicalHash(db, mid), mid)

if header == nil {

return fmt.Errorf("header %d not found", mid)

}

if header.Difficulty.Sign() == 0 {

// This is a post-merge block, look earlier

high = mid - 1

mergeBlock = mid

found = true

} else {

// This is a pre-merge block, look later

low = mid + 1

}

}

sort.Search(*headNumber, func(index int) bool {

header := rawdb.ReadHeader(db, rawdb.ReadCanonicalHash(db, index), index)

if header == nil {

panic(fmt.Sprintf("header %d not found", index))

}

return header.Difficulty.Sign() == 0

})

Can't we just assume the merge block is hardcoded to ethereum mainnet value? ~~, and assume that we are running on a pre-merged chain?~~

s1na · 2025-03-11T13:41:17Z

cmd/geth/dbcmd.go

+
+		// First, read all headers up to the merge block
+		log.Info("Reading headers to preserve them", "count", mergeBlock)
+		headers := make([][]byte, mergeBlock)


So this will be 15 Gbs of memory on mainnet

s1na · 2025-03-11T13:45:16Z

cmd/geth/dbcmd.go

+		}
+
+		// Re-insert the headers and hashes
+		log.Info("Re-inserting headers", "count", len(headers))


I think this will not work. Yes you can re-insert the headers, but the freezer has the assumption that all of its tables have the same length. So upon next boot it will truncate the headers to match that of blocks, receipts etc.

It seems we have two options:

relax this assumption in the freezer

Move the headers and hashes to a fresh freezer

Thankyou for pointing out the issues.

…nfigurable batch size

sivaratrisrinivas · 2025-03-12T04:07:23Z

I've updated the PR with optimizations that address all the concerns raised:

1. Memory Usage Issue

Implemented batch processing with a configurable batch size (default: 10,000)
Added progress reporting to show status during processing
Extracted batch processing logic into dedicated helper functions

2. Freezer Table Length Assumption

Created a temporary freezer specifically for headers and hashes
Completely truncate the original freezer (to zero) before re-inserting headers
Added proper error handling when copying between freezers

3. Hardcoded Merge Block Values

Added useHardcodedMergeFlag flag
Created knownMergeBlocks map with network-specific merge block numbers
Extracted merge block detection into a dedicated function

Additional Improvements

Better Code Organization: Broke down the large function into smaller, focused helper functions
Enhanced User Experience: Added progress reporting for long-running operations
Improved Error Handling: Using %w for error wrapping
Configurability: Made batch size configurable via command-line flag

Please let me know your thoughts on it.

…instead of progress reporter

sivaratrisrinivas · 2025-03-14T05:19:06Z

Hii @s1na , I fixed the issues you have mentioned and made some modifications to the codebase to make it more readable and maintainable. Can you please review it and share your feedback.

fjl · 2025-03-14T10:07:30Z

We decided to work on our own version of this. Thank you for your contribution!

cmd/geth:Implement freezer truncation as a subcommand

8907443

MariusVanDerWijden reviewed Mar 11, 2025

View reviewed changes

s1na reviewed Mar 11, 2025

View reviewed changes

Optimize freezer truncation: batch processing, progress reporting, co…

19d510a

…nfigurable batch size

sivaratrisrinivas added 7 commits March 11, 2025 23:18

Fix go.mod: correct Go version format from 1.23.0 to 1.23

c874466

Restore Go version to 1.23.0 to match main repository

3408f1d

Fix progress reporting in freezer truncation to use standard logging …

6942807

…instead of progress reporter

cmd/geth: Fix type mismatches in dbcmd_test.go between uint64 and int

39765c1

Fix linting issues in dbcmd files

cbd75d1

Optimize merge block detection with sort.Search

ad8884c

Fix linting issue: remove unnecessary type conversion

0bfcdcd

fjl closed this Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/geth:Implement freezer truncation as a subcommand(fixes #31135) #31351

cmd/geth:Implement freezer truncation as a subcommand(fixes #31135) #31351

sivaratrisrinivas commented Mar 11, 2025 •

edited

Loading

MariusVanDerWijden Mar 11, 2025

jwasinger Mar 11, 2025 •

edited

Loading

s1na Mar 11, 2025

s1na Mar 11, 2025

s1na Mar 11, 2025

sivaratrisrinivas Mar 12, 2025

sivaratrisrinivas commented Mar 12, 2025

sivaratrisrinivas commented Mar 14, 2025

fjl commented Mar 14, 2025

cmd/geth:Implement freezer truncation as a subcommand(fixes #31135) #31351

cmd/geth:Implement freezer truncation as a subcommand(fixes #31135) #31351

Conversation

sivaratrisrinivas commented Mar 11, 2025 • edited Loading

These are the Implementation Details

MariusVanDerWijden Mar 11, 2025

Choose a reason for hiding this comment

jwasinger Mar 11, 2025 • edited Loading

Choose a reason for hiding this comment

s1na Mar 11, 2025

Choose a reason for hiding this comment

s1na Mar 11, 2025

Choose a reason for hiding this comment

s1na Mar 11, 2025

Choose a reason for hiding this comment

sivaratrisrinivas Mar 12, 2025

Choose a reason for hiding this comment

sivaratrisrinivas commented Mar 12, 2025

1. Memory Usage Issue

2. Freezer Table Length Assumption

3. Hardcoded Merge Block Values

Additional Improvements

sivaratrisrinivas commented Mar 14, 2025

fjl commented Mar 14, 2025

sivaratrisrinivas commented Mar 11, 2025 •

edited

Loading

jwasinger Mar 11, 2025 •

edited

Loading