Batch-optimization #3313

ScottyPoi · 2024-03-10T03:50:17Z

Continues work on optimizations described by @holgerd77 in #3293

`trie.batch()`

1. Externalized to `trie/batch.ts`

trie.batch() will now call an externalized function called batch, imported from batch.ts.
batch.ts also contains helper functions and modified trie operations related to batch.

2. Return updated stack

trie._updateNode and trie._createInitialNode have been modified to return the stack of nodes after updating. This will be utilized in the batch optimization.

3. `orderBatch()`

orderBatch() is a helper function which sorts a batch of trie operations by key nibbles. This sorting will allow for an optimized approach to updating trie nodes.

4. custom `_put` and `_del` functions

modified version of trie operations _put and _del were added to batch.ts. These functions accept additional path parameters, and return the modified stack of nodes following a trie update. This will allow for caching of recently visited nodes within the batch operation, which reduces the amount of trie walking (findPath() calls) necessary for updates.

5. `batchPut`

batchPut is a modified version of trie.put which acts as an intermediary between the batch method, and the modified _put and _del methods.

6. `batch`

Refactored batch method keeps the same parameters. No batch calls need to be modified elsewhere in the codebase.

stackPathCache: Map<string, TrieNode>

_batch will keep track of traversed node paths and nodes. with each trie update, the updated path from the new root to the new leaf will be stored in a map by their path nibbles.
This cached path will contain most or all of the nodes involved in the following update, due to the presorted keys.
Rather than walk the trie again from each new root, this cache allows the next update to use the partial path already traversed the previous update.
_batch will call batchPut for each op with the added parameter of the cached stack nodes, and the nibble remainder of the next key.

Benchmarks:

This refactored batch process completes 33% faster than before!

codecov · 2024-03-10T03:52:32Z

Codecov Report

Attention: Patch coverage is 95.45455% with 4 lines in your changes are missing coverage. Please review.

Project coverage is 86.91%. Comparing base (48e6a30) to head (e075991).

Additional details and impacted files

Flag	Coverage Δ
block	`88.43% <ø> (ø)`
blockchain	`91.61% <ø> (ø)`
client	`84.85% <ø> (ø)`
common	`98.43% <ø> (ø)`
devp2p	`82.12% <ø> (ø)`
ethash	`∅ <ø> (∅)`
evm	`74.10% <ø> (ø)`
genesis	`99.98% <ø> (ø)`
rlp	`∅ <ø> (?)`
statemanager	`77.08% <ø> (ø)`
trie	`89.42% <95.45%> (+0.09%)`	⬆️
tx	`95.08% <ø> (ø)`
util	`89.34% <ø> (ø)`
vm	`79.90% <ø> (ø)`
wallet	`88.35% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

holgerd77 · 2024-03-11T14:07:49Z

That sounds really promising, also this looks super-clean already! 🤩 Is this already ready for review or are you doing continued work here?

ScottyPoi · 2024-03-12T07:40:07Z

Not quite yet! I wrote more tests that try it with different trie settings, and not all are passing yet. close!

ScottyPoi · 2024-03-13T02:14:29Z

OK ready for review! 👍

holgerd77

Hi there, I am now roughly through going through the commits and getting a broad overview on the changes! This looks super-clean regarding the execution, with this great and easy to review separation in clear commits, also generally really excited about this work! 🤩

Let's please find a way to do this with substantially less code additions. +588 is far too much for a PR with a one-feature change and this is introducing too much code redundancy and generally new code to care about. There should be a way to do with less, by adding additional optional parameters and such.

Do not have the full picture yet, what takes what and where and why 😋, we can discuss later the day on the call or after that in an async way.

holgerd77 · 2024-03-15T12:59:21Z

This already looks a lot better respectively „more compact“, let me know when this should get another look! 🙂

ScottyPoi · 2024-03-15T23:16:22Z

@holgerd77

reduced this down to +88 / -17 lines of code (not including the test)
Removed new file created in PR

If this feels small enough to you, then it is ready for another review

holgerd77 · 2024-03-19T12:45:14Z

Rebased this via UI

holgerd77

🙏 🙏 🙏 🙏 🙏 🙏

This looks absolutely fabulous (without exaggeration)!!! 👍 🙂

Really dramatically clean now and super reduced and great to review.

One longer basic comment on the whole API design respectively necessary options to switch the new functionality on and off.

Let me know if you have questions! 🙂

holgerd77 · 2024-03-19T13:08:28Z

packages/trie/src/trie.ts

+          }
+    const sortedOps = orderBatch(ops, keyTransform)
+    let stack: TrieNode[] = []
+    const stackPathCache: Map<string, TrieNode> = new Map()


We need to make both the ordering as well as the caching optional for the batch() method, and therefore provide extra configuration parameters to configure.

Ordering: Reason for ordering is that the ordering in certain cases does have an effect on the outcome. So if there is first a put for a key and then a delete for the same key this is obviously different then having this the other way around. Same for two puts for the same key but different values. So the whole ordering needs to be deactivated by default (also for backwards compatibility) and the user would need to activate manually.

Caching: Reason that caching would need to also get an option and also would needed to be deactivated by default is that one does not know how big a batch() operation will be (only the user knows (at best)). So this might "overload" the cache respectively there is the risk for out-of-memory situations.

I would suggest that we add a dedicated options dictionary with its own type in types.ts called BatchOpts for this, seems worth it, and then we add boolean flags sortOperations (or so) and activatePathCache (or so) for this.

My tendency actually would be to also include skipKeyTransform and then be a bit hacky and change the function signature to async batch(ops: BatchDBOp[], skipKeyTransformOrOptsDict?: boolean | BatchOpts) so that we can then do a switch respectively type check for the parameter and either assign skipKeyTransform directly or use the options dict.

Then we can use the method directly with trie.batch(ops, opts) (or trie.batch(ops, { sortOperations: true })) and do not always need to channel in the skipKeyTransform default.

(let me know if this has serious downsides though I am overlooking. I would think this should still be backwards compatible for people)

I did some further research into the effects of this in the context of other modules, and I agree about tweaking the options for how batch works based on the situation.

There are definitely circumstances in which it would be faster to simply iterate through the batch like we currently do, instead of sorting and caching. I also wonder how the results would change if we used different sorting and trie walking algorithms. If there's something useful there, it could be another option in batch()

ScottyPoi · 2024-03-19T20:10:49Z

I think any user with a bit of dyslexia will get dizzy trying to decipher skipKeyTransformOrOptsDict?: boolean | BatchOpts
🤣

ScottyPoi · 2024-03-19T21:03:38Z

Thinking out loud here about the cache memory overload issue

The cache can be thought of like an in-memory slice of a Trie database
Difference is that Trie Nodes are stored by their current path in the trie at each iteration
- *instead of by node_hash like trie.db
- `"[ ]" is the key for the root node
- Children of the root node will have a key-path like: "[ a ]"
- Nodes further down will have key-path like: "[ a, 0, b, c, e]"
With each batch operation, updated nodes are insterted via their path in the updated trie.
- "[ ]" will point to the new rootnode, etc.
To me this means that the maximum size of the cache is equal to the total number of updated nodes in the final trie
Nodes are individually only about 500 bytes. So i think that even a large batch would not build a promlematically large memory cache.

ScottyPoi · 2024-03-20T01:21:30Z

if there is first a put for a key and then a delete for the same key this is obviously different then having this the other way around

The way we currently do the sorting, multiple batch ops for the same key would remain the original order relative to each other. So the order of multiple put or del for the same key should not be affected, and the end result should be the same

holgerd77 · 2024-03-20T13:02:27Z

I think any user with a bit of dyslexia will get dizzy trying to decipher skipKeyTransformOrOptsDict?: boolean | BatchOpts 🤣

This would for sure only be temporary until the next breaking releases (maybe autumn or so?), then we would remove the skipKeyTransform part. 🙂

This would have the advantage though that people then could already "go" into the new API and would not need to adjust again along breaking releases, so I think I would still be a fan of taking this version. 🙂

(we can also pick these things up on the call)

holgerd77 · 2024-03-20T13:03:32Z

Thinking out loud here about the cache memory overload issue

* The cache can be thought of like an in-memory slice of a `Trie` database

* Difference is that Trie Nodes are stored by their current **path** in the trie at each iteration
  
  * *instead of by **node_hash** like `trie.db`
  * `"[ ]" is the key for the root node
  * Children of the root node will have a key-path like: `"[ a ]"`
  * Nodes further down will have key-path like: `"[ a, 0, b, c, e]"`

* With each batch operation, updated nodes are insterted via their path in the updated trie.
  
  * `"[ ]"` will point to the new rootnode, etc.

* To me this means that the maximum size of the cache is equal to the total number of updated nodes **in the final trie**

* Nodes are individually only about `500 bytes`.  So i think that even a large batch would not build a promlematically large memory cache.

I am unsure TBH. But might very well be that it's a non-issue. I would be ok with trying and eventually - if things doesn't hold for users in certain scenarios - push another release with an additional flag.

holgerd77 · 2024-03-20T13:05:39Z

if there is first a put for a key and then a delete for the same key this is obviously different then having this the other way around

The way we currently do the sorting, multiple batch ops for the same key would remain the original order relative to each other. So the order of multiple put or del for the same key should not be affected, and the end result should be the same

Haven't looked at the sorting algorithm yet, but my imagination goes short atm to imagine a sorting algorithm that is so non-deterministic (in some sense, maybe "dynamic" might be a better word) that not one of the cases A, B and B, A is sorted in a way that behavior is changed. 🤔

Not sure, maybe I'll have some additional time for a closer look at the algorithm later on.

holgerd77 · 2024-03-21T09:43:02Z

Can this then please get an update, or otherwise please let me know where we still need to clarify things! 🙂

So, I would assume we now do:

Use async batch(ops: BatchDBOp[], skipKeyTransformOrOptsDict?: boolean | BatchOpts) as signature
Add a new BatchOpts type
Keep the cache without an extra option
Add sortOperations: boolean to BatchOpts defaulting to false

Let me know if there is further need to discuss certain points, eventually you can independently also push the things which are unambiguous.

ScottyPoi added type: feature type: refactor package: mpt labels Mar 10, 2024

ScottyPoi force-pushed the batch-optimization branch 2 times, most recently from 2f7214a to 63b5512 Compare March 10, 2024 08:09

ScottyPoi force-pushed the batch-optimization branch from be5f023 to d1d9ee2 Compare March 13, 2024 01:52

ScottyPoi added the PR state: needs review label Mar 13, 2024

ScottyPoi mentioned this pull request Mar 13, 2024

Block: use trie.batch #3317

Draft

holgerd77 requested changes Mar 13, 2024

View reviewed changes

ScottyPoi force-pushed the batch-optimization branch from f42329e to 18d053a Compare March 15, 2024 05:37

ScottyPoi force-pushed the batch-optimization branch 2 times, most recently from cec3ce0 to 964b0ec Compare March 15, 2024 22:32

ScottyPoi added 12 commits March 19, 2024 13:44

trie: create file for batch functions

4d05198

trie: helper function to sort batch keys

30c4c5f

trie: create modified _put method for batch use

8668578

trie: create modified _del method for batch

3db7de7

trie: create intermediate method batchPut

f7e0ceb

trie: implement optimized batch method

674f7e6

trie: return updated node stack from _update

6c43d64

trie: replace trie.batch with imported batch method

252ee7b

trie: test new batch functions

e4123db

trie: use hashed keys in batch sort (secure trie)

6fd4602

trie: test batch on secure / pruned trie

b2c2624

trie: use _del instead of del in batch

71535aa

ScottyPoi added 9 commits March 19, 2024 13:44

trie: test additional trie opts

1a98e2c

remove .only from test

f08bd10

trie: modify original trie.put for batch

2135b9a

trie: modify original trie.del for batch

945ace5

trie: modify _batch to use direct trie methods

87d0465

trie: remove custom functions from batch.ts

f51ff04

trie: move _batch back to trie.batch

4fd15a2

trie: move orderBatch function to util file / delete batch.ts

c3dadd4

trie: condense batch test file

e075991

holgerd77 force-pushed the batch-optimization branch from 3455a3a to e075991 Compare March 19, 2024 12:44

holgerd77 requested changes Mar 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch-optimization #3313

Batch-optimization #3313

ScottyPoi commented Mar 10, 2024

codecov bot commented Mar 10, 2024 •

edited

Loading

holgerd77 commented Mar 11, 2024

ScottyPoi commented Mar 12, 2024

ScottyPoi commented Mar 13, 2024

holgerd77 left a comment •

edited

Loading

holgerd77 commented Mar 15, 2024

ScottyPoi commented Mar 15, 2024

holgerd77 commented Mar 19, 2024

holgerd77 left a comment

holgerd77 Mar 19, 2024

ScottyPoi Mar 19, 2024

ScottyPoi commented Mar 19, 2024

ScottyPoi commented Mar 19, 2024

ScottyPoi commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 21, 2024

Batch-optimization #3313

Are you sure you want to change the base?

Batch-optimization #3313

Conversation

ScottyPoi commented Mar 10, 2024

trie.batch()

1. Externalized to trie/batch.ts

2. Return updated stack

3. orderBatch()

4. custom _put and _del functions

5. batchPut

6. batch

Benchmarks:

codecov bot commented Mar 10, 2024 • edited Loading

Codecov Report

holgerd77 commented Mar 11, 2024

ScottyPoi commented Mar 12, 2024

ScottyPoi commented Mar 13, 2024

holgerd77 left a comment • edited Loading

Choose a reason for hiding this comment

holgerd77 commented Mar 15, 2024

ScottyPoi commented Mar 15, 2024

holgerd77 commented Mar 19, 2024

holgerd77 left a comment

Choose a reason for hiding this comment

holgerd77 Mar 19, 2024

Choose a reason for hiding this comment

ScottyPoi Mar 19, 2024

Choose a reason for hiding this comment

ScottyPoi commented Mar 19, 2024

ScottyPoi commented Mar 19, 2024

ScottyPoi commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 20, 2024

holgerd77 commented Mar 21, 2024

`trie.batch()`

1. Externalized to `trie/batch.ts`

3. `orderBatch()`

4. custom `_put` and `_del` functions

5. `batchPut`

6. `batch`

codecov bot commented Mar 10, 2024 •

edited

Loading

holgerd77 left a comment •

edited

Loading