cip: Batched Anchor Data Structure #74

oed · 2020-11-12T14:10:32Z

PR for CIP-69.
Discussion #69

CIPs/CIP-69/CIP-69.md

stbrody · 2020-11-12T16:07:49Z

CIPs/CIP-69/CIP-69.md

+```js
+const { BloomFilter } = require('bloom-filters')
+const filterEntries = [...] // An array of strings
+const errorRate = 0.04 // 4 % error rate


Why .04? Was that chosen arbitrarily or based on some calculations about the size of the resulting filter?

Kind of arbitrarily. Just took it out of the readme of the bloom-filters package.

Ah I see. I guess I had been thinking this was supposed to be an example of how someone might consume the information stored in the bloomFilter field of the TreeMetadata to reconstitute the bloom filter and use it, but I guess this is more like example code of how we might build the bloom filter before persisting it in the TreeMetadata. In that case, is this example even really relevant/useful to include in this document?

i don't think it hurts to include?

Only risk is confusion/distraction from the main focus of how to use/consume this structure. Probably not a big deal either way though

stbrody · 2020-11-12T16:10:48Z

CIPs/CIP-69/CIP-69.md

+
+type TreeMetaData struct {
+  numEntries Int
+  bloomType String


I wonder if we want more than just a string for specifying the bloomType here. In order to consume the bloom filter users will have to know the arguments used to construct it (either size of the bloom filter and number of hash functions, or target number of items and false positive rate). To provide that to them we'll either need a public mapping of the bloomType string to the arguments used to construct it (so that we could change the bloomType string if we ever wanted to change the arguments we're using to construct the bloom filter, even if the underlying bloom filter library we're using doesn't change) or we need to encode those arguments here. I think putting them here probably makes more sense

Actually we have to put it here, because there's no way to know how many entries are in the bloom filter otherwise. Different anchor merkle trees could wind up with a different number of filter entries, and if we're generating the bloom filter based on the exact known number of entries, then that's going to change the size of the filter.

The library we intend on using will export to a format which includes this information: https://github.com/Callidon/bloom-filters#export-and-import
So the info will be in the bloomFilter property.

CIPs/CIP-69/CIP-69.md

stbrody · 2020-11-16T19:52:21Z

CIPs/CIP-69/CIP-69.md

 2. `schema` - if multiple documents have the same topic, sort by the *schema*
-3. `DID` - if multiple documents have the same topic and schema, extract and sorty by the DID from the `kid` in the signed record
+3. `controllers` - if multiple documents have the same topic and schema, sort by the first controller, then subsequent ones


For this to work right, we'll have to enforce that the controllers array in records is always stored sorted, regardless of the order the user specifies them in. Should we make a story for this now so we don't forget? Is there an epic for supporting multiple controllers we could put it in?

I guess that's true, unless you already know the order from a previous version of the document. I don't think there is an epic yet, but feel free to create one!

ceramicnetwork/js-ceramic#514

CIPs/CIP-69/CIP-69.md

stbrody · 2020-11-16T19:58:46Z

CIPs/CIP-69/CIP-69.md

+```js
+const { BloomFilter } = require('bloom-filters')
+const filterEntries = [...] // An array of strings
+const errorRate = 0.04 // 4 % error rate


Ah I see. I guess I had been thinking this was supposed to be an example of how someone might consume the information stored in the bloomFilter field of the TreeMetadata to reconstitute the bloom filter and use it, but I guess this is more like example code of how we might build the bloom filter before persisting it in the TreeMetadata. In that case, is this example even really relevant/useful to include in this document?

CIPs/CIP-69/CIP-69.md

oed · 2020-11-18T09:13:27Z

@stbrody Pushed some updates here.

stbrody · 2020-11-18T19:41:56Z

LGTM!

stbrody · 2020-12-03T19:35:23Z

CIPs/CIP-69/CIP-69.md

+type TreeMetadata struct {
+  numEntries Int
+  bloomType String
+  bloomFilter {String:Any}


Can't seem to make a mult-line suggestion in github but how about instead of separate top-level bloomType and bloomFilter fields we have a single top-level bloomFilter field whose value is an object with a type and a data field? So then we can add more tree metadata in the future and it's not mixed-in with bloom filter specific data

cip: Batched Anchor Data Structure

b04f40d

oed requested review from stbrody and simonovic86 November 12, 2020 14:10

oed self-assigned this Nov 12, 2020

oed mentioned this pull request Nov 12, 2020

Discussion: Batched Anchor Data Structure #69

Open

stbrody reviewed Nov 12, 2020

View reviewed changes

oed added 2 commits November 12, 2020 20:25

chore: change topic to collection

161e7ea

fix: updates

f29c701

stbrody reviewed Nov 16, 2020

View reviewed changes

fix: Rename topic to 'family'

650737c

stbrody approved these changes Nov 18, 2020

View reviewed changes

stbrody reviewed Dec 3, 2020

View reviewed changes

fix: update link to tree implementation

954c8f8

oed merged commit 33414a6 into master Dec 10, 2020

oed deleted the cip/batched-anchors branch February 16, 2021 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cip: Batched Anchor Data Structure #74

cip: Batched Anchor Data Structure #74

oed commented Nov 12, 2020

stbrody Nov 12, 2020

oed Nov 13, 2020

stbrody Nov 16, 2020

oed Nov 18, 2020

stbrody Nov 18, 2020

stbrody Nov 12, 2020

stbrody Nov 12, 2020

oed Nov 13, 2020

stbrody Nov 16, 2020

oed Nov 18, 2020 •

edited

Loading

stbrody Nov 18, 2020

stbrody Nov 16, 2020

oed commented Nov 18, 2020

stbrody commented Nov 18, 2020

stbrody Dec 3, 2020

oed Dec 4, 2020

cip: Batched Anchor Data Structure #74

cip: Batched Anchor Data Structure #74

Conversation

oed commented Nov 12, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oed Nov 18, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oed commented Nov 18, 2020

stbrody commented Nov 18, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oed Nov 18, 2020 •

edited

Loading