-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
some assumptions about additive indexes are not inline with real world usage #14
Comments
This seems to be so as per the intended meaning of additive_indexes. It should indicate that these indexes can be considered disconnected from each other. Data in one will not affect the other. Hence you can reuse them when calculating information for a certain set of allowed_groups. |
When reading the documentation, I came across a similar misinterpretation. Maybe this gives some guidance where the docs might become confusing. The question that spawned after reading the docs was:
Where the answer was
This answer is quite clear, but this case is not mentioned (as far as I can see) anywhere in the docs In the documentation, the following line was the main reason for confusion:
A mention when breaking the advice to avoid overlapping data, would make it clear that it is okay to do so in these cases. |
@x-m-el Updated the README with the suggestions you gave. Can you have another look? |
Yes, the extra text makes it clear for me 👍 |
When searching mu-search currently assumes that if it can find or create an index for each group specified as allowed group and that this should suffice. (based on my understanding of get_request_indexes )
This does unfortunately does not always hold, a very common case is information from the public allowed group and a private group being combined in the rdf properties of one document. In this case both the public index and private index lack information, whereas a non additive index for both groups would hold this information.
An example config.json demonstrating this case
and following quads:
would result in documents without mandatees if additive indexes are enabled.
We should at the very least address this issue in the readme with a note where we describing additive indexes and recommend not use them if this case occurs. A combination of eager indexing, with a proper configuration of the eager indexing groups and additive indexes might work and should also be documented if so.
We can also change the assumption and change our understanding of additive indexes. We should then define what an additive index is and what indexes are expected to be created when indexes are created non eagerly. if we create all possible permutations this could cause the creation of a lot of indexes.
The text was updated successfully, but these errors were encountered: