Add "needle in the haystack" queries

The JSONBench dataset contains fields with big number of unique values (aka high-cardinality fields):

- did (aka user_id)
- commit.cid (aka commit_id)
- commit.record.subject.cid

Sometimes it is needed to find all the rows for the particular rarely seen value of some field. For example, to find all the rows generated by some user. Then the following query can be used for JSONBench data:

```sql
SELECT count(*) FROM bluesky WHERE data.did = 'did:plc:stwikwzlk2mepaagokthylry'
```

Another practical query is to select a row for the given commit_id:

```sql
SELECT * FROM bluesky WHERE data.commit.cid = 'bafyreielfqkpggsdqwtbtg5tyh7iqytp64paevfjbeufnw6kc7sgmjemhm'
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add "needle in the haystack" queries #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add "needle in the haystack" queries #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions