feat: 99% credible set validation during `study_locus_validation` #765

d0choa · 2024-09-16T14:00:07Z

Includes:

Filter credible sets by 95% confidence intervals
Removes unnecessary filters in other steps

addramir · 2024-09-16T16:04:14Z

We previously discussed 99% CS, not 95% to be more comprehensive. And actually removing it from coloc and everything else.
Moreover, this filter doesn't work for our susie credible sets, because we don't really populate thes columns (we know that these CSs are 99%), both columns is95CredibleSet and is99CredibleSet are Null.

d0choa · 2024-09-16T16:09:12Z

It's doable, but we must fix a few other problems then. Let's discuss this in person

DSuveges · 2024-09-17T08:39:36Z

Removes unnecessary filters in other steps

I like the idea of having one single point in the process where things get dropped. Although pruning the locus object would is radically different from the validation of other datasets (as it would not lead to any new flagged objects and the filtered out tags would not get anywhere unlike invalid studies or study loci. They just disappear.) It would make sense to have it in the validation and would make the resulting datasets consistent across all applications.

@addramir

Moreover, this filter doesn't work for our susie credible sets, because we don't really populate thes columns (we know that these CSs are 99%), both columns is95CredibleSet and is99CredibleSet are Null.

To be honest, I don't really like this inconsistency. Would it be possible to make all cred.set dataset similar? Similarly, if we don't care about different levels of confidence, and would keep everything 99%, we can just drop the column from the schema.

d0choa · 2024-09-17T09:22:20Z

I need to adjust this PR based on the new decisions described on the ticket opentargets/issues#3468

d0choa · 2024-09-18T12:56:57Z

Implementing the new decisions described on the ticket opentargets/issues#3468 is pretty simple.

Now, all credible sets are annotated when we try to filter them. That would make all the logic work as long as the locus contains a populated posteriorProbability column. (including SuSie credible sets)

We need to remember that the current PICS results are filtered to 95%, so much of this will not have an effect until we re-run PICS.

DSuveges

All makes sense:

Annotating PICSed credible sets upon creation.
Filtering method has the logic to do the annotation as well.
Filtering is happening at the validation step.
As the dataset is already filtered, coloc doesn't need to apply filter anymore.

d0choa added 2 commits September 16, 2024 14:47

feat: study locus validation filters for 95% credible sets

1fdaeca

revert: no longer needed to filter for credible set interval

7d9e964

d0choa requested a review from addramir September 16, 2024 14:00

github-actions bot added size-S Method Step Feature labels Sep 16, 2024

d0choa added 2 commits September 18, 2024 09:23

Merge branch 'dev' into do_credible_set_95_validation

47f8210

feat: annotate credible sets before filter them

690b492

github-actions bot added the Dataset label Sep 18, 2024

d0choa and others added 2 commits September 18, 2024 11:52

docs: adding more context here

41a6c21

Merge branch 'dev' into do_credible_set_95_validation

eaf7089

d0choa marked this pull request as ready for review September 18, 2024 12:53

d0choa changed the title ~~feat: 95% credible set validation during study_locus_validation~~ feat: 99% credible set validation during study_locus_validation Sep 18, 2024

d0choa requested a review from DSuveges September 18, 2024 12:58

chore: merge dev

0a42029

github-actions bot removed the Method label Sep 24, 2024

chore: merge dev

48e9e91

DSuveges approved these changes Sep 24, 2024

View reviewed changes

Merge branch 'dev' into do_credible_set_95_validation

a3510f9

d0choa merged commit 84d6638 into dev Sep 24, 2024
5 checks passed

d0choa deleted the do_credible_set_95_validation branch September 24, 2024 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 99% credible set validation during `study_locus_validation` #765

feat: 99% credible set validation during `study_locus_validation` #765

d0choa commented Sep 16, 2024

addramir commented Sep 16, 2024

d0choa commented Sep 16, 2024

DSuveges commented Sep 17, 2024 •

edited

Loading

d0choa commented Sep 17, 2024 •

edited

Loading

d0choa commented Sep 18, 2024 •

edited

Loading

DSuveges left a comment

feat: 99% credible set validation during study_locus_validation #765

feat: 99% credible set validation during study_locus_validation #765

Conversation

d0choa commented Sep 16, 2024

addramir commented Sep 16, 2024

d0choa commented Sep 16, 2024

DSuveges commented Sep 17, 2024 • edited Loading

d0choa commented Sep 17, 2024 • edited Loading

d0choa commented Sep 18, 2024 • edited Loading

DSuveges left a comment

Choose a reason for hiding this comment

feat: 99% credible set validation during `study_locus_validation` #765

feat: 99% credible set validation during `study_locus_validation` #765

DSuveges commented Sep 17, 2024 •

edited

Loading

d0choa commented Sep 17, 2024 •

edited

Loading

d0choa commented Sep 18, 2024 •

edited

Loading