Use ClinVar submissions file (3/N) by pj-sullivan · Pull Request #277 · diskin-lab-chop/AutoGVP

pj-sullivan · 2026-01-30T21:28:51Z

Purpose/implementation Section

What feature is being added or bug is being addressed?

Replacing the input ClinVar file with the ClinVar submissions file that has been filtered and cleaned in a prior step.

What was your approach?

Combined the cavatica and custom input scripts into one as the differences between them no longer exist.

What GitHub issue does your pull request address?

Closes #275

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Check the output for any new columns we don't want, or missing columns. Currently aware the ClinVar star value is missing as it is not supplied in the submissions file, but I can add that into the select submissions script.

Which areas should receive a particularly close look?

Is there anything that you want to discuss further?

Documentation Checklist

The function has examples to showcase the usage
Added a vignette

…-clinvar

rjcorb

I've done a few test runs on PBTA samples and only noticed minor discrepancies between variant calls that should be resolved with ticket #278. I pushed a few changes I made during my test runs. I think we'll just want to double-check variant names in these scripts, and will also need to update the README with new example commands.

rjcorb · 2026-02-05T20:16:16Z

run_autogvp.sh

 ## default files
 variant_summary_file="$BASEDIR/data/ClinVar-selected-submissions.tsv"


Suggested change

rjcorb · 2026-02-05T20:18:48Z

scripts/02-annotate_variants.R

Suggested change

output_tab_abr_file <- paste0(output_name, ".annotations_report.abridged.tsv")

Take ClinSig when not conflicting (4/N)

rjcorb

I tested this again using some PBTA samples, and, as before, we only see discrepancies due to including the P/LP and B/LB classifications. I left one minor suggested change, and I also just noticed that there is a typo in the resolve-clinvar-intepretations.R script name :).

I will approve with merge contingent on updates above, and I also think we should have @rebkau review. @rebkau would you be able to test this on some of the NBL samples? Here was my process:

Run sample(s) through AutoGVP on main branch:

bash run_autogvp.sh --vcf=<vcf_file> \
--filter_criteria=<filtering_criteria>' \
--clinvar=data/clinvar_20260104.vcf.gz \
--intervar=<intervar_path> \
--multianno=<multianno_path> \
--autopvs1=<autopvs1_path> \
--outdir=results \
--out=<name> \
--selected_clinvar_submissions=refs/ClinVar-selected-submissions.tsv

note you would need to run the select-clinVar-submissions.R Rscript beforehand

Rscript scripts/select-clinVar-submissions.R --variant_summary data/variant_summary_20260104.txt.gz --submission_summary data/submission_summary_20260104.txt.gz --conceptID_list refs/clinvar_cancer_concept_ids_20260130.txt --outdir refs/

Switch to this branch and run new resolve-clinvar-intepretations.R script

Rscript scripts/resolve-clinvar-intepretations.R --variant_summary data/variant_summary_20260104.txt.gz --submission_summary data/submission_summary_20260104.txt.gz --conceptID_list refs/clinvar_cancer_concept_ids_20260130.txt --outdir refs/

Run AutoGVP on this branch:

bash run_autogvp.sh --vcf=<vcf_file> \
--filter_criteria=<filtering_criteria>' \
--intervar=<intervar_path> \
--multianno=<multianno_path> \
--autopvs1=<autopvs1_path> \
--outdir=results \
--out=<name> \
--sample_id=<sample_id> \
--resolved_clinvar=refs/resolved-clinvar-interpretations.tsv

Compare pathogenicity calls between runs. The only differences I noticed were cases where a variant was called P or LP or B or LB originally (in main), but called P/LP or B/LB in the branch results. That is an expected change, since we are no longer trying to resolve these calls and just taking the call as is when it's reported in ClinVar.

pj-sullivan added 3 commits January 28, 2026 16:11

replace clinvar input with selected submissions

9af35f6

update input arguments

260e36b

Merge branch 'pj-sullivan/select-submissions' into pj-sullivan/filter…

8526324

…-clinvar

pj-sullivan requested a review from rjcorb January 30, 2026 21:28

pj-sullivan self-assigned this Jan 30, 2026

pj-sullivan changed the base branch from main to pj-sullivan/select-submissions January 30, 2026 21:29

pj-sullivan changed the title ~~Use ClinVar submissions file~~ Use ClinVar submissions file (3/N) Jan 30, 2026

pj-sullivan and others added 2 commits January 30, 2026 21:29

Apply code style changes

eab7924

clean up testing lines

018fed5

Base automatically changed from pj-sullivan/select-submissions to main February 2, 2026 15:36

pj-sullivan added 2 commits February 2, 2026 14:39

remove different versions of 02-annotate

de669a0

update run script with edited input files

1bc3e3d

pj-sullivan marked this pull request as ready for review February 2, 2026 20:17

jharenza requested a review from rebkau February 5, 2026 18:06

update select_submissions variable in scripts

ef54f03

rjcorb reviewed Feb 5, 2026

View reviewed changes

pj-sullivan and others added 6 commits February 10, 2026 15:11

use variant clinsig instead of submission

591e8da

update to use variant clinsig

c9655da

rename resolve clinvar

dad9ce6

update links to resolve clinvar script

0fd64df

Merge pull request #279 from diskin-lab-chop/pj-sullivan/clinsig

da85820

Take ClinSig when not conflicting (4/N)

Apply code style changes

b1bfe1f

rjcorb self-requested a review February 24, 2026 14:52

rjcorb approved these changes Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ClinVar submissions file (3/N)#277

Use ClinVar submissions file (3/N)#277
pj-sullivan wants to merge 14 commits intomainfrom
pj-sullivan/filter-clinvar

pj-sullivan commented Jan 30, 2026 •

edited

Loading

Uh oh!

rjcorb left a comment

Uh oh!

rjcorb Feb 5, 2026

Uh oh!

rjcorb Feb 5, 2026

Uh oh!

rjcorb left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		## default files
		variant_summary_file="$BASEDIR/data/ClinVar-selected-submissions.tsv"


	output_tab_abr_file <- paste0(output_name, ".annotations_report.abridged.tsv")

Conversation

pj-sullivan commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose/implementation Section

What feature is being added or bug is being addressed?

What was your approach?

What GitHub issue does your pull request address?

Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.

Which areas should receive a particularly close look?

Is there anything that you want to discuss further?

Documentation Checklist

Uh oh!

rjcorb left a comment

Choose a reason for hiding this comment

Uh oh!

rjcorb Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

rjcorb Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

rjcorb left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pj-sullivan commented Jan 30, 2026 •

edited

Loading