-
Notifications
You must be signed in to change notification settings - Fork 194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About the usage of vg deconstruct #4356
Comments
Yeah, this is expected and super annoying. The issue is that the snarl decomposition, which determines the sites in the VCF, is somewhat dependent on the "rooting" of the snarl tree. And this rooting can be insconsistent between a graph and subgraph, when computed independently. One work-around might be to use |
I think the main obstacle to doing something like that was UI; we'd need to come up with a way for the user to explain how they want their snarls rooted and then bring it all the way through, and we'd probably want to have it in anything that can need to make snarls. Now that we have a stronger notion of reference-sense paths, it might be possible to prefer those for anchoring by default? |
Hi, I am wondering if it is possible to use |
The idea now is for using haplotype sampling only for mapping. You still call all samples on the full graph (with which the sampled GAMs are compatible) |
Hi @glennhickey , Thanks for the suggestion, I have tried calling variants on the full graph. I ran When considering variants with INDEL length> 20 (i.e.,
It is really great to see most of variants of |
Hi!
I used vg chunk to extract subgraphs from a pan-genome graph and then used vg deconstruct to obtain VCF files for both the pan-genome graph and the subgraph. However, I noticed that the VCF file for the subgraph contains variants that are not present in the pan-genome graph. Is this expected?
Thank you very much!
The text was updated successfully, but these errors were encountered: