Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recomendation for AT-high and very repetitive genome #261

Open
V-JJ opened this issue May 31, 2023 · 0 comments
Open

Recomendation for AT-high and very repetitive genome #261

V-JJ opened this issue May 31, 2023 · 0 comments

Comments

@V-JJ
Copy link

V-JJ commented May 31, 2023

Hello @ruanjue !

We have to assemble a genome with the following features:

  • Genome size: ~2Gb
  • High repetitiveness: ~78.68% (inferred from a close species)
  • Hi AT content: ~72.36% (inferred from a close species)
  • PacBio HiFi sequencing: 130x coverage

We've checked some of the issues, such as #239 and we thought about the following tests. So, we'd like to ask your opinion or recommendations about them. All of them have two steps: (a) assembly with wtdbg2; (b) polishing with wtdbg-cns (minimap2 is used as a read mapper).

List of possible tests/ideas:

  1. Default options:
  • wtdbg2 -g 2.05g -t 24 -x sq
  • wtdbg2 -g 2.05g -t 24 -x ccs
  1. Add -R parameter:
  • wtdbg2 -g 2.05g -t 24 -x sq -R
  1. Increase -s parameter (0.5 or 0.7 as in Parameters for triploid HIFI #239): wtdbg2 -g 2.05g -t 24 -x sq -s 0.5

  2. Try to vary -L, increasing it to keep the longest reads.

Any advice would be appreciated,
Thanks in advance

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant