-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CCS data assemble far too small #260
Comments
Then I used the default parameter and did not choose the -x option like this : (please ignore the different name of input, they are actually the same file ) If the kmer distribution is not good, please kill me and adjust -k, -p, and -KCannot get a good distribution anyway, should adjust -S -s, also -A -e in assembly** PROC_STAT(0) **: real 3155.775 sec, user 6807.670 sec, sys 931.790 sec, maxrss 118617004.0 kB, maxvsize 189408212.0 kB this time seemed to get a much better assemble result : |
Please have a look at #259 |
Thank you for your reply. I tried hifiasm too but the software ran for weeks and didnot output any log and files which seemed to be abnormal for a 600m genome. So I really need your help. Do you have any other suggestions for wtdbg2? |
Please check your fastq data first, and read the #259 again. |
Dear prof. Ruan
I assembled a plant genome (~600m ~1.94% heterozygosity) based on ~400G Pacbio Sequel II CCS data with the followed line:
wtdbg2 -t 0 -x ccs -g 600m -i ccs23.fastq.gz -o beichai34 -e 2
The kmer distribution was like this:
|
|
|
|
|
|
|
|
|
||
||
||
||
||
|||
|||
||||
|||||
|||||||
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
********************** 1 - 201 **********************
Quatiles:
10% 20% 30% 40% 50% 60% 70% 80% 90% 95%
1 2 4 6 11 20 55 269 1779 9742
** PROC_STAT(0) **: real 2439.237 sec, user 6326.840 sec, sys 773.720 sec, maxrss 95177552.0 kB, maxvsize 130789572.0 kB
[Wed Apr 19 12:10:56 2023] - high frequency kmer depth is set to 13776
[Wed Apr 19 12:10:56 2023] - Total kmers = 728629161
[Wed Apr 19 12:10:56 2023] - average kmer depth = 7
[Wed Apr 19 12:10:56 2023] - 368836231 low frequency kmers (<2)
[Wed Apr 19 12:10:56 2023] - 4011 high frequency kmers (>13776)
Finally obtained only 4 contigs TOT 54784.
How to adjust the parameter to get a reliable output in this case ? thanks alot.
The text was updated successfully, but these errors were encountered: