You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
License permits unrestricted use (educational + commercial)
Please also take note of the reviewer guidelines below to facilitate a smooth review process.
FOR REVIEWERS:
.dockstore.yml: file is present and aligned with creator metadata in workflow. ORCID identifiers are strongly encouraged in creator metadata. The .dockstore.yml file is required to run tests
Workflow is sufficiently generic to be used with lab data and does not hardcode sample names, reference data and can be run without reading an accompanying tutorial.
In workflow: annotation field contains short description of what the workflow does. Should start with This workflow does/runs/performs … xyz … to generate/analyze/etc …
In workflow: workflow inputs and outputs have human readable names (spaces are fine, no underscore, dash only where spelling dictates it), no abbreviation unless it is generally understood. Altering input or output labels requires adjusting these labels in the the workflow-tests.yml file as well
In workflow: name field should be human readable (spaces are fine, no underscore, dash only where spelling dictates it), no abbreviation unless generally understood
Workflow folder: prefer dash (-) over underscore (_), prefer all lowercase. Folder becomes repository in iwc-workflows organization and is included in TRS id
Readme explains what workflow does, what are valid inputs and what outputs users can expect. If a tutorial or other resources exist they can be linked. If a similar workflow exists in IWC readme should explain differences with existing workflow and when one might prefer one workflow over another
Changelog contains appropriate entries
Large files (> 100 KB) are uploaded to zenodo and location urls are used in test file
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.001s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 244169.33reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:07, 7.44s/chunks]1chunks [00:07, 7.44s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 41731.31it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.001s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 256164.41reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:07, 7.95s/chunks]1chunks [00:07, 7.95s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 40366.95it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.001s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 191721.01reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:06, 6.69s/chunks]1chunks [00:06, 6.69s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 40937.66it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.002s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 150833.01reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:07, 7.03s/chunks]1chunks [00:07, 7.03s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 39418.52it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
Unexpected HTTP status code: 400: {"err_msg":"Workflow cannot be run because input step '14' (DeepARG database) is not optional and no input provided.","err_code":0}
Unexpected HTTP status code: 400: {"err_msg":"Workflow cannot be run because input step '14' (DeepARG database) is not optional and no input provided.","err_code":0}
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.001s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 215168.33reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:07, 7.25s/chunks]1chunks [00:07, 7.25s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 34214.57it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)TrimmomaticPE: Started with arguments: -phred33 input_forward.fastqsanger.gz input_reverse.fastqsanger.gz input_forward.fastqsanger.gz.paired input_forward.fastqsanger.gz.unpaired input_reverse.fastqsanger.gz.paired input_reverse.fastqsanger.gz.unpaired LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Multiple cores found: Using 4 threadsInput Read Pairs: 125000 Both Surviving: 125000 (100.00%) Forward Only Surviving: 0 (0.00%) Reverse Only Surviving: 0 (0.00%) Dropped: 0 (0.00%)TrimmomaticPE: Completed successfullyvsearch v2.27.1_linux_x86_64, 15.6GB RAM, 4 coreshttps://github.com/torognes/vsearchMerging reads 100% 125000 Pairs 69309 Merged (55.4%) 55691 Not merged (44.6%)Pairs that failed merging due to various reasons: 39021 too few kmers found on same diagonal 265 multiple potential alignments 12101 alignment score too low, or score drop too high 295 overlap too short 4009 staggered read pairsStatistics of all reads: 92.44 Mean read lengthStatistics of merged reads: 109.97 Mean fragment length 40.60 Standard deviation of fragment length 0.07 Mean expected error in forward sequences 0.06 Mean expected error in reverse sequences 0.04 Mean expected error in merged sequences 0.01 Mean observed errors in merged region of forward sequences 0.00 Mean observed errors in merged region of reverse sequences 0.01 Mean observed errors in merged regionWARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string./usr/local/lib/python2.7/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module. "downsample module has been moved to the theano.tensor.signal.pool module.")/usr/local/lib/python2.7/site-packages/sklearn/cross_validation.py:41: DeprecationWarning: This module was deprecated in version 0.18 in favor of the model_selection module into which all the refactored classes and functions are moved. Also note that the interface of the new CV iterators are different from that of this module. This module will be removed in 0.20. "This module will be removed in 0.20.", DeprecationWarning)INFO:root:DIAMOND blastx alignmentINFO:root:Running: diamond blastx -q deeparg_short_reads_output/SR.clean -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4/database/v2/features -k 1000 --id 80.0 --sensitive -e 1e-10 -a deeparg_short_reads_output/SR.clean.deeparg.alignINFO:root:Input:deeparg_short_reads_output/SR.clean output:deeparg_short_reads_output/SR.clean.deeparg model: deepARG_SS, Input type: blastxINFO:root:parsing output file diamond view -a deeparg_short_reads_output/SR.clean.deeparg.align.daa -o deeparg_short_reads_output/SR.clean.deeparg.align.daa.tsvdiamond v2.1.10.164 (C) Max Planck Society for the Advancement of Science, Benjamin Buchfink, University of TuebingenDocumentation, support and updates available at http://www.diamondsearch.orgPlease cite: http://dx.doi.org/10.1038/s41592-021-01101-x Nature Methods (2021)#CPU threads: 4Loading subject IDs... [0s]Scoring parameters: (Matrix=blosum62 Lambda=0.267 K=0.041 Penalties=11/1)DB sequences = 12279DB sequences used = 642DB letters = 5088898#Target sequences to report alignments for: 25Generating output... [0.002s]INFO:root:Loading deep learning model ...INFO:root:loading gene lengthsINFO:root:Loading sample to analyze0reads [00:00, ?reads/s]1237reads [00:00, 280375.79reads/s]INFO:root:Predicting ARG-like reads: Running deepARG_SS model version v2INFO:root:input dataset is splitted into chunks of 10000 reads0chunks [00:00, ?chunks/s]1chunks [00:07, 7.17s/chunks]1chunks [00:07, 7.17s/chunks]INFO:root:Predicting ARGs 0%| | 0/136 [00:00<?, ?it/s]100%|██████████| 136/136 [00:00<00:00, 35676.11it/s]180691 reads; of these: 180691 (100.00%) were unpaired; of these: 180399 (99.84%) aligned 0 times 0 (0.00%) aligned exactly 1 time 292 (0.16%) aligned >1 times0.16% overall alignment rate
Standard Output:
Step 1: Trimming and QC using Trimmomatic['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 2: Merging paired end reads using Vsearch['/usr/local/bin', '/usr/local/lib/python27.zip', '/usr/local/lib/python2.7', '/usr/local/lib/python2.7/plat-linux2', '/usr/local/lib/python2.7/lib-tk', '/usr/local/lib/python2.7/lib-old', '/usr/local/lib/python2.7/lib-dynload', '/usr/local/lib/python2.7/site-packages']Step 3: Run DeepARG-SS to identify ARG-like readsdeeparg predict --type nucl --model SS -d /cvmfs/data.galaxyproject.org/byhand/deeparg/deeparg_1.0.4 -i deeparg_short_reads_output/SR.clean -o deeparg_short_reads_output/SR.clean.deeparg --arg-alignment-identity 80.0 --min-prob 0.8 --arg-alignment-evalue 1e-10Step 4: Quantification of ARG-like countssort -k1,1 -k2,2n deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG | bedtools merge -c 12,5 -o sum,distinct >deeparg_short_reads_output/SR.clean.deeparg.mapping.ARG.mergedStep 5: Normalize to 16S rRNAs - this may take a while ...Total number of 16S Reads in the sample: 178
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
FOR CONTRIBUTOR:
FOR REVIEWERS:
This workflow does/runs/performs … xyz … to generate/analyze/etc …namefield should be human readable (spaces are fine, no underscore, dash only where spelling dictates it), no abbreviation unless generally understood-) over underscore (_), prefer all lowercase. Folder becomes repository in iwc-workflows organization and is included in TRS id