Multiple EC_number tags for CDSs, lower case initials in /product descriptions for prokka-1.9.1-testing by aleimba · Pull Request #18 · tseemann/prokka

aleimba · 2014-05-30T12:55:17Z

New pull request to include multiple EC_number tags per CDS with version prokka-1.9.1-testing and some bug fixes

For this purpose the database format was adapted to include several EC_numbers separated by a semicolon, e.g.:

K9NBS6 3.5.1.13;3.5.1.14;3.5.1.4~~~aam~~~Acylamidase
MTEQNLHWLSATEMAASVASNNLSPNEIAEAMIQRVDAVNPS...

This is included in the necessary database preparations scripts:

prokka-biocyc_to_fasta_db
prokka-genbank_to_fasta_db
prokka-genpept_to_fasta_db
prokka-hamap_to_hmm
prokka-uniprot_to_fasta_db

prokka can handle this format now. As a result a new uniprot database in /db/kingdom/Bacteria/sprot is included.

Additionally, a couple of bug fixes:

Adapted the usage of prokka-hamap_to_hmm (will concatenate result files to 'HAMAP.hmm') and prokka-uniprot_to_fasta_db (divert stdout to '> sprot')
/protein description initials are changed to lowercase (as instructed by NCBI), except for 'Rossman' and 'Willebrand', see https://github.com/aleimba/prokka/blob/master/bin/prokka#L1177-1178
I didn't get the if-condition in line 58 of prokka-uniprot_to_fasta_db. What is 'if (1)' for? https://github.com/Victorian-Bioinformatics-Consortium/prokka/blob/master/bin/prokka-uniprot_to_fasta_db#L58
I removed the condition.
Included Shaun's 'Remove stray space from locus_tag' Remove stray space from locus_tag #15 pull request, so there's no conflict between our pull requests

This might warrant a new version, changed it to prokka-1.9.2.-testing. ChangeLog.txt is updated as well.

…iptions

aleimba · 2014-05-30T13:12:36Z

Forgot to mention, in prokka /gene and /EC_number feature tags are first removed (if it's a hypothetical protein) and subsequently added, see:
https://github.com/Victorian-Bioinformatics-Consortium/prokka/blob/master/bin/prokka#L915-931
Thus, I changed the order, to first add the tags and afterwards remove if hypothetical, see:
https://github.com/aleimba/prokka/blob/master/bin/prokka#L919-934

…sistency

aleimba added 2 commits May 30, 2014 13:47

Enable multiple EC_number tags; lower case initials in /product descr…

a134f08

…iptions

Update ChangeLog.txt

7a2d819

aleimba added 3 commits June 24, 2014 14:53

minor syntax change in regex to lower case initials in /product desc

7bd434e

bug fix for option '--hypo' in 'prokka-genbank_to_fasta_db'

88bd053

fixed translation in prokka and prokka-genbank_to_fasta_db

55be7d3

aleimba mentioned this pull request Dec 4, 2014

*.faa result files methionine/stop codon #54

Closed

aleimba added 3 commits December 4, 2014 13:12

typo in last commit for "prokka-genbank_to_fasta_db"

13a4066

fixed some more syntax errors in \'prokka-genbank_to_fasta_db\', jeez

73e2947

added '-' in front of translation option 'complete' bug fixes for con…

9e94787

…sistency

aleimba mentioned this pull request Mar 4, 2015

Prokka version not in .LOG file #81

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple EC_number tags for CDSs, lower case initials in /product descriptions for prokka-1.9.1-testing#18

Multiple EC_number tags for CDSs, lower case initials in /product descriptions for prokka-1.9.1-testing#18
aleimba wants to merge 8 commits intotseemann:masterfrom
aleimba:master

aleimba commented May 30, 2014

Uh oh!

aleimba commented May 30, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aleimba commented May 30, 2014

New pull request to include multiple EC_number tags per CDS with version prokka-1.9.1-testing and some bug fixes

Uh oh!

aleimba commented May 30, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant