BGC2Tree pipeline

For creating phylogenetic trees of homologous proteins from biosynthetic gene clusters

Requirements: (Tested in WSL2 Ubuntu 20.04.2 LTS)

Example local usage:

nextflow run main.nf --in data/example_query.fasta --db data/gbk_records

Remote usage:

nextflow run jbwdn/bgc_2_tree --in your_local_query.fasta --db your_local_direcory_of_gbks

Description:

1. Accept a query fasta containing 1 sequence and a path to a folder containing .gbk records
2. Using phmmer find a potential homolog from each record and save all into a fasta with labels
3. Perform protein sequence alignment using MUSCLE
4. Use alignment to calculate a phylogenetic Tree using FastTree
5. Paths to the three output files (homologs, alignment & tree) printed to stdout

See Nextflow and conda docs for installation instructions.

Built using this template.

References:

pHMMER: Eddy2009 (10.1142/9781848165632_0019)
MUSCLE: Edgar2004 (10.1186/1471-2105-5-113)
FastTree: Price2010 (10.1371/journal.pone.0009490)

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
bin		bin
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BGC2Tree pipeline

References:

About

Releases

Packages

Languages

License

JBwdn/BGC_2_Tree

Folders and files

Latest commit

History

Repository files navigation

BGC2Tree pipeline

References:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages