Metahit is a cutting-edge metagenomic data processing pipeline.
Metahit requires only a x86 computer with enough RAM (64GB+ is recommended) to support the in-memory operations.
Metahit is supported and tested in Linux systems only.
Run bash setup.sh
to install external dependenceies.
Run conda env create -f env.yaml
in root directory
Run conda activate metahit_env
to activate conda env.
RUN conda env remove -n metahit_env
to delete env
Then, in the conda environment, directly run bash demo.sh
and the pipeline will be triggered.
The folder (db_setup
) contains three scripts to download and set up databases for CheckM, CheckM2, and GTDB-Tk. By default, each script downloads the database into a database
folder in your current working directory. You can optionally provide a custom path.
-
CheckM database downloading:
checkm_db.sh
bash db_setup/checkm_db.sh [DB_DIR]
-
CheckM2 database downloading:
checkm2_db.sh
bash db_setup/checkm2_db.sh [DB_DIR]
-
GTDB-TK database downloading:
tdbtk_db.sh
bash db_setup/gtdbtk_db.sh [DB_DIR] [RELEASE_VERSION]
Example:
bash db_setup/gtdbtk_db.sh /your/custom/path 214
bash demo.sh
Hang Yu([email protected])