B1

This file summarises the procedure used for the analysis of the B1 fastq files deposited by Cattonaro et al on the ENA: SRP163096 Fastq files were retrived for sample B1 (3,830,083 reads). Analysis were run using SeqBox ecosystem Beccuti et al. Fastq fle R1 and R2 were mapped on hg38 human genome using bwa software Jo and Koh.

library(docker4seq)
bwa(group = "docker", fastq.folder = getwd(),
  scratch.folder = "/data/scratch", genome.folder = "/data/genomes/hg38bwa", genome.name = "genome.fa",
  seq.type = "pe", threads = 8, sample.id = "B1")

Hg38 indexing was generated using the bwaIndex functin implemented in docker4seq package Kulkarni et al 2018

library(docker4seq)
bwaIndex(group="docker", genome.folder=getwd(), genome.url="ftp://ftp.ensembl.org/pub/release-94/fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna.toplevel.fa.gz", mode="General")

File dedup_reads.stats.xlsx contains the statistics on mapping produced with SAMTOOLS with the command samtools idxstats

As output the docker4seq bwa functin extract also the unmapped reads: sorted_unmapped_R1.fastq.gz, sorted_unmapped_R2.fastq.gz

The unmapped files were analysed with kraken2 using the 8GB Kraken 2 Database built from the Refseq bacteria, archaea, and viral libraries and the GRCh38 human genome

The kraken2 was embedded in a docker container (docker.io/repbioinfo/kraken.2019.01) and analysis was run using default parameters with the following commands:

ID=$(docker run  -i -t -v /data/genomes/minikraken:/reference -v /data/corvelva/B2:/data -v /data/scratch:/scratch -d docker.io/repbioinfo/kraken.2019.01 /bin/bash)
docker attach $ID
# passing as parameters the two unmapped fastq coming from the bwa mapping to human genome and the numbero threads
/bin/kraken_run.sh sorted_unmapped_R1.fastq.gz sorted_unmapped_R2.fastq.gz 8

The output of kraken is available in the file B1_kraken2.report.xlsx

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
B1_kraken2.report.xlsx		B1_kraken2.report.xlsx
README.md		README.md
dedup_reads.stats.xlsx		dedup_reads.stats.xlsx
fig1.png		fig1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

B1

About

Releases

Packages

kendomaniac/B1

Folders and files

Latest commit

History

Repository files navigation

B1

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages