 | MACS PeakCalling | MACS3 is a peak calling tool generally used on ChIP seq data to identify transcript factor binding sites. | - .bam alignment files
- compare_sheet.csv (see README)
|
 | featureCounts | This capsule will run featureCounts from the Subreads R package to generate an expression matrix. | - Gene annotation .gtf file
- .bam alignments
|
 | HOMER | Homer contains a useful, all-in-one program for performing peak annotation called annotatePeaks.pl. This capsule uses annotatePeaks.pl to annotate *.bed coordinates with gene features. | - .bed files containing peaks
- Genome reference .fasta
- Gene annotation .gtf file.
|
 | Gene Enrichment Analysis (GEA) | This capsule presents a user-friendly Streamlit application designed to facilitate gene enrichment analysis. The analysis results are sourced from reliable and widely-used platforms, namely g-profiler and Panther. | - File containing gene names
|
 | GATK RNAseq short variant discovery (SNPs + Indels) | Based on GATK RNASeq short variant discovery pipeline. Takes in alignments and outputs vcf containing SNPs and indels. | |
 | Delly somatic complete analysis | Structural variant (SV) prediction to discover, genotype and visualize deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read massively parallel sequencing data of somatic cells. | - Genome reference .fasta
- .bam DNA alignment files
|
 | Delly germline complete analysis | Structural variant (SV) prediction to discover, genotype and visualize deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read massively parallel sequencing data of germline cells. | - Genome reference .fasta
- .bam DNA alignment files
|
 | ART-Simulation-Illumina | ART is a set of simulation tools to generate synthetic next-generation sequencing reads. | - .fasta containing the sequence to simulate reads from
|
 | PySpark and EMR Serverless | This capsule runs an example PySpark job on EMR Serverless. | - NOAA Global Surface Summary of Day dataset
|