quantmsdiann

Introduction

quantmsdiann is a bigbio bioinformatics pipeline, built following nf-core guidelines, for quantitative mass spectrometry analysis using DIA-NN. It supports Data-Independent Acquisition (DIA) workflows including label-free, plexDIA (mTRAQ, SILAC, Dimethyl), phosphoproteomics with site localization, and Bruker timsTOF/PASEF data.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a portable manner. It uses Docker/Singularity containers making results highly reproducible. The Nextflow DSL2 implementation of this pipeline uses one container per process, making it easy to maintain and update software dependencies.

Pipeline summary

The pipeline takes SDRF metadata and mass spectrometry data files (.raw, .mzML, .d, .dia) as input and performs:

Input validation — SDRF parsing and validation via sdrf-pipelines
File preparation — RAW to mzML conversion (ThermoRawFileParser), indexing, Bruker .d handling (tdf2mzml)
In-silico spectral library generation — deep learning-based prediction, or use a user-provided library (--diann_speclib)
Preliminary analysis — per-file calibration and mass accuracy estimation (parallelized)
Empirical library assembly — consensus library from preliminary results with RT profiling
Individual analysis — per-file search with the empirical library (parallelized)
Final quantification — protein/peptide/gene group matrices with cross-run normalization
MSstats conversion — DIA-NN report to MSstats-compatible format
Quality control — interactive QC report via pmultiqc

Supported DIA-NN Versions

Version	Profile	Container	Key features
1.8.1 (default)	`diann_v1_8_1`	`docker.io/biocontainers/diann:v1.8.1_cv1`	Core DIA analysis, TSV output
2.1.0	`diann_v2_1_0`	`ghcr.io/bigbio/diann:2.1.0`	Native .raw support, Parquet output
2.2.0	`diann_v2_2_0`	`ghcr.io/bigbio/diann:2.2.0`	Speed optimizations (up to 1.6x on HPC)
2.3.2	`diann_v2_3_2`	`ghcr.io/bigbio/diann:2.3.2`	DDA support (beta), InfinDIA, up to 9 var mods

Switch versions with e.g. -profile diann_v2_2_0,docker. See the DIA-NN Version Selection guide and full parameter reference for details.

Quick start

Note

If you are new to Nextflow and nf-core, please refer to this page on how to set up Nextflow.

Run with test data:

nextflow run bigbio/quantmsdiann -profile test_dia,docker --outdir results

Run with your own data:

nextflow run bigbio/quantmsdiann \
    --input 'experiment.sdrf.tsv' \
    --database 'proteins.fasta' \
    --outdir './results' \
    -profile docker

Run with a specific DIA-NN version:

nextflow run bigbio/quantmsdiann \
    --input 'experiment.sdrf.tsv' \
    --database 'proteins.fasta' \
    --outdir './results' \
    -profile docker,diann_v2_2_0

Warning

Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files specified with -c must only be used for tuning process resource specifications, not for defining parameters.

Documentation

Usage — How to run the pipeline, input formats, optional outputs, and custom configuration
Parameters — Complete reference of all pipeline parameters organised by category
Output — Description of all output files produced by the pipeline

Credits

quantmsdiann is developed and maintained by:

Yasset Perez-Riverol (EMBL-EBI)
Dai Chengxin (Beijing Proteome Research Center)
Julianus Pfeuffer (Freie Universitat Berlin)
Vadim Demichev (Charite Universitaetsmedizin Berlin)
Qi-Xuan Yue (Chongqing University of Posts and Telecommunications)

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citation

If you use quantmsdiann in your research, please cite:

Dai et al. "quantms: a cloud-based pipeline for quantitative proteomics" (2024). DOI: 10.5281/zenodo.15573386

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.claude		.claude
.github		.github
assets		assets
conf		conf
docs		docs
modules		modules
subworkflows		subworkflows
tests		tests
workflows		workflows
.gitattributes		.gitattributes
.gitignore		.gitignore
.nf-core.yml		.nf-core.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
nf-test.config		nf-test.config
ro-crate-metadata.json		ro-crate-metadata.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

quantmsdiann

Introduction

Pipeline summary

Supported DIA-NN Versions

Quick start

Documentation

Credits

Contributions and Support

Citation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

quantmsdiann

Introduction

Pipeline summary

Supported DIA-NN Versions

Quick start

Documentation

Credits

Contributions and Support

Citation

License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages