Pipeline Tutorial

This tutorial provides a comprehensive explanation of each Pipeline feature while building an RNA Sequencing pipeline from scratch. The list of chapters shows what features are covered and can be used to skip to the most relevant parts. Each Capsule is available on the Code Ocean Apps Library and all Data Assets have been made public allowing you to follow along and build the Pipeline yourself.

{% embed url="https://www.loom.com/share/bc5c20e4388a4bc9ac26595f59da4b0f?sid=1e3177fe-ce09-4ac9-b0ee-4946b0de7820" %}

Chapters

00:00 Pipeline overview
02:24 Connecting Capsules
06:01 Adding Data Assets
06:25 main.nf explained
07:42 Requirements for using a Capsule in a Pipeline
08:28 Considerations when designing a Capsule for a Pipeline
14:17 Understanding connection types (parallelization)
29:20 Map Paths: source and destination paths
33:10 Capsule Settings
34:53 Pipeline App Panel
37:30 Pipeline settings (cache, IAM roles, error strategies)
44:35 Running the Pipeline
47:30 Nextflow Artifacts
50:24 Debugging strategies
53:00 Writing Nextflow and nf-core Pipelines

Data Assets

All 3 Data Assets used in the tutorial should be imported to your deployment to ensure they can be used without an IAM role.

Paired End Reads (8GB)

Bucket Name: codeocean-public-data

Path: example_datasets/rna-seq-tutorial/GSE157194_reads/

STAR Index (28GB)

Bucket Name: codeocean-public-data

Path: example_datasets/STAR_GRCh38_GENCODE_Release_21_Index/star_index/

Annotation File (1GB)

Bucket Name: codeocean-public-data

Path: genomes/hg38_Annotation/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline Tutorial

Chapters

Data Assets

Paired End Reads (8GB)

STAR Index (28GB)

Annotation File (1GB)

FilesExpand file tree

pipeline-tutorial.md

Latest commit

History

pipeline-tutorial.md

File metadata and controls

Pipeline Tutorial

Chapters

Data Assets

Paired End Reads (8GB)

STAR Index (28GB)

Annotation File (1GB)