Skip to content

Latest commit

 

History

History
66 lines (43 loc) · 3.32 KB

File metadata and controls

66 lines (43 loc) · 3.32 KB
description A video walkthrough of all Code Ocean Pipeline features.
metaLinks
alternates

Pipeline Tutorial

This tutorial provides a comprehensive explanation of each Pipeline feature while building an RNA Sequencing pipeline from scratch. The list of chapters shows what features are covered and can be used to skip to the most relevant parts. Each Capsule is available on the Code Ocean Apps Library and all Data Assets have been made public allowing you to follow along and build the Pipeline yourself.

{% embed url="https://www.loom.com/share/bc5c20e4388a4bc9ac26595f59da4b0f?sid=1e3177fe-ce09-4ac9-b0ee-4946b0de7820" %}

Chapters

  • 00:00 Pipeline overview
  • 02:24 Connecting Capsules
  • 06:01 Adding Data Assets
  • 06:25 main.nf explained
  • 07:42 Requirements for using a Capsule in a Pipeline
  • 08:28 Considerations when designing a Capsule for a Pipeline
  • 14:17 Understanding connection types (parallelization)
  • 29:20 Map Paths: source and destination paths
  • 33:10 Capsule Settings
  • 34:53 Pipeline App Panel
  • 37:30 Pipeline settings (cache, IAM roles, error strategies)
  • 44:35 Running the Pipeline
  • 47:30 Nextflow Artifacts
  • 50:24 Debugging strategies
  • 53:00 Writing Nextflow and nf-core Pipelines

Data Assets

All 3 Data Assets used in the tutorial should be imported to your deployment to ensure they can be used without an IAM role.

  1. Paired End Reads
  2. STAR Index
  3. Annotation File

Paired End Reads (8GB)

Bucket Name: codeocean-public-data

Path: example_datasets/rna-seq-tutorial/GSE157194_reads/

STAR Index (28GB)

Bucket Name: codeocean-public-data

Path: example_datasets/STAR_GRCh38_GENCODE_Release_21_Index/star_index/

Annotation File (1GB)

Bucket Name: codeocean-public-data

Path: genomes/hg38_Annotation/