Skip to content
Merged
Show file tree
Hide file tree
Changes from 12 commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
f012b14
Align MHC typing column names across sdrf-terms.tsv, adoc templates, …
jonasscheid Feb 23, 2026
dcf0f6c
Merge pull request #801 from jonasscheid/fix/mhc-allele-typing-incons…
ypriverol Feb 26, 2026
cc490f2
Add PDF specification v1.1.0-dev
github-actions[bot] Feb 26, 2026
497f28b
Minor updates of related with inmunopeptidomics
ypriverol Feb 26, 2026
1aa7a46
small changes in the spec
ypriverol Feb 26, 2026
5ed203b
Merge branch 'dev' of https://github.com/bigbio/proteomics-sample-met…
ypriverol Feb 26, 2026
99cab9f
Add PDF specification v1.1.0-dev
github-actions[bot] Feb 26, 2026
d778705
Merge branch 'master' of https://github.com/bigbio/proteomics-sample-…
ypriverol Feb 27, 2026
f75edd1
Add PDF specification v1.1.0-dev
github-actions[bot] Feb 27, 2026
9cf6a6f
Minor changes inmunopeptidomics
ypriverol Feb 27, 2026
26692d6
Merge branch 'dev' of https://github.com/bigbio/proteomics-sample-met…
ypriverol Feb 27, 2026
61b7005
Add PDF specification v1.1.0-dev
github-actions[bot] Feb 27, 2026
90c5906
remove dda-acquisition in favor of ms-proteomics
ypriverol Feb 28, 2026
2d69de2
Add PDF specification v1.1.0-dev
github-actions[bot] Feb 28, 2026
0da5252
Major updates on syn with the current templates
ypriverol Mar 7, 2026
0cfab58
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 7, 2026
1b78f70
Merge branch 'master' of https://github.com/bigbio/proteomics-sample-…
ypriverol Mar 7, 2026
2e8d2b4
fix examples
ypriverol Mar 7, 2026
7f1e290
Merge branch 'dev' of https://github.com/bigbio/proteomics-sample-met…
ypriverol Mar 7, 2026
a1eaf60
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 7, 2026
8b8e736
Update aome of the issues after new schemas
ypriverol Mar 8, 2026
42316e7
Merge branch 'dev' of https://github.com/bigbio/proteomics-sample-met…
ypriverol Mar 8, 2026
ee174b3
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 8, 2026
0f517a7
Updating the exisintg annotated projects
ypriverol Mar 8, 2026
32484e4
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 8, 2026
ee777b0
feat: add template inheritance resolver module
ypriverol Mar 8, 2026
cf933b7
feat: add CSS styles for template pages
ypriverol Mar 8, 2026
753b6c7
feat: add Jinja2 template for per-template HTML pages
ypriverol Mar 8, 2026
23af6d1
feat: add per-template HTML page generator
ypriverol Mar 8, 2026
b64adc5
feat: add auto-generated template section for index.html
ypriverol Mar 8, 2026
2e2dc7b
feat: update build pipeline for YAML-driven template pages
ypriverol Mar 8, 2026
719c11b
fix: improve template page rendering and index layout
ypriverol Mar 8, 2026
1e8c415
major changes for the web generation
ypriverol Mar 8, 2026
015769c
Merge branch 'dev' of https://github.com/bigbio/proteomics-sample-met…
ypriverol Mar 8, 2026
dbfb270
major changes to the web and clean specification
ypriverol Mar 8, 2026
921e1f5
Merge pull request #804 from ypriverol/feat/yaml-driven-template-docs
ypriverol Mar 8, 2026
d9c0a75
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 8, 2026
0fac979
minor changes
ypriverol Mar 8, 2026
28a49b4
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 8, 2026
5248c75
update examples
ypriverol Mar 9, 2026
5b1197b
Add PDF specification v1.1.0-dev
github-actions[bot] Mar 9, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions llms.txt
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@
- sdrf-proteomics/templates/dda-acquisition/README.adoc - DDA Acquisition: dissociation method, collision energy
- sdrf-proteomics/templates/dia-acquisition/README.adoc - DIA Acquisition: scan windows, isolation width
- sdrf-proteomics/templates/single-cell/README.adoc - Single-Cell Proteomics: cell isolation, carrier proteome
- sdrf-proteomics/templates/immunopeptidomics/README.adoc - Immunopeptidomics: MHC class, HLA typing
- sdrf-proteomics/templates/immunopeptidomics/README.adoc - Immunopeptidomics: MHC protein complex, MHC typing
- sdrf-proteomics/templates/crosslinking/README.adoc - Crosslinking MS: crosslinker reagents
- sdrf-proteomics/templates/metaproteomics/README.adoc - Metaproteomics: environmental and microbiome samples
- sdrf-proteomics/templates/olink/README.adoc - Olink: proximity extension assays
Expand Down Expand Up @@ -62,7 +62,7 @@ Machine-readable YAML definitions used by sdrf-pipelines for validation. Each te
- sdrf-proteomics/sdrf-templates/crosslinking/1.1.0/crosslinking.sdrf.tsv - Crosslinking example
- sdrf-proteomics/sdrf-templates/single-cell/1.0.0/single-cell.yaml - Single-Cell (experiment layer): cell isolation, carrier proteome
- sdrf-proteomics/sdrf-templates/single-cell/1.0.0/single-cell.sdrf.tsv - Single-Cell example
- sdrf-proteomics/sdrf-templates/immunopeptidomics/1.0.0-dev/immunopeptidomics.yaml - Immunopeptidomics (experiment layer): MHC class, HLA typing
- sdrf-proteomics/sdrf-templates/immunopeptidomics/1.0.0-dev/immunopeptidomics.yaml - Immunopeptidomics (experiment layer): MHC protein complex, MHC typing
- sdrf-proteomics/sdrf-templates/metaproteomics/1.0.0-dev/metaproteomics.yaml - Metaproteomics (experiment layer): environmental and microbiome samples
- sdrf-proteomics/sdrf-templates/metaproteomics/1.0.0-dev/metaproteomics.sdrf.tsv - Metaproteomics example
- sdrf-proteomics/sdrf-templates/olink/1.0.0/olink.yaml - Olink (experiment layer): proximity extension assays
Expand Down
Binary file modified psi-document/sdrf-proteomics-specification-v1.1.0-dev.pdf
Binary file not shown.
Original file line number Diff line number Diff line change
Expand Up @@ -79,7 +79,7 @@ Add extra columns for specific experimental workflows:
|DIA-specific parameters. link:templates/dia-acquisition/README.adoc[View template]

|**immunopeptidomics**
|HLA typing and related metadata. link:templates/immunopeptidomics/README.adoc[View template]
|MHC typing and related metadata. link:templates/immunopeptidomics/README.adoc[View template]

|**crosslinking**
|XL-MS specific columns. link:templates/crosslinking/README.adoc[View template]
Expand Down
42 changes: 38 additions & 4 deletions sdrf-proteomics/README.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -117,7 +117,7 @@ For the complete versioning strategy — including template versioning, ontology
[[reserved-words]]
=== Reserved words

There are general scenarios where cell values cannot be provided with actual data. The following reserved words MUST be used in these cases:
There are general scenarios where cell values cannot be provided with actual data. The following reserved words MUST be used in these cases. Reserved words MUST be all lowercase (e.g., `not available`, NOT `Not Available` or `Not available`):

- **not available**: In some cases, the column is mandatory in the format, but for some samples the corresponding value is unknown or could not be determined. In those cases, users SHOULD use *not available*.
- **not applicable**: In some cases, the column is mandatory, but for some samples the corresponding value or concept does not apply. In those cases, users SHOULD use *not applicable*.
Expand Down Expand Up @@ -258,9 +258,9 @@ The value for each property, (e.g. characteristics, comment, factor value) corre

- **Ontology url (Computer readable)**: Users can provide the corresponding URI (Uniform Resource Identifier) of the ontology/CV term as a value. This is recommended for enriched files where the user does not want to use intermediate tools to map from free text to ontology/CV terms.

- **Key=value representation (Human and Computer readable)**: The current representation aims to provide a mechanism to represent the complete information of the ontology/CV term including Accession, Name and other additional properties. In the key=value pair representation, the Value of the property is represented as an Object with multiple properties, where the key is one of the properties of the object and the value is the corresponding value for the particular key. An example of key value pairs is post-translational modification (see link:templates/ms-proteomics.html#_protein_modifications[Protein Modifications]):
- **Key=value representation (Human and Computer readable)**: The current representation aims to provide a mechanism to represent the complete information of the ontology/CV term including Accession, Name and other additional properties. In the key=value pair representation, the Value of the property is represented as an Object with multiple properties, where the key is one of the properties of the object and the value is the corresponding value for the particular key. The key order MUST be `NT` (name) first, followed by `AC` (accession), then any additional keys. An example of key value pairs is post-translational modification (see link:templates/ms-proteomics.html#_protein_modifications[Protein Modifications]):

NT=Glu->pyro-Glu;MT=fixed;PP=Anywhere;AC=Unimod:27;TA=E
NT=Glu->pyro-Glu;AC=Unimod:27;MT=fixed;PP=Anywhere;TA=E

[[validation]]
== Validating SDRF Files
Expand Down Expand Up @@ -420,6 +420,8 @@ For detailed documentation of sample preparation and MS/MS fragmentation propert
- **Fractionation**: fractionation method (used with `comment[fraction identifier]`)
- **Fragmentation**: collision energy, dissociation method

NOTE: For HCD (Higher-energy C-trap Dissociation), the canonical accession is https://www.ebi.ac.uk/ols4/ontologies/ms/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FMS_1000422[MS:1000422 - beam-type collision-induced dissociation]. Use `NT=beam-type collision-induced dissociation;AC=MS:1000422` or the short label `HCD`. Do not use PRIDE:0000590 or MS:1002481.

[[data-acquisition-method]]
=== Proteomics data acquisition method

Expand Down Expand Up @@ -501,6 +503,38 @@ Templates follow a layered hierarchy:

Child templates inherit all columns from parents and may add new columns or strengthen requirements (e.g., `optional` → `required`).

=== Template Combination Rules

Templates within the same layer are **mutually exclusive** - you MUST choose exactly one from each applicable layer:

|===
|Layer |Mutually Exclusive Templates |Rule

|TECHNOLOGY
|`ms-proteomics` vs `affinity-proteomics`
|Choose one (REQUIRED)

|SAMPLE
|`human` vs `vertebrates` vs `invertebrates` vs `plants`
|Choose one based on organism (RECOMMENDED)

|EXPERIMENT (acquisition)
|`dda-acquisition` vs `dia-acquisition`
|Choose one based on method (OPTIONAL)

|EXPERIMENT (affinity platform)
|`olink` vs `somascan`
|Choose one if using affinity-proteomics (OPTIONAL)
|===

Templates from different layers can be freely combined. Common valid combinations:

- `ms-proteomics` + `human` + `dda-acquisition` (human DDA proteomics)
- `ms-proteomics` + `human` + `immunopeptidomics` (human immunopeptidomics)
- `ms-proteomics` + `vertebrates` + `cell-lines` (mouse cell line proteomics)
- `ms-proteomics` + `human` + `crosslinking` (human crosslinking MS)
- `affinity-proteomics` + `human` + `olink` (human Olink)

=== Specifying Templates in SDRF Files

Declare templates using `comment[sdrf template]` columns. Only list leaf templates (parents are implied). When using multiple templates, add multiple columns with the same name. Two formats are supported:
Expand Down Expand Up @@ -544,7 +578,7 @@ sample_1 ... human v1.1.0 crosslinking v1.0.0
- link:templates/dia-acquisition/README.adoc[DIA Acquisition] - scan windows, isolation width, spectral library
- link:templates/cell-lines/README.adoc[Cell Lines] - Cellosaurus integration
- link:templates/single-cell/README.adoc[Single-Cell] - cell isolation, carrier proteome
- link:templates/immunopeptidomics/README.adoc[Immunopeptidomics] - MHC class, HLA typing
- link:templates/immunopeptidomics/README.adoc[Immunopeptidomics] - MHC protein complex, MHC typing
- link:templates/crosslinking/README.adoc[Crosslinking MS] - crosslinker reagents
- link:templates/metaproteomics/README.adoc[Metaproteomics] - environmental sample type

Expand Down
12 changes: 3 additions & 9 deletions sdrf-proteomics/metadata-guidelines/sample-metadata.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -163,17 +163,12 @@ For model organisms, use specific stages when applicable: `embryonic day 14` (mo

For samples without disease, the terminology matters for standardization:

**`normal`** (PATO:0000461) - **Recommended**
**`normal`** (PATO:0000461) - **REQUIRED term for healthy samples**

- Standard term in pathology ("normal tissue" vs "tumor tissue")
- Well-defined ontology mapping to PATO:0000461
- Well-defined ontology mapping to https://www.ebi.ac.uk/ols4/ontologies/pato/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FPATO_0000461[PATO:0000461]
- Widely used in existing proteomics datasets

**`healthy`** (SIO:001012) - **Accepted alternative**

- More intuitive for clinical/human samples
- Has valid ontology support (Semanticscience Integrated Ontology)
- Validators should accept both `normal` and `healthy`
- This is the single canonical term for healthy/control samples

[[WARNING]]
====
Expand Down Expand Up @@ -252,7 +247,6 @@ Both `healthy_1` and `adjacent_1` have `normal` as their disease state (no tumor
|Disease |SDRF Value |Ontology Link

|Healthy/no disease |normal |https://www.ebi.ac.uk/ols4/ontologies/pato/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FPATO_0000461[PATO:0000461]
|Healthy (alternative) |healthy |https://www.ebi.ac.uk/ols4/ontologies/sio/classes/http%253A%252F%252Fsemanticscience.org%252Fresource%252FSIO_001012[SIO:001012]
|Breast cancer |breast carcinoma |https://www.ebi.ac.uk/ols4/ontologies/mondo/classes/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FMONDO_0004989[MONDO:0004989]
|===

Expand Down
Loading