Skip to content

[hca dcp] Refresh CELLxGENE → HCA project mapping #4847

@frano-m

Description

@frano-m

Summary

Run the get-cellxgene-projects-hca script to regenerate site-config/hca-dcp/ma-dev/scripts/out/cellxgene-projects.json with the current CELLxGENE → HCA project ID mappings.

The file was last refreshed in #4361 (commit 60a7a6eb, 2025-01-31) — roughly 16 months old, so any CELLxGENE collections added or removed since then are missing or stale in the explorer.

What the script does

site-config/hca-dcp/ma-dev/scripts/get-cellxgene-projects.ts:

  1. Fetches https://api.cellxgene.cziscience.com/curation/v1/collections
  2. Parses each collection's links[].link_url for HCA project URLs (data.humancellatlas.org/projects/<id>)
  3. Sorts the resulting {cellxgeneId, hcaProjectId} pairs by hcaProjectId (for nice diffs)
  4. Writes the JSON to site-config/hca-dcp/ma-dev/scripts/out/cellxgene-projects.json

Steps

npm run get-cellxgene-projects-hca
git diff site-config/hca-dcp/ma-dev/scripts/out/cellxgene-projects.json

If there are changes, commit them on a chore: branch and open a PR (mirroring #4361's pattern).

Acceptance criteria

  • Script runs successfully
  • cellxgene-projects.json updated if upstream collections changed
  • PR opened with the regenerated file (or issue closed as no-change if upstream returned the same mappings)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions