Rank nearest neighbors in embedding space #211

ziw-liu · 2024-11-21T01:02:25Z

An ideal cell state representation has the following properties:

When the cell state does not change, the neighboring time points of the same cell should have more similar embeddings compared to other cells (continuity)
When the cell state does change, the embeddings change a lot so that different cell states can be expressed (dynamic range)

We observed these properties for time-regularized models visually in UMAPs. However, Euclidean distances in UMAP space carry no global meaning, so the observations cannot be directly quantified by comparing UMAP values. This PR provides methods to describe the embedding similarities close to how UMAP is computed: each sample is ranked as the $k$-th nearest neighbor for each sample, and the displacement in this neighborhood ($k$ at $t_{i+1}$ for each $t_i$) can then be used to describe the fluctuation of embeddings in a way that preserves latent space topology.

Example from ALFI dataset:

ziw-liu · 2024-11-21T01:10:10Z

Using standardized features (949df3c) changes the angular distance, but doesn't really change the shape of the rankings curve:

edyoshikun

Works like a charm! Also super helpful and thanks for this elegant implementation.

* methods to rank nearest neighbors in embeddings * example script to plot state change of a single track * test using scaled features

ziw-liu added 2 commits November 20, 2024 16:50

methods to rank nearest neighbors in embeddings

5fc7515

example script to plot state change of a single track

efbf1f5

ziw-liu added enhancement New feature or request representation Representation learning (SSL) labels Nov 21, 2024

ziw-liu requested a review from edyoshikun November 21, 2024 01:03

test using scaled features

949df3c

edyoshikun approved these changes Nov 22, 2024

View reviewed changes

ziw-liu marked this pull request as ready for review November 27, 2024 22:47

ziw-liu merged commit 987874f into main Nov 27, 2024
4 checks passed

ziw-liu deleted the rank-neighbors branch November 27, 2024 22:48

edyoshikun pushed a commit that referenced this pull request Dec 18, 2024

Rank nearest neighbors in embedding space (#211)

351bfac

* methods to rank nearest neighbors in embeddings * example script to plot state change of a single track * test using scaled features

edyoshikun pushed a commit that referenced this pull request Dec 19, 2024

Rank nearest neighbors in embedding space (#211)

a5af9e5

* methods to rank nearest neighbors in embeddings * example script to plot state change of a single track * test using scaled features

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rank nearest neighbors in embedding space #211

Rank nearest neighbors in embedding space #211

Uh oh!

ziw-liu commented Nov 21, 2024

Uh oh!

ziw-liu commented Nov 21, 2024 •

edited

Loading

Uh oh!

edyoshikun left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Rank nearest neighbors in embedding space #211

Rank nearest neighbors in embedding space #211

Uh oh!

Conversation

ziw-liu commented Nov 21, 2024

Uh oh!

ziw-liu commented Nov 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

edyoshikun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ziw-liu commented Nov 21, 2024 •

edited

Loading