Consensus Clustering for Robust Bioinformatics Analysis

Clustering plays an important role in a multitude of bioinformatics applications, including protein function prediction, population genetics, and gene expression analysis. The results of most clustering algorithms are sensitive to variations of the input data, the clustering algorithm and its parameters, and individual datasets. Consensus clustering (CC) is an extension to clustering algorithms that aims to construct a robust result from those clustering features that are invariant under the above sources of variation. As part of CC, stability scores can provide an idea of the degree of reliability of the resulting clustering. Here, we present a review that structures the CC approaches in the literature into three principal types, introduce and illustrate the concept of stability scores, and illustrate the use of CC in applications to simulated and real-world gene expression datasets.

Easy-to-use tutorial Tutorial.ipynb

See package on CRAN link

Citation

Please cite the following manuscript:

Behnam Yousefi, Benno Schwikowski, "Consensus Clustering for Robust Bioinformatics Analysis," BioRxiv (2024).

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
Analysis		Analysis
package/ConsensusClustering		package/ConsensusClustering
temp		temp
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
README.md		README.md
Tutorial.ipynb		Tutorial.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Consensus Clustering for Robust Bioinformatics Analysis

Citation

About

Uh oh!

Releases

Packages

Languages

behnam-yousefi/ConsensusClustering

Folders and files

Latest commit

History

Repository files navigation

Consensus Clustering for Robust Bioinformatics Analysis

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages