Skip to content
Change the repository type filter

All

    Repositories list

    • TeX
      MIT License
      0001Updated Feb 21, 2025Feb 21, 2025
    • Blazing-Fast Bioinformatic Operations on Python DataFrames
      Python
      Apache License 2.0
      17184Updated Feb 20, 2025Feb 20, 2025
    • Apache License 2.0
      0000Updated Feb 20, 2025Feb 20, 2025
    • A set of native implementation of common bioinformatics algorithms to be used as Arrow-Datafusion or SeQuiLa (Apache Spark) extensions.
      Rust
      Apache License 2.0
      00180Updated Feb 12, 2025Feb 12, 2025
    • Benchmarks of various genomic ranges operations
      Jupyter Notebook
      Apache License 2.0
      1011Updated Feb 12, 2025Feb 12, 2025
    • Self service for Data Science labs
      HCL
      Apache License 2.0
      48021Updated Jan 18, 2025Jan 18, 2025
    • Jupyter Notebook
      Apache License 2.0
      46531Updated Jan 18, 2025Jan 18, 2025
    • Apache DataFusion Comet Spark Accelerator
      Rust
      Apache License 2.0
      182001Updated Nov 24, 2024Nov 24, 2024
    • Jupyter Notebook
      1100Updated Oct 27, 2024Oct 27, 2024
    • phenodb

      Public
      Serverless vector database for deep phenotyping
      Apache License 2.0
      0000Updated Aug 29, 2024Aug 29, 2024
    • Fine-tuning LLaMA 2 for rare disease concept normalization
      Jupyter Notebook
      3000Updated Aug 9, 2024Aug 9, 2024
    • sequila

      Public
      SeQuiLa: Distributed analytics for genomics based on Apache Spark!
      HTML
      Apache License 2.0
      710238Updated Aug 2, 2024Aug 2, 2024
    • PhenoGPT

      Public
      Jupyter Notebook
      MIT License
      6000Updated May 16, 2024May 16, 2024
    • Launcher shortcuts for classic Jupyter Notebook & JupyterLab
      Python
      BSD 3-Clause "New" or "Revised" License
      11000Updated Feb 26, 2024Feb 26, 2024
    • PhenoTagger
      GAP
      MIT License
      16000Updated Jan 24, 2024Jan 24, 2024
    • rnafusion

      Public
      RNA-seq analysis pipeline for detection gene-fusions
      Nextflow
      MIT License
      104000Updated Dec 1, 2023Dec 1, 2023
    • rust-bio

      Public
      This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.
      Rust
      MIT License
      204000Updated Nov 18, 2023Nov 18, 2023
    • coitrees

      Public
      A very fast interval tree data structure
      Rust
      MIT License
      8000Updated Nov 10, 2023Nov 10, 2023
    • iitii

      Public
      Implicit Interval Tree with Interpolation Index
      Jupyter Notebook
      Apache License 2.0
      4000Updated Nov 9, 2023Nov 9, 2023
    • A little benchmarking tool for Python
      Python
      MIT License
      6000Updated Oct 15, 2023Oct 15, 2023
    • Python
      2000Updated Aug 24, 2023Aug 24, 2023
    • ds-images

      Public
      Shell
      Apache License 2.0
      00154Updated May 25, 2023May 25, 2023
    • sparkseq

      Public
      Scala
      Apache License 2.0
      0000Updated Mar 5, 2023Mar 5, 2023
    • popgen

      Public
      Scala
      0000Updated Mar 5, 2023Mar 5, 2023
    • Scala
      Apache License 2.0
      0000Updated Mar 5, 2023Mar 5, 2023
    • disq

      Public
      A library for manipulating bioinformatics sequencing formats in Apache Spark
      Java
      MIT License
      11100Updated Jan 29, 2023Jan 29, 2023
    • cannoli

      Public
      Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
      Scala
      Apache License 2.0
      17000Updated Jan 29, 2023Jan 29, 2023
    • Python
      2000Updated Jan 17, 2023Jan 17, 2023
    • SeQuiLa recipes, examples and other cloud-related content
      HCL
      Apache License 2.0
      1300Updated Nov 15, 2022Nov 15, 2022
    • pysequila

      Public
      Python wrapper for SeQuiLa: Distributed analytics for genomics based on Apache Spark!
      HTML
      Apache License 2.0
      1233Updated Nov 4, 2022Nov 4, 2022