JosepSampe

Josep Sampé JosepSampe

PhD in Computer Engineering. Researcher interested in distributed systems, cloud computing, data analytics and software engineering.

29 followers · 14 following

Barcelona

Achievements

Organizations

Starred repositories

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,911 2,211 Updated Feb 1, 2025

awslabs / multi-agent-orchestrator

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 4,574 373 Updated Mar 27, 2025

apache / incubator-xtable

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 1,006 167 Updated Mar 11, 2025

dbos-inc / dbos-transact-py

Ultra-Lightweight Durable Execution in Python

Python 614 22 Updated Mar 28, 2025

duckdb / duckdb

DuckDB is an analytical in-process SQL database management system

C++ 28,065 2,185 Updated Mar 27, 2025

CODAIT / stocator

Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.

Java 114 72 Updated May 17, 2024

apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,708 2,390 Updated Mar 28, 2025

tcort / markdown-link-check

checks all of the hyperlinks in a markdown text to determine if they are alive or dead

JavaScript 612 120 Updated Mar 26, 2025

coder / coder

Provision remote development environments via Terraform

Go 9,293 824 Updated Mar 28, 2025

dask / dask-expr

Python 89 27 Updated Jan 21, 2025

Qbeast-io / qbeast-spark

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala 225 21 Updated Jan 24, 2025

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,910 1,799 Updated Mar 27, 2025

clusterlink-net / clusterlink

A Gateway for connecting application services in different domains, networks, and cloud infrastructures

Go 17 18 Updated Mar 10, 2025

gorilla-llm / gorilla-cli

LLMs for your CLI

Python 1,333 76 Updated May 29, 2024

bloomberg / memray

Memray is a memory profiler for Python

Python 13,812 400 Updated Mar 25, 2025

kubernetes-sigs / kueue

Kubernetes-native Job Queueing

Go 1,685 308 Updated Mar 27, 2025

volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)

Go 4,541 1,041 Updated Mar 27, 2025

kubestellar / kubeflex

A flexible and scalable platform for running Kubernetes control plane APIs.

Go 57 18 Updated Mar 26, 2025

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,933 1,069 Updated Mar 27, 2025

sustainablecomputing / caspian

Go 14 1 Updated May 28, 2024

kubeflow / pipelines

Machine Learning Pipelines for Kubeflow

Python 3,788 1,704 Updated Mar 27, 2025

cmu-db / optd-original

CMU-DB's Cascades optimizer framework

Rust 396 28 Updated Jan 6, 2025

holoviz / panel

Panel: The powerful data exploration & web app framework for Python

Python 5,122 540 Updated Mar 27, 2025

IBM / Anonymized-ETL-Flow-Datasets-for-FSM

Anonymized version of six datasets taken from IBM's DataStage™ production systems and can be used for frequent subgraph mining

Python 8 Updated Jan 28, 2024

kdash-rs / kdash

A simple and fast dashboard for Kubernetes

Rust 2,216 85 Updated Mar 12, 2025

opengeos / leafmap

A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment

Python 3,307 407 Updated Mar 26, 2025

stern / stern

⎈ Multi pod and container log tailing for Kubernetes -- Friendly fork of https://github.com/wercker/stern

Go 3,747 132 Updated Mar 23, 2025

project-codeflare / multi-cluster-app-dispatcher

Holistic job manager on Kubernetes

Go 114 64 Updated Feb 20, 2024

intel / platform-aware-scheduling

Enabling Kubernetes to make pod placement decisions with platform intelligence.

Go 174 45 Updated Jan 29, 2025

linkedin / transport

A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.

Java 299 72 Updated Jan 12, 2024

Josep Sampé JosepSampe

Organizations

Starred repositories

Serverless

multicloud

object-storage

cloud-computing

Kubernetes

Python

Java

Awesome Lists