Skip to content

Conversation

@selmanozleyen
Copy link
Member

hi,

Here is my implementation of sepal with CUDA. It needs revising regarding the parameters passed to the kernels but the algorithm works regardless. The whole simulation is in C++. It uses global memory but I tried to make the access sequential as possible. For now the speedup is seen when the gene count increases but it's still faster than CPU. I didn't do time comparisons yet because the speed of CPU implementation might change with this: scverse/squidpy#1035

@selmanozleyen selmanozleyen self-assigned this Sep 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant