Skip to content

Conversation

@Sireesha-Upenn
Copy link

@Sireesha-Upenn Sireesha-Upenn commented Sep 23, 2020

Repo link

Features implemented :

  • Cpu scan
  • thrust scan
  • gpu naive parallel scan
  • gpu work efficient parallel scan
  • Stream compaction on cpu and gpu

I tried using the shared memory optimization for the work efficient implementation. You can check the result by un-commenting #define SHARED_MEMORY in main.cpp. I am getting an error when trying to write to shared memory in dev_kernWriteToSharedMemory. I will update this repo later on if I figure it out.
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant