API/GPU/UCX: Create GPU-side Device API #704

michal-shalev · 2025-08-20T14:27:54Z

What?

Add GPU device function stubs for UCX backend with template-based coordination levels (thread, warp, block, grid) for memory transfers, signal transfers, and status checking.

Why?

To establish foundation for GPU-initiated transfers.
See also: #705

How?

Added 5 templated device functions in nixl_device.cuh that return NIXL_ERR_NOT_SUPPORTED as placeholders:
nixlGpuPostSingleWriteXferReq - memory transfers
nixlGpuPostSignalXferReq - signal transfers
nixlGpuPostPartialWriteXferReq - partial transfers
nixlGpuPostWriteXferReq - full transfers
nixlGpuGetXferStatus - status checking

copy-pr-bot · 2025-08-20T14:27:58Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions · 2025-08-20T14:28:03Z

👋 Hi michal-shalev! Thank you for contributing to ai-dynamo/nixl.

Your PR reviewers will review your contribution then trigger the CI to test your changes.

🚀

src/api/cuda/ucx/nixl_device.cuh

yosefe · 2025-08-25T06:59:43Z

/build

yosefe · 2025-08-28T15:42:03Z

/build

michal-shalev · 2025-08-28T15:57:00Z

/build

Signed-off-by: Michal Shalev <[email protected]>

michal-shalev · 2025-08-28T16:06:26Z

/build

michal-shalev requested a review from a team as a code owner August 20, 2025 14:27

pull-request-size bot added the size/L label Aug 20, 2025

github-actions bot added the external-contribution label Aug 20, 2025

michal-shalev requested a review from yosefe August 20, 2025 14:28

michal-shalev self-assigned this Aug 20, 2025

michal-shalev requested review from brminich, iyastreb, tvegas1, ovidiusm, tstamler and aranadive August 20, 2025 14:29

michal-shalev force-pushed the create-device-api branch from 398fee0 to de2f4cf Compare August 20, 2025 14:46

michal-shalev mentioned this pull request Aug 20, 2025

Create host-side GPU transfer request API #705

Merged

michal-shalev requested a review from mkhazraee August 21, 2025 08:43

brminich reviewed Aug 21, 2025

View reviewed changes

michal-shalev force-pushed the create-device-api branch 3 times, most recently from 5c32b17 to 3d2a8a4 Compare August 24, 2025 14:26

copy-pr-bot bot temporarily deployed to SWX_AWS August 24, 2025 14:26 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 24, 2025 14:26 Inactive

copy-pr-bot bot temporarily deployed to SWX_AWS August 24, 2025 14:26 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 24, 2025 14:29 Inactive

michal-shalev changed the title ~~API/CUDA/UCX: Create Device API~~ API/GPU/UCX: Create Device API Aug 25, 2025

michal-shalev force-pushed the create-device-api branch from 3d2a8a4 to f0ce524 Compare August 25, 2025 19:20

copy-pr-bot bot temporarily deployed to SWX_AWS August 25, 2025 19:20 Inactive

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 15:38 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 15:41 Inactive

yosefe approved these changes Aug 28, 2025

View reviewed changes

yosefe enabled auto-merge (squash) August 28, 2025 15:43

michal-shalev force-pushed the create-device-api branch from 6509ac0 to 9d5cd2e Compare August 28, 2025 15:55

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 15:55 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 15:55 Inactive

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 15:55 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 15:56 Inactive

michal-shalev force-pushed the create-device-api branch from 9d5cd2e to bde1fab Compare August 28, 2025 15:56

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 15:56 Inactive

copy-pr-bot bot had a problem deploying to SWX_AWS August 28, 2025 15:56 Failure

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 15:56 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 16:00 Inactive

API/GPU/UCX: Create Device API

e6c93d2

Signed-off-by: Michal Shalev <[email protected]>

michal-shalev force-pushed the create-device-api branch from bde1fab to e6c93d2 Compare August 28, 2025 16:06

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 16:06 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 16:06 Inactive

copy-pr-bot bot temporarily deployed to SWX_AWS August 28, 2025 16:06 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 16:12 Inactive

yosefe merged commit add1824 into ai-dynamo:main Aug 28, 2025
16 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API/GPU/UCX: Create GPU-side Device API #704

API/GPU/UCX: Create GPU-side Device API #704

michal-shalev commented Aug 20, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Aug 20, 2025

Uh oh!

github-actions bot commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yosefe commented Aug 25, 2025

Uh oh!

yosefe commented Aug 28, 2025

Uh oh!

michal-shalev commented Aug 28, 2025

Uh oh!

michal-shalev commented Aug 28, 2025

Uh oh!

Uh oh!

Uh oh!

API/GPU/UCX: Create GPU-side Device API #704

API/GPU/UCX: Create GPU-side Device API #704

Conversation

michal-shalev commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What?

Why?

How?

Uh oh!

copy-pr-bot bot commented Aug 20, 2025

Uh oh!

github-actions bot commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yosefe commented Aug 25, 2025

Uh oh!

yosefe commented Aug 28, 2025

Uh oh!

michal-shalev commented Aug 28, 2025

Uh oh!

michal-shalev commented Aug 28, 2025

Uh oh!

Uh oh!

Uh oh!

michal-shalev commented Aug 20, 2025 •

edited

Loading