Implement `dlpack` dunder for pylibcudf columns #18566

seberg · 2025-04-24T13:32:59Z

This implements the __dlpack__ dunder (and __dlpack_device__), which could then be also forwarded to libcudf columns.

There is a bit of a clash with the old dlpack implementation. It is similar, but also different because the old one is table centric and always copies, while this is column centric and copies only if requested.
Thus, I kept it as detailed API.

The from_dlpack() can/should be extended to support at least 1-D objects that implement __dlpack__ (although)

One of the more complex things here is the stream synchronization unfortunately, it seems very hard to test reliably in practice. (My tries didn't produce a failure when it should fail.)

Marking as draft, since there is likely some discussion/thought needed. For one, while it existed I am not sure I believe in exposing this in C++ (at this time). And then it might work to create a helper/intermediate object rather than doing it all here?

(Also wondering if it wouldn't be easier to just vendor the dlpack header?)

CC @vyasr, since I think you were interested in this.

This implements the `__dlpack__` dunder (and `__dlpack_device__`), which could then be also forwarded to libcudf columns. There is a bit of a clash with the old dlpack implementation. It is similar, but also different because the old one is table centric and always copies, while this is column centric and copies only if requested. Thus, I kept it as detailed API. The `from_dlpack()` can/should be extended to support at least 1-D objects that implement `__dlpack__` (although) One of the more complex things here is the stream synchronization unfortunately, it seems very hard to test reliably in practice. Signed-off-by: Sebastian Berg <[email protected]>

copy-pr-bot · 2025-04-24T13:33:02Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

Matt711 · 2025-04-24T17:04:41Z

Just noting a relevant issue I've worked on recently.

[Story] Add standard data ingestion pipelines to pylibcudf #15132

vyasr · 2025-04-25T01:51:04Z

Thanks Sebastian! Yes, this is definitely of interest. Probably the most relevant similar things that have happened recently are #18402 and #15370, the corresponding Arrow protocols.

One of the more complex things here is the stream synchronization unfortunately, it seems very hard to test reliably in practice. (My tries didn't produce a failure when it should fail.)

Possibly because everything in cudf executes on the default stream anyway. You would have to explicitly request a different (non-blocking) stream to test this I think. Not sure if you tried that.

Marking as draft, since there is likely some discussion/thought needed. For one, while it existed I am not sure I believe in exposing this in C++ (at this time). And then it might work to create a helper/intermediate object rather than doing it all here?

I'm not sure that I understand. The dlpack spec is intended to support C-level interfacing as well as Python, so are you just asking whether libcudf wants to do that?

(Also wondering if it wouldn't be easier to just vendor the dlpack header?)

As in vendoring vs cloning as part of the build? We don't vendor too much in RAPIDS, but the dlpack header is small enough that it could probably be justified.

seberg · 2025-04-25T07:16:34Z

The dlpack spec is intended to support C-level interfacing as well as Python

Yeah,unfortunately we don't have as clearly defined interface for exchange in C/C++. So I think it makes sense to make it public (with a slightly different signature).
I.e. the C++ function here asks the caller to do the ownership tracking and that may always be the case (you could do an overload for a shared_ptr<column> as input).

But yeah, we could just expose it outside detail, I might do two modifications:

skip the to_host and copy arguments
Add DLVersion max_version struct to be passed in.

Maybe actually safer to make it to_dlpack_v1 function in C++.

You would have to explicitly request a different (non-blocking) stream

Yeah, was using a second non-blocking cupy stream. But I should try mixing host/device copy and kernel launches rather than two kernel launches to improve the chance of seeing something, maybe.

As in vendoring vs cloning as part of the build?

Yeah, most projects do and I think it should be a build-time and not a runtime dependency in conda. But I can also just move it there. DLPack should take care to not break ABI in unexpected ways. Possible ABI change may also make the to_dlpack_v1 naming clearer.

vyasr · 2025-04-28T20:58:45Z

I would be fine with vendoring the header. dlpack is simpler than arrow and we are also not using a large helper library like nanoarrow that is a moving target. Vendoring a single header for this would probably be simpler and also address issues like #12175.

Do we need to support multiple versions? Can we just go straight to the newer versioned dlpack structs?

seberg · 2025-04-29T06:07:54Z

Do we need to support multiple versions? Can we just go straight to the newer versioned dlpack structs?

Maybe we should just to keep things simple and since nobody complained much about the lack, yet.
Roll-out of versioned support is still in progress (e.g. torch doesn't have it yet, while cupy/numpy support it for long enough now).

I would be fine with vendoring the header.

👍

vyasr · 2025-05-07T16:55:27Z

Practically speaking I think Arrow data interchange is more valuable for cudf than dlpack, especially on the export side. Since nested types involve multiple buffers, using dlpack for that data requires looping over each buffer, which really is only supportable from Python at the pylibcudf level since we don't expose those buffers in the public pandas- or polars-like APIs. We're still building out the pylibcudf API for real public consumption, so if we can get our dlpack ducks in a row in time for that I think that would be sufficient.

github-actions bot assigned seberg Apr 24, 2025

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. CMake CMake build issue pylibcudf Issues specific to the pylibcudf package labels Apr 24, 2025

github-project-automation bot added this to cuDF Python Apr 24, 2025

seberg added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Apr 24, 2025

vyasr moved this to In Progress in cuDF Python May 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `dlpack` dunder for pylibcudf columns #18566

Implement `dlpack` dunder for pylibcudf columns #18566

Uh oh!

seberg commented Apr 24, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Apr 24, 2025

Uh oh!

Matt711 commented Apr 24, 2025

Uh oh!

vyasr commented Apr 25, 2025 •

edited

Loading

Uh oh!

seberg commented Apr 25, 2025

Uh oh!

vyasr commented Apr 28, 2025

Uh oh!

seberg commented Apr 29, 2025

Uh oh!

vyasr commented May 7, 2025

Uh oh!

Uh oh!

Implement __dlpack__ dunder for pylibcudf columns #18566

Are you sure you want to change the base?

Implement __dlpack__ dunder for pylibcudf columns #18566

Uh oh!

Conversation

seberg commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Apr 24, 2025

Uh oh!

Matt711 commented Apr 24, 2025

Uh oh!

vyasr commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seberg commented Apr 25, 2025

Uh oh!

vyasr commented Apr 28, 2025

Uh oh!

seberg commented Apr 29, 2025

Uh oh!

vyasr commented May 7, 2025

Uh oh!

Uh oh!

Implement `dlpack` dunder for pylibcudf columns #18566

Implement `dlpack` dunder for pylibcudf columns #18566

seberg commented Apr 24, 2025 •

edited

Loading

vyasr commented Apr 25, 2025 •

edited

Loading