Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Utilize Amazon S3 condition write to support concurrent write #2843

Open
Cpaulyz opened this issue Sep 3, 2024 · 7 comments
Open

Utilize Amazon S3 condition write to support concurrent write #2843

Cpaulyz opened this issue Sep 3, 2024 · 7 comments
Assignees
Labels
binding/rust Issues for the Rust crate enhancement New feature or request on-hold Issues and Pull Requests that are on hold for some reason
Milestone

Comments

@Cpaulyz
Copy link

Cpaulyz commented Sep 3, 2024

Description

Hi, I noticed that Amazon S3 supports condition write now (https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3-conditional-writes/). Is it possible to utilize this feature to replace dynamodb-lock to support concurrent write natively?

@Cpaulyz Cpaulyz added the enhancement New feature or request label Sep 3, 2024
@thomasfrederikhoeck
Copy link
Contributor

I guess the following upstream needs to be closed first: apache/arrow-rs#6285

@rtyler rtyler added the binding/rust Issues for the Rust crate label Sep 5, 2024
@rtyler rtyler self-assigned this Sep 5, 2024
@ion-elgreco ion-elgreco added the on-hold Issues and Pull Requests that are on hold for some reason label Sep 7, 2024
@danielgafni
Copy link

danielgafni commented Sep 29, 2024

Just to confirm, this won't allow parallel writing (for example, writing 100 partitions at once), but will remove the dependency on DynamoDB as locking mechanism, right?

@Cpaulyz
Copy link
Author

Cpaulyz commented Oct 7, 2024

Just to confirm, this won't allow parallel writing (for example, writing 100 partitions at once), but will remove the dependency on DynamoDB as locking mechanism, right?

Yes. I think so.

@rtyler
Copy link
Member

rtyler commented Oct 7, 2024

If you're up for doing some experimentation (on non-production workloads), I believe that our conditional put support will "just work" for S3.

@rtyler rtyler added this to the Rust v1.0.0 milestone Oct 8, 2024
@satyanmandavilli
Copy link

satyanmandavilli commented Nov 10, 2024

apache/arrow-rs#6285 is closed now. Do we have any update on this issue? Looking forward to remove DynamoDB dependency.

@gianghq
Copy link

gianghq commented Nov 12, 2024

Keen on this as well.

@ion-elgreco
Copy link
Collaborator

apache/arrow-rs#6285 is closed now. Do we have any update on this issue? Looking forward to remove DynamoDB dependency.

You will have to wait on the next arrow release that aligns with datafusion and object store. Then there are still bugs in datafusion that stops us from actually upgrading for some time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
binding/rust Issues for the Rust crate enhancement New feature or request on-hold Issues and Pull Requests that are on hold for some reason
Projects
None yet
Development

No branches or pull requests

7 participants