Skip to content

Conversation

@ehigham
Copy link
Member

@ehigham ehigham commented Aug 1, 2025

This PR adds EXPERIMENTAL support for query-on-batch-in-batch by running the driver process locally while offloading partition computation to Batch. This differs from the traditional "batch" backend which requires an additional driver job.

By adding a new experimental feature, my hope is to commit this small proof of concept and then improve it incrementally.

Key changes:

  • Added pyServiceBackend method to create a ServiceBackend from Python
  • Added proper interrupt handling in ServiceBackend to cancel jobs when interrupted
  • Implemented retry logic for job group creation to handle situations where multiple driver jobs create batch updates concurrently.
  • Added a new experimental Python API for initializing Hail with a local driver and Batch backend

Security Assessment

This change has no impact on the hail batch service in GCP.

Copy link
Member Author

ehigham commented Aug 1, 2025

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

@ehigham ehigham marked this pull request as ready for review August 1, 2025 05:16
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 9803dda to 4900338 Compare August 1, 2025 14:58
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 970b53e to 491b8dc Compare August 1, 2025 14:58
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from bfb03ff to 969af37 Compare August 1, 2025 22:16
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 0fab4fd to 9d9d912 Compare August 1, 2025 22:16
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 969af37 to 65ae981 Compare August 2, 2025 04:21
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 9d9d912 to 2e55e71 Compare August 2, 2025 04:21
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 65ae981 to b53a9fb Compare August 2, 2025 04:59
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 2e55e71 to c97beef Compare August 2, 2025 04:59
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from cced9ef to f4eebb5 Compare August 4, 2025 16:57
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from a35e964 to 1b2e3ed Compare August 4, 2025 16:57
@ehigham ehigham force-pushed the ehigham/experimental-init branch from f4eebb5 to 8f3b9da Compare August 4, 2025 17:21
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 1b2e3ed to d6df5c0 Compare August 4, 2025 17:21
@ehigham ehigham marked this pull request as draft August 4, 2025 18:52
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 8f3b9da to cd9fa56 Compare August 4, 2025 21:06
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from d6df5c0 to 11d12b1 Compare August 4, 2025 21:06
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from 9ec4390 to 18a198a Compare August 5, 2025 19:08
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from 11d12b1 to a492cc5 Compare August 5, 2025 19:08
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 18a198a to 4b0a84a Compare September 15, 2025 20:42
@ehigham ehigham force-pushed the ehigham/tmpdir-per-query branch from a492cc5 to 4c46605 Compare September 15, 2025 20:42
Base automatically changed from ehigham/tmpdir-per-query to main September 15, 2025 21:36
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from e2262ff to 7b85699 Compare October 27, 2025 19:08
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch 2 times, most recently from 73ace30 to 035da72 Compare October 27, 2025 20:14
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from 52bac9f to 2298818 Compare October 28, 2025 03:10
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from 035da72 to 7699b2f Compare October 28, 2025 03:10
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 2298818 to c3f6f77 Compare October 28, 2025 19:39
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from 7699b2f to 58666e6 Compare October 28, 2025 19:39
@ehigham ehigham force-pushed the ehigham/experimental-init branch from c3f6f77 to e0b0e08 Compare October 29, 2025 16:20
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from 58666e6 to 800a403 Compare October 29, 2025 16:20
@ehigham ehigham force-pushed the ehigham/experimental-init branch from e0b0e08 to 6e30672 Compare October 29, 2025 16:32
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from 800a403 to a2a0217 Compare October 29, 2025 16:32
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 6e30672 to 8b200ea Compare October 30, 2025 14:59
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from a2a0217 to 8b81868 Compare October 30, 2025 14:59
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 8b200ea to effc98f Compare October 30, 2025 15:18
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch 2 times, most recently from 9219faf to 6f58b61 Compare October 30, 2025 19:26
@ehigham ehigham force-pushed the ehigham/experimental-init branch 3 times, most recently from a0050e0 to 64c5584 Compare October 31, 2025 19:12
@ehigham ehigham marked this pull request as ready for review October 31, 2025 19:23
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 64c5584 to 7f247d9 Compare November 3, 2025 15:21
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from 6f58b61 to e7b4dea Compare November 3, 2025 15:21
@ehigham ehigham force-pushed the ehigham/experimental-init branch 2 times, most recently from d46195e to 44d1bc5 Compare November 3, 2025 22:01
@ehigham ehigham force-pushed the ehigham/experimental-init branch from 44d1bc5 to 71be1cd Compare November 4, 2025 20:26
@ehigham ehigham force-pushed the ehigham/query-cloud-credentials branch from e7b4dea to a0ce020 Compare November 4, 2025 20:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants