Always send shutdown_worker RPC, fix WorkerStatus state when shutting down worker #1082

yuandrew · 2025-12-16T22:32:13Z

What was changed

Always send shutdown_worker RPC, decouple disabling eager workflow start and worker heartbeat unregistration for worker shutdown

Why?

shutdown_worker RPC doesn't indicate that the worker has fully shutdown, only that it has started. Server and others can tell that a worker has fully shutdown by checking if there has been a heartbeat within the "heartbeat interval" after receiving the ShuttingDown status.

Checklist

Closes
How was this tested:

Any docs updates needed?

Note

Always send shutdown_worker RPC and refactor worker unregistration into two steps (disable eager start, then finalize heartbeat cleanup), updating APIs and tests.

Worker shutdown behavior
- Always calls shutdown_worker RPC during shutdown; sets status to WorkerStatus::ShuttingDown.
- Removes client-side mutation of heartbeat status in shutdown_worker; client only fills common heartbeat fields.
- finalize_shutdown now calls workers().finalize_unregister(...) after shutdown completes.
Client worker registry (slot providers/heartbeat)
- Replaces unregister_worker with two-step API:
  - unregister_slot_provider(worker_instance_key) to disable eager workflow start early.
  - finalize_unregister(worker_instance_key) to remove from all_workers and heartbeat manager; errors if still present in slot_providers.
- Worker::initiate_shutdown and replace_client updated to use the new two-step flow.
Tests and mocks
- Update unit/integration tests to use unregister_slot_provider then finalize_unregister; add worker_unregister_order test for enforcement.
- Expect shutdown_worker to be invoked once (success or best-effort failure tolerated); heartbeat status expectations changed to ShuttingDown.
- Minor formatting updates in poll_buffer.rs log asserts.

^{Written by Cursor Bugbot for commit 1849b16. This will update automatically on new commits. Configure here.}

… down

…register

crates/client/src/worker/mod.rs

cursor · 2025-12-17T19:52:51Z

crates/sdk-core/src/worker/mod.rs

                .workers()
-                .unregister_worker(self.worker_instance_key);
+                .unregister_slot_provider(self.worker_instance_key);
        }


Bug: Shutdown status not set on initiation

initiate_shutdown no longer updates self.status to WorkerStatus::ShuttingDown. Callers that use initiate_shutdown to begin shutdown (before awaiting shutdown/finalize_shutdown) will keep sending heartbeats with Running, delaying/obscuring shutdown signaling and breaking server-side “seen ShuttingDown then no heartbeat” detection.

We want ShuttingDown state to be set when we send the worker_shutdown RPC call

cursor · 2025-12-17T21:28:37Z

crates/sdk-core/src/worker/mod.rs

+        // This is a best effort call and we can still shutdown the worker if it fails
+        match self.client.shutdown_worker(sticky_name, heartbeat).await {
+            Err(err)
+                if !matches!(


Bug: Empty sticky queue sent on shutdown

shutdown() now always calls shutdown_worker and uses unwrap_or_default() for sticky_name, which becomes an empty string when no sticky queue is used (e.g., max_cached_workflows == 0 or workflow polling disabled). If the server treats an empty sticky_task_queue as invalid when implemented, this can cause noisy warnings and failed shutdown signaling.

This is intentional, we want to start always sending shutdown_worker, not just on sticky queue

Sushisource

Looking good to me (aside from it looks like a few tests need updating), just one minor thing

Sushisource · 2025-12-18T00:47:31Z

crates/client/src/worker/mod.rs

-            slot_vec.retain(|info| info.worker_id != worker_instance_key);
-            if slot_vec.is_empty() {
-                self.slot_providers.remove(&slot_key);
+        if let Some(slot_vec) = self.slot_providers.get(&slot_key) {


We just did this check above, no? Could this ever happen?

yuandrew added 2 commits December 16, 2025 17:30

Always send shutdown_worker RPC, fix WorkerStatus state when shutting…

756fa71

… down

Split unregister worker into unregister_slot_provider and finalize_un…

9d47601

…register

yuandrew marked this pull request as ready for review December 17, 2025 19:46

yuandrew requested a review from a team as a code owner December 17, 2025 19:46

cursor bot reviewed Dec 17, 2025

View reviewed changes

check for unregistration order without mutating state, prevent leak

61591f8

cursor bot reviewed Dec 17, 2025

View reviewed changes

Sushisource approved these changes Dec 18, 2025

View reviewed changes

yuandrew and others added 2 commits December 19, 2025 09:18

Remove duplicate check, fix test

19af7ab

Merge branch 'master' into worker-heartbeat-shutdown-status

1849b16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Always send shutdown_worker RPC, fix WorkerStatus state when shutting down worker #1082

Always send shutdown_worker RPC, fix WorkerStatus state when shutting down worker #1082

Uh oh!

yuandrew commented Dec 16, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

cursor bot Dec 17, 2025

Uh oh!

yuandrew Dec 17, 2025

Uh oh!

cursor bot Dec 17, 2025

Uh oh!

yuandrew Dec 19, 2025

Uh oh!

Sushisource left a comment •

edited

Loading

Uh oh!

Sushisource Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Always send shutdown_worker RPC, fix WorkerStatus state when shutting down worker #1082

Are you sure you want to change the base?

Always send shutdown_worker RPC, fix WorkerStatus state when shutting down worker #1082

Uh oh!

Conversation

yuandrew commented Dec 16, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was changed

Why?

Checklist

Uh oh!

Uh oh!

cursor bot Dec 17, 2025

Choose a reason for hiding this comment

Bug: Shutdown status not set on initiation

Uh oh!

yuandrew Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Dec 17, 2025

Choose a reason for hiding this comment

Bug: Empty sticky queue sent on shutdown

Uh oh!

yuandrew Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Sushisource left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sushisource Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yuandrew commented Dec 16, 2025 •

edited by cursor bot

Loading

Sushisource left a comment •

edited

Loading