UN-1824 [FIX] Return HTTP 409 when tool image not found in container registry #1757

pk-zipstack · 2026-01-27T11:13:03Z

What

Return HTTP 409 Conflict instead of HTTP 200/422 when a tool image is not found in the container registry during API deployment execution

Why

Previously, when a tool image was not available in the container registry, the API returned HTTP 200 with execution_status: "ERROR" in the response body, or HTTP 422 with a generic error message
This made it difficult for API consumers to distinguish between configuration issues (tool not deployed) vs execution errors (tool ran but failed)
HTTP 409 Conflict better represents the situation where the platform state conflicts with the request requirements

How

Added ToolImageNotFoundError exception in runner to catch Docker image pull failures
Moved container config creation inside try block in runner.py to properly catch the exception during image pull
Added error_code field to RunnerContainerRunResponse DTO to propagate specific error types
Added ToolNotFoundInRegistryError exception in tool-sandbox that's raised when error_code matches
Added ToolNotFoundInRegistry API exception (HTTP 409) in backend
Updated api_deployment_views.py to detect "not found in container registry" pattern in both top-level and file-level errors
Fixed _process_final_output to preserve original error message when secondary exceptions occur during finalization
Skip retry attempts for tool not found errors (configuration issue, not transient)

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

No breaking changes for normal workflows - only the HTTP status code changes from 200/422 to 409 for this specific error condition
API consumers that check for specific HTTP status codes may need to handle 409 for tool configuration errors
The response body structure remains unchanged.

Database Migrations

N/A

Env Config

N/A

Relevant Docs

N/A

Related Issues or PRs

Dependencies Versions

Notes on Testing

Set an invalid tool image tag in backend .env:
```
STRUCTURE_TOOL_IMAGE_TAG=0.0.999
```
Restart backend and worker containers
Execute an API deployment that uses the Structure tool
Verify the response returns HTTP 409 with the error message containing "not found in container registry"

Screenshots

Checklist

I have read and understood the Contribution Guidelines.

…calls inside the try block in runner.py

for more information, see https://pre-commit.ci

coderabbitai · 2026-01-27T11:13:29Z

Summary by CodeRabbit

Release Notes

Bug Fixes
- Improved error handling for missing tool container images. The system now correctly identifies when a tool image is not found in the registry and returns appropriate server-side error responses with detailed error messages.
- Enhanced error detection across API endpoints to distinguish between server-side errors (missing tools) and client-side errors (invalid requests), returning correct HTTP status codes.
New Features
- Added more granular error reporting with error codes for better diagnostic information when tool deployments fail.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Walkthrough

Add end-to-end detection and propagation of "tool image not found in registry" across API, runner, tool-sandbox, and workflow layers; introduce new exceptions and error_code field, map these conditions to a 500 server error, and adjust execution/status handling and metadata pruning.

Changes

Cohort / File(s)	Summary
API exceptions & views `backend/api_v2/exceptions.py`, `backend/api_v2/api_deployment_views.py`	Add `ToolNotFoundInRegistry` (500) and `TOOL_NOT_FOUND_PATTERNS`; add `contains_tool_not_found_error(response)`; centralize detection in POST/GET flows, map tool-not-found -> 500, other exec errors -> 422; add result_acknowledged and completion/metadata pruning logic; extend `PresignedURLFetchError` constructors.
Runner client & exceptions `runner/src/unstract/runner/exception.py`, `runner/src/unstract/runner/clients/docker_client.py`, `runner/src/unstract/runner/runner.py`	Add `ToolImageNotFoundError`; detect image-pull failures (ImageNotFound, API 404, pull stream "error") and raise it; catch and convert to structured runner error responses with `error_code`; adjust run flow, command formatting, sidecar handling, and cleanup.
Tool sandbox DTO & helpers `unstract/tool-sandbox/src/unstract/tool_sandbox/dto.py`, `.../exceptions.py`, `.../helper.py`	Add `error_code` field to `RunnerContainerRunResponse`; add `ToolNotFoundInRegistryError` exception; parse runner HTTP/json error bodies and embedded responses to raise `ToolNotFoundInRegistryError` when `error_code` matches.
Workflow manager / execution helpers `backend/workflow_manager/workflow_v2/file_execution_tasks.py`, `unstract/workflow-execution/src/unstract/workflow_execution/tools_utils.py`	Detect and short-circuit on tool-not-found exceptions: log and return standardized error results; prevent retries for tool-not-found; preserve and propagate original processing errors during finalization.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant API as API Deployment
    participant Sandbox as Tool-Sandbox
    participant Runner as Runner/Docker
    participant Registry as Container Registry

    Client->>API: POST execution request
    activate API
    API->>Sandbox: request tool run
    activate Sandbox
    Sandbox->>Runner: HTTP request to runner
    activate Runner
    Runner->>Registry: pull tool image
    activate Registry
    Registry-->>Runner: ImageNotFound / 404 / pull stream error
    deactivate Registry
    Runner->>Runner: raise ToolImageNotFoundError -> return structured error (error_code)
    Runner-->>Sandbox: HTTP error with JSON including error_code
    deactivate Runner
    Sandbox->>API: raise ToolNotFoundInRegistryError (propagate)
    deactivate Sandbox
    API->>API: contains_tool_not_found_error() detects pattern
    API-->>Client: 500 Internal Server Error with status/result
    deactivate API

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 45.45% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately identifies the main change: HTTP 409 return status for tool image not found errors, which aligns with the primary objective of the PR.
Description check	✅ Passed	The PR description comprehensively covers all required template sections with detailed explanations of What, Why, How, and includes explicit confirmation of no breaking changes for normal workflows.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@runner/src/unstract/runner/clients/docker_client.py`:
- Around line 177-205: The pull-stream error handling in the client.api.pull
loop currently raises ToolImageNotFoundError for any stream "error"; update the
logic in the loop that inspects line.get("error") so it discriminates based on
errorDetail and message: extract error_msg = line.get("error") and err_detail =
line.get("errorDetail", {}), then if err_detail.get("code") == 404 or "manifest
unknown" in error_msg.lower() or "not found" in error_msg.lower() raise
ToolImageNotFoundError(repository, image_tag); otherwise log the full error
(include error_msg and err_detail) and re-raise or propagate a generic exception
(so auth 401, rate-limit 429, network errors are not misclassified as
not-found); keep using image_name_with_tag, repository, image_tag and
ToolImageNotFoundError to locate the code to change.

🧹 Nitpick comments (1)

backend/api_v2/api_deployment_views.py (1)

54-88: Prefer explicit error_code checks over string matching.
Since downstream results now carry error_code, checking it directly avoids brittle text matching and future message changes.

♻️ Suggested refinement

-    if isinstance(response, dict):
-        error = response.get("error")
-        result = response.get("result", [])
-    else:
-        error = getattr(response, "error", None)
-        result = getattr(response, "result", []) or []
+    if isinstance(response, dict):
+        error = response.get("error")
+        error_code = response.get("error_code")
+        result = response.get("result", [])
+    else:
+        error = getattr(response, "error", None)
+        error_code = getattr(response, "error_code", None)
+        result = getattr(response, "result", []) or []
+
+    if error_code == ToolNotFoundInRegistry.ERROR_CODE:
+        return True
...
-            if isinstance(item, dict):
-                file_error = item.get("error", "")
+            if isinstance(item, dict):
+                if item.get("error_code") == ToolNotFoundInRegistry.ERROR_CODE:
+                    return True
+                file_error = item.get("error", "")

runner/src/unstract/runner/clients/docker_client.py

chandrasekharan-zipstack

LGTM for the most part, do confirm on whether the 409 status code can be 500 here instead. Between

runner -> backend, it can be 409
backend -> user, it needs to be 500

backend/api_v2/api_deployment_views.py

for more information, see https://pre-commit.ci

github-actions · 2026-01-28T08:12:28Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 66 passed, 0 failed (66 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

sonarqubecloud · 2026-01-28T08:12:35Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

pk-zipstack added 5 commits January 27, 2026 13:51

Return HTTP 409 when tool container not present

c34d0a1

Moved get_container_run_config() and _get_sidecar_container_config() …

35c5766

…calls inside the try block in runner.py

Preserved error from runner to backend

42260b7

preserve original error message from runner

26ad3dd

Revert changes to uv.lcok

5baac9c

pk-zipstack requested review from a team, chandrasekharan-zipstack, muhammad-ali-e and ritwik-g January 27, 2026 11:13

pk-zipstack self-assigned this Jan 27, 2026

[pre-commit.ci] auto fixes from pre-commit.com hooks

2a4188f

for more information, see https://pre-commit.ci

coderabbitai bot reviewed Jan 27, 2026

View reviewed changes

runner/src/unstract/runner/clients/docker_client.py Show resolved Hide resolved

chandrasekharan-zipstack reviewed Jan 27, 2026

View reviewed changes

backend/api_v2/api_deployment_views.py Outdated Show resolved Hide resolved

backend/api_v2/api_deployment_views.py Outdated Show resolved Hide resolved

pk-zipstack and others added 3 commits January 28, 2026 13:35

Changed 409 conflict to 500 internal server error

7aa9ed1

moved contains_tool_not_found_error to api_v2 of backend

71db84b

[pre-commit.ci] auto fixes from pre-commit.com hooks

9db220d

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UN-1824 [FIX] Return HTTP 409 when tool image not found in container registry #1757

UN-1824 [FIX] Return HTTP 409 when tool image not found in container registry #1757

Uh oh!

pk-zipstack commented Jan 27, 2026

Uh oh!

coderabbitai bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

chandrasekharan-zipstack left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 28, 2026

Uh oh!

sonarqubecloud bot commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

UN-1824 [FIX] Return HTTP 409 when tool image not found in container registry #1757

Are you sure you want to change the base?

UN-1824 [FIX] Return HTTP 409 when tool image not found in container registry #1757

Uh oh!

Conversation

pk-zipstack commented Jan 27, 2026

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Database Migrations

Env Config

Relevant Docs

Related Issues or PRs

Dependencies Versions

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chandrasekharan-zipstack left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 28, 2026

Test Results

Uh oh!

sonarqubecloud bot commented Jan 28, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

coderabbitai bot commented Jan 27, 2026 •

edited

Loading

chandrasekharan-zipstack left a comment •

edited

Loading