Skip to content

Conversation

kacpersaw
Copy link
Contributor

Closes #119

This PR adds error handling and retry functionality to the log streaming by adding exponential backoff retry logic for failed log operations.

Changes

  • Added retry mechanism with exponential backoff - automatically retries failed log operations, starting at 1 second delay and doubling up to 30 seconds max
  • Retry logic for failed API operations - triggers retries for PostLogSource and Agent dRPC client failures to handle connection issues
  • Per-agent retry state tracking - built retry state struct for each agent with individual timer and delay management
  • Automatic retry cleanup - clears retry state on successful operations and proper cleanup on agent deletion
  • Tests

With this PR, users will see more reliable logs from pod startup in their Coder workspace, including pod events before container startup that were previously not visible.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Pod events before container startup are not visible in Coder UI
1 participant