Skip to content

Conversation

@mpaulucci
Copy link
Collaborator

Motivation
All failing workflows are showing "Unknown job" in Slack message

Description
The culprit was how we gathered the job list: we queried the generic listJobsForWorkflowRun, which sometimes returns an empty set for the latest attempt of a run. By switching to the attempt-specific endpoint and paging it ourselves, the script now pulls the jobs that actually failed before formatting the message, so the alert should finally show the real job names instead of defaulting to “Unknown job.”

@mpaulucci mpaulucci requested a review from a team as a code owner October 29, 2025 12:07
Copilot AI review requested due to automatic review settings October 29, 2025 12:07
@github-actions github-actions bot added the L1 Ethereum client label Oct 29, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR improves the resilience and error handling of GitHub Actions workflows. The changes prevent matrix job failures from canceling other jobs and enhance the failure alert system to handle workflow retries more accurately.

  • Added fail-fast: false to the daily snapsync workflow to allow all matrix jobs to run independently
  • Refactored the failure alert script to fetch jobs per attempt with fallback logic for better retry handling
  • Switched from pagination utility to manual pagination for more control over job collection

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
.github/workflows/daily_snapsync.yaml Added fail-fast: false strategy to prevent matrix job cancellation
.github/workflows/common_failure_alerts.yaml Refactored job collection logic to support per-attempt job fetching with fallback and manual pagination

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@mpaulucci mpaulucci changed the title ci(l1): Fix "Unknown job" job message when workflow fails. ci(l1): fix "Unknown job" job message when workflow fails. Oct 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

L1 Ethereum client

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

3 participants