Implement CI improvements outlined in #3925 #5301

galargh · 2024-05-31T13:09:03Z

Resolves #3925

Hi! I've noticed that you identified a couple of issues with your CI setup that you wanted addressed in #3925, and I'd love to help :)

As per the issue description, I included all of the improvements in a single PR - I aimed to contain each change in a separate commit to help you review them, but I'm also happy to split them up into separate PRs if it makes it easier for you.

Breakdown

For all workflows, include the workflow file itself as part of the files that trigger it

✅ e27c940
I ensured that all the workflows that use paths filters include the workflow file itself in the filter value.

Don't use $default-branch. This variable is meant to be used only in templates. Using it in workflows is a mistake.

✅ af96f4d
I replaced each occurrence of $default-branch with main. I've also tried to search for other similar instances of template syntax usage but didn't find any.

Parallelize the "lint" workflow: linting the code and linting the website should be done in parallel.

✅ c56528a
I parallelized the Lint workflow using the matrix strategy. After the changes, the workflow consists of 2 jobs, Lint and Lint (docs). I noticed that linting root requires building custom eslint plugins while it's not necessary for the docs. I also decided to check dependency versions only when linting root.

When I looked at the workflow, I noticed that linting was duplicated in three other workflows: Check docs site, Comment on lint failure and Pre-release tests.

❓ ac06514
I removed linting from the Check docs site workflow. After the changes, it only builds the docs. Let me know if that's OK.

❓ ac06514
I also modified Comment on lint failure workflow. As I understand, it was triggered on pull_request_target to enable commenting on a pull request. The same can be achieved with a workflow triggered by other workflow_runs. I modified the workflow to be triggered whenever Lint completes. The updated workflow continues only if the Lint workflow was triggered by a pull request and if it failed during one of the linting steps. This eliminates the duplication of work while maintaining the desired functionality.

Improve running time, if possible. Review how caches are used, consider moving some things to only run on main or in scheduled workflows, etc.

🟡 f750bb7
I only performed a basic review of the cache setup. The one thing I noticed was that we might be overriding the pnpm cache sometimes - e.g. when we build only a single package, the resulting cache might not have the dependencies other packages require. To counteract that, I extracted the env setup (pnpm install, node install, cache handling) to a helper action. Then, I applied the action wherever the pnpm cache was used. I configured it to work in restore-only mode by default, and I added a new Populate cache workflow, which is responsible for populating the cache. All in all, I wouldn't expect this to speed up the runtimes significantly, but it does create an environment for more intentional cache use.

🟡 8927236
I did a light review and cleanup of workflow triggers, but there's definitely more room for improvement here.

As for other ideas for runtime improvements, setting up the cache restore to work with partial key matches might be an interesting idea. It should be simpler to implement now that there's only a single place that deals with the pnpm cache.

The usage of self-hosted runners might be interesting, too. Especially as a means to increase the availability of the runners (e.g. package builds might require ~70 runners for changes lower down the stack while the concurrency limit in GitHub on a free plan is 20).

Expose built packages (.tar.gz files that can be installed) as artifacts.

🔴 I didn't address this in this PR. Would you mind clarifying the intention here? Is it to package up the dist dirs after package builds and expose them as artifacts? If so, I'd be happy to add it to the PR or create a follow-up one.

Consider using templates to make the workflows more maintainable.

🟢 8de529a
I'd like to propose an alternative to templating. I turned all the package build (except core) workflows into a single one that uses a build matrix. The matrix is dynamically constructed at the beginning of the run to determine which packages need to be built and tested. I retrieve this information from the pnpm lockfile because it seemed simple enough. In this iteration, I decided to only look at the direct package dependencies, which seems to be sufficient as well.

The advantage of this approach is that there is only one workflow file to maintain, the dependencies between packages do not have to be hardcoded (e.gI discovered that hardhat-truffle5 depends on hardhat-web3, but modifying the latter wouldn't trigger the former's build before), and new packages will be pulled in automatically (e.g. I discovered that hardhat-toolbox-viem wasn't being built before). The tradeoff is that the package builds are now delayed by the package discovery job (< 10s). Also, there is only one workflow to inspect in the Actions tab, which means fewer switches when inspecting a workflow run, but also, the workflow run history is collective.

➕ 322f13c
Finally, I updated the GitHub Actions versions to ensure the latest ones are used. This will help clear any node version deprecation warnings you might have been seeing.

That's it for now; let me know what you think :)

Testing

Create a PR from this branch to the fork's default branch and verify all the workflows run as expected - Test the implementation of CI improvements galargh/hardhat#1
Merge the PR and verify all the workflows run as expected - galargh@ad02d41
Modify a subset of packages, and verify only them and those that depend on them are tested in a PR - Touch a couple of packages galargh/hardhat#2
Merge the PR modifying a subset of packages and verify only them and those that depend on them are tested - galargh@93944c6
Introduce a linting error and verify the comment is correctly posted on the PR - Introduce a linting error galargh/hardhat#4
Merge a PR with a lint error and verify the comment workflow is correctly skipped - https://github.com/galargh/hardhat/actions/runs/9315362156
Verify workflows triggered on pull_request can use cache populated on PR branch
Verify workflows triggered on pull_request can use cache populated on default branch
Verify workflows triggered on push can use cache populated on the default branch

changeset-bot · 2024-05-31T13:09:07Z

⚠️ No Changeset found

Latest commit: 684653b

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

vercel · 2024-05-31T13:09:07Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
hardhat	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jul 29, 2024 1:13pm

alcuadrado · 2024-05-31T13:19:29Z

Hey! Thanks for submitting this pr! I'll take a look at this either today or over the weekend

galargh · 2024-05-31T13:24:57Z

No rush, take your time :) And if there's anything that needs some more clarification, let me know

alcuadrado · 2024-06-05T20:49:58Z

This is really interesting. I'm still processing and reading through some parts of it, as this makes me read more docs than a usual PR, but there are some really good things here.

The commit that I'm not sure it works as intended is the one modifying Comment on lint failure. This runs on pull_request_target to improve on a typical situation where an external contributor sends a PR but it doesn't pass the linter, and they don't notice it until we "approve and run".

I'm still reading about the cache strategy, that's pretty cool.

The matrix thing, I actually had done something really similar in our main development branch! The implementation is different, but I think it's the same intention: https://github.com/NomicFoundation/hardhat/blob/v-next/.github/workflows/v-next-ci.yml

alcuadrado · 2024-06-05T20:50:17Z

I'll continue reviewing this tomorrow

galargh · 2024-06-08T08:40:37Z

This is really interesting. I'm still processing and reading through some parts of it, as this makes me read more docs than a usual PR, but there are some really good things here.

I can definitely relate, GitHub Docs do have a tendency to pull one in 😅

The commit that I'm not sure it works as intended is the one modifying Comment on lint failure. This runs on pull_request_target to improve on a typical situation where an external contributor sends a PR but it doesn't pass the linter, and they don't notice it until we "approve and run".

I see! Thank you for providing this additional bit of context. It definitely makes much more sense to me now, and I understand the original intent better.

My objective was to secure the repository against security breaches caused by script injection, and in this case the easiest way to achieve that was by avoiding pull_request_target usage. However, that's not the only way.

🆕 1911acf
I have now restored the pull_request_target trigger in the linting workflow, but I added a couple of changes to help minimise the possible attack vector.

I split the lint and comment jobs so that we can grant minimal permissions (just contents read) to the job that checks out unvetted code.
I ensured the checkouts do not persist any git credentials on disk.
I ensured that sensitive files (such as actions definitions, for example) are sourced from the default branch rather than the PR merge commit.

In this setup, it would still be possible to execute arbitrary code on a runner, but it shouldn't have access to sensitive data. The GitHub token can only read the repo which is public already anyway, and to even obtain it one would have to scrape the memory. As for secrets, GitHub claims that unreferenced ones are wiped from memory altogether.

I tested the new setup on a PR in my fork. The workflow run triggered on pull_request_target successfully posted a comment to the PR, while the one triggered on pull_request skipped that step as expected.

The matrix thing, I actually had done something really similar in our main development branch! The implementation is different, but I think it's the same intention: https://github.com/NomicFoundation/hardhat/blob/v-next/.github/workflows/v-next-ci.yml

That's awesome! Yes, that looks exactly like what I was going for, too. One important thing I tried to preserve on this branch was the "laziness" of the package builds since there are quite a few combinations to consider, but the principle is the same 😄

Thank you so much for spending the time reviewing my proposals, greatly appreciated 🙇

…ments

galargh · 2024-07-31T13:27:23Z

I'm closing this PR as we agreed to:

Split it into smaller chunks to make it easier to review & merge.
Target the v-next branch instead.

galargh added 8 commits May 31, 2024 09:36

Include the workflow files in the paths trigger filters

e27c940

Do not use template syntax in the branches trigger filters

af96f4d

Parallelize the Lint workflow

c56528a

Do not duplicate linting work

ac06514

Use matrix strategy to test packages

8de529a

Upgrade outdated GitHub actions

322f13c

Save cache only in Populate cache workflow

f750bb7

Review workflow triggers

8927236

github-actions bot assigned kanej May 31, 2024

github-actions bot added the status:triaging label May 31, 2024

vercel bot deployed to Preview May 31, 2024 13:10 View deployment

alcuadrado self-requested a review May 31, 2024 13:19

alcuadrado assigned alcuadrado and unassigned kanej Jun 5, 2024

Restore pull_request_target trigger in lint workflow

1911acf

vercel bot deployed to Preview June 8, 2024 08:41 View deployment

Merge remote-tracking branch 'origin/main' into chore/3925-ci-improve…

6e86198

…ments

vercel bot deployed to Preview June 25, 2024 09:00 View deployment

vercel bot deployed to Preview July 11, 2024 20:01 View deployment

vercel bot deployed to Preview July 12, 2024 06:01 View deployment

Merge remote-tracking branch 'origin/main' into chore/3925-ci-improve…

684653b

…ments

galargh force-pushed the chore/3925-ci-improvements branch from e69f90b to 684653b Compare July 29, 2024 13:11

vercel bot deployed to Preview July 29, 2024 13:13 View deployment

galargh closed this Jul 31, 2024

galargh mentioned this pull request Jul 31, 2024

ci(v3): update GitHub actions and review workflow triggers #5559

Merged

3 tasks

galargh deleted the chore/3925-ci-improvements branch July 31, 2024 13:28

This was referenced Aug 1, 2024

ci(v2): remove duplicated lint jobs/steps #5562

Merged

ci(v3): improve usage of github actions cache #5563

Merged

ci(v3): test and lint v-next packages lazily #5564

Merged

ci(v2): parameterise v2 packages ci to minimise duplication #5565

Merged

github-actions bot locked as resolved and limited conversation to collaborators Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement CI improvements outlined in #3925 #5301

Implement CI improvements outlined in #3925 #5301

galargh commented May 31, 2024 •

edited

Loading

changeset-bot bot commented May 31, 2024 •

edited

Loading

vercel bot commented May 31, 2024 •

edited

Loading

alcuadrado commented May 31, 2024

galargh commented May 31, 2024

alcuadrado commented Jun 5, 2024

alcuadrado commented Jun 5, 2024

galargh commented Jun 8, 2024 •

edited

Loading

galargh commented Jul 31, 2024

Implement CI improvements outlined in #3925 #5301

Implement CI improvements outlined in #3925 #5301

Conversation

galargh commented May 31, 2024 • edited Loading

Breakdown

Testing

changeset-bot bot commented May 31, 2024 • edited Loading

⚠️ No Changeset found

vercel bot commented May 31, 2024 • edited Loading

alcuadrado commented May 31, 2024

galargh commented May 31, 2024

alcuadrado commented Jun 5, 2024

alcuadrado commented Jun 5, 2024

galargh commented Jun 8, 2024 • edited Loading

galargh commented Jul 31, 2024

galargh commented May 31, 2024 •

edited

Loading

changeset-bot bot commented May 31, 2024 •

edited

Loading

vercel bot commented May 31, 2024 •

edited

Loading

galargh commented Jun 8, 2024 •

edited

Loading