-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
requeue recently fetched components #390
Comments
I really like the idea of doing it lazily, even if we end up running a job as well. |
Updating with some thinking... The webhooks from the crawler to the service tell the service about all new tool runs. In that run output we have the version of the tools that were used. The service should just keep a running max. Then when it sees a definition that has an older tool, it can requeue that component. Take care to handle startup conditions where the service does not yet have a value for a given tool. Also note that some tools can have different version numbers depending on the type/provider being crawled. For example, the ClearlyDefined tool for Crates may be 1.2.0 and fro NPMs may be 1.9.0. So the key for the table is type/provider/tools |
@AlexWebYourmind this is a good issue to have a service side dev look at. |
Maybe once we determine that the fetched definition represents an update opportunity, we should also set a response header for the CDN to not cache it, and/or signal the caller (maybe an X-header, or another seemingly out-of-band indication). |
@ignacionr that makes sense. good call |
We have cases now where the definitions are being built from reasonably old harvested data and as a result the definitions are substantially out of date. Previously we talked about having the service check the current crawler tool levels and requeue any definition based on older tools. We'd still return the old definition but soon enough the new one would be ready.
Thoughts:
Summary:
thoughts @dabutvin ?
The text was updated successfully, but these errors were encountered: