Solver pod failing because of expired SSL certificate for https://tensorflow.pypi.thoth-station.ninja/index/manylinux2010/AVX2/simple/ #5195
Labels
kind/bug
Categorizes issue or PR as related to a bug.
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
sig/stack-guidance
Categorizes an issue or PR as relevant to SIG Stack Guidance.
triage/accepted
Indicates an issue or PR is ready to be actively worked on.
Describe the bug
When trying to solve the
torchvision
package on the stage environment, some of the solver pods fail because of an expired SSL certificate for our tensorflow package index https://tensorflow.pypi.thoth-station.ninja/index/manylinux2010/AVX2/simple/.The expired certificate causes a too large number of retries to reach the index URL and thus the pod to exceed the amount of time it can run on its node.
As tensorflow builds are no longer maintained by Thoth, we should either delete this package index or renew the certificate to allow fetching the necessary dependencies from there during solver runs.
To Reproduce
Schedule
torchvision
to be solved on the stage user API.Expected behavior
Solver can fetch all necessary dependencies.
Additional context
From one of the solver pods in stage environment:
The text was updated successfully, but these errors were encountered: