-
Notifications
You must be signed in to change notification settings - Fork 26
ARAX-Shepherd is failing TestCase_4 en masse #2676
Description
In the latest test run of the Refactor:
https://arax.ncats.io/?systest=281
We are failing a lot of Test Case 4 that BTE and Aragorn apparently pass:
And apparently we are causing the ARS results to fail as well.
Looking at the results:
https://arax.ci.transltr.io/?r=56b0f4a1-ba29-4c96-8f7f-93a20c151942
It would appear that while Aragorn and BTE are only returning 4 answers (and thus easy for them to duck this huge list of "NeverShow"s, ARAX is returning 500 answers, and thus more vulnerable to NeverShows
https://arax.ci.transltr.io/?r=b6469a57-3f6d-43ef-94a8-75b17282a2f1
For example, MMP3 is a NeverShow
It is result number 330.
It would appear that Retriever is providing this edge.
Since Retriever is the primary knowledge source, this is presumably a subclass reasoning edge. But due to bug #2662 we cannot see the support graph.
I suspect resolving this problem would result in much better Refactor test scores for ARAX-Shepherd as well as ARS (since many FAILs would be turned into PASSes).
We pass these in CI with legacy KPs:
Only 14 results in legacy CI:
https://arax.ci.transltr.io/?r=5ffc9544-4f84-4d6f-aafc-73cd59beae98
and does not include MMP3 et al.
It would seem that ARAX-Shepherd is failing many tests because it is getting NeverShow information from Retriever
This would seem like an impactful mystery to solve....