Skip to content

ARAX-Shepherd is failing TestCase_4 en masse #2676

@edeutsch

Description

@edeutsch

In the latest test run of the Refactor:
https://arax.ncats.io/?systest=281

Image

We are failing a lot of Test Case 4 that BTE and Aragorn apparently pass:

Image

And apparently we are causing the ARS results to fail as well.
Looking at the results:
https://arax.ci.transltr.io/?r=56b0f4a1-ba29-4c96-8f7f-93a20c151942

Image

It would appear that while Aragorn and BTE are only returning 4 answers (and thus easy for them to duck this huge list of "NeverShow"s, ARAX is returning 500 answers, and thus more vulnerable to NeverShows

https://arax.ci.transltr.io/?r=b6469a57-3f6d-43ef-94a8-75b17282a2f1

Image Image

For example, MMP3 is a NeverShow
It is result number 330.

Image

It would appear that Retriever is providing this edge.
Since Retriever is the primary knowledge source, this is presumably a subclass reasoning edge. But due to bug #2662 we cannot see the support graph.

I suspect resolving this problem would result in much better Refactor test scores for ARAX-Shepherd as well as ARS (since many FAILs would be turned into PASSes).

We pass these in CI with legacy KPs:

Image

Only 14 results in legacy CI:
https://arax.ci.transltr.io/?r=5ffc9544-4f84-4d6f-aafc-73cd59beae98

and does not include MMP3 et al.

It would seem that ARAX-Shepherd is failing many tests because it is getting NeverShow information from Retriever

This would seem like an impactful mystery to solve....

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions