-
Notifications
You must be signed in to change notification settings - Fork 60
Unify llm judges into a single prepare file #1696
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Hi @martinscooper ,
How does that sound to you? |
Then, if a user needs a more specific source from where the context field should be taken from, they could specify it manually for their use case.
|
|
||
from unitxt import evaluate, load_dataset | ||
from unitxt.blocks import Task, TaskCard | ||
from unitxt.llm_as_judge import CreateYesNoCriteriaFromString | ||
from unitxt.loaders import LoadFromDictionary | ||
|
||
data = { | ||
"test": [ | ||
{ | ||
"question": "How is the weather?", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why was this examples changed? Is this intentional?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, I will remove it
Signed-off-by: Martín Santillán Cooper <[email protected]>
67103ef
to
9d77d49
Compare
@yoavkatz @elronbandel I applied the fix. |
This PR moves judges in
prepare/metrics/llm_as_judge/direct/llama_3_3_70b_instruct_adherence_completeness.py
toprepare/metrics/llm_as_judge/llm_as_judge.py
so that:llm_as_judge_contants.py
@lilacheden the default context fields of the adherence metric's
instructions
field seems a bit too specific.Do you think we could simplify it?
@elronbandel I tried setting those context fields values using the square bracket notation but it says it is marlformed. Could you remind me if dictionaries are supported there?