Delta based relevance for comp reg alignment to multi-attribute targets #192

jadie1 · 2025-05-26T19:30:27Z

Enables comp. reg. alignment to multi-attribute targets by adding a component step that for each probe:

Finds the attribute with smaller predicted delta and removes it from predicted scores
Alters the alignment target to only include the more relevant (bigger delta) attribute

Example call to run zeroshot:

run_align_system +experiment=phase2_june_collab/multi_attribute_pipeline_zeroshot_comparative_regression +alignment_target=june2025/ADEPT-June2025-affiliation_merit-0.0_0.0.yaml

Example call to run fewshot:

run_align_system +experiment=phase2_june_collab/multi_attribute_pipeline_fewshot_comparative_regression_loo +alignment_target=june2025/ADEPT-June2025-affiliation_merit-0.0_0.0.yaml

…-attribute alignment

align_system/algorithms/misc_itm_adm_components.py

eveenhuis

Overall LGTM, I'd be fine with merging as is. Just a couple of nits/sanity checks

eveenhuis · 2025-05-26T23:27:06Z

align_system/algorithms/misc_itm_adm_components.py

+        # Two or more attributes -> keep the one with largest delta
+        else:
+            if len(attribute_prediction_scores.keys()) > 2:
+                raise RuntimeError("Relevance filtering not implemented for more than two choices.")


Nit: David recommended I raise NotImplementedError for a similar check

eveenhuis · 2025-05-26T23:28:44Z

align_system/algorithms/misc_itm_adm_components.py

+            alignment_target):
+        # If there are two non-medical attributes, removes the one with smaller delta
+
+        attributes = list({key for inner in attribute_prediction_scores.values() for key in inner})


I assume we're looping all choice predictions in case different choices have different predictions? Do we think that'll actually happen?

eveenhuis · 2025-05-26T23:30:34Z

align_system/algorithms/misc_itm_adm_components.py

+                choiceA, choiceB = list(attribute_prediction_scores.keys())
+                max_delta = -np.inf
+                for attr in attributes:
+                    delta = abs(np.array(attribute_prediction_scores[choiceA][attr]).mean() - np.array(attribute_prediction_scores[choiceB][attr]).mean())


I know at one point we potentially had to handle both a single prediction and a list of predictions. Do we still need to do that? (If not yayyyyy simpler code :))

eveenhuis · 2025-05-26T23:31:54Z

align_system/algorithms/misc_itm_adm_components.py

+                max_delta = -np.inf
+                for attr in attributes:
+                    delta = abs(np.array(attribute_prediction_scores[choiceA][attr]).mean() - np.array(attribute_prediction_scores[choiceB][attr]).mean())
+                    if delta > max_delta:


This will only keep the first one if there's a tie. I think that's fine because I don't know what we'd actually do if there was a tie (plus that seems unlikely), but just double checking you didn't have a different set of assumptions

eveenhuis · 2025-05-26T23:39:58Z

align_system/algorithms/misc_itm_adm_components.py

+                # Update predicted scores to only have more relevant attribute
+                filtered_attribute_prediction_scores = {choiceA:{}, choiceB:{}}
+                for keep_attr in keep_attributes:
+                    filtered_attribute_prediction_scores[choiceA][keep_attr] = attribute_prediction_scores[choiceA][keep_attr]
+                    filtered_attribute_prediction_scores[choiceB][keep_attr] = attribute_prediction_scores[choiceB][keep_attr]


(nit)

Suggested change

# Update predicted scores to only have more relevant attribute

filtered_attribute_prediction_scores = {choiceA:{}, choiceB:{}}

for keep_attr in keep_attributes:

filtered_attribute_prediction_scores[choiceA][keep_attr] = attribute_prediction_scores[choiceA][keep_attr]

filtered_attribute_prediction_scores[choiceB][keep_attr] = attribute_prediction_scores[choiceB][keep_attr]

# Update predicted scores to only have more relevant attribute

filtered_attribute_prediction_scores = {

choice: {

keep_attr: predictions[choice][keep_attr]

for keep_attr in keep_attributes

}

for choice in attribute_prediction_scores.keys()

}

jadie1 added 2 commits May 26, 2025 15:21

Fix for ICL with multi-attribute targets

26da5b0

Adding step to remove less reevant attribute based on delta for multi…

73f5854

…-attribute alignment

dmjoy approved these changes May 26, 2025

View reviewed changes

jadie1 added 2 commits May 26, 2025 16:38

Correcting Phase2RegressionRemoveIrrelevantAttributes

3700e31

Small improvements to Phase2RegressionRemoveIrrelevantAttributes

7903e17

jadie1 marked this pull request as ready for review May 26, 2025 20:49

eveenhuis approved these changes May 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Delta based relevance for comp reg alignment to multi-attribute targets #192

Delta based relevance for comp reg alignment to multi-attribute targets #192

Uh oh!

jadie1 commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eveenhuis left a comment

Uh oh!

eveenhuis May 26, 2025

Uh oh!

eveenhuis May 26, 2025

Uh oh!

eveenhuis May 26, 2025

Uh oh!

eveenhuis May 26, 2025

Uh oh!

eveenhuis May 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Delta based relevance for comp reg alignment to multi-attribute targets #192

Are you sure you want to change the base?

Delta based relevance for comp reg alignment to multi-attribute targets #192

Uh oh!

Conversation

jadie1 commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eveenhuis left a comment

Choose a reason for hiding this comment

Uh oh!

eveenhuis May 26, 2025

Choose a reason for hiding this comment

Uh oh!

eveenhuis May 26, 2025

Choose a reason for hiding this comment

Uh oh!

eveenhuis May 26, 2025

Choose a reason for hiding this comment

Uh oh!

eveenhuis May 26, 2025

Choose a reason for hiding this comment

Uh oh!

eveenhuis May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants