Skip to content

adding model and dataset flags to harness#18

Merged
andreea-alexandru merged 8 commits intofhe-benchmarking:mainfrom
stamcenter:harness-add-params
Mar 23, 2026
Merged

adding model and dataset flags to harness#18
andreea-alexandru merged 8 commits intofhe-benchmarking:mainfrom
stamcenter:harness-add-params

Conversation

@stamcenter
Copy link
Contributor

@stamcenter stamcenter commented Mar 19, 2026

This pull request extends the harness interface by introducing two new parameters:
--model
--dataset

These parameters allow the harness to dynamically select the model and dataset used during submission execution. It also moved the fault mlp model provided in the submission folder into the submissions folder for testing.

@stamcenter stamcenter changed the title Harness add params adding model and dataset flags to harness Mar 19, 2026
Copy link
Contributor

@andreea-alexandru andreea-alexandru left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, I just left a few minor comments.

@andreea-alexandru
Copy link
Contributor

Thank you for the changes! One more thing: while specifying a model that does not exist fails gracefully immediately ([harness]: Model directory [...] not found.), specifying a dataset that doesn't exist goes through building the submission and then throws an exception (raise ValueError(f"Unsupported dataset name: {DATASET_NAME}")). It would be good to have the same behavior from the harness for the latter as well.

Copy link
Contributor

@andreea-alexandru andreea-alexandru left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks!

@stamcenter
Copy link
Contributor Author

Thank you

@andreea-alexandru andreea-alexandru merged commit 7d4a6de into fhe-benchmarking:main Mar 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants