Feat/add folmsbee conformer benchmark by lwehrhan · Pull Request #429 · ddmms/ml-peg

lwehrhan · 2026-03-16T16:03:38Z

Pre-review checklist for PR author

PR author must check the checkboxes below when creating the PR.

I've confirmed the contribution guidelines.

Summary

The Folmsbee dataset of low-energy conformers of drug-like molecules. The differences in energy are smaller compared to the Wiggle500 dataset and it features a greater number of molecules. The highest available level of theory for energy evaluations to be used as ground-truth is DLPNO-CCSD(T). This is a test for moving the benchmarks of mlip-audit into this repository. I have included an analysis script for this benchmark, however would like to kindly ask for assistance with building and harmonizing the Dash layout.

Linked issue

Resolves #427

Progress

Calculations
Analysis
Application
Documentation

Testing

New decorators/callbacks

joehart2001 · 2026-03-18T14:22:51Z

Hi @lwehrhan, thank you for your PR and its looking great overall! A few things:

would you be able to share the data file so i can uplaod it to our s3 bucket so i can test the calc and analysis is running as expected?
i have pushed the app and also the metrics.yml, would you be able to check over this metrics file to make sure its correct?

Once we've got the data file uploaded, i think we can make a few changes to the calc script for consistency with similar benchmarks, but i think the changes will be minor.

Just a note, make sure you to fetch any changes i've made before working locally, otherwise your next push may overwrite my changes.

thanks!

joehart2001 · 2026-03-18T14:32:51Z

+    # Add D3 calculator for this test
+    calc = model.add_d3_calculator(calc)
+
+    data_path = Path(__file__).parent / "data" / "folmsbee_dataset.json"


Suggested change

data_path = Path(__file__).parent / "data" / "folmsbee_dataset.json"

data_path = (

download_s3_data(

filename="Folmsbee.zip",

key="inputs/conformers/Folmsbee/Folmsbee.zip",

)

/ "Folmsbee"

)

joehart2001 · 2026-03-18T14:33:08Z

+import pytest
+from tqdm import tqdm
+
+from ml_peg.models.get_models import load_models


Suggested change

from ml_peg.models.get_models import load_models

from ml_peg.calcs.utils.utils import download_s3_data

from ml_peg.models.get_models import load_models

joehart2001 · 2026-04-13T14:17:09Z

Hey @lwehrhan, we've just merged a PR so that you can tag your benchmark with the mlip-auditl, add your logo and have your own dedicated tab (PR #434). Please see the framework credit tags docs

…b.com/lwehrhan/ml-peg into feat/add-folmsbee-conformer-benchmark

lwalew · 2026-05-19T08:01:31Z

+        Name of model and model object to get calculator.
+    """
+    model_name, model = mlip
+    model.default_dtype = "float64"


question: What's the reason for this?

lwalew · 2026-05-19T08:14:30Z

+
+    (out_path / "model_output.json").write_text(
+        benchmark.model_output.model_dump_json()
+    )


question: Why not json.dump?

lwalew · 2026-06-02T15:59:13Z

+        results[model_name] = mae(
+            conformer_energies["ref"], conformer_energies[model_name]
+        )
+    return results


IIUC, here we are putting all conformers into a single flat list then taking the mae, whereas in the upstream benchmark we compute per-molecule MAEs which we then average to give avg_mae.

lwalew · 2026-06-02T16:03:14Z

+            i = int(conf_str)
+            molecule = result_by_name[mol_name]
+
+            results[model_name].append(float(molecule.predicted_energy_profile[i]))


This might be None if a molecule had an unsupported element.

lwalew · 2026-06-02T16:07:07Z


 [tool.uv.sources]
 asemolec = { git = "https://github.com/imagdau/aseMolec.git" }
+mlipaudit = { git = "https://github.com/instadeepai/MLIPAudit.git", branch = "mlpeg-migration" }


remark: Note that this will have to be udpated to main at some point, either once everything is migrated or per-benchmark. Probably the former.

leonwehrhan added 3 commits March 10, 2026 18:36

feat: calc folmsbee

b542fd3

fix: units

e82bc26

feat: add analysis script

cd4e9da

alinelena requested review from ElliottKasoar and joehart2001 March 16, 2026 17:14

joehart2001 reviewed Mar 18, 2026

View reviewed changes

add flomsbee app and metrics.yml

e03bf30

joehart2001 added the new benchmark Proposals and suggestions for new benchmarks label Mar 19, 2026

leonwehrhan and others added 5 commits May 12, 2026 16:28

feat: calc folmsbee

e2ab058

fix: units

398fd9b

feat: add analysis script

d682fb0

add flomsbee app and metrics.yml

894c1ef

s3 download for calc, calc analysis and app fixes

efb1ea1

joehart2001 force-pushed the feat/add-folmsbee-conformer-benchmark branch from e03bf30 to efb1ea1 Compare May 12, 2026 15:29

joehart2001 and others added 3 commits May 12, 2026 16:38

add framework details

3fad5b7

Merge branch 'feat/add-folmsbee-conformer-benchmark' of https://githu…

c484172

…b.com/lwehrhan/ml-peg into feat/add-folmsbee-conformer-benchmark

feat: use mlip audit benchmark classes

caf5786

lwalew reviewed May 19, 2026

View reviewed changes

lwalew approved these changes Jun 2, 2026

View reviewed changes

lwalew reviewed Jun 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/add folmsbee conformer benchmark#429

Feat/add folmsbee conformer benchmark#429
lwehrhan wants to merge 12 commits into
ddmms:mainfrom
lwehrhan:feat/add-folmsbee-conformer-benchmark

lwehrhan commented Mar 16, 2026 •

edited by joehart2001

Loading

Uh oh!

joehart2001 commented Mar 18, 2026 •

edited

Loading

Uh oh!

joehart2001 Mar 18, 2026

Uh oh!

joehart2001 Mar 18, 2026

Uh oh!

joehart2001 commented Apr 13, 2026

Uh oh!

lwalew May 19, 2026

Uh oh!

lwalew May 19, 2026

Uh oh!

lwalew Jun 2, 2026

Uh oh!

lwalew Jun 2, 2026

Uh oh!

lwalew Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-    data_path = Path(__file__).parent / "data" / "folmsbee_dataset.json"
+    data_path = (
+        download_s3_data(
+            filename="Folmsbee.zip",
+            key="inputs/conformers/Folmsbee/Folmsbee.zip",
+        )
+        / "Folmsbee"
+    )

	from ml_peg.models.get_models import load_models
	from ml_peg.calcs.utils.utils import download_s3_data
	from ml_peg.models.get_models import load_models

Conversation

lwehrhan commented Mar 16, 2026 • edited by joehart2001 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pre-review checklist for PR author

Summary

Linked issue

Progress

Testing

New decorators/callbacks

Uh oh!

joehart2001 commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joehart2001 Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

joehart2001 Mar 18, 2026

Choose a reason for hiding this comment

Uh oh!

joehart2001 commented Apr 13, 2026

Uh oh!

lwalew May 19, 2026

Choose a reason for hiding this comment

Uh oh!

lwalew May 19, 2026

Choose a reason for hiding this comment

Uh oh!

lwalew Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

lwalew Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

lwalew Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lwehrhan commented Mar 16, 2026 •

edited by joehart2001

Loading

joehart2001 commented Mar 18, 2026 •

edited

Loading