rfc: @ls.pytest.mark.parametrize interface #1199

baskaryan · 2024-11-08T22:23:19Z

almost certainly not handling lazy eval correctly, but what do we think of interface?

automatically logs a pass/fail feedback based on test pass/fail
can return whatever other feedback you want as well

@ls.pytest.mark.parametrize("Sample Dataset 3", (lambda x: x))
def test_parametrize(inputs, outputs, reference_outputs) -> list:
    assert inputs == outputs
    return [{"key": "foo", "value": "bar"}]

some example experiments here https://dev.smith.langchain.com/public/e7782ea0-3de5-4352-8cd4-7b2cdbb03e4c/d

hinthornw · 2024-11-08T22:35:59Z

Things I like about this:

Can connect to dataset
outputs are fairly localized/transpernt
Trace seems sensical (has outputs by default)
Parallelized!
Think you can re-use the score helping function if you wanted

Things I don't looove about this relative to @unit

Seems harder to check multi-step things
The actual system is run "outside" the test function
Currently seems to be 1 experiment per unit test? Maybe that is the right equivalence though not sure
Pytest doesn't like if you return stuff from the test function

jakerachleff · 2024-11-08T22:55:10Z

python/langsmith/pytest/mark.py

+            pass_result = [r for r in eval_results if r.key == "pass"][0]
+            if not pass_result.score:
+                error = pass_result.comment
+                pytest.fail(


How would you set failure conditions? I assume people don't want to actually fail if any evaluation fails?

which might mean allowing customizability on the interface on this

this only fails if the actual test raises an error (we need to add a manual pytest.fail for that bc we catch and log all errors in the wrapper L48). so it is customizable by default

rfc: @ls.pytest.mark.parametrize interface

6cce45e

baskaryan requested a review from hinthornw November 8, 2024 22:23

fmt

a509417

jakerachleff reviewed Nov 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc: @ls.pytest.mark.parametrize interface #1199

rfc: @ls.pytest.mark.parametrize interface #1199

baskaryan commented Nov 8, 2024

hinthornw commented Nov 8, 2024 •

edited

Loading

jakerachleff Nov 8, 2024

jakerachleff Nov 8, 2024

baskaryan Nov 8, 2024

rfc: @ls.pytest.mark.parametrize interface #1199

Are you sure you want to change the base?

rfc: @ls.pytest.mark.parametrize interface #1199

Conversation

baskaryan commented Nov 8, 2024

hinthornw commented Nov 8, 2024 • edited Loading

jakerachleff Nov 8, 2024

Choose a reason for hiding this comment

jakerachleff Nov 8, 2024

Choose a reason for hiding this comment

baskaryan Nov 8, 2024

Choose a reason for hiding this comment

hinthornw commented Nov 8, 2024 •

edited

Loading