feat!: typed PRA#run #329

tristan-f-r · 2025-07-14T18:28:54Z

Note

This also contains changes that close #297 to resolve a dependency diamond. I was debating about splitting that to another PR, but that 'containers' PR would depend on #292 (giving this 2 degrees of dependency), and the types in PRA#run actually make that change easier to follow and motivate.

Depends on refactor: redo PRM properties #285
Depends on refactor!: config parsing #292
Depends on refactor: don't use globals in runner #287
Needs fix: correct oi1 container #389 for tests to pass
Depends on refactor!: move container options to shared containers key #387
Depends on refactor: broaden container settings args #390

This also borrows (not a direct dependency) from #286. #286 should be merged after #329 is merged, as there's a good chance that we can just auto-generate documentation from the documentation here instead.

Closes #321, closes #296, and closes #297.

Arguments are now specified as a pydantic BaseModel with attached documentation:

class DominoParams(BaseModel):
    module_threshold: Optional[float]
    "the p-value threshold for considering a slice as relevant (optional)"

    slice_threshold: Optional[float]
    "the p-value threshold for considering a putative module as final module (optional)"

    model_config = ConfigDict(use_attribute_docstrings=True)

When constructing a PRM, this is passed in as a generic:

class DOMINO(PRM[DominoParams]):

For algorithms that don't specify parameters, the Empty type is preferred instead (whose signature is the empty BaseModel with an attached model_config that doesn't allow any other parameters.

This also changes the signature of PRA#run (reflecting the PR title) to be:

# Where T is the TypedVar which must be a pydantic.BaseModel
def run(inputs: dict[str, str | os.PathLike], output_file: str | os.PathLike, args: T, container_settings: ProcessedContainerOptions):

This has the disadvantage that inputs no longer has code completion, but this was probably something to not hide from the developer-user anyway, as we were passing inputs in via kwargs. See:

# This PR is more verbose when passing in arguments, which is a problem for people who
# are directly using PRA#run. However, I don't care about this audience.
PathLinker.run({"nodetypes": TEST_DIR+'input/sample-in-nodetypes.txt',
                "network": TEST_DIR+'input/sample-in-net.txt'},
                output_file=OUT_FILE_100,
                args=PathLinkerParams(k=100))

All of this gives us:

Encouraged, easily parsable PRM argument documentation
Parameter validation
a fully-specified JSON schema if feat: json schema #358
default factories, which are used to fix nondeterminism (feat: seeds #335).
Parameter types, to be used for parameter tuning for automatically determining what parameters to select
- (and AST-based range parsing for us to easily divide the step size for parameter tuning)

For reviewers

algorithms.py is the only "dense code" in this PR. That file extends our previous eval system and includes some hard-to-follow workflow code that enables type checking for our config file.

Otherwise, most of this is just refactoring all of the PRA#run calls to meet the new signature, specifying the new parameter models, and adding more parameter documentation.

…to config-args

ntalluri

This is my first pass of this PR.

Could you add an example demonstrating how these files and components fit together for an example algorithm and its configuration file, and show how the new files are used for validation and execution?

Also, for someone looking to integrate a new algorithm into SPRAS, what details or requirements should they be aware of? I’m assuming the new key piece would involve the Pydantic models.

config/config.yaml

docs/contributing/index.rst

spras/analysis/summary.py

spras/prm.py

spras/responsenet.py

spras/runner.py

spras/rwr.py

spras/strwr.py

Co-authored-by: Neha Talluri <[email protected]>

tristan-f-r · 2025-11-21T18:21:57Z

Could you add an example demonstrating how these files and components fit together for an example algorithm and its configuration file, and show how the new files are used for validation and execution?

I'm confused by this question: we do this with the present algorithms.

ntalluri · 2025-11-21T20:07:38Z

Could you add an example demonstrating how these files and components fit together for an example algorithm and its configuration file, and show how the new files are used for validation and execution?

I'm confused by this question: we do this with the present algorithms.

What I mean by this is writing down in a paragraph what is happening with an example in this PR. Essentially, just spelling out what’s happening step-by-step for an example so it’s clear how everything works together.

tristan-f-r · 2025-11-21T21:28:53Z

We have example usage in the PR description 👍

As for implementation details, the important part is the schema:

Algorithm files (e.g. pathlinker.py) do not depend on the schema, but rather depend on schema objects imported by algorithms.py, which is why we also need to separate containers.py to avoid the dependency diamond.

agitter

I can follow the overall design goals. My big picture takeaway is that I can see why we need these changes, but the typing adds indirection and hurts code readability. I don't have a solution for that.

I haven't looked at every pathway reconstruction algorithm and test case update yet, so I'll need to take at least one more pass. I wanted to leave some initial comments.

algorithms.py has some sophisticated Python. I am fairly sure I understand it when reviewing it today. I'm not sure about my ability to troubleshoot things if/when they break.

docker-wrappers/SPRAS/example_config.yaml

docs/_static/config/intermediate.yaml

docs/contributing/index.rst

spras/config/algorithms.py

spras/config/util.py

spras/allpairs.py

spras/prm.py

spras/runner.py

Co-authored-by: Anthony Gitter <[email protected]>

…to config-args

agitter

My only new comments are small.

In the spirit of helping new contributors who encounter this code base in the future, I'm wondering where to capture some of the information about the overall SPRAS design, especially the part that is changing here. Some of the useful information in the original message of this PR, e.g.

Arguments are now specified as a pydantic BaseModel with attached documentation:

provides a guide to how SPRAS works and where to find things. Is there a place to retain that knowledge in the repo? Scrolling through individual files to reconstruct it is going to get harder and hard.

docs/contributing/index.rst

spras/omicsintegrator2.py

docs/contributing/index.rst

spras/omicsintegrator2.py

tristan-f-r · 2025-12-21T01:10:54Z

provides a guide to how SPRAS works and where to find things. Is there a place to retain that knowledge in the repo? Scrolling through individual files to reconstruct it is going to get harder and hard.

The best place to document that would be the contributing guide: We could increase the sophistication of our AllPairs wrapping example to take in an optional argument, but I'm not sure what that would be.

tristan-f-r added 3 commits July 14, 2025 11:07

Merge branch 'property-expe' into config-args

7d521ce

Merge branch 'config-pydantic' into config-args

9c85c56

feat: rough draft of args design

647f947

tristan-f-r changed the title ~~Config args~~ refactor!: typed PRA#run Jul 14, 2025

feat: type oi1/oi2, rwr/strwr

76011e0

tristan-f-r changed the title ~~refactor!: typed PRA#run~~ feat!: typed PRA#run Jul 14, 2025

tristan-f-r added 7 commits July 14, 2025 12:27

refactor: meo, mcf, pl types

94b50c8

chore: begin slowly updating

09fa1ba

refactor: moving more tests

32d4b5c

fix: correct params

9b539e9

fix: specify default args out of run

da67711

fix: more defaults

45cfe87

Merge branch 'umain' into config-args

a6406e2

tristan-f-r added enhancement New feature or request needed for benchmarking Priority PRs needed for the benchmarking paper labels Jul 14, 2025

tristan-f-r and others added 10 commits July 14, 2025 14:16

Merge branch 'no-globals' into config-args

cf93cec

feat: begin algorithm parsing

e080857

fix: clean up type errors, begin nondetermnism

53f55e2

Merge branch 'config-pydantic' into config-args

7c2454b

chore: begin little utility

a4e265d

chore: mv container schema changes over

145b2ec

feat: initial schema

5effe69

feat: more algs schema handling

398350e

feat: default runs for default algorithms

72c4cbd

feat: function running

2ef2672

tristan-f-r mentioned this pull request Jul 15, 2025

feat!: SPRAS revision #320

Open

tristan-f-r added 3 commits July 15, 2025 17:13

chore: drop play

9442b64

fix(config): don't try to parse in config.py

60b562f

fix: subscriptability

c1947e6

tristan-f-r changed the title ~~feat!: typed PRA#run~~ feat!: schema & typed PRA#run Jul 15, 2025

Merge branch 'config-args' of https://github.com/tristan-f-r/spras in…

b2d4239

…to config-args

ntalluri requested review from agitter and ntalluri November 19, 2025 21:43

ntalluri reviewed Nov 21, 2025

View reviewed changes

docs: typo?

3b84e8f

Co-authored-by: Neha Talluri <[email protected]>

tristan-f-r added 5 commits November 21, 2025 18:25

chore: apply suggestions, fix meo param

33a85d8

docs: move domino cmt

81a16fe

docs: more suggestions

25e8b67

docs: more cmts

dce8e42

docs: add comment to is_numpy_friendly

790cab9

docs: fix cmt

27e6109

agitter reviewed Dec 6, 2025

View reviewed changes

tristan-f-r and others added 11 commits December 5, 2025 20:25

docs: typo

893b80d

Co-authored-by: Anthony Gitter <[email protected]>

docs: grammar

630f5c7

Co-authored-by: Anthony Gitter <[email protected]>

docs: rephrasing, contributing update

3d7cc6d

Merge branch 'config-args' of https://github.com/tristan-f-r/spras in…

22c0120

…to config-args

docs: clearer pydantic and reflection calls

fe93bd0

fix: pass less objects in runner.py

5911e59

docs: better algorithm fail message

ee41575

docs: more PRA#run documentation

a5832ee

docs: add generate inputs info

9fd5b0e

docs: drop unused cmt

75c2035

Merge branch 'umain' into config-args

87bf6f0

agitter reviewed Dec 20, 2025

View reviewed changes

tristan-f-r added 2 commits December 21, 2025 01:09

docs(contributing): defer to pathlinker for dockerfile

70175c8

docs(oi2): mention weight to cost transformation

569a49b

feat!: typed PRA#run #329

Are you sure you want to change the base?

feat!: typed PRA#run #329

Uh oh!

Conversation

tristan-f-r commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

For reviewers

Uh oh!

ntalluri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Nov 21, 2025

Uh oh!

ntalluri commented Nov 21, 2025

Uh oh!

tristan-f-r commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Dec 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tristan-f-r commented Jul 14, 2025 •

edited

Loading

tristan-f-r commented Nov 21, 2025 •

edited

Loading