-
Notifications
You must be signed in to change notification settings - Fork 25
Update summary.py to include parameter combinations #194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
31 commits
Select commit
Hold shift + click to select a range
c83370b
new updates
cgsze bdb818f
Merge branch 'update-summary' of https://github.com/cgsze/spras into …
cgsze 320d399
pre-commit
cgsze 753035a
updated filepath indexing
cgsze ed6e4ca
edit test cases
cgsze c4c65e9
Updated input/output test files
cgsze 4893f86
node tables into edge/node files from yaml
cgsze 400df3b
init from file
cgsze a75cb4f
dataset_dict
cgsze aadb0f8
started loading test case
cgsze 83cacd8
resolved mismatched param combos
cgsze 602145c
latest changes
cgsze a1a189b
resolved 3 AttributeErrors
cgsze 4f365f6
resolved TypeError, set all but summary to false
cgsze a6a7f48
debugging summarize_networks returning empty df -> AssertionError
cgsze 3a1c0c6
fixed AssertionErrors
cgsze d0b0b6c
load dataset test case
cgsze 9df48cd
pytest fixes
cgsze 6b7092f
Merge branch 'Reed-CompBio:master' into update-summary
cgsze a7187e0
revert file path changes
cgsze ac99e5e
load dataset_dict test case
cgsze 1823788
added dummy node to test case/summarize_networks
cgsze fb6b1ae
output/ directory issues (to be continued)
cgsze 0d29b43
test_summary.py test cases
cgsze a8563d1
debugging
cgsze c1fc881
set graphspace/cytoscape to true in config files
cgsze 61e7b09
new changes
cgsze 156a9a5
linux style paths
cgsze 4696cdc
Refactor summarization code
agitter f00a42a
Fix test case paths
agitter 1cf27b8
Fix test case path again
agitter File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,4 @@ | ||
| NODEID prize active dummy sources targets | ||
| 0 C 5.7 True NaN NaN True | ||
| 1 A 2.0 True True True NaN | ||
| 2 B NaN NaN NaN NaN NaN |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,10 @@ | ||
| Name Number of nodes Number of edges Number of connected components Nodes in prize Nodes in sources Nodes in targets Nodes in active Nodes in dummy Parameter combination | ||
| test/analysis/input/egfr/tps-egfr-domino-params-V3X4RW7_pathway.txt 48 45 3 27 0 27 27 0 {'slice_threshold': 0.3, 'module_threshold': 0.05} | ||
| test/analysis/input/egfr/tps-egfr-meo-params-GKEDDFZ_pathway.txt 1877 12845 1 621 1 620 621 1 {'local_search': 'Yes', 'max_path_length': 3, 'rand_restarts': 10} | ||
| test/analysis/input/egfr/tps-egfr-omicsintegrator1-params-3THRXWW_pathway.txt 28 20 8 28 1 27 28 1 {'b': 2, 'd': 10, 'g': '1e-3', 'r': 0.01, 'w': 0.1, 'mu': 0.008, 'dummy_mode': 'file'} | ||
| test/analysis/input/egfr/tps-egfr-omicsintegrator1-params-5QH767V_pathway.txt 39 31 8 39 1 38 39 1 {'b': 10, 'd': 10, 'g': '1e-3', 'r': 0.01, 'w': 0.1, 'mu': 0.008, 'dummy_mode': 'file'} | ||
| test/analysis/input/egfr/tps-egfr-omicsintegrator1-params-ITO5EQS_pathway.txt 14 9 5 14 0 14 14 0 {'b': 0.55, 'd': 10, 'g': '1e-3', 'r': 0.01, 'w': 0.1, 'mu': 0.008, 'dummy_mode': 'file'} | ||
| test/analysis/input/egfr/tps-egfr-omicsintegrator2-params-EHHWPMD_pathway.txt 593 591 2 531 1 530 531 1 {'b': 2, 'g': 3} | ||
| test/analysis/input/egfr/tps-egfr-omicsintegrator2-params-IV3IPCJ_pathway.txt 704 702 2 616 1 615 616 1 {'b': 4, 'g': 0} | ||
| test/analysis/input/egfr/tps-egfr-pathlinker-params-7S4SLU6_pathway.txt 14 17 1 6 1 5 6 1 {'k': 10} | ||
| test/analysis/input/egfr/tps-egfr-pathlinker-params-TCEMRS7_pathway.txt 25 32 1 11 1 10 11 1 {'k': 20} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,13 @@ | ||
| Name Number of nodes Number of edges Number of connected components Nodes in prize Nodes in active Nodes in dummy Nodes in sources Nodes in targets Parameter combination | ||
| test/analysis/input/example/data0-allpairs-params-BEH6YB2_pathway.txt 3 2 1 2 2 1 1 1 {'spras_placeholder': 'no parameters'} | ||
| test/analysis/input/example/data0-domino-params-V3X4RW7_pathway.txt 0 0 0 0 0 0 0 0 {'slice_threshold': 0.3, 'module_threshold': 0.05} | ||
| test/analysis/input/example/data0-meo-params-GKEDDFZ_pathway.txt 3 2 1 2 2 1 1 1 {'max_path_length': 3, 'local_search': 'Yes', 'rand_restarts': 10} | ||
| test/analysis/input/example/data0-mincostflow-params-SZPZVU6_pathway.txt 3 2 1 2 2 1 1 1 {'flow': 1, 'capacity': 1} | ||
| test/analysis/input/example/data0-omicsintegrator1-params-E3LSEZQ_pathway.txt 3 2 1 2 2 1 1 1 {'b': 6, 'w': 5.0, 'd': 10, 'dummy_mode': 'file'} | ||
| test/analysis/input/example/data0-omicsintegrator1-params-NFIPHUX_pathway.txt 0 0 0 0 0 0 0 0 {'b': 6, 'w': 0.0, 'd': 10, 'dummy_mode': 'file'} | ||
| test/analysis/input/example/data0-omicsintegrator1-params-SU2S63Y_pathway.txt 3 2 1 2 2 1 1 1 {'b': 5, 'w': 0.0, 'd': 10, 'dummy_mode': 'file'} | ||
| test/analysis/input/example/data0-omicsintegrator1-params-V26JBGX_pathway.txt 0 0 0 0 0 0 0 0 {'b': 5, 'w': 5.0, 'd': 10, 'dummy_mode': 'file'} | ||
| test/analysis/input/example/data0-omicsintegrator2-params-EHHWPMD_pathway.txt 0 0 0 0 0 0 0 0 {'b': 2, 'g': 3} | ||
| test/analysis/input/example/data0-omicsintegrator2-params-IV3IPCJ_pathway.txt 3 2 1 2 2 1 1 1 {'b': 4, 'g': 0} | ||
| test/analysis/input/example/data0-pathlinker-params-6SWY7JS_pathway.txt 3 2 1 2 2 1 1 1 {'k': 200} | ||
| test/analysis/input/example/data0-pathlinker-params-VQL7BDZ_pathway.txt 3 2 1 2 2 1 1 1 {'k': 100} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,137 @@ | ||
| # The length of the hash used to identify a parameter combination | ||
| hash_length: 7 | ||
|
|
||
| # Specify the container framework. Current supported versions include 'docker' and | ||
| # 'singularity'. If container_framework is not specified, SPRAS will default to docker. | ||
| container_framework: docker | ||
|
|
||
| # Only used if container_framework is set to singularity, this will unpack the singularity containers | ||
| # to the local filesystem. This is useful when PRM containers need to run inside another container, | ||
| # such as would be the case in an HTCondor/OSPool environment. | ||
| # NOTE: This unpacks singularity containers to the local filesystem, which will take up space in a way | ||
| # that persists after the workflow is complete. To clean up the unpacked containers, the user must | ||
| # manually delete them. | ||
| unpack_singularity: false | ||
|
|
||
| # Allow the user to configure which container registry containers should be pulled from | ||
| # Note that this assumes container names are consistent across registries, and that the | ||
| # registry being passed doesn't require authentication for pull actions | ||
| container_registry: | ||
| base_url: docker.io | ||
| # The owner or project of the registry | ||
| # For example, "reedcompbio" if the image is available as docker.io/reedcompbio/allpairs | ||
| owner: reedcompbio | ||
|
|
||
| algorithms: | ||
| - name: "pathlinker" | ||
| params: | ||
| include: true | ||
| run1: | ||
| k: range(100,201,100) | ||
|
|
||
| - name: "omicsintegrator1" | ||
| params: | ||
| include: true | ||
| run1: | ||
| b: [5, 6] | ||
| w: np.linspace(0,5,2) | ||
| d: [10] | ||
| dummy_mode: ["file"] | ||
|
|
||
| - name: "omicsintegrator2" | ||
| params: | ||
| include: true | ||
| run1: | ||
| b: [4] | ||
| g: [0] | ||
| run2: | ||
| b: [2] | ||
| g: [3] | ||
|
|
||
| - name: "meo" | ||
| params: | ||
| include: true | ||
| run1: | ||
| max_path_length: [3] | ||
| local_search: ["Yes"] | ||
| rand_restarts: [10] | ||
|
|
||
| - name: "mincostflow" | ||
| params: | ||
| include: true | ||
| run1: | ||
| flow: [1] # The flow must be an int | ||
| capacity: [1] | ||
|
|
||
| - name: "allpairs" | ||
| params: | ||
| include: true | ||
|
|
||
| - name: "domino" | ||
| params: | ||
| include: true | ||
| run1: | ||
| slice_threshold: [0.3] | ||
| module_threshold: [0.05] | ||
|
|
||
|
|
||
| # Here we specify which pathways to run and other file location information. | ||
| # DataLoader.py can currently only load a single dataset | ||
| # Assume that if a dataset label does not change, the lists of associated input files do not change | ||
| datasets: | ||
| - | ||
| # Labels can only contain letters, numbers, or underscores | ||
| label: data0 | ||
| node_files: ["node-prizes.txt", "sources.txt", "targets.txt"] | ||
| # DataLoader.py can currently only load a single edge file, which is the primary network | ||
| edge_files: ["network.txt"] | ||
| # Placeholder | ||
| other_files: [] | ||
| # Relative path from the spras directory | ||
| data_dir: "input" | ||
|
|
||
| gold_standards: | ||
| - | ||
| # Labels can only contain letters, numbers, or underscores | ||
| label: gs0 | ||
| node_files: ["gs_nodes0.txt"] | ||
| data_dir: "input" | ||
| # List of dataset labels to compare with the specific gold standard dataset | ||
| dataset_labels: ["data0"] | ||
|
|
||
| # If we want to reconstruct then we should set run to true. | ||
| reconstruction_settings: | ||
| #set where everything is saved | ||
| locations: | ||
| #place the save path here | ||
| reconstruction_dir: "output" | ||
| run: true | ||
|
|
||
| analysis: | ||
| # Create one summary per pathway file and a single summary table for all pathways for each dataset | ||
| summary: | ||
| include: true | ||
| # Create output files for each pathway that can be visualized with GraphSpace | ||
| graphspace: | ||
| include: true | ||
| # Create Cytoscape session file with all pathway graphs for each dataset | ||
| cytoscape: | ||
| include: true | ||
| # Machine learning analysis (e.g. clustering) of the pathway output files for each dataset | ||
| ml: | ||
| # ml analysis per dataset | ||
| include: false | ||
| # adds ml analysis per algorithm output | ||
| # only runs for algorithms with multiple parameter combinations chosen | ||
| aggregate_per_algorithm: true | ||
| # specify how many principal components to calculate | ||
| components: 2 | ||
| # boolean to show the labels on the pca graph | ||
| labels: true | ||
| # 'ward', 'complete', 'average', 'single' | ||
| # if linkage: ward, must use metric: euclidean | ||
| linkage: 'ward' | ||
| # 'euclidean', 'manhattan', 'cosine' | ||
| metric: 'euclidean' | ||
| evaluation: | ||
| include: false | ||
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.