enhanced parallelization

### problem

Each module `run_patch.py` `run_segmentation.py` etc... launches a process to parallelize across sites, but often we have a large dataset for a given site (high Z, T, C count) and are not able to parallelize each, say, timepoint.

### possible solutions

Each module uses its own worker class and the python `multiprocessing` library to spawn new processes.  If we use multiprocess pool (either with concurrent.futures.processpoolexecutor or multiprocess libraries), we can pass a list of parameters to the executor, which will spawn processes on its own.

### Questions

This touches on two questions:
- what is the data structure we'll use?  Do we want a single array (currently .npy), in which case, we need it to be capable of concurrent writing.  Should patches be written to individual files?
- at exactly what level do we parallelize the data?  Should it be possible at the finest level (Z, T, C)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enhanced parallelization #33

problem

possible solutions

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

enhanced parallelization #33

Description

problem

possible solutions

Questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions