[WIP] Benchmarking custom models #92

prateekdesai04 · 2025-02-05T02:43:29Z

Issue #, if available:

Description of changes:

The current version has warmpool implemented, hence if max_concurrent_jobs are 30 and 90 jobs are launched, the remaining 60 will be queued and SageMaker instances will be reused.
This has been tested on D244_F3_C1530_30 over all available folds [0, 1, 2], for LightGBM_c1_BAG_L1_Reproduced_AWS config
max_concurrent_jobs should be less than account limit which is 34 for now

To setup and run:

Copy the tabflow folder into a parent directory - NOTE this parent directory must also contain tabrepo, autogluon-benchmark and autogluon-bench folders, make sure all 3 are installed before installing tabflow
If you change anything in autogluon or tabrepo, you will need to re-build the image - navigate to parent folder followed by tabflow/docker, and run ./build_docker.sh {ecr_repo_name} {tag} {source_account} {target_account} {region} - AWS credentials required
In your IDE make the necessary changes inside launch_jobs.py: like entering your docker image URI which you just pushed to ECR, make the change here - DOCKER_IMAGE_ALIASES. (I plan to make these as args in the future edits)
Assuming you are in the parent folder - pip install tabflow
Input your AWS credentials
Read Example on how to run
If you want to import any new model then import it in evaluate.py

Example:

To run one or several datasets over certain folds (datasets and folds are space separated)
tabflow --datasets Australian --folds 0 1 --methods_file ~/method_configs.yaml --s3_bucket test-bucket --experiment_name test-experiment --max-concurrent-jobs 30 --wait

To run all datasets in a context over all folds for that context
tabflow --datasets run_all --folds -1 --methods_file ~/method_configs.yaml --s3_bucket test-bucket --experiment_name test-experiment --max-concurrent-jobs 30 --wait

Note:

For new experiment_names, caching won't come into play
Max concurrent jobs must always be less than your account limit, expect failures otherwise

To Do (mostly prioritized order):

Add requirements.txt or pyproject.toml [x]
Handle more than 100 listTraining jobs using exponential back-off to avoid throttling [x]
Do a code clean up, and modularize everything [x] (this is also incremental based on feed-back)
Logging every task, fetch from sagemaker and save them to s3 along with results.pkl [x]
Multi-threading for instantaneous job launch [WIP]
Clean up docker instructions, add wait flags and misc. items [x]
Get results from s3 and store from local to s3 and other convenience functions
Give args for Dockerfile name and build etc., add docker building step to pipeline
Change date time format of experiment, currently it is not in sorted order in S3 (if required, not necessary)
Adopt model register implementation from tabrepo when available [x]

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

prateekdesai04 force-pushed the tabflow branch from 8969e9d to 171f19e Compare February 6, 2025 06:58

prateekdesai04 changed the base branch from main to refactor_part3 February 6, 2025 07:14

prateekdesai04 force-pushed the tabflow branch from 1346352 to aa33d91 Compare February 26, 2025 08:02

Innixma force-pushed the refactor_part3 branch 5 times, most recently from 53aee5a to 1b593a1 Compare March 10, 2025 20:53

prateekdesai04 force-pushed the tabflow branch from 63a02fe to 978d203 Compare March 11, 2025 17:57

Innixma force-pushed the refactor_part3 branch from 1b593a1 to 1b43563 Compare March 12, 2025 22:31

prateekdesai04 force-pushed the tabflow branch from 5f694f8 to 39025b3 Compare March 12, 2025 23:03

Squashed commits

d89bf1b

prateekdesai04 force-pushed the tabflow branch from da5790b to d89bf1b Compare March 13, 2025 18:37

prateekdesai04 closed this Mar 13, 2025

prateekdesai04 deleted the tabflow branch March 13, 2025 19:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Benchmarking custom models #92

[WIP] Benchmarking custom models #92

prateekdesai04 commented Feb 5, 2025 •

edited

Loading

[WIP] Benchmarking custom models #92

[WIP] Benchmarking custom models #92

Conversation

prateekdesai04 commented Feb 5, 2025 • edited Loading

prateekdesai04 commented Feb 5, 2025 •

edited

Loading