Update README

ttanida · ttanida · commit 42a8a027d2e7 · 2023-03-01T23:42:41.000+01:00
diff --git a/README.md b/README.md
@@ -18,32 +18,6 @@ The automatic generation of radiology reports has the potential to assist radiol
 
 After the setup, run "**python create_dataset.py**" in src/dataset/ to create training, val and test csv files, in which each row contains specific information about a single image. See doc string of create_dataset.py for more details.
 
-## Training
+## Training and Testing
 
-The full model is trained in 3 training stages:
-
-1. Object detector
-2. Object detector + abnormality classification module + region selection module
-3. Full model end-to-end
-
-### Object detector
-
-For training the object detector, specify the training configurations (e.g. batch size etc.) in lines 32 - 49 of src/object_detector/training_script_object_detector.py, then run "**python training_script_object_detector.py**".
-The weights of the trained object detector model will be stored in the folder specified in src/path_datasets_and_weights.py
-
-### Object detector + abnormality classification module + region selection module
-
-For the second training stage, first specify the path to the best trained object detector in report generation model (see line 26 of src/full_model/report_generation_model.py), such that the trained object detector will be trained together with the 2 binary classifiers.
-Next, specify the run configurations in src/full_model/run_configurations.py. In particular, set "**PRETRAIN_WITHOUT_LM_MODEL = True**",
-such that the language model is fully excluded from training. See doc string of src/full_model/run_configurations.py for more details.
-Start training by running "**python train_full_model.py**" in src/full_model/.
-
-### Full model
-
-For the third training stage, again adjust the run configurations in src/full_model/run_configurations.py (e.g., the batch size may have to be lowered, since the full model requires a lot of memory). In particular, set "**PRETRAIN_WITHOUT_LM_MODEL = False**", such that the full model is trained end-to-end. Next, specify the checkpoint of the best pre-trained model of training stage 2 in the main function (line 567) of src/full_model/train_full_model.py, such this pre-trained model is loaded at beginning of training. Start training by running "**python train_full_model.py**" in src/full_model/.
- 
-During each training stage, the validation metrics and other useful information (such as images with bounding boxes and generated sentences etc.) are logged to tensorboard files saved in the corresponding run folders (specified in path_datasets_and_weights.py). Additionally, for the 3rd training stage, txt files with generated reports and sentences are saved in the run folders. 
-
-## Testing
-
-Specify the run and checkpoint of the best trained full model to be tested in lines 40 - 41 of src/full_model/test_set_evaluation.py, then run "**python test_set_evaluation.py**". Txt files with the test set scores (and generated reports/sentences) will be saved in src/.
+Please read the README_TRAIN_TEST.md for specific information on trainig and testing the model.
diff --git a/README_TRAIN_TEST.md b/README_TRAIN_TEST.md
@@ -0,0 +1,29 @@
+## Training
+
+The full model is trained in 3 training stages:
+
+1. Object detector
+2. Object detector + abnormality classification module + region selection module
+3. Full model end-to-end (i.e. all 4 modules together)
+
+### Object detector
+
+For training the object detector, specify the training configurations (e.g. batch size etc.) in lines 32 - 49 of src/object_detector/training_script_object_detector.py, then run "**python training_script_object_detector.py**".
+The weights of the trained object detector model will be stored in the folder specified in src/path_datasets_and_weights.py
+
+### Object detector + abnormality classification module + region selection module
+
+For the second training stage, first specify the path to the best trained object detector in report generation model (see line 26 of src/full_model/report_generation_model.py), such that the trained object detector will be trained together with the 2 binary classifiers.
+Next, specify the run configurations in src/full_model/run_configurations.py. In particular, set "**PRETRAIN_WITHOUT_LM_MODEL = True**",
+such that the language model is fully excluded from training. See doc string of src/full_model/run_configurations.py for more details.
+Start training by running "**python train_full_model.py**" in src/full_model/.
+
+### Full model
+
+For the third training stage, again adjust the run configurations in src/full_model/run_configurations.py (e.g., the batch size may have to be lowered, since the full model requires a lot of memory). In particular, set "**PRETRAIN_WITHOUT_LM_MODEL = False**", such that the full model is trained end-to-end. Next, specify the checkpoint of the best pre-trained model of training stage 2 in the main function (line 567) of src/full_model/train_full_model.py, such this pre-trained model is loaded at beginning of training. Start training by running "**python train_full_model.py**" in src/full_model/.
+ 
+During each training stage, the validation metrics and other useful information (such as images with bounding boxes and generated sentences etc.) are logged to tensorboard files saved in the corresponding run folders (specified in path_datasets_and_weights.py). Additionally, for the 3rd training stage, txt files with generated reports and sentences are saved in the run folders. 
+
+## Testing
+
+Specify the run and checkpoint of the best trained full model to be tested in lines 40 - 41 of src/full_model/test_set_evaluation.py, then run "**python test_set_evaluation.py**". Txt files with the test set scores (and generated reports/sentences) will be saved in src/.
diff --git a/src/full_model/run_configurations.py b/src/full_model/run_configurations.py
@@ -16,7 +16,8 @@
 (2) Object detector + binary classifiers training:
     - load best object detector weights from step (1) into the object detector in the __init__ method in src/full_model/report_generation_model.py
     - set PRETRAIN_WITHOUT_LM_MODEL = True in this file
-    - make sure that in the main function of src/full_model/train_full_model.py, no other weights are loaded into the instantiated ReportGenerationModel
+    - make sure that in the main function of src/full_model/train_full_model.py,
+    no other weights are loaded into the instantiated ReportGenerationModel (i.e. make sure that line 567 is commented out)
     - pre-train full model without language model with src/full_model/train_full_model.py
 
 (3) Full model training: