Add instructions for face detection (openvinotoolkit#97)

sovrasov · AlexanderDokuchaev · commit b8b17bbcc352 · 2019-07-04T16:11:07.000+03:00
* Add instructions for face detection

* Update fr readme and init script

* Update mmdetection submodule

* Update mmdetection submodule
diff --git a/external/mmdetection b/external/mmdetection
@@ -1 +1 @@
-Subproject commit 804451bb89db1bf3a3cd6800e6d7169d8516ef4d
+Subproject commit b0ef8ab0f45e0c069e64132a55c4b25129cd4ba0
diff --git a/pytorch_toolkit/object_detection/README_face_detection.md b/pytorch_toolkit/object_detection/README_face_detection.md
@@ -0,0 +1,111 @@
+# Real time face detector
+
+This repository contains training scripts for lightweight SSD-based face detector. The detector is based on the MobileNetV2 backbone and has single SSD head with manually designed anchors. As a result it has computational complexity 0.51 GMACs and 1.03 M of parameters.
+
+## Requirements
+
+* Ubuntu 16.04
+* Python 3.5 or 3.6 (3.6 is preferable)
+* PyTorch 1.1
+
+## Prerequisites
+
+1. Download mmdetection submodule `git submodule update --init --recommend-shallow external/mmdetection`
+2. Download the [WIDER Face](http://shuoyang1213.me/WIDERFACE/) and unpack it to `data` folder.
+3. Annotation in the VOC format
+can be found in this [repo](https://github.com/sovrasov/wider-face-pascal-voc-annotations.git). Move the annotation files from `WIDER_train_annotations` and `WIDER_val_annotations` folders
+to the `Annotation` folders inside the corresponding directories `WIDER_train` and `WIDER_val`.
+Also annotation lists `val.txt` and `train.txt` should be copied to `data/WIDERFace` from `WIDER_train_annotations` and `WIDER_val_annotations`.
+The directory should be like this:
+
+```
+object_detection
+├── tools
+├── data
+│   ├── WIDERFace
+│   │   ├── WIDER_train
+│   |   │   ├──0--Parade
+│   |   │   ├── ...
+│   |   │   ├── Annotations
+│   │   ├── WIDER_val
+│   |   │   ├──0--Parade
+│   |   │   ├── ...
+│   |   │   ├── Annotations
+│   │   ├── val.txt
+│   │   ├── train.txt
+
+```
+4. Create virtual environment `bash init_venv.sh`
+
+## Training
+
+1. Download pre-trained MobileNetV2 weights `mobilenet_v2.pth.tar` from: [https://github.com/tonylins/pytorch-mobilenet-v2](https://github.com/tonylins/pytorch-mobilenet-v2). Move the file with weights to the folder `snapshots`.
+2. Run in terminal:
+```bash
+python3 ../../external/mmdetection/tools/train.py   \
+  ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py
+```
+to train the detector on a single GPU.
+
+## Validation
+
+1. Run in terminal
+```bash
+python3 ../../external/mmdetection/tools/test.py    \
+  ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py   \
+  <CHECKPOINT>   \
+  --out result.pkl
+```
+to dump detections.
+2. Then run
+```bash
+python3 ../../external/mmdetection/tools/voc_eval.py    \
+  result.pkl    \
+  ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py
+```
+One should observe 0.305 AP on validation set. For more detailed results and comparison with vanilla SSD300 see `../../external/mmdetection/configs/wider_face/README.md`.
+
+## Conversion to OpenVINO format
+
+1. Convert PyTorch model to ONNX format: run script in terminal
+```bash
+python3 tools/onnx_export.py  \
+      ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py
+      <CHECKPOINT>            \
+      face_detector.onnx
+```
+This command produces `face_detector.onnx`.
+2. Convert ONNX model to OpenVINO format with Model Optimizer: run in terminal
+```bash
+python3 <OpenVINO_INSTALL_DIR>/deployment_tools/model_optimizer/mo.py   \
+  --input_model face_detector.onnx    \
+  --scale 255   \
+  --reverse_input_channels
+```
+This produces model `face_detector.xml` and weights `face_detector.bin` in single-precision floating-point format (FP32). The obtained model expects normalized image in planar BGR format.
+
+
+## Python demo
+
+To run the demo connect a webcam end execute command:
+```bash
+python3 tools/detection_live_demo.py  \
+        ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py \
+        <CHECKPOINT>  \
+        --cam_id 0
+```
+
+
+## Estimate theoretical computational complexity
+
+To get per layer computational complexity estimations run the following command:
+```bash
+python3 tools/count_flops.py  \
+    ../../external/mmdetection/configs/wider_face/mobilenetv2_tiny_ssd300_wider_face.py
+```
+
+
+## Fine-tuning
+
+* The annotations have to be in [PASCAL VOC](http://host.robots.ox.ac.uk/pascal/VOC/voc2012/index.html#data) format.
+* Fine-tuning steps are the same as step 2 for training, just edit the initial checkpoint and data paths in detector config.
diff --git a/pytorch_toolkit/object_detection/init_venv.sh b/pytorch_toolkit/object_detection/init_venv.sh
@@ -22,6 +22,11 @@ else
   echo "Model optimizer requirements were not installed. Please install the OpenVino toolkit to use one."
 fi
 
+. venv/bin/activate
+cd ../../external/mmdetection/
+bash compile.sh
+python setup.py develop
+deactivate
 
 echo
 echo "===================================================="
diff --git a/pytorch_toolkit/object_detection/prepare_mmdet.sh b/pytorch_toolkit/object_detection/prepare_mmdet.sh