SWIN-MTL: Swin Transformer based Multi-Task Learning Model

Introduction

This repository provides a Python-based implementation of an efficient Multi-task learning architecture for dense prediction tasks. The repository is based upon Swin-Transformer and uses some modules from Multi-Task-Learning-PyTorch.

Download Datasets

We use the same data (PASCAL-Context and NYUD-v2) as ATRC. You can download the data from: PASCALContext.tar.gz, NYUDv2.tar.gz

And then extract the datasets by: tar xfvz NYUDv2.tar.gz tar xfvz PASCALContext.tar.gz

Requirements

Clone the repo git clone https://github.com/scale-lab/E-MTL.git ; cd E-MTL
Create a virtual environment with Python 3.9 or later python -m venv env ; source env/bin/activate
Install the requirements using pip install -r requirements.txt

Training

python -m torch.distributed.launch 
        --nproc_per_node 1 
        --master_port 12345 
        main.py --cfg {CONFIG.yaml}
                --pascal {PASCAL_DATA_DIR}
                --tasks {TASK_NAMES} 
                --batch-size 64 
                --ckpt-freq 10
                --epoch 200 
                [--resume-backbone {SWIN_PRETRAINED.pth}]

CONFIG.yaml is the path of the desired model configuration, check model.args for an example.
PASCAL_DATA_DIR is the path of the downloaded pascal dataset.
TASK_NAMES is the name of the desired tasks, available tasks for Pascal dataset are semseg, normals, sal, and human_parts. For example, to create a model that performs semantic segmentation and saliency distillation, TASK_NAMES should be set to semseg,sal
SWIN_PRETRAINED.pth is the path to the pretrained SWIN transformer backbone. Pretrained SWIN transformer backbones can be downloaded from their REPO. For example, to download pretrained SWIN Tiny, use wget https://github.com/SwinTransformer/storage/releases/download/v1.0.0/swin_tiny_patch4_window7_224.pth

Inference

python -m torch.distributed.launch 
        --nproc_per_node 1 
        --master_port 12345 
        main.py --cfg {CONFIG.yaml}
                --pascal {PASCAL_DATA_DIR}
                --tasks {TASK_NAMES} 
                --batch-size 64 
                --resume {PRETRAINED.pth}
                --eval

CONFIG.yaml is the path of the desired model configuration, check model.args for an example.
PASCAL_DATA_DIR is the path of the downloaded pascal dataset.
TASK_NAMES is the name of the desired tasks, available tasks for Pascal dataset are semseg, normals, sal, and human_parts. For example, to create a model that performs semantic segmentation and saliency distillation, TASK_NAMES should be set to semseg,sal
PRETRAINED.pth is the path to the pretrained E-MTL model.

Authorship

Since the release commit is squashed, the GitHub contributors tab doesn't reflect the authors' contributions. The following authors contributed equally to this codebase:

License

MIT License. See LICENSE file

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs/swin		configs/swin
data		data
evaluation		evaluation
kernels/window_process		kernels/window_process
models		models
LICENSE		LICENSE
README.md		README.md
config.py		config.py
logger.py		logger.py
lr_scheduler.py		lr_scheduler.py
main.py		main.py
mtl_loss_schemes.py		mtl_loss_schemes.py
optimizer.py		optimizer.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SWIN-MTL: Swin Transformer based Multi-Task Learning Model

Introduction

Download Datasets

Requirements

Training

Inference

Authorship

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

scale-lab/Swin_MTL

Folders and files

Latest commit

History

Repository files navigation

SWIN-MTL: Swin Transformer based Multi-Task Learning Model

Introduction

Download Datasets

Requirements

Training

Inference

Authorship

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages