Bridging the Gap to Real-World Object-Centric Learning

This is the UNOFFICIAL PyTorch implementations of papers Bridging the Gap to Real-World Object-Centric Learning [1] published in ICLR 2023 and Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames [2] published in ICML 2023.

This repository extends the original DINOSAUR model using Invariant Slot Attention and DINOv2 backbone.

Training

After installing COCO dataset to /path/to/coco and Pascal VOC to /path/to/pascal_voc12, you can train the model:

torchrun --master_port=12345 --nproc_per_node=#GPUs \
train.py \
--dataset {coco, pascal_voc12} \
--root /path/to/{coco, pascal_voc12} \
--model_save_path /path/to/save \
--encoder "dinov2-vitb-14" \
--ISA

Results

COCO

This is the expected object discovery performance of the models using the MLP decoder in COCO

Model	mBOⁱ	mBO^c	mIoU	FG-ARI
DINOSAUR (reported)	26.1	30.0	-	39.4
DINOSAUR (my reproduction)	28.0	31.7	20.4	40.2
DINOSAUR + DINOv2	30.3	34.3	22.3	44.9
DINOSAUR + DINOv2 + ISA	30.9	34.4	22.7	45.8

Pascal VOC

This is the expected object discovery performance of the models using the MLP decoder in Pascal VOC

Model	mBOⁱ	mBO^c	mIoU	FG-ARI
DINOSAUR (reported)	39.3	40.8	-	24.6
DINOSAUR (my reproduction)	39.1	42.9	23.3	26.1
DINOSAUR + DINOv2 + ISA	37.3	40.7	22.4	24.3

Acknowledgments

This work is built on the official implementation of SOLV, "Self-supervised Object-Centric Learning for Videos" published in NeurIPS 2023.

References

[1] Seitzer, Maximilian, et al. "Bridging the gap to real-world object-centric learning.", ICLR 2023

[2] Biza, Ondrej, et al. "Invariant slot attention: Object discovery with slot-centric reference frames.", ICML 2023

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
datasets		datasets
figures		figures
models		models
LICENSE		LICENSE
README.md		README.md
read_args.py		read_args.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bridging the Gap to Real-World Object-Centric Learning

Training

Results

COCO

Pascal VOC

Acknowledgments

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bridging the Gap to Real-World Object-Centric Learning

Training

Results

COCO

Pascal VOC

Acknowledgments

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages