Run training

Learning Visual-Semantic Subspace Representations

Gabriel Moreira, Manuel Marques, Joao Costeira, Alexander G Hauptmann

Published 2025AISTATS 2025

Abstract:

Learning image representations that capture rich semantic relationships remains a significant challenge. Existing approaches are either contrastive, lacking robust theoretical guarantees, or struggle to effectively represent the partial orders inherent to structured visual-semantic data. In this paper, we introduce a nuclear norm-based loss function, grounded in the same information theoretic principles that have proved effective in self-supervised learning. We present a theoretical characterization of this loss, demonstrating that, in addition to promoting class orthogonality, it encodes the spectral geometry of the data within a subspace lattice. This geometric representation allows us to associate logical propositions with subspaces, ensuring that our learned representations adhere to a predefined symbolic structure.

The capability of explicitely represent negations is one key feature of this subspace representation. The figure shows that massive systems like CLIP that do not have structured representations by design do not cope with "negations" among other queries whereas by imposing a subspace structure the negation is naturally represented by the orthogonal complement.

Code

Run training

python ./train.py --config-name=celeb-a general.name="name_of_experiment"

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
configs		configs
.gitignore		.gitignore
__init__.py		__init__.py
convnet.py		convnet.py
dataset.py		dataset.py
experiments_incremental.ipynb		experiments_incremental.ipynb
experiments_retrieval.ipynb		experiments_retrieval.ipynb
experiments_synthetic.ipynb		experiments_synthetic.ipynb
model.py		model.py
queries.txt		queries.txt
readme.md		readme.md
resnet.py		resnet.py
sampler.py		sampler.py
train.py		train.py
train_xent.ipynb		train_xent.ipynb
unit.py		unit.py
viz.ipynb		viz.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning Visual-Semantic Subspace Representations

Code

Run training

About

Uh oh!

Releases

Packages

Languages

sipg-isr/visualsemantic-subspaces

Folders and files

Latest commit

History

Repository files navigation

Learning Visual-Semantic Subspace Representations

Code

Run training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages