GitHub - harshg99/explaining_ram: Explaining internal representation learnt via recurrent attention models for image classification

Explaining Recurrent Attention Models

(The code has been adapted from https://github.com/kevinzakka/recurrent-visual-attention)

Recurrent Attention Models (RAM) actively selects and observes a sequence of patches in an image to make a prediction. Unlike in the deep convolution network, in hard attention it is explainable which regions of the image contributed to the prediction. To infer the glimpses and explain the model qualitatively, we build a Variational Autoencoder (VAE) on the final hidden state of the recurrent units and visualize the reconstruction of the images after each glimpse is processed. We also prove quantitatively the model encodes some latent space statistics of the entire image through a sequence of patches by evaluating the expected information gain(EIG) over the classification output after each glimpse. These are demonstrated on the MNIST and cluttered MNIST dataset. We also attempted to study the improvement in the above statistics through reward shaping the inherent reinforcement learning algorithm that dictates where to see next. We report that the new reward structure performs better than the original one used in the paper in terms of information gain over the MNIST dataset however, no improvement was reported in terms of expected information gain.

Model Description

In this paper, the attention problem is modeled as the sequential decision process of a goal-directed agent interacting with a visual environment. The agent is built around a recurrent neural network: at each time step, it processes the sensor data, integrates information over time, and chooses how to act and how to deploy its sensor at the next time step.

The data can be downloaded from https://drive.google.com/drive/folders/1D_u1vKUL87Ubhivv8GjmVr2TFRDqw0W9?usp=sharing

Network Description

Usage

The easiest way to start training your RAM variant is to edit the parameters in arguments.py and run the following command: Please create the following folders prior to running the code within the same directory as the code: 1)ckpt 2)data 3)logs 4)models 5)plots 6)report 7)tests

python main.py

To resume training, run:

python main.py --resume=True

Finally, to test a checkpoint of your model that has achieved the best validation accuracy, run the following command:

python main.py --is_train=False

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.idea		.idea
DRAM		DRAM
recurrent-visual-attention-master/recurrent-visual-attention-master		recurrent-visual-attention-master/recurrent-visual-attention-master
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Explaining Recurrent Attention Models

Model Description

Network Description

Usage

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

harshg99/explaining_ram

Folders and files

Latest commit

History

Repository files navigation

Explaining Recurrent Attention Models

Model Description

Network Description

Usage

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages