Seq2Seq Transliteration with and without Attention

Name: Sai Mani Kumar Devathi Roll No: DA24M016

Project Overview

This repository contains two implementations of a character-level Seq2Seq transliteration system (Latin → Devanagari): one using a vanilla RNN encoder–decoder, and the other augmented with an attention mechanism. The goal is to compare performance, inspect predictions, and analyze the gains achieved by attention.

Links

WandB Report: View the full training & evaluation dashboard
GitHub Repository: https://github.com/saimanikumar-da24m016/da6401_assignment3

Repository Structure

|--- assignment_3_attention.ipynb          # Notebook: training + eval with attention
|--- assignment_3_vannila.ipynb            # Notebook: training + eval without attention
|--- model_attention.py                    # Seq2Seq model with attention
|--- model_vannila.py                      # Vanilla Seq2Seq model
|--- train_attention.py                    # Training script for attention model
|--- train_vannila.py                      # Training script for vanilla model
|--- res_attention_predictions/
|    |--- best_model_attention.pt          # Saved checkpoint
|    |--- test_predictions_attention.csv   # Test set predictions with attention
|    |--- visual_examples.csv              # Sample visual examples
|--- res_vannila_predictions/
|    |--- best_model_vanilla.pt            # Saved checkpoint
|    |--- test_predictions_vanilla.csv     # Test set predictions without attention
|--- vocab/
|    |--- best_model.pt                    # Best model for vocab building (optional)
|    |--- src_vocab.json                   # Source vocabulary
|    |--- tgt_vocab.json                   # Target vocabulary
|--- README.md                             # This file

Installation & Setup

Clone the repository:

git clone https://github.com/saimanikumar-da24m016/da6401_assignment3.git
cd da6401_assignment3

Create a Python environment and install dependencies:

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Quickstart

Train Attention Model:

python train_attention.py --config configs/attention.yaml

Train Vanilla Model:

python train_vannila.py --config configs/vanilla.yaml

Evaluate & Visualize:
- Open assignment_3_attention.ipynb or assignment_3_vannila.ipynb in Jupyter.
- Generate accuracy metrics, confusion matrices, and inspect example predictions.

Results & Predictions

The res_attention_predictions/ folder contains the best checkpoint and test predictions for the attention model.
The res_vannila_predictions/ folder contains the same for the vanilla model.
You can compare test_predictions_attention.csv vs. test_predictions_vanilla.csv to see where attention helps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Seq2Seq Transliteration with and without Attention

Project Overview

Links

Repository Structure

Installation & Setup

Quickstart

Results & Predictions

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
res_attention_predictions		res_attention_predictions
res_vannila_predictions		res_vannila_predictions
vocab		vocab
.gitignore		.gitignore
README.md		README.md
assignment_3_attention.ipynb		assignment_3_attention.ipynb
assignment_3_vannila.ipynb		assignment_3_vannila.ipynb
model_attention.py		model_attention.py
model_vannila.py		model_vannila.py
train_attention.py		train_attention.py
train_vannila.py		train_vannila.py

saimanikumar67/Transliteration-DL

Folders and files

Latest commit

History

Repository files navigation

Seq2Seq Transliteration with and without Attention

Project Overview

Links

Repository Structure

Installation & Setup

Quickstart

Results & Predictions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages