It is a collection of resources and technologies for non-scheduled and endangered Indian languages and, if available, the link to access / download these resources and tools.
NOTE: If you want to contribute some resource / technology to this list, please fill up this form. We will add the contribution here after verifying it.
Magahi Text Corpus - Cick here to access the corpus
Magahi OCR - Available through the author (riteshkrjnu(at)gmail)
A phonemically transcribed lexicon of approximately 16,000 Garo words - Click here to access the lexicon
Part-of-speech annotated corpus of Awadhi - Click here to access the corpus
Online searchable dictionary - Click here to access the dictionary
English-Bhojpuri Parallel Corpus - Contact author (shashwatup9k(at)gmail)
Sankskrit-Bhojpuri Parallel Corpus - Contact author (shagunsinha5(at)gmail)
Hindi-Bhojpuri Machine Translation System - Contact author (rgmishrajuly16(at)gmail or shashwatup9k(at)gmail)
English-Bhojpuri Machine Translation System - Contact author (shashwatup9k(at)gmail)