Skip to content

phylypo/khmer-language-model-ulmfit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

khmer-language-model-ulmfit

This repository contain Python notebook for Khmer Language Model using ULMFiT. We use Khmer Wikipedia as our training data long with 1000 articles from Khmer news from the segmentation-crf-khmer repository.

We save the Wiki data files and the segmented output file. The pre-trained model is not in this repository due to the size. But the notebook contain code to download from a Google drive.

See detail write up here:

https://medium.com/@phylypo/khmer-language-model-using-ulmfit-b0f8ca4e15be

We created a web interface where you can test out the model's next words prediction. See:

http://ml.tovnah.com/khmer-ulmfit/.

About

Khmer Language Model using ULMFiT

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors