You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 6, 2022. It is now read-only.
Copy file name to clipboardexpand all lines: README.md
+19-5
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
# Konglog: Write your favorite Konglish correctly with Prolog
2
2
3
-
Final Project for CS579 Computational Linguistics, Fall 2021
3
+
Final Project for CS579 Computational Linguistics at KAIST, Fall 2021, by Janggun Lee.
4
4
5
5
## Introduction:
6
-
<!-- TODO -->
6
+
Konglog is an implementation of the [Korean Loanword Orthography](https://kornorms.korean.go.kr/m/m_regltn.do#a) in Prolog and Python. It aims to faifully encode the rules of the orthography, and provide a simple API for all to use.
7
7
8
8
## Dependencies:
9
9
@@ -20,7 +20,7 @@ Final Project for CS579 Computational Linguistics, Fall 2021
* Download the `cmudict` corpus. Run the following simple script. This will download only the nessecary data.
23
+
* Download the `cmudict` corpus, and run the following Python script. This will download only the nessecary data.
24
24
25
25
```python
26
26
import nltk
@@ -29,7 +29,21 @@ nltk.download('cmudict')
29
29
* If the download doesn't start with `[SSL:CERTIFICATE_VERIFY_FAILED]`, check [this comment](https://github.com/gunthercox/ChatterBot/issues/930#issuecomment-322111087) for a solution.
30
30
31
31
## API:
32
-
<!-- TODO -->
32
+
Konglog provides a simple function, `eng_to_kong` that takes in a english word as input, and returns the Konglish translation as output. A very simple example is shown below.
33
+
34
+
```python
35
+
import konglog
36
+
37
+
defmain():
38
+
word ="shrimp"
39
+
40
+
print(konglog.eng_to_kong(word))
41
+
```
33
42
34
43
## Structure:
35
-
<!-- TODO -->
44
+
Konglog has three main steps in its architecture, depicted in the picture below.
45
+

46
+
47
+
1. First, the input word is translated into phonems by looking up the CMU pronounciation dictionary, provided by NLTK.
48
+
2. Second, the phonems are trasnalted into jaem and moems.
49
+
3. Finally, the jaem and moems are combined into one. The tools for this combination are in [unicode.py](unicode.py), and is taken from [`hangulutils`](https://github.com/kaniblu/hangul-utils)
0 commit comments