Skip to content

Commit 04bbd27

Browse files
committed
Fix documentation errors and spelling
Corrects function usage example for text_dublication_detector.py. Corrects minor spelling errors in project README files.
1 parent 043faf8 commit 04bbd27

File tree

5 files changed

+62
-58
lines changed

5 files changed

+62
-58
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ SinaTools
22
======================
33
Open Source Toolkit for Arabic NLP and NLU developed by [SinaLab](http://sina.birzeit.edu/) at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.
44

5-
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).
5+
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarity, parser, tokenizers, corpora processing, transliteration, etc).
66

77
See [Demo Pages](https://sina.birzeit.edu/sinatools/).
88

@@ -24,7 +24,7 @@ Some modules in SinaTools require some data files and fine-tuned models to be do
2424

2525
Documentation
2626
--------
27-
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online domuementation](https://sina.birzeit.edu/sinatools/documentation).
27+
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online documentation](https://sina.birzeit.edu/sinatools/documentation).
2828

2929
Citation
3030
-------

README.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ SinaTools
22
======================
33
Open Source Toolkit for Arabic NLP and NLU developed by [SinaLab](http://sina.birzeit.edu/) at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.
44

5-
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).
5+
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarity, parser, tokenizers, corpora processing, transliteration, etc).
66

77
See [Demo Pages](https://sina.birzeit.edu/sinatools/).
88

@@ -24,7 +24,7 @@ Some modules in SinaTools require some data files and fine-tuned models to be do
2424

2525
Documentation
2626
--------
27-
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online domuementation](https://sina.birzeit.edu/sinatools/documentation).
27+
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online documentation](https://sina.birzeit.edu/sinatools/documentation).
2828

2929
Citation
3030
-------

SinaTools.egg-info/PKG-INFO

Lines changed: 54 additions & 50 deletions
Original file line numberDiff line numberDiff line change
@@ -1,50 +1,54 @@
1-
Metadata-Version: 2.1
2-
Name: SinaTools
3-
Version: 0.1.36
4-
Summary: Open-source Python toolkit for Arabic Natural Understanding, allowing people to integrate it in their system workflow.
5-
Home-page: https://github.com/SinaLab/sinatools
6-
License: MIT license
7-
Description: SinaTools
8-
======================
9-
Open Source Toolkit for Arabic NLP and NLU developed by [SinaLab](http://sina.birzeit.edu/) at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.
10-
11-
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarly, parser, tokenizers, corpora processing, transliteration, etc).
12-
13-
See [Demo Pages](https://sina.birzeit.edu/sinatools/).
14-
15-
See the [benchmarking](https://www.jarrar.info/publications/HJK24.pdf), which shows that SinaTools outperformed all related toolkits.
16-
17-
Installation
18-
--------
19-
To install SinaTools, ensure you are using Python version 3.10.8, then clone the [GitHub](git://github.com/SinaLab/SinaTools) repository.
20-
21-
Alternatively, you can execute the following command:
22-
23-
```bash
24-
pip install sinatools
25-
```
26-
27-
Installing Models and Data Files
28-
--------
29-
Some modules in SinaTools require some data files and fine-tuned models to be downloaded. To download these models, please consult the [DataDownload](https://sina.birzeit.edu/sinatools/documentation/cli_tools/DataDownload/DataDownload.html).
30-
31-
Documentation
32-
--------
33-
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online domuementation](https://sina.birzeit.edu/sinatools/documentation).
34-
35-
Citation
36-
-------
37-
Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: [SinaTools: Open Source Toolkit for Arabic Natural Language Understanding](http://www.jarrar.info/publications/HJK24.pdf). In Proceedings of the 2024 AI in Computational Linguistics (ACLing 2024), Procedia Computer Science, Dubai. ELSEVIER.
38-
39-
License
40-
--------
41-
SinaTools is available under the MIT License. See the [LICENSE](https://github.com/SinaLab/sinatools/blob/main/LICENSE) file for more information.
42-
43-
Reporting Issues
44-
--------
45-
To report any issues or bugs, please contact us at "sina.institute.bzu@gmail.com" or visit [SinaTools Issues](https://github.com/SinaLab/sinatools/issues).
46-
47-
48-
Keywords: sinatools
49-
Platform: UNKNOWN
50-
Description-Content-Type: text/markdown
1+
Metadata-Version: 2.1
2+
Name: SinaTools
3+
Version: 0.1.36
4+
Summary: Open-source Python toolkit for Arabic Natural Understanding, allowing people to integrate it in their system workflow.
5+
Home-page: https://github.com/SinaLab/sinatools
6+
License: MIT license
7+
Keywords: sinatools
8+
Platform: UNKNOWN
9+
Description-Content-Type: text/markdown
10+
License-File: LICENSE
11+
License-File: AUTHORS.rst
12+
13+
SinaTools
14+
======================
15+
Open Source Toolkit for Arabic NLP and NLU developed by [SinaLab](http://sina.birzeit.edu/) at Birzeit University. SinaTools is available through Python APIs, command lines, colabs, and online demos.
16+
17+
See the full list of [Available Packages](https://sina.birzeit.edu/sinatools/), which include: (1) [Morphology Tagging](https://sina.birzeit.edu/sinatools/index.html#morph), (2) [Named Entity Recognition (NER)](https://sina.birzeit.edu/sinatools/index.html#ner), (3) [Word Sense Disambiguation (WSD)](https://sina.birzeit.edu/sinatools/index.html#wsd), (4) [Semantic Relatedness](https://sina.birzeit.edu/sinatools/index.html#sr), (5) [Synonymy Extraction and Evaluation](https://sina.birzeit.edu/sinatools/index.html#se), (6) [Relation Extraction](https://sina.birzeit.edu/sinatools/index.html#re), (7) [Utilities](https://sina.birzeit.edu/sinatools/index.html#u) (diacritic-based word matching, Jaccard similarity, parser, tokenizers, corpora processing, transliteration, etc).
18+
19+
See [Demo Pages](https://sina.birzeit.edu/sinatools/).
20+
21+
See the [benchmarking](https://www.jarrar.info/publications/HJK24.pdf), which shows that SinaTools outperformed all related toolkits.
22+
23+
Installation
24+
--------
25+
To install SinaTools, ensure you are using Python version 3.10.8, then clone the [GitHub](git://github.com/SinaLab/SinaTools) repository.
26+
27+
Alternatively, you can execute the following command:
28+
29+
```bash
30+
pip install sinatools
31+
```
32+
33+
Installing Models and Data Files
34+
--------
35+
Some modules in SinaTools require some data files and fine-tuned models to be downloaded. To download these models, please consult the [DataDownload](https://sina.birzeit.edu/sinatools/documentation/cli_tools/DataDownload/DataDownload.html).
36+
37+
Documentation
38+
--------
39+
For information, please refer to the [main page](https://sina.birzeit.edu/sinatools) or the [online documentation](https://sina.birzeit.edu/sinatools/documentation).
40+
41+
Citation
42+
-------
43+
Tymaa Hammouda, Mustafa Jarrar, Mohammed Khalilia: [SinaTools: Open Source Toolkit for Arabic Natural Language Understanding](http://www.jarrar.info/publications/HJK24.pdf). In Proceedings of the 2024 AI in Computational Linguistics (ACLing 2024), Procedia Computer Science, Dubai. ELSEVIER.
44+
45+
License
46+
--------
47+
SinaTools is available under the MIT License. See the [LICENSE](https://github.com/SinaLab/sinatools/blob/main/LICENSE) file for more information.
48+
49+
Reporting Issues
50+
--------
51+
To report any issues or bugs, please contact us at "sina.institute.bzu@gmail.com" or visit [SinaTools Issues](https://github.com/SinaLab/sinatools/issues).
52+
53+
54+

build/lib/sinatools/utils/text_dublication_detector.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ def removal(csv_file, columnName, finalFileName, deletedFileName, similarityThre
3434
.. code-block:: python
3535
3636
from sinatools.utils.text_dublication_detector import removal
37-
removal("/path/to/csv/file1", sentences, "/path/to/csv/file2", 0.8)
37+
removal("/path/to/csv/file1", "sentences", "/path/to/final/file", "/path/to/deleted/file", 0.8)
3838
"""
3939

4040
# Read CSV file
@@ -129,4 +129,4 @@ def textToVector(text):
129129
# deletedFileName = "Arabic-Oct7-Feb12DeletedSent.csv"
130130

131131
# result = removal(csvFile, columnName, finalFileName, deletedFileName, similarityThreshold)
132-
# print(result)
132+
# print(result)

sinatools/utils/text_dublication_detector.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ def removal(csv_file, columnName, finalFileName, deletedFileName, similarityThre
3434
.. code-block:: python
3535
3636
from sinatools.utils.text_dublication_detector import removal
37-
removal("/path/to/csv/file1", sentences, "/path/to/csv/file2", 0.8)
37+
removal("/path/to/csv/file1", "sentences", "/path/to/final/file", "/path/to/deleted/file", 0.8)
3838
"""
3939

4040
# Read CSV file
@@ -129,4 +129,4 @@ def textToVector(text):
129129
# deletedFileName = "Arabic-Oct7-Feb12DeletedSent.csv"
130130

131131
# result = removal(csvFile, columnName, finalFileName, deletedFileName, similarityThreshold)
132-
# print(result)
132+
# print(result)

0 commit comments

Comments
 (0)