Source of language datasets #37

DonaldTsang · 2019-11-21T07:33:54Z

Where is the source text dataset for the Ngrams of those 52 languages? Would like to see if it is different from wooorm/franc#78 usage of UDHR, and if it is more accurate than them.

mahnunchik · 2020-05-24T09:48:39Z

Ping @FGRibreau

FGRibreau · 2020-05-24T09:57:44Z

As said in the README, the whole database came from https://pear.php.net/package/Text_LanguageDetect :)

FGRibreau closed this as completed May 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Source of language datasets #37

Source of language datasets #37

DonaldTsang commented Nov 21, 2019 •

edited

Loading

mahnunchik commented May 24, 2020

FGRibreau commented May 24, 2020 •

edited

Loading

Source of language datasets #37

Source of language datasets #37

Comments

DonaldTsang commented Nov 21, 2019 • edited Loading

mahnunchik commented May 24, 2020

FGRibreau commented May 24, 2020 • edited Loading

DonaldTsang commented Nov 21, 2019 •

edited

Loading

FGRibreau commented May 24, 2020 •

edited

Loading