-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recognize both digit & alphabet when fine tune digits #11
Comments
Now you can also use the blacklist config to avoid alphabet.
|
Dear Shreeshrii,
|
will ignore a-z and A-Z only. Punctuation, digits and any other characters in unicharset will be recognized.
Only the digits 0-9 will be recognized.
was only trained on a limited characterset of 0-9
I will have to check, but it was trained for both Alphabet and digits in OCRB font for recognition of ID. Please note that all these are |
Dear Shreeshrii, |
Dear Shreeshrii, |
Try suggestions in
https://groups.google.com/forum/?fromgroups#!searchin/tesseract-ocr/lorenzo%7Csort:date/tesseract-ocr/2uBsbG9XHzI/1Y9QoA37BQAJ
…On Fri, Aug 30, 2019 at 8:53 AM duonghb53 ***@***.***> wrote:
Dear Shreeshrii,
This is image I use and result:
Test.zip
<https://github.com/Shreeshrii/tessdata_shreetest/files/3558152/Test.zip>
Please view it.
Regards,
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#11?email_source=notifications&email_token=ABG37I3KVUS2YU5HY4EBTGTQHCHEXA5CNFSM4IPNQ6Y2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD5QNUBY#issuecomment-526440967>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ABG37I4X3BGLW7UQZUWZVQTQHCHEXANCNFSM4IPNQ6YQ>
.
--
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
|
@nguyenq Quan Is it possible to use digits config file with VietOCR? |
According to its readme file:
|
@nguyenq I try config follow your guide but it still recognize to alphabet. |
Dear @Shreeshrii , |
If your images are skewed, either deskew before feeding to tesseract or train on italic font matching your images. |
Dear Shreeshrii,
I try your guide to fine tune from data_best/eng.datatrained add number font Ocrb but when I get ocrb.datatrained to recognize it still get alphabet & digit.
I don't know how to do same you create digit.datatrained. It only get digit.
Please help me.
Thank you.
The text was updated successfully, but these errors were encountered: