-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
For reproduction steps see the workaround for #1
On the right top is the text 'print' that still isn't found by this script.
python hocrmod.py -l nld+eng+lat -f ~/175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.jpg
missing base hocr file: /home/rmast/175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr, running Tesseract
sort through hocr paragraphs..................................................................!
block out recognized text..................................................................!
look for missed text blocks......!
work through contours.......................................................!
hocr line(s) added: 4
(hocrmod4) rmast@rmast-virtual-machine:~/hocrmod$ grep -i print ../175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr.bak ../175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr
Metadata
Metadata
Assignees
Labels
No labels