Skip to content

Still doesn't find 'print' #2

@rmast

Description

@rmast

For reproduction steps see the workaround for #1

On the right top is the text 'print' that still isn't found by this script.

python hocrmod.py -l nld+eng+lat -f ~/175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.jpg
missing base hocr file: /home/rmast/175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr, running Tesseract
sort through hocr paragraphs..................................................................!
block out recognized text..................................................................!
look for missed text blocks......!
work through contours.......................................................!
hocr line(s) added: 4
(hocrmod4) rmast@rmast-virtual-machine:~/hocrmod$ grep -i print ../175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr.bak ../175789293-f39ddfdb-6f3e-4598-8d16-80a1f4a88b36.hocr

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions