Replies: 2 comments
-
Hi @chmaz 👋,
Best regards, |
Beta Was this translation helpful? Give feedback.
0 replies
-
Thanks a lot for the quick answers, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello!
I am new to docTR and am reading the documentation. Before looking more deeply in the repo, I have two questions to understand better if I could use it for my case:
(1) I work with A4 300 dpi pages, which corresponds to a resolution of 2480 x 3508 pixels (usually black and white scans). Would it be thinkable to train doctr detection and recognition models from scratch with this resolution? (for example the height of the training samples for recognition should internally probably be 64 pixels instead of 32 and so on).
(2) I have a second question, for a custom recognition training dataset, there is one image per word. If we aim to have a training set with several hundred thousands words, it makes a huge number of files inside one folder. Is it possible to have the images in a hierarchy of directories and the json containing relative paths instead of file names?
Sorry if I am asking wrong questions,
Thanks in advance,
Chris
Beta Was this translation helpful? Give feedback.
All reactions