Skip to content

OCR parser update #379

Description

@tenzin3

Description

Currently, the OCR parser stores annotation information in the metadata.json file. However, this annotation data should instead be returned alongside the parsed pecha to the OpenPecha backend, where it can be properly stored.

Expected Output

The OpenPecha backend should be capable of storing both the pecha and all relevant annotation information provided by the OCR parser.

Implementation Steps

  • Create BDRC annotation models.
  • Modify the HOCR parser to return annotation data.
  • Remove the logic that stores annotation information in metadata.json.
  • Modify the Google Vision parser to return annotation data.
  • Ensure annotation data is not stored in metadata.json.

Reviewer

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions