Hi,
I am trying to get Tika working in order to extract text from my documents and improve my search results.
Despite my configuration appearing correct, I keep getting the following logs when importing/creating documents:
2026-05-02T21:15:44Z INF file won't be indexed line=github.com/owncloud/ocis/v2/services/search/pkg/search/service.go:505 path=. service=search
2026-05-02T21:15:44Z INF file exceeds content extraction size limit. skipping. Name=PERSONNEL ResourceID={"opaque_id":"12643457-0e3a-4526-a29e-06a626e313c8","space_id":"d702e443-d7e2-42f3-83a4-75a785391d51","storage_id":"6d1a80d8-c765-4581-9315-a50b843642fe"} line=github.com/owncloud/ocis/v2/services/search/pkg/content/tika.go:66 service=search
These logs occur for all file types — both light and heavy PDFs, with a lot of content or very little.
My Tika configuration is similar to the example retrieved from ocis-full.
Have you perhaps already encountered this issue?
Thank you very much!
Hi,
I am trying to get Tika working in order to extract text from my documents and improve my search results.
Despite my configuration appearing correct, I keep getting the following logs when importing/creating documents:
These logs occur for all file types — both light and heavy PDFs, with a lot of content or very little.
My Tika configuration is similar to the example retrieved from ocis-full.
Have you perhaps already encountered this issue?
Thank you very much!