diff --git a/pages/docs/features/ocr.mdx b/pages/docs/features/ocr.mdx index bbb01c1df..82449e7e8 100644 --- a/pages/docs/features/ocr.mdx +++ b/pages/docs/features/ocr.mdx @@ -128,6 +128,18 @@ ocr: Support for custom OCR providers and user-defined strategies is planned for future releases. +### 5. Direct Provider OCR + +For supported LLM Providers (**OpenAI, AzureOpenAI, Anthropic, and Google**) and their respective models, files can now be sent directly to the provider APIs as message attachments, +allowing the provider to use their own native OCR implementations to parse files using the `Upload to Provider` option in the file attachment dropdownmenu. + +Currently all four of the aforementioned providers offer support for images and PDFs, with Google also including support for audio and video files when used in conjunction with compatible multimodal models. + + +The Upload to Provider option is only available for OpenAI endpoints when using the Responses API. +If you do not see 'Upload to Provider' as an option in your chat's attachment dropdown menu, ensure that the Responses API parameter is enabled in the Parameters panel. + + ## Detailed Configuration For additional, detailed configuration options, see the [OCR Config Object Structure](/docs/configuration/librechat_yaml/object_structure/ocr).