Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 12 additions & 0 deletions pages/docs/features/ocr.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -128,6 +128,18 @@ ocr:

Support for custom OCR providers and user-defined strategies is planned for future releases.

### 5. Direct Provider OCR

For supported LLM Providers (**OpenAI, AzureOpenAI, Anthropic, and Google**) and their respective models, files can now be sent directly to the provider APIs as message attachments,
allowing the provider to use their own native OCR implementations to parse files using the `Upload to Provider` option in the file attachment dropdownmenu.

Currently all four of the aforementioned providers offer support for images and PDFs, with Google also including support for audio and video files when used in conjunction with compatible multimodal models.

<Callout type="note" title="OpenAI Responses API Caveat" emoji='✏️'>
The Upload to Provider option is only available for OpenAI endpoints when using the Responses API.
If you do not see 'Upload to Provider' as an option in your chat's attachment dropdown menu, ensure that the Responses API parameter is enabled in the Parameters panel.
</Callout>

## Detailed Configuration

For additional, detailed configuration options, see the [OCR Config Object Structure](/docs/configuration/librechat_yaml/object_structure/ocr).
Expand Down