Skip to content

Conversation

@marevol
Copy link
Contributor

@marevol marevol commented Nov 8, 2025

Replace OpenSearch Scroll API with Point in Time (PIT) API across the codebase to improve pagination and performance.

Changes:

  • ESSourceReader: Replace scroll-based pagination with PIT and search_after
  • SuggestIndexer: Update deleteDocumentWords() and deleteQueryWords() to use PIT
  • SuggestUtil: Add createPit() and deletePitContext() methods, update deleteByQuery()
  • SuggestSettings: Replace scrollTimeout with pitKeepAlive
  • SuggestSettingsBuilder: Replace scrollTimeout() with pitKeepAlive()
  • Update all test files to use PIT API

Benefits of PIT API:

  • More consistent pagination results
  • Support for bidirectional pagination
  • Better handling of concurrent document changes
  • Not bound to specific queries like Scroll API

Replace OpenSearch Scroll API with Point in Time (PIT) API across the
codebase to improve pagination and performance.

Changes:
- ESSourceReader: Replace scroll-based pagination with PIT and search_after
- SuggestIndexer: Update deleteDocumentWords() and deleteQueryWords() to use PIT
- SuggestUtil: Add createPit() and deletePitContext() methods, update deleteByQuery()
- SuggestSettings: Replace scrollTimeout with pitKeepAlive
- SuggestSettingsBuilder: Replace scrollTimeout() with pitKeepAlive()
- Update all test files to use PIT API

Benefits of PIT API:
- More consistent pagination results
- Support for bidirectional pagination
- Better handling of concurrent document changes
- Not bound to specific queries like Scroll API
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants