Skip to content

sort batch docs by partition to reduce mem usage lookups#6331

Open
PSeitz wants to merge 1 commit intoquickwit-oss:mainfrom
PSeitz:mem_check
Open

sort batch docs by partition to reduce mem usage lookups#6331
PSeitz wants to merge 1 commit intoquickwit-oss:mainfrom
PSeitz:mem_check

Conversation

@PSeitz
Copy link
Copy Markdown
Collaborator

@PSeitz PSeitz commented Apr 22, 2026

Sort the incoming ProcessedDocBatch by partition and group with chunk_by so get_or_create_indexed_split and the index_writer mem_usage() probes are done once per partition group instead of once per doc.

Description

Describe the proposed changes made in this PR.

How was this PR tested?

Describe how you tested this PR.

Sort the incoming ProcessedDocBatch by partition and group with
chunk_by so get_or_create_indexed_split and the index_writer
mem_usage() probes are done once per partition group instead of
once per doc.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant