Skip to content

Conversation

nik9000
Copy link
Member

@nik9000 nik9000 commented Sep 14, 2025

Remove the "unknown" size estimates for ESQL types - we were defaulting to 50 bytes each and that's just... always more wrong and making a bad estimate for each type.

Remove the "unknown" size estimates for ESQL types - we were defaulting
to 50 bytes each and that's just... always more wrong and making a bad
estimate for each type.
@elasticsearchmachine elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Sep 14, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-analytical-engine (Team:Analytics)

@nik9000
Copy link
Member Author

nik9000 commented Sep 14, 2025

I'm marking this as >non-issue because I don't think this is generally going to be visible to users. It makes implementing #134699 easier which is why I opened it now. But it should help out with the unreleased dense_vector field circuit breaking.

Copy link
Member

@dnhatn dnhatn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@nik9000 nik9000 enabled auto-merge (squash) September 15, 2025 14:17
OBJECT(builder().esType("object").unknownSize()),
SOURCE(builder().esType(SourceFieldMapper.NAME).unknownSize()),
OBJECT(builder().esType("object").estimatedSize(1024)),
SOURCE(builder().esType(SourceFieldMapper.NAME).estimatedSize(1024)),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if it is worth having even higher estimate for source?
I imagine logs source documents could easily reach 10k

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah. Probably is. Let me give it 10k or so.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Usually _source is loaded very temporarily - just row by row - in the reader for things like text fields. This just about the METADATA _source thing.

@nik9000 nik9000 merged commit 50e74a4 into elastic:main Sep 19, 2025
34 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Analytics/ES|QL AKA ESQL >non-issue Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) v9.2.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants