-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: support long texts for labels in ElasticsearchDocumentStore
#3346
fix: support long texts for labels in ElasticsearchDocumentStore
#3346
Conversation
ElasticsearchDocumentStore
ElasticsearchDocumentStore
…ElasticsearchDocumentStore
@vblagoje I just merged main into this branch. |
…asticsearchDocumentStore
@anakin87, my apologies - I somehow missed these notifications. I am not an expert in ElasticSearch, let me ask who can understand this PR much better than I do. |
@anakin87 just to make sure: does this represent a schema break? For example: if you write some documents in ES with the code in I'm asking because if it throws exceptions this should be marked as a breaking change and we should warn users about this detail when migrating (and potentially offer some instructions on how to access their documents again) |
@ZanSara great question! Digging into this, I remembered that the method I performed some tests:
So it doesn't seem like a breaking change. |
@anakin87 great! I think it's good to merge then 👍 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚀
Thank you. |
The stacktrace of that test is quite funny I have to say 😂 |
@ZanSara finally ready to go! |
Related Issues
write_labels
crashes forElasticsearchDocumentStore
for labels with long context #2621Proposed Changes:
As deeply explained in #2621, in label index, I changed field types of
answer
anddocument
fromflattened
tonested
, aligningElasticsearchDocumentStore
withOpensearchDocumentStore
. This way, the text in these fields is tokenized correctly and is no longer considered one immense term.How did you test it?
Added new related test
Checklist