test: lower low boundary for accuracy in `test_calculate_context_similarity_on_non_matching_contexts` #3199

ZanSara · 2022-09-12T08:24:41Z

Related Issues

failing main: https://github.com/deepset-ai/haystack/runs/8300467040?check_suite_focus=true

Proposed Changes:

Lower low boundary for accuracy in test_calculate_context_similarity_on_non_matching_contexts from 0.99 to 0.98

How did you test it?

The test passes locally (with accuracy=1) and on CI (with accuracy=0.9860646599777034)

Notes for the reviewer

I attempted to use a more refined solution with pytest.approx, but that works with equality checks only, not with larger-than checks.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used the conventional commit convention for my PR title
~~I documented my code~~
I ran pre-commit hooks and fixed any issue

tstadel

For a quick fix of main this is fine.
However, that's interesting. When did that start to fail?
I suppose there has been a new rapidfuzz version, that is causing this. Let's make sure this is a feature and not a bug

tstadel · 2022-09-12T12:40:34Z

Ok, in the failing tests we're using rapidfuzz 2.8.0. Last passing test had rapidfuzz 2.6.1.

tstadel

As this is clearly caused by the new rapidfuzz version 2.8.0, let's better add a version pin restriction of <2.8.0. We also need to create an issue at https://github.com/maxbachmann/RapidFuzz. I'll take care of this. After the issue has been resolved or it turns out this is a feature we can remove the version restriction.

tstadel

Ok, I looked into the changes of rapidfuzz 2.8.0. This is actually a feature and should enable us to get rid of the code I wrote to boost split overlaps (calculate_context_similarity's boost_split_overlap option). I created an issue for that as it would need some tinkering regarding a proper threshold: #3202
Let's reference this issue in the pin comment and merge :-)

tstadel

Just saw that this dependency entry is a duplicate. Let's merge it with the existing one and add a comment about the issue

pyproject.toml

…larity_on_non_matching_contexts` (#3199) * Change min value * revert test change and pin rapidfuzz<2.8.0 * duplicate

Change min value

c4f8a64

ZanSara added type:bug Something isn't working topic:modeling topic:eval labels Sep 12, 2022

ZanSara marked this pull request as ready for review September 12, 2022 09:10

ZanSara requested a review from a team as a code owner September 12, 2022 09:10

ZanSara requested review from vblagoje and tstadel and removed request for a team and vblagoje September 12, 2022 09:10

tstadel approved these changes Sep 12, 2022

View reviewed changes

tstadel requested changes Sep 12, 2022

View reviewed changes

revert test change and pin rapidfuzz<2.8.0

b9f59a4

tstadel approved these changes Sep 12, 2022

View reviewed changes

tstadel requested changes Sep 12, 2022

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

duplicate

42cf1fe

ZanSara requested a review from tstadel September 12, 2022 15:24

ZanSara mentioned this pull request Sep 12, 2022

refactor: remove pre haystack-1.0 import paths support #3204

Merged

6 tasks

tstadel approved these changes Sep 12, 2022

View reviewed changes

ZanSara merged commit 49b1c88 into main Sep 13, 2022

ZanSara deleted the fix-context-similarity-test branch September 13, 2022 07:32

anakin87 mentioned this pull request Sep 21, 2022

bug: make ElasticSearchDocumentStore use batch_size in get_documents_by_id #3166

Merged

6 tasks

brandenchan pushed a commit that referenced this pull request Sep 21, 2022

test: lower low boundary for accuracy in `test_calculate_context_simi…

21da03c

…larity_on_non_matching_contexts` (#3199) * Change min value * revert test change and pin rapidfuzz<2.8.0 * duplicate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: lower low boundary for accuracy in `test_calculate_context_similarity_on_non_matching_contexts` #3199

test: lower low boundary for accuracy in `test_calculate_context_similarity_on_non_matching_contexts` #3199

ZanSara commented Sep 12, 2022 •

edited

Loading

tstadel left a comment

tstadel commented Sep 12, 2022

tstadel left a comment •

edited

Loading

tstadel left a comment •

edited

Loading

tstadel left a comment

test: lower low boundary for accuracy in test_calculate_context_similarity_on_non_matching_contexts #3199

test: lower low boundary for accuracy in test_calculate_context_similarity_on_non_matching_contexts #3199

Conversation

ZanSara commented Sep 12, 2022 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

tstadel left a comment

Choose a reason for hiding this comment

tstadel commented Sep 12, 2022

tstadel left a comment • edited Loading

Choose a reason for hiding this comment

tstadel left a comment • edited Loading

Choose a reason for hiding this comment

tstadel left a comment

Choose a reason for hiding this comment

test: lower low boundary for accuracy in `test_calculate_context_similarity_on_non_matching_contexts` #3199

test: lower low boundary for accuracy in `test_calculate_context_similarity_on_non_matching_contexts` #3199

ZanSara commented Sep 12, 2022 •

edited

Loading

tstadel left a comment •

edited

Loading

tstadel left a comment •

edited

Loading