Revisit tests #2811

masci · 2022-07-14T10:29:58Z

Problem

At the moment there is some confusion about how test functions are categorized and it's not easy to distinguish between unit and integration tests. Fixing this problem would improve the development experience by making the tests fail fast if there is an issue and by making it more obvious what's the issue.

Proposal

We formally define three scopes for tests in Haystack with different requirements and purposes; newly added tests will follow those requirements while we progressively adapt the existing tests to the new model.

The three scopes are defined as follows:

Unit test
- Tests a single logical concept
- Execution time is a few milliseconds
- Any external resource is mocked
- Always returns the same result
- Can run in any order
- Runs at every commit in draft and ready PRs, automated through pytest
- Can run locally with no additional setup
- Goal: being confident in merging code
Integration test
- Tests a single logical concept
- Execution time is a few seconds
- It uses external resources that must be available before execution
- When using models, cannot use inference
- Always returns the same result or an error
- Can run in any order
- Runs at every commit in ready PRs, automated through pytest
- Can run locally with some additional setup (e.g. Docker)
- Goal: being confident in merging code
End to End (e2e) test
- Tests a sequence of multiple logical concepts
- Execution time has no limits (can be always on)
- Can use inference
- Evaluates the results of the execution or the status of the system
- It uses external resources that must be available before execution
- Can return different results
- Can be dependent on the order
- Can be wrapped into any process execution
- Runs outside the development cycle (nightly or on demand)
- Might not be possible to run locally due to system and hardware requirements
- Goal: being confident in releasing Haystack

Action plan

Note: this planning will be subject to heavy changes as we progress understanding how the tests could be improved.

The text was updated successfully, but these errors were encountered:

ZanSara · 2022-08-03T16:28:18Z

Interesting case of tests on the line between integration and end-to-end: #2903

These tests are e2e according to the definition above, because they initialize a model and do inference on it. However, the nodes are classifiers, so for the purpose of testing, mocking the models is trivial and matter of a simple if-else. So, are these tests better seen as end-to-end due to the inference step, or they should rather be mocked and used as integration tests?

Note that these, in my opinion, are too high level to be considered unit tests, regardless of the presence of the mock or their speed. They are testing most of the node at once and imho this is too wide of a scope.

Add the outcome of #2811 to the developers docs Ideally, newly added tests will follow those requirements while we progressively adapt the existing tests to the new model.

* Update CONTRIBUTING.md Add the outcome of #2811 to the developers docs Ideally, newly added tests will follow those requirements while we progressively adapt the existing tests to the new model. * address review comments

masci · 2022-09-05T16:08:42Z

We decided to split this epic and continue the work in chunks, see the updated issue description. Further conversations will happen in their respective epic.

With the newly reduced scope, I'm going to call this one done and close.

masci added the epic label Jul 14, 2022

masci assigned masci and ZanSara Jul 20, 2022

ZanSara changed the title ~~Revisit unit and integration tests~~ Revisit tests Jul 27, 2022

This was referenced Jul 27, 2022

Add coverage report to PRs #2892

Closed

Run GPU tutorials nightly deepset-ai/haystack-tutorials#33

Closed

Using Docker to speed up installation of dependencies and cache HF models #2894

Closed

masci added the epic:idle Epic not yet started label Jul 28, 2022

ZanSara mentioned this issue Jul 28, 2022

test: document store tests #2906

Closed

2 tasks

masci added the topic:tests label Jul 28, 2022

masci mentioned this issue Aug 1, 2022

Enable Opensearch unit tests in Windows CI #2936

Merged

4 tasks

masci added epic:in-progress Epic is in progress and removed epic:idle Epic not yet started labels Aug 3, 2022

masci added a commit that referenced this issue Sep 5, 2022

Update CONTRIBUTING.md

62b6761

Add the outcome of #2811 to the developers docs Ideally, newly added tests will follow those requirements while we progressively adapt the existing tests to the new model.

This was referenced Sep 5, 2022

docs: add tests types to CONTRIBUTING.md #3158

Merged

Identify and isolate e2e tests #3155

Closed

Identify and separate unit and integration tests in document stores #3156

Closed

Identify and separate unit and integration tests in other packages #3157

Closed

masci closed this as completed Sep 5, 2022

masci added epic:done and removed epic:in-progress Epic is in progress labels Sep 5, 2022

masci mentioned this issue Oct 21, 2022

Document Store test refactoring #3449

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revisit tests #2811

Revisit tests #2811

masci commented Jul 14, 2022 •

edited

Loading

ZanSara commented Aug 3, 2022

masci commented Sep 5, 2022

Revisit tests #2811

Revisit tests #2811

Comments

masci commented Jul 14, 2022 • edited Loading

Problem

Proposal

Action plan

ZanSara commented Aug 3, 2022

masci commented Sep 5, 2022

masci commented Jul 14, 2022 •

edited

Loading