Add `run_batch` for standard pipelines #2595

bogdankostic · 2022-05-24T15:22:14Z

In #2481, we added the possibility to run batches of queries through a Pipeline. This PR adds makes this possible also for standard pipelines by adding run_batch methods.

julian-risch

Looks very good to me already! 👍 The only change I would like to see made is in RetrieverQuestionGenerationPipeline. Both run and run_batch have the same code. So I'd argue that run should call run_batch internally instead of duplicating the code. What do you think?

julian-risch · 2022-05-25T06:56:38Z

haystack/pipelines/standard_pipelines.py

@@ -495,3 +563,21 @@ def run(self, document_ids: List[str], top_k: int = 5):

        self.document_store.return_embedding = False  # type: ignore
        return similar_documents
+
+    def run_batch(self, document_ids: List[str], top_k: int = 5):  # type: ignore


the code of run_batch is exactly the same as for run here. So I'd say it's better to call run from within run_batch and avoid code duplication.

julian-risch

LGTM! 👍

bogdankostic and others added 2 commits May 24, 2022 17:19

Add run_batch for standard pipelines

fb95293

Update Documentation & Code Style

7ae3e0d

bogdankostic added the topic:pipeline label May 24, 2022

Fix mypy

25726d4

bogdankostic marked this pull request as ready for review May 24, 2022 15:59

bogdankostic requested a review from julian-risch May 24, 2022 15:59

julian-risch requested changes May 25, 2022

View reviewed changes

bogdankostic added 3 commits May 25, 2022 17:30

Remove code duplication

341fcab

Merge branch 'master' into run_batch_standard_pipelines

3e944c6

Fix linter

e0b7e05

julian-risch approved these changes May 27, 2022

View reviewed changes

bogdankostic merged commit 0395533 into master May 27, 2022

bogdankostic deleted the run_batch_standard_pipelines branch May 27, 2022 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `run_batch` for standard pipelines #2595

Add `run_batch` for standard pipelines #2595

bogdankostic commented May 24, 2022

julian-risch left a comment

julian-risch May 25, 2022

julian-risch left a comment

Add run_batch for standard pipelines #2595

Add run_batch for standard pipelines #2595

Conversation

bogdankostic commented May 24, 2022

julian-risch left a comment

Choose a reason for hiding this comment

julian-risch May 25, 2022

Choose a reason for hiding this comment

julian-risch left a comment

Choose a reason for hiding this comment

Add `run_batch` for standard pipelines #2595

Add `run_batch` for standard pipelines #2595