Improve output of the tutorials #1675

ZanSara · 2021-10-28T15:59:22Z

Fixes #1595

review-notebook-app · 2021-10-28T16:33:21Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…o tutorials_output

…print

…o tutorials_output

…n tTutorial11 (both)

…o tutorials_output

tholor · 2021-11-01T14:30:32Z

docs/_src/tutorials/tutorials/1.md

+
+# from pprint import pprint
+
+# pprint(prediction)


I thought we also wanted to show the actual output here (at least the first answer in the json and then ...) 🤔

Right! I put it in the .py script but somehow forgot to put it here 🤦

…o tutorials_output

brandenchan · 2021-11-03T10:45:38Z

tutorials/Tutorial1_Basic_QA_Pipeline.py

    print_answers(prediction, details="minimal")

+    # Or directly print the object
+    from pprint import pprint
+    pprint(prediction)


In the console output, the minimal printout is followed immediately by the full printout. I think it would be good if we added headings like

Minimal Output ============

and

Full Output ========

So that new users aren't overwhelmed by how much is being printed out

brandenchan · 2021-11-03T10:48:27Z

tutorials/Tutorial1_Basic_QA_Pipeline.py

+    #         Answer(answer='Joffrey', type='extractive', score=0.6753827035427094, }),
+    #         Answer(answer='Robb', type='extractive', score=0.6665983200073242, })
+    #     ],
+    #     'documents': [


Maybe its worth explaining exactly which documents these are. I assume they are the ones returned by the Retriever. But I assume also multiple answers could come from the one document, meaning that some of these candidate documents might not contain an answer.

brandenchan · 2021-11-03T10:54:31Z

tutorials/Tutorial3_Basic_QA_Pipeline_without_Elasticsearch.ipynb

@@ -64,7 +64,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 3,


More general point about Tutorial 3. This is essentially the same as Tutorial 1 except that we're using a different document store. Do we want to make the same printing changes to this tutorial too?

Good point! 👍

brandenchan · 2021-11-03T11:43:09Z

tutorials/Tutorial13_Question_generation.py

+    print(f"\n * Generating questions for document {idx}: {document.content[:50]}...")
+    result = question_generation_pipeline.run(documents=[document])
+
+    print("Generated questions:")


I like these titles but the printout seems a bit dense. Could you add a line break before "Generated questions"?

brandenchan · 2021-11-03T11:46:14Z

tutorials/Tutorial13_Question_generation.py

@@ -46,8 +52,14 @@
 # RetrieverQuestionGenerationPipeline
 retriever = ElasticsearchRetriever(document_store=document_store)
 rqg_pipeline = RetrieverQuestionGenerationPipeline(retriever, question_generator)
+
+print(f"\n * Generating questions for documents matching the query 'Arya Stark'")


We should also create titles for each pipeline we are using and print them to the console so that the outputs don't get confused so easily e.g.

QuestionGenerationPipeline =====================

brandenchan · 2021-11-03T11:49:40Z

Also just want to ask whether you evaluated the necessity of these steps that were suggested in the original issue?

Adjust print_answers / print_documents utils
Adjust the str representation of Document / Answer and utilize this in the tutorials
Use a custom print to only show certain attributes like here

…o tutorials_output

Improve output of Tutorial11 (.py version only)

00b3908

ZanSara marked this pull request as draft October 28, 2021 15:59

ZanSara added 3 commits October 28, 2021 18:20

Improve output of Tutorial10 (.py only)

67baf0f

Slightly improve Tutorial8 (.py only)

2bc14e6

Reduce level of detail of printed answers in Tutorial11 (.ipynb)

638d4f4

github-actions bot and others added 8 commits October 28, 2021 16:37

Add latest docstring and tutorial changes

54cbf18

Improve output printing of tutorial13 (.py only)

abc29ff

Improve output of tutorial13 (.ipynb)

ce3fc38

Add latest docstring and tutorial changes

c9dff47

Improve output of Tutorial14 (.py only)

e421a2e

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

fe179e9

…o tutorials_output

Add the same modifications to the ipynb version of Tutorial14

0148e10

Add the same modifications to the ipynb version of Tutorial14

a2973df

ZanSara marked this pull request as ready for review October 29, 2021 16:40

Add latest docstring and tutorial changes

08ed6c5

ZanSara requested review from brandenchan and tholor October 29, 2021 16:42

ZanSara and others added 10 commits November 1, 2021 09:26

Add a clear message to print_answers in case there are no answers to …

e77b98a

…print

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

c6c0d3e

…o tutorials_output

Clean up Tutorial14 and rename QueryClassifier to MyQueryClassifier i…

289ef13

…n tTutorial11 (both)

Add latest docstring and tutorial changes

293096c

Clear all notebooks' output

27d9ff3

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

cb84923

…o tutorials_output

Add latest docstring and tutorial changes

621a53e

Add more details about how to print the output in Tutorial1

6ea249a

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

77cfee0

…o tutorials_output

Add latest docstring and tutorial changes

fb8f662

tholor reviewed Nov 1, 2021

View reviewed changes

Add output to the first tutorial's last cell

8cf8627

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

52d4a0f

…o tutorials_output

ZanSara marked this pull request as draft November 3, 2021 09:16

Add latest docstring and tutorial changes

d3c7735

brandenchan suggested changes Nov 3, 2021

View reviewed changes

ZanSara added 2 commits November 4, 2021 10:36

Modify repr and str for Document and Answer class

fade01f

Merge branch 'tutorials_output' of github.com:deepset-ai/haystack int…

843bf24

…o tutorials_output

ZanSara closed this Nov 4, 2021

ZanSara deleted the tutorials_output branch November 4, 2021 11:42

ZanSara mentioned this pull request Nov 4, 2021

Improve tutorials' output #1694

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve output of the tutorials #1675

Improve output of the tutorials #1675

ZanSara commented Oct 28, 2021

review-notebook-app bot commented Oct 28, 2021

tholor Nov 1, 2021

ZanSara Nov 3, 2021

brandenchan Nov 3, 2021

brandenchan Nov 3, 2021

brandenchan Nov 3, 2021

ZanSara Nov 4, 2021

brandenchan Nov 3, 2021

brandenchan Nov 3, 2021

brandenchan commented Nov 3, 2021

Improve output of the tutorials #1675

Improve output of the tutorials #1675

Conversation

ZanSara commented Oct 28, 2021

review-notebook-app bot commented Oct 28, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brandenchan commented Nov 3, 2021