Add execute_eval_run example to Tutorial 5 #2459

tstadel · 2022-04-26T09:55:10Z

Proposed changes:

Add section Storing results in MLflow to Tutorial 5

Status (please check what you already did):

First draft (up for discussions & feedback)
Final code

review-notebook-app · 2022-04-26T09:55:13Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…into execute_eval_run_nb

julian-risch

I tested the changes and got the results stored here: https://public-mlflow.deepset.ai/#/experiments/698 that looks good to me! 👍 @tstadel I will go ahead and merge this PR but I'll also leave some comments here on how I think we could improve the tutorial further.

One thing I noticed is that the comparison of scores returned by pipeline.eval(add_isolated_node_eval=True) and by reader.eval() is difficult.

reader.eval output is:

Reader Top-4-Accuracy: 99.09208819714657
Reader Top-1-Exact Match: 95.71984435797665
Reader Top-1-F1-Score: 95.73510337987335
Reader Top-4-Accuracy (without no_answers): 72.0
Reader Top-4-Exact Match (without no_answers): 44.0
Reader Top-4-F1-Score (without no_answers): 59.71344537815126

and pipeline.eval output is:

0.48 #print(metrics["Reader"]["exact_match"])
0.6027426153741944 #print(metrics["Reader"]["f1"])

Here, we could add a sentence to explain that the "without no_answers" metrics are the ones that a re expected to be similar.
As another small improvement, we could add a sentence in the tutorial after the headline "Run experiments". We should briefly explain here that an experiment consists of several executions of pipeline.eval() (evaluation runs) so that the user knows what to expect.

add execute_eval_run.ipynb

56da86d

julian-risch added the topic:tutorials label May 4, 2022

julian-risch added the topic:eval label May 11, 2022

tstadel mentioned this pull request May 24, 2022

Support context matching in pipeline.eval() #2482

Merged

4 tasks

tstadel added 2 commits June 9, 2022 19:17

Merge branch 'master' into execute_eval_run_nb

bc54b3c

update Tutorial 5

29dd285

tstadel marked this pull request as ready for review June 9, 2022 17:21

tstadel requested a review from julian-risch June 9, 2022 17:21

github-actions bot and others added 3 commits June 9, 2022 17:23

Update Documentation & Code Style

1644ded

change experiment name

88b4adb

Merge branch 'execute_eval_run_nb' of github.com:deepset-ai/haystack …

86172f2

…into execute_eval_run_nb

julian-risch approved these changes Jun 13, 2022

View reviewed changes

julian-risch merged commit 66c7d1a into master Jun 13, 2022

julian-risch deleted the execute_eval_run_nb branch June 13, 2022 07:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add execute_eval_run example to Tutorial 5 #2459

Add execute_eval_run example to Tutorial 5 #2459

tstadel commented Apr 26, 2022 •

edited

Loading

review-notebook-app bot commented Apr 26, 2022

julian-risch left a comment

Add execute_eval_run example to Tutorial 5 #2459

Add execute_eval_run example to Tutorial 5 #2459

Conversation

tstadel commented Apr 26, 2022 • edited Loading

review-notebook-app bot commented Apr 26, 2022

julian-risch left a comment

Choose a reason for hiding this comment

tstadel commented Apr 26, 2022 •

edited

Loading