Definition of the benchmarking metrics #125

mweidling · 2022-08-26T08:47:01Z

We have identified the following metrics to be relevant for benchmarking:

Bag of Words
CER/WER
Flexible CER
Reading Order
IoU
mAP
CPU time
wall time
I/O
memory usage

In order for us and our users to be clear what we exactly mean when we use these terms we have to properly define them and add them to the OCR-D specs.

Prior Art: https://pad.gwdg.de/3S_yuzyERum4WQChxV6UyQ
Link to draft: https://pad.gwdg.de/rLDBVhmYQ8CwOd67KcYHwQ#

define each metric
add them to the specs

mweidling · 2022-08-31T05:48:52Z

See https://pad.gwdg.de/rLDBVhmYQ8CwOd67KcYHwQ# for the current status.

mweidling · 2022-09-05T06:16:18Z

@kba @cneud

My first draft of the metrics is ready. Could you please have a look at them? There are still some open TODOs which indicate points that we should talk about / need to define.

I intentionally left the "scenario based layout evaluation" empty because from what I got from the paper linked this is not a metric in the narrower sense. Maybe we could talk about this as well.

cneud · 2022-09-05T15:49:58Z

Thank you @mweidling! I've added a new top level and slightly restructured to make the distinction between the evaluation of text, layout and resource utilization more clear, and added some introductory remarks for those sections. Looks very good otherwise, I guess we can have one more call and then publish a first version to spec.

mweidling · 2022-09-06T05:59:20Z

Thank you for your feedback and work, @cneud ! I'll schedule a call then.

mweidling self-assigned this Aug 26, 2022

mweidling mentioned this issue Aug 31, 2022

1st draft #126

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Definition of the benchmarking metrics #125

Definition of the benchmarking metrics #125

mweidling commented Aug 26, 2022 •

edited

Loading

mweidling commented Aug 31, 2022

mweidling commented Sep 5, 2022

cneud commented Sep 5, 2022

mweidling commented Sep 6, 2022

Definition of the benchmarking metrics #125

Definition of the benchmarking metrics #125

Comments

mweidling commented Aug 26, 2022 • edited Loading

mweidling commented Aug 31, 2022

mweidling commented Sep 5, 2022

cneud commented Sep 5, 2022

mweidling commented Sep 6, 2022

mweidling commented Aug 26, 2022 •

edited

Loading