Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use multi-model recognition #37

Open
bertsky opened this issue May 25, 2023 · 1 comment
Open

use multi-model recognition #37

bertsky opened this issue May 25, 2023 · 1 comment

Comments

@bertsky
Copy link
Collaborator

bertsky commented May 25, 2023

Kraken offers "multi-script" (actually multi-model) prediction in one pass, so instead of a fixed model, we could run with multiple models and use the annotated language and script mappings to select per-segment (as in ocrd-tesserocr-recognize with xpath_model).

IIUC, that would entail using mm_rpred (instead of rpred) and passing lang/script to bounds['boxes'][...]['tags'] (or bounds['lines'][...]['tags'] with baseline segmentation) and a dict from lang/script to model names as the first arg.

@bertsky
Copy link
Collaborator Author

bertsky commented May 29, 2024

After the Kraken v5 API changes, this would now mean passing the dict to .tags per segmentation.lines.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant