Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support PDF generation/compression without hOCR files #11

Open
MerlijnWajer opened this issue Sep 18, 2021 · 0 comments
Open

Support PDF generation/compression without hOCR files #11

MerlijnWajer opened this issue Sep 18, 2021 · 0 comments

Comments

@MerlijnWajer
Copy link
Collaborator

MerlijnWajer commented Sep 18, 2021

This should be a no-brainer, but we need to deal with a few things:

  • We use hOCR files to estimate the page size based on the DPI encoded in the hOCR files (if present), otherwise we estimate it.
  • The code that generates the initial PDF with text layer obviously relies on hOCR. We could just make a PDF with empty pages of the right size as alternative when we have no hOCR.
@MerlijnWajer MerlijnWajer changed the title Support PDF generation/compression with hOCR files Support PDF generation/compression without hOCR files Sep 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant