Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve performance #143

Merged
merged 1 commit into from
May 28, 2024
Merged

Improve performance #143

merged 1 commit into from
May 28, 2024

Conversation

VikParuchuri
Copy link
Owner

@VikParuchuri VikParuchuri commented May 27, 2024

  • Fewer false positives (and true positives :( ) for OCR heuristics
  • Speed up OCR performance by pulling in new surya version
  • Fix pdftext bug causing heuristic false positives
  • Improve pdf extraction time marginally

@VikParuchuri VikParuchuri merged commit 7bf2e91 into master May 28, 2024
2 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators May 28, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant