Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat] Add pdf loader #71

Merged
merged 8 commits into from
Sep 13, 2021
Merged

[feat] Add pdf loader #71

merged 8 commits into from
Sep 13, 2021

Conversation

lolipopshock
Copy link
Member

@lolipopshock lolipopshock commented Sep 12, 2021

Add support for loading pdf files in layoutparser

>>> import layoutparser as lp
>>> pdf_layout = lp.load_pdf("path/to/pdf")
>>> pdf_layout[0] # the layout for page 0
>>> pdf_layout, pdf_images = lp.load_pdf("path/to/pdf", load_images=True)
>>> lp.draw_box(pdf_images[0], pdf_layout[0])

@maswiebe
Copy link

Can I use this to extract a table from a PDF into a dataframe?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants