Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

VikParuchuri / marker Public

Notifications You must be signed in to change notification settings
Fork 936
Star 16.5k

Code
Issues 113
Pull requests 18
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: VikParuchuri/marker

Releases · VikParuchuri/marker

OCR and misc improvements; demo app

19 Aug 21:30

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

OCR and misc improvements; demo app Latest

Latest

Language no longer needs to be specified
Fix OCR memory leak
Add marker GUI demo app to test out conversion
Add progress for equation detection
Improve table recognition slightly
Add table benchmark

Assets 2

Loading

yiyibooks, omega-lua, kksasa, and JH6588 reacted with thumbs up emoji

gcgbarbosa, heldilira, cthulhu-tww, kuengroc, RalfNorthman, svmrw, Louis-htmlcss, MateoWartelle, dubsuar, nj-crossml, and 2 more reacted with hooray emoji

All reactions

👍 4 reactions
🎉 12 reactions

16 people reacted

Significant speedup

12 Jul 18:04

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Significant speedup

This release has a 15% GPU speedup, 3x CPU, 7x MPS. The speedup comes from new surya models for layout and text detection that are a lot more efficient.

This is a "best case" speedup, if you need to OCR or do equation recognition, the speedup will be lower. But it will still be a lot faster.

Assets 2

Loading

yiyibooks, ngirard, Kilowon, johnconnor-sec, Daerkle, h-arnold, jaelliot, Harvester62, lucasmelojs, omega-lua, and 2 more reacted with thumbs up emoji

mclevey, 651961, and ngirard reacted with laugh emoji

mattvr, 651961, FBruzzesi, ngirard, Blair-Johnson, yiyibooks, yasyf, dubsuar, cpursley, jrzkaminski, and Mutaz94 reacted with rocket emoji

All reactions

👍 12 reactions
😄 3 reactions
🚀 11 reactions

21 people reacted

Fix transformers bugs

30 Jun 15:20

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix transformers bugs

New transformers version introduces a new kwarg in donut models. Handle this case by ignoring it.
New transformers version breaks MPS compatibility by using torch .isin to do a comparison. Handle this by setting the pytorch mps fallback setting.

Assets 2

Loading

All reactions

Pagination, bug fixes

17 Jun 17:04

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Pagination, bug fixes

Add a setting to enable output pagination
Enable convert.py to use mps (but less memory efficient than cpu/cuda)
Fix bug with inference ram setting
Fix bug with pdf names with dots in them
Fix bug with images at the end of blocks

Assets 2

Loading

All reactions

Fix convert.py bug

30 May 01:55

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix convert.py bug

Fix model device check.

Assets 2

Loading

Harvester62, ggHydraLinn, and adarshmadrecha reacted with thumbs up emoji

All reactions

👍 3 reactions

3 people reacted

Specify page range

29 May 18:09

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Specify page range

Make it more clear MPS can't be used with convert.py
Specify page range in convert with start_page and max_pages

Assets 2

Loading

All reactions

Python 3.12 compatibility

28 May 22:36

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Python 3.12 compatibility

Remove ray to enable python 3.12 compatibility
Removing ray frees a lot of VRAM (since we can use torch shared tensors), so on average with convert.py each process takes 3GB VRAM. This enables much higher throughput (was between 4.5GB and 5GB before).

Assets 2

Loading

All reactions

OCR speedups

28 May 04:34

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

OCR speedups

Pull in new surya and pdftext versions for speedups in OCR and text extraction, respectively
Refine heuristics to reduce OCR false positives (and true positives, unfortunately)
Enable float batch multipliers

Assets 2

Loading

mrchengshunlong and 651961 reacted with laugh emoji

tuanbmstu, 651961, and Harvester62 reacted with heart emoji

All reactions

😄 2 reactions
❤️ 3 reactions

4 people reacted

Speed improvements

23 May 23:24

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Speed improvements

Enable parallel text extraction, with worker count settings
Bump surya version to pull in layout/line segmentation speed improvements, and OCR bug fix

Assets 2

Loading

pauloeli and SidneyRey reacted with thumbs up emoji

yiyibooks, mrchengshunlong, yasyf, heldilira, SidneyRey, and Jaitely-involead reacted with heart emoji

All reactions

👍 2 reactions
❤️ 6 reactions

7 people reacted

Faster OCR

18 May 04:28

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Faster OCR

OCR is now ~2.5x faster, due to improvements in surya

Assets 2

Loading

tcluri, mrchengshunlong, SebastianBodza, nischalj10, yiyibooks, xtyrrell, abhirupghosh, and ggHydraLinn reacted with rocket emoji

FBruzzesi, omega-lua, 651961, and xtyrrell reacted with eyes emoji

All reactions

🚀 8 reactions
👀 4 reactions

11 people reacted

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.