-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More General OCR #50
Comments
--type format is OK. |
I tried that, it didn't work. It didn't extract chemical formula. |
The full command is the same as 'format' OCR. |
I couldn't install it on colab from source code, so I'm using via HuggingFace Pipeline. res = model.chat(tokenizer, image_file, ocr_type='format') https://huggingface.co/stepfun-ai/GOT-OCR2_0 I've also tried the HF demo - https://huggingface.co/spaces/ucaslcl/GOT_online |
Thank you very much, I understand, can you please share the 3 example images used in the More General OCR section. I tried with musical notes and geometry images but it didn't work but probably my images are too different from whats the model has been trained on. |
Hi, the benchmark.zip includes samples. |
On the example image, More General OCR at the bottom,
music notes, chemical compound, some geometrical shapes are shown.
Whats the python command to extract such things?
I've tried all the example code provided, but they are extracting as plain text only.
Thanks,
The text was updated successfully, but these errors were encountered: