-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Explore uploading models to Hugging Face #804
Comments
Most of the scripts I looked at don't support the Transformer-RNN structure that we use. Plus we'd have to support the |
The HPLT models on HF are 300 Mb in size, so they look more like a Teacher model with transformer-base architecture. We can explore how to upload the student models without conversion and whether they will be usable later somehow. Maybe having proper Python bindings can help to integrate it with HF pipelines. |
Yes, but it's ONNX. I'm sure you can implement everything that's in Marian in Pytorch. It doesn't mean we have to go this way though as it's definitely some work but it would be interesting to explore. |
We could start by uploading teacher models. |
The investigation wasn't just for ONNX, as there are links out to other converters. But yes, I agree that if we can get out to other formats it gives us much more options on how we can run these things. |
It will likely require converting them to HF format/Pytorch, similar to how it's done for the OPUS-MT and HPLT models:
https://huggingface.co/Helsinki-NLP/opus-mt-zh-en
https://huggingface.co/HPLT/translate-sw-en-v1.0-hplt_opus/tree/main
I also found this converter:
https://github.com/huggingface/transformers/blob/main/src/transformers/models/marian/convert_marian_to_pytorch.py
There might be useful code here as well:
https://github.com/hplt-project/HPLT-MT-Models/tree/main/v1.0/raw_scripts
The text was updated successfully, but these errors were encountered: