-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding translator with many generic input parameter support #782
Conversation
@tholor Please review this. |
It seems transformers pipeline for translation is not consistent. It is not correctly picking up "de" to "en" translation model. I tried multiple ways to integrate model but translation model APIs are not consistent with regards to parameters. @tholor what is view about using |
I just did a first review & testing round. I think the code design is great and fits well into the rest of Haystack. Example:
The results basically only cover a part of the input:
I thought it might be due to the If we don't find a fix here, I'd suggest we rather use the OPUS models (e.g. https://huggingface.co/Helsinki-NLP/opus-mt-en-de). They are rather small, are available for most language pairs and translation results look decent.
I would not really like to integrate an external service here. Maybe later as a second option, but our default should always be free and open source. |
Thanks I will update PR by using OPUS models |
@tholor I have done the changes you suggested. Also added End-to-end translation pipeline. Please review these changes. |
@tholor can you please look into this. I don't want to resolve merge conflicts after seeing list current PRs 🙂 |
@lalitpagaria yep, I am on it - but writing the docs always takes quite some time ;) |
Overall what is your view about pipeline changes? |
I like the idea of "wrapping" an existing pipeline. I am not sure if users will really use it rather than creating a new pipeline, but it's worth a test and ask users for feedback :) |
…t cause tensor size miss match error. If this option required by used then it can be added later.
Really liked the CI changes by @oryx1729 |
@tholor all green now |
To complete #704
Pending tasks -
Following tasks can be done in separate PR -