Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

large-v3-zh经过ctranslate转换之后识别中文音频,但是结果确实英文 #586

Open
bingogo888 opened this issue May 31, 2024 · 3 comments

Comments

@bingogo888
Copy link

image
image

large-v3-zh经过ctranslate转换之后识别中文音频,但是结果确实英文,不经过ctranslate转换没有问题。
这该怎么解决?

@shuaijiang
Copy link
Collaborator

可能是tokenizer.json有问题?可以使用这个 https://huggingface.co/BELLE-2/Belle-whisper-large-v3-zh/blob/main/tokenizer.json

@paulcx
Copy link

paulcx commented Jun 21, 2024

不仅仅是英文,而且结果非常差。包括https://huggingface.co/BELLE-2/Belle-whisper-large-v3-zh-punct

@shuaijiang
Copy link
Collaborator

怎么调用的模型呢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants