Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support float16 on ARM CPUs with native float16 support #1153

Open
FlippFuzz opened this issue Mar 30, 2023 · 3 comments
Open

Support float16 on ARM CPUs with native float16 support #1153

FlippFuzz opened this issue Mar 30, 2023 · 3 comments
Labels
cpu Issues related to CPU execution enhancement New feature or request

Comments

@FlippFuzz
Copy link

From SYSTRAN/faster-whisper#65


Some CPUs such as ARM Neoverse-N1 (Oracle Cloud free tier) support FP16 computation.
It would be nice to have this feature because there could be up to a 2x speedup in computation speed compared to float32.

I'm just creating the enhancement request and understand that there might not be a focus on this because it only applies to a small subset of CPUs.

@ephemer
Copy link

ephemer commented Apr 14, 2023

FWIW Apple Silicon CPUs also support FP16 so that adds some more potential consumers here

@nlgtuankiet
Copy link

I would love to use faster-whisper instead of whisper.cpp but the lack of FP16 on CPU is kind of the deal breaker for me.
For now, with faster-whisper I have to choose between speed (int8) and accuracy (float32), FP16 is the missing balance point.
Consider that ARM CPUs are very popular nowadays (mobile phone, Apple M series, cloud providers)
I think this is a good idea to consider on fp16 support.
Do you have any thoughts about this @guillaumekln

@bil-ash
Copy link

bil-ash commented Dec 17, 2023

Not just arm and apple silicon, latest intel and AMD CPUs (I am using an AMD one) also support float16. Would be very nice if int8_int16 inference is supported on the CPUs which support float16.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cpu Issues related to CPU execution enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants