Slower than openai whisper with my gpu #181

tuotuoshao · 2024-02-23T08:03:27Z

Env
device: A30
model version: large-v2
cuda version: cuda11

Measurement
for 57min audio
openai costs 450s
whisper-jax costs 400s

for 90s audio
openai costs 20s
whisper-jax costs 38s

for 2s audio
openai costs 1s
whisper-jax costs 5s

All my measurement are the second infefence.
Only if the audio’s duration is large enough, jax is faster but the increase is less then 10%. When processing short audio, jax is several times slower than openai.

Code config:
openai config:
model = whisper.load_model("large-v2")
whisper-jax config:
pipeline = FlaxWhisperPipline("openai/whisper-large-v2", dtype=jnp.float16, batch_size=16)

ewwink · 2024-03-10T07:40:09Z

try batch_size=1 or whatever number of GPU you have

flexchar · 2024-03-24T10:44:33Z

This is normal if your batch is small then there isn't room for a win. I wouldn't call it an issue. :)

tuotuoshao closed this as completed May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slower than openai whisper with my gpu #181

Slower than openai whisper with my gpu #181

tuotuoshao commented Feb 23, 2024

ewwink commented Mar 10, 2024

flexchar commented Mar 24, 2024

Slower than openai whisper with my gpu #181

Slower than openai whisper with my gpu #181

Comments

tuotuoshao commented Feb 23, 2024

ewwink commented Mar 10, 2024

flexchar commented Mar 24, 2024