Replies: 1 comment 7 replies
-
Segment size controls how large each chunk of audio data gets loaded onto the GPU. The bigger it is, the less time you spend on swapping data between RAM and VRAM, thus the faster the processing. Also the bigger the segment size, the less it needs to rely on overlapping, thus might give better results. Set it to as large as your VRAM allows. Overlap presumably controls how much the audio data chunks are overlapped. I just leave it as default and probably won't change it unless there are artifacts in the result that I suspect to be caused by insufficient context given to the model (like gaps in the audio). |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How do you even configure that?
Beta Was this translation helpful? Give feedback.
All reactions