How to train model for more than 2 speakers? #19

Sonish-Maharjan-2014 · 2024-01-03T10:16:29Z

I trained the model (form NBSS) branch for 2 speakers separation using wsj0 dataset. It perfectly worked. But now I want to train the model for more than 2 speakers. What steps should I follow?

quancs · 2024-01-11T10:27:28Z

Hello, thank you for your insterests in our works. To train the model for a dataset where each utterance more than 2 speakers, you can change the number of output channels to 2N (N speakers and each has 2 numbers for the real and imginary parts of STFT coefficients) for each TF-bin.

Sonish-Maharjan-2014 · 2024-01-11T13:36:55Z

Thank you for your response.. I tried adapting the code for four speakers. I generated room impulse responses (RIR) for the four speakers and made some adjustments in the code. Unfortunately, I ran into an error towards the end of the process.

Could you help me fix the problem?

quancs · 2024-01-26T01:49:47Z

You can debug your code to check the shape of echoics, echoic_i, and the value of needed_lens

Sonish-Maharjan-2014 · 2024-02-06T03:48:53Z

Thanks, I will try to debug

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train model for more than 2 speakers? #19

How to train model for more than 2 speakers? #19

Sonish-Maharjan-2014 commented Jan 3, 2024

quancs commented Jan 11, 2024

Sonish-Maharjan-2014 commented Jan 11, 2024

quancs commented Jan 26, 2024

Sonish-Maharjan-2014 commented Feb 6, 2024

How to train model for more than 2 speakers? #19

How to train model for more than 2 speakers? #19

Comments

Sonish-Maharjan-2014 commented Jan 3, 2024

quancs commented Jan 11, 2024

Sonish-Maharjan-2014 commented Jan 11, 2024

quancs commented Jan 26, 2024

Sonish-Maharjan-2014 commented Feb 6, 2024