Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dataset #10

Open
PriyankaPaud opened this issue May 22, 2023 · 18 comments
Open

dataset #10

PriyankaPaud opened this issue May 22, 2023 · 18 comments

Comments

@PriyankaPaud
Copy link

Where can I find the WSJ0 dataset to run the project? Can I use other datasets instead of WSJO ?

@PriyankaPaud
Copy link
Author

How can I convert .wv1 files into .wav file?

@quancs
Copy link
Member

quancs commented May 23, 2023

Where can I find the WSJ0 dataset to run the project? Can I use other datasets instead of WSJO ?

You can download the WSJ0 dataset from https://catalog.ldc.upenn.edu/LDC93S6A

@quancs
Copy link
Member

quancs commented May 23, 2023

How can I convert .wv1 files into .wav file?

You can use the sph2pipe tool
sph2pipe.zip

@PriyankaPaud
Copy link
Author

I can't find the model in the github repo . I want to test the model . Seems like the pushing of model is prohibited by .gitignore file. Can anyone help me with the model file ?

@quancs
Copy link
Member

quancs commented May 23, 2023

What do you mean the model?

@PriyankaPaud
Copy link
Author

I mean the actual model the trained one by the team ?

@quancs
Copy link
Member

quancs commented May 23, 2023

The trained models are not uploaded in the repo.

@PriyankaPaud
Copy link
Author

Can I get access to the trained model ? It would be a great help

@quancs
Copy link
Member

quancs commented May 23, 2023

Which model you want?

@quancs
Copy link
Member

quancs commented May 23, 2023

Accutally, you can train the models, like NB-BLSTM, NBC2, using the scripts provided in the repo from scratch.

@PriyankaPaud
Copy link
Author

Yeah will be doing that once we test on our data . For testing purposes we need the model at first

@quancs
Copy link
Member

quancs commented May 23, 2023

The model we trained were using our private code (we cleaned the code by removing, renaming some classes or functions), so it might take you some additional work to load the weights. Or you can train the model from scratch, so that the obtained model will match the code.

NBC2_small - epoch99_neg_si_sdr-19.0141.zip

@PriyankaPaud
Copy link
Author

Okay thankyou so much for this .

@quancs
Copy link
Member

quancs commented May 23, 2023

You're welcome.

@PriyankaPaud
Copy link
Author

Do we need to include al dataset provided by WSJ0 while training? or only few will also work ?

@quancs
Copy link
Member

quancs commented May 24, 2023

Not all the data in WSJ0, only the utterances used in WSJ0-2mix

@PriyankaPaud
Copy link
Author

Can you share the dataset the unmixed ones which can be used easily to train the script. I see that the dataset path and the dataset name all are hardcoded ? Can you help here . I just want the dataset that will help run the script .Thankyou . I have a dataset but the train process gets killed after 1 epoch only.

@PriyankaPaud
Copy link
Author

PriyankaPaud commented May 24, 2023

Do you have the list of utterances which were used during training ? Its very hard to analyze from the wsj0-mix

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants