Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How we can train languages other than Korean? #31

Closed
usamaehsan opened this issue Jul 3, 2020 · 6 comments
Closed

How we can train languages other than Korean? #31

usamaehsan opened this issue Jul 3, 2020 · 6 comments

Comments

@usamaehsan
Copy link

No description provided.

@sooftware
Copy link
Owner

If you have dataset (audio & transcript), You can train, but you have to modify a few things.

  1. First, you should make a character label like this.
  2. Second, you have to convert transcripts to ids on character labels.
  3. The code should be modified considering the structure of the dataset. (You will need to modify the kospech/data directory.)

If you have any other questions, please leave an issue at any time.

@usamaehsan
Copy link
Author

@sooftware can you please tell whats freq in aihub_labels.csv?

@sooftware
Copy link
Owner

Freq is not very important. Freq (frequency) is a count of how often it appeared in a dataset.
Freq is not used in training, so you don't have to worry.

@sooftware sooftware pinned this issue Jul 4, 2020
@usamaehsan
Copy link
Author

thanks,you are very helpful
i am new to this feild and there is no speech recognition available for my language urdu thats why i am trying to build urdu speech recognition system using this repo

@sooftware
Copy link
Owner

It's a great honor. I want to help you as much as I can, so please leave an issue or contact me at sh951011@gmail.com at any time.
During the week, I may be busy and answer late, but I will answer as soon as possible on the weekend.

@usamaehsan
Copy link
Author

thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants