Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A way to execute hydra_train.py for Ksponspeech #191

Open
taejin0128 opened this issue Jan 11, 2023 · 0 comments
Open

A way to execute hydra_train.py for Ksponspeech #191

taejin0128 opened this issue Jan 11, 2023 · 0 comments

Comments

@taejin0128
Copy link

taejin0128 commented Jan 11, 2023

❓ Questions & Help

  1. Is it possible to make a STT model for converting bilingual speech to two different languages? For example, converting human speech(voice) to text '현재 비행기가 Turbulence로 인해 흔들리고 있습니다.)

    1-1) If it is possible, how can I train the model?

  2. If I want to train a Korean language data for a specific field, which one is a common method to train additional data on a pre-trained model?
    Method 1.
    step 1) Train the Ksponspeech data & Make a model
    step 2) Train a Korean language data of a specific field with the model generated in step 1

    Method 2. (I think that it needs too much efforts and time to transform the data for training together)
    step 1) Make a combined dictionary for Ksponspeech data and the other Korean language data
    step 2) Train all the data(Ksponspeech & other Korean langauge data) together

  3. For improving terminology recognition for a specific field, is it adequate to train addition data on a pre-trained model (which is trained with Ksponspeech data and "character" mode") with the "subword" option?

Details

@taejin0128 taejin0128 changed the title The way to execute hydra_train.py for Ksponspeech A way to execute hydra_train.py for Ksponspeech Jan 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant