You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see that the output can be aligned (provide per-token timestamps) if we use the CTCBeamDecoder, I wonder if I can get timestamps also if using another decoder such an lstm or transformer-based one.
The text was updated successfully, but these errors were encountered:
Currently we are not providing it. Also I know that E2E timestamps (including CTC decoder) perform relatively poorly,
How was your experience using CTCBeamDecoder?
Hey @upskyy , no worries, if I can I'll try to implement something and open a PR. I havent had much time to play with the CTCBeamDecoder yet. Will get back to you when I've tested it more
❓ Questions & Help
I see that the output can be aligned (provide per-token timestamps) if we use the CTCBeamDecoder, I wonder if I can get timestamps also if using another decoder such an lstm or transformer-based one.
The text was updated successfully, but these errors were encountered: