Replies: 1 comment
-
|
Fine-tuning Whisper on Dutch is very doable — mozilla-foundation/common_voice_17_0 has solid Dutch coverage and was used in the original training, so mixing it with domain-specific recordings is the standard approach. Hugging Face's |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I’d like to inquire whether current ASR models support fine-tuning to improve recognition accuracy for an existing low-resource language, specifically Dutch.
If such fine-tuning is feasible, could you kindly provide a general list of the datasets that have already been used for training? This would help us avoid potential data duplication during subsequent training for Dutch and mitigate issues such as overfitting or catastrophic forgetting.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions