How much time did it take for custom STT model to train?

Question

Hey! 🙂

I've just trained a custom STT model using Azure Speech Services. However, I don't know how much time the train took as I can only see the creation date. Is there any way to check how much time a particular train took?

Thanks a lot!

Answer

Hi @Bruno Goncalves Vaz (P)

Thank you for your question.

Training a custom speech-to-text (STT) model can indeed vary significantly in terms of time and cost depending on several factors.

Training time increases with the amount of audio and transcript data; dedicated hardware regions process ~10 hours/day, while others handle ~1 hour/day. Training is faster in regions with dedicated hardware, and more complex models take longer. While the Speech to text FAQ and How long does it take to train a custom model with audio data- doesn't specifically mention adjusting epochs, reducing the number of epochs or other training parameters might help speed up training. However, this could also affect model performance. More complex models (or models that require more data for fine-tuning) will naturally take longer to train.

I hope this helps. Thank you.

Share via

How much time did it take for custom STT model to train?

1 answer

Your answer