TTS introduction

Article
01/14/2009

TTS means "Text to speech" which is the technology to let computer speak to you.

In short, you can input the text to computer, then a computer algorithm (so called TTS engine) will analyze the text, determine the pronunciation, synthesize the sound with some mathematical models. The output of a TTS engine is usually kind of sound data like PCM wave data etc.

On Windows platform, TTS functionality can be called via Speech API. There are two Speech APIs as below:

- System.Speech: managed

- SAPI: unmanaged

To get Speech API SDK, here is the good source to start

https://blogs.msdn.com/chuckop/archive/2008/02/26/speech-content-in-the-windows-sdk.aspx

I will discuss some aspects on how to use TTS via Speech API in future posts.

The user scenerios of TTS are mainly related to human machine interaction. Together with SR (Speech Recognition), Speech technology provides the foundmentals for next generation of natual user interface.

Dear readers, do you use TTS in your application or scenerio? How do you think about it? I'd like to hear comments or story from you.

Comments

Anonymous
November 23, 2009
I did try using TTS in certain application. I found that it is really did bring some improvement for the people with deaf impairment. But, many TTS are in English version. Is there possible to have TTS with different language version? How to develop it?
Anonymous
January 16, 2010
yes, there are TTS for other languages http://blogs.msdn.com/speak/archive/2009/05/03/programming-with-latest-ms-tts-engines.aspx

Partager via

TTS introduction

Comments

Ressources supplémentaires