Speech interview on Channel 9

Some links to stuff I talk about:

The new speech API I posted about a few days ago
The app I demo when I dial 0 is running on Speech Server, and the case study I mention is here.
You may also be interested in some of our research web pages, since I mention them at one stage in the interview: Synthesis research; Speech research in Redmond; Speech research in Asia.
I also mentioned the Voice Command app for Windows Mobile.

Comments

Anonymous
June 23, 2005
Sorry about the late comment to this story. I've been falling behind on my blog reading lately.

Late in your interview with Robert Scoble, he asked you about the possibility of using speech rocognition to produce transcripts of his interviews for example. Your answer was that the results he would get would not be very good unless the the speech engine was trained to each of the spearkers voices.

I've heard about ASR engines produced by companies like Autonomy/Virage that claim to be able to do a decent job of speaker independent and unconstrained domain voice recognition for similar uses like indexing and and searching newscasts etc. Do you have experience with or an opinion about how good those engines are?

Related to this is another question I've been wondering about: Suppose you pointed an engine at a video like the interview example above, but instead of using it to produce a transcript of the interview you were only interested in finding instances of a well defined list of keywords. This would be useful in indexing and searching libraries of audio content also. Would that be an easier problem to solve for speaker independent (i.e. untrained) speech recognition?

Thanks if you find this and have the time to respond.
Anonymous
April 17, 2009
PingBack from http://blogs.msdn.com/robertbrown/archive/2005/07/14/speech-recognition-of-interviews-amp-videos.aspx