Overview of VoiceXML Support in UCMA 3.0
VoiceXML support in the Unified Communications Managed API (UCMA) version 3.0 consists of a VoiceXML runtime and the .NET namespace Microsoft.Rtc.Collaboration.AudioVideo.VoiceXml.
The namespace consists of the Browser class, which provides access to the VoiceXML runtime and to the telephony and audio functionality in UCMA 3.0. This functionality facilitates automated communication with callers.
Note
Browser derives from the Browser class in the Microsoft.Speech.VoiceXml namespace.
To run a VoiceXML application in UCMA 3.0, you must create a .NET application. Your .NET application answers calls, launches a Browser for the VoiceXML session, and responds to Browser events. See Tasks of the Host .NET Application for more information.
Also, your .NET application can track how the VoiceXML interpreter executes elements in VoiceXML documents. You can use the event-driven feedback from VoiceXML processing to identify problem areas or inefficiencies in VoiceXML documents, and to tune the performance of VoiceXML applications. See Browser Class (Microsoft.Speech) for more information about the events that a Browser instance generates during a VoiceXML session.
VoiceXML Runtime
The VoiceXML runtime includes a VoiceXML interpreter that processes the elements in VoiceXML documents. The VoiceXML interpreter processes elements that control dialog flow, including variables, events, and executable elements. Other VoiceXML elements specify parameters such as timeouts and other settings that govern resource fetching.
The VoiceXML interpreter also processes speech recognition grammars and speech synthesis prompts that are included inline or linked externally from the VoiceXML document. Grammars define which words the application will recognize in VoiceXML dialogs, while speech synthesis prompts contain the text that the application will speak. The VoiceXML interpreter may also instruct the Browser to access and play back to the caller pre-recorded audio files instead of synthesized speech.
Browser
The Browser acquires the documents for the VoiceXML runtime to process, and carries out instructions from the VoiceXML interpreter to communicate with a caller.
To communicate with a caller, the Browser connects to audio functionality in UCMA 3.0, such as speech recognition, Dual-Tone Multi-Frequency (DTMF) tone recognition, audio recording, speech synthesis, and audio playback.
While processing the elements in VoiceXML documents, the Browser generates events that inform the hosting .NET application about the status of the call.
Creating VoiceXML Applications
The Unified Communications Managed API (UCMA) 3.0 documentation does not instruct developers on creating VoiceXML applications. See the VoiceXML Tutorial on MSDN for instructions and examples that will introduce you to VoiceXML authoring.
Additional resources include the following specifications of the World Wide Web Consortium (W3C):