Introduction
In this module, we'll create a Mixed Reality application that explores the use of Azure Speech Services with the HoloLens. Azure Speech Services is the unification of speech-to-text, text-to-speech, and speech translation into a single Azure subscription.
Imagine a scenario where you're expected to build an application capable of translating your speech into other languages. You'd want a brief overview of incorporating that feature in your application, and Azure Speech Services can provide that.
Upon completion of this module, we'll be able to use your device's microphone to transcribe speech to text in real time. In addition, we'll be able to translate our speech into other languages and use the Intent-recognition feature to understand voice commands using artificial intelligence.
You can find a completed example of this tutorial here.
Learning objectives
- Learn how to integrate Azure Speech Services with a HoloLens 2 application
- Learn how to use speech recognition to transcribe text
- Learn how Azure speech recognition can be used to execute commands
- Learn how to integrate Azure speech translation
- Learn how to set up intent, entities, and utterances in the Language Studio portal
- Learn how to implement intent and natural-language understanding in our application
Prerequisites
- A Windows 10 PC configured with the correct tools.
- Windows 10 SDK 10.0.18362.0 or later.
- Unity Hub with Unity 2021.3 or later installed and the Universal Windows Platform Build Support module added.
- Set up a mixed reality project in Unity module.
- Mixed Reality Feature Tool.