Hi @clark-zh , Welcome to Microsoft Q&A,
It seems that you have already asked for opinions from UWP.
I will answer what I can from the perspective of C#.
For Windows.Media.SpeechRecognition,System.Speech.Recognition and Microsoft Speech API (SAPI), where are their default model running? CPU, GPU or NPU?
They all default to CPU. Windows.Media.SpeechRecognition is optimized for UWP apps and may use NPU on newer devices.
System.Speech.Recognition is based on the traditional speech engine of .NET and only uses CPU. Microsoft Speech API (SAPI) is a legacy API and does not support GPU/NPU.
Are there any permission restrictions on the use of these APIs?
Windows.Media.SpeechRecognition (UWP):
- Requires Microphone and voice permissions ("Capabilities" in the manifest).
- Speech recognition requires User consent.
- Some features are regionally restricted (e.g. some languages may not work offline)
System.Speech.Recognition (.NET Framework): No strict permission requirements, but **limited to installed system voices.
Microsoft Speech API (SAPI):
- No explicit permission requirements, but requires installation of SAPI-compatible voices.
- Some voices have license restrictions
For more details, it is recommended that you ask more professional personnel.
Best Regards,
Jiale
If the answer is the right solution, please click "Accept Answer" and kindly upvote it. If you have extra questions about this answer, please click "Comment".
Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.