Speech Recognition engine question on Windows

clark-zh 40 Reputation points
2025-02-22T06:56:27.04+00:00

Hi MSFT member,

On windows, when developing a Speech recognition app, there are about three solution APIs:

  1. Windows.Media.SpeechRecognition
  2. System.Speech Programming Guide for .NET Framework
  3. Microsoft Speech API (SAPI)

Since We want to use local offline solution, Azure speech service is not included.

Q1. if OEM want to customize a new SR engine, which solution do you recommend?

Q2. for Windows.Media.SpeechRecognition we did not find how to set SR engine, does it support customized engine?

Q3. If OEM can customize SR engine, how to make a custom SR engine? Do you a guide document to make a SR engine? Is there any restriction?

Thanks.

Universal Windows Platform (UWP)
{count} votes

Accepted answer
  1. Junjie Zhu - MSFT 20,681 Reputation points Microsoft External Staff
    2025-02-25T08:49:15.8833333+00:00

    Hello @clark-zh ,

    Since your question is tagged with UWP, I will answer it from the UWP perspective.

    define custom recognition constraints is used to filtering ASR results. but we want to custom ASR model, which API is suitable?

    The Windows.Media.SpeechRecognition API does not natively support setting a custom SR engine. If you need to use a custom SR engine, you might need to consider System.Speech API and Microsoft Speech API, which offer Engine-Level Interfaces.

    If you need help with System.Speech API, please ask a new question with "C#" tag.

    For Windows.Media.SpeechRecognition,System.Speech.Recognition and Microsoft Speech API (SAPI), where are their default model running? CPU, GPU or NPU?

    The Windows.Media.SpeechRecognition API uses the built-in speech recognition capabilities provided by Windows, which typically run on the CPU.

    Are there any permission restrictions on the use of these APIs.

    There are no specific permission restrictions on Windows.Media.SpeechRecognition API, but you must comply with Microsoft's terms of service and any relevant privacy and security guidelines. Additionally, In UWP, you may require specific permissions to access the microphone and other resources on the device.

    Thank you.


    If the answer is the right solution, please click "Accept Answer" and kindly upvote it. If you have extra questions about this answer, please click "Comment".

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.