Hi Yu Lan,
The WordBoundary event in the Azure Speech SDK is designed to be triggered before the corresponding word is spoken, providing the timing information needed to highlight words as they are spoken.
However, the Synthesizing event, which provides the audio data, can sometimes be processed at a different pace. To ensure that the word boundary information is always available when you play the audio, you can rely on the fact that the WordBoundary event is generally fired before the audio for that word is played.
Means you should receive the word boundary information in time to highlight the word as the audio plays. Make sure your event handlers for WordBoundary and Synthesizing are set up correctly.
Hope this helps. Do let us know if you any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer
Thank You.