Azure OpenAI Whisper hallucinates source audio language
Shamus Sim
0
Reputation points
Hi,
I have been experimenting with the Whisper service via Azure OpenAI services.
There are alot of cases where the audio language send to the API is (clealry) English but it comes out as a Malay(Bahasa Malaysia) text, despite clearly using the transcription API and not the translation API.
Sample prompt and API call
`transcription = client_tts.audio.transcriptions.create(`
```` model="whisper",`
` file=audio_file,`
` prompt="""Transcribe the following audio file, it is taken in medical clinic setting, `
` return the original language and the transcription. Only return English, Bahasa Malaysia, and Mandarin language"""`
` )`
Sign in to answer