AudioTranscriptionSegment Class
- java.
lang. Object - com.
azure. ai. openai. models. AudioTranscriptionSegment
- com.
Implements
public final class AudioTranscriptionSegment
implements JsonSerializable<AudioTranscriptionSegment>
Extended information about a single segment of transcribed audio data. Segments generally represent roughly 5-10 seconds of speech. Segment boundaries typically occur between words but not necessarily sentences.
Method Summary
Modifier and Type | Method and Description |
---|---|
static
Audio |
fromJson(JsonReader jsonReader)
Reads an instance of Audio |
double |
getAvgLogprob()
Get the avg |
double |
getCompressionRatio()
Get the compression |
Duration |
getEnd()
Get the end property: The time at which this segment ended relative to the beginning of the transcribed audio. |
int |
getId()
Get the id property: The 0-based index of this segment within a transcription. |
double |
getNoSpeechProb()
Get the no |
int |
getSeek()
Get the seek property: The seek position associated with the processing of this audio segment. |
Duration |
getStart()
Get the start property: The time at which this segment started relative to the beginning of the transcribed audio. |
double |
getTemperature()
Get the temperature property: The temperature score associated with this audio segment. |
String |
getText()
Get the text property: The transcribed text that was part of this audio segment. |
List<Integer> |
getTokens()
Get the tokens property: The token IDs matching the transcribed text in this audio segment. |
Json |
toJson(JsonWriter jsonWriter) |
Methods inherited from java.lang.Object
Method Details
fromJson
public static AudioTranscriptionSegment fromJson(JsonReader jsonReader)
Reads an instance of AudioTranscriptionSegment from the JsonReader.
Parameters:
Returns:
Throws:
getAvgLogprob
public double getAvgLogprob()
Get the avgLogprob property: The average log probability associated with this audio segment.
Returns:
getCompressionRatio
public double getCompressionRatio()
Get the compressionRatio property: The compression ratio of this audio segment.
Returns:
getEnd
public Duration getEnd()
Get the end property: The time at which this segment ended relative to the beginning of the transcribed audio.
Returns:
getId
public int getId()
Get the id property: The 0-based index of this segment within a transcription.
Returns:
getNoSpeechProb
public double getNoSpeechProb()
Get the noSpeechProb property: The probability of no speech detection within this audio segment.
Returns:
getSeek
public int getSeek()
Get the seek property: The seek position associated with the processing of this audio segment. Seek positions are expressed as hundredths of seconds. The model may process several segments from a single seek position, so while the seek position will never represent a later time than the segment's start, the segment's start may represent a significantly later time than the segment's associated seek position.
Returns:
getStart
public Duration getStart()
Get the start property: The time at which this segment started relative to the beginning of the transcribed audio.
Returns:
getTemperature
public double getTemperature()
Get the temperature property: The temperature score associated with this audio segment.
Returns:
getText
public String getText()
Get the text property: The transcribed text that was part of this audio segment.
Returns:
getTokens
public List
Get the tokens property: The token IDs matching the transcribed text in this audio segment.
Returns: