AudioChunk
data.datamodel.AudioChunk()Segment of audio, usually created by Voice Activity Detection (VAD).
Attributes
| Name | Type | Description |
|---|---|---|
| start | float | Start time of the chunk in seconds. |
| end | float | End time of the chunk in seconds. |
| text | (str, optional) | Optional text transcription for the chunk. |
| duration | (float, optional) | Duration of the chunk in seconds. |
| audio_frames | (int, optional) | Number of audio frames a chunk spans. |
| num_logits | (int, optional) | Number of model output logits for the chunk. |
| language | (str, optional) | Language code for the chunk. |
| language_prob | (float, optional) | Probability/confidence of the detected language. |
| id | (str or int, optional) | Optional unique identifier for the chunk. |