get_vad_features
data.dataset.AudioFileDataset.get_vad_features(audio_path, metadata, sr=16000)Extract features for each VAD chunk in the metadata.
The global start time of each chunk is also returned for debugging purposes. This method is used when alignment_strategy is set to chunk.
Parameters
| Name | Type | Description | Default |
|---|---|---|---|
| audio_path | str | Path to the audio file. | required |
| metadata | AudioMetadata | Metadata object. | required |
| sr | int | Sample rate. | 16000 |
Returns
| Name | Type | Description |
|---|---|---|
| list of dict | List of dictionaries containing extracted features and metadata for each chunk. |