get_vad_features

data.dataset.AudioFileDataset.get_vad_features(audio_path, metadata, sr=16000)

Extract features for each VAD chunk in the metadata.

The global start time of each chunk is also returned for debugging purposes. This method is used when alignment_strategy is set to chunk.

Parameters

Name Type Description Default
audio_path str Path to the audio file. required
metadata AudioMetadata Metadata object. required
sr int Sample rate. 16000

Returns

Name Type Description
list of dict List of dictionaries containing extracted features and metadata for each chunk.