Auto-Chunking Audio Files into Intonational Phrases
author: T. Mark Ellison date: 2017-05-01
tags: - Segmentation - Intonational Phrase - Silence - PRAAT
categories: - Tips
Introduction
Eri Kashima and I have found a neat way of chunking speech from the audio file, as a first step in transcription. Initial efforts using silence-detection in ELAN were not successful. Instead, we found that PRAAT’s silence detection did the job quite well once the right parameters were chosen.
We use PRAAT’s Annotate >> To TextGrid (silences)… option from the PRAAT file window. This option is available once you have loaded the .wav file. Our parameter settings are:
Minimum pitch 70Hz
Silence threshold (dB): -35
Minimum silent interval duration(s): 0.25
Minimum sounding interval duration(s): 0.1
Silent interval label: (empty)
Sounding interval label: ***
A detailed walkthrough - of chunking by PRAAT for a file normally explored in ELAN - can be seen on Eri’s blog page on the topic.