Methods for transcription and forced alignment of a legacy speech corpus
Keywords
DASS, DARLA, Digital Archive of Southern Speech, transcription, analysis
Abstract
This paper describes the transcription and forced alignment of the Digital Archive of Southern Speech (DASS), a subset of the Linguistic Atlas of the Gulf States comprising 372 hours of recordings (64 interviews) conducted across eight southern U.S. states from 1968 to 1983. This project provides a large corpus of historical, semi-spontaneous Southern speech, time-aligned to the audio for acoustic analysis. Manual orthographic transcription of full DASS interviews is carried out according to in-house guidelines that ensure consistency across files and transcribers. Separate codes are used for the interviewee, interviewer, nonspeech, overlapping and unintelligible speech. Transcriber output is converted to Praat TextGrids using scripts from LaBB-CAT, a tool for maintaining large speech corpora. TextGrids containing only the interviewee’s speech are generated, and subjected to forced alignment by DARLA, which accommodates the levels of variation and noise in the DASS files with high degrees of success. Toward acoustic analysis, four methods for vowel formant extraction are evaluated: the native output of DARLA, FAVE, a local implemen-tation of FAVE-Extract, and a Praat-based extractor that incorporates separate formant tracks for different regions of the vowel space. The workflow of transcription and analysis is presented to benefit other projects of similar size and scope.
Original Publication Citation
Rachel M. Olsen, Michael L. Olsen, Joseph A. Stanley, Margaret E. L. Renwick, & William A. Kretzschmar, Jr. 2017. “Methods for transcription and forced alignment of a legacy speech corpus.”Proceedings of Meetings on Acoustics 30, 060001; doi:http://dx.doi.org/10.1121/2.0000559.
BYU ScholarsArchive Citation
Stanley, Joseph A.; Olsen, Rachel M.; Olsen, Michael L.; Renwick, Margaret E. L.; and Kretzschmar, William, "Methods for transcription and forced alignment of a legacy speech corpus" (2017). Faculty Publications. 6133.
https://scholarsarchive.byu.edu/facpub/6133
Document Type
Peer-Reviewed Article
Publication Date
2017
Permanent URL
http://hdl.lib.byu.edu/1877/8862
Publisher
Acoustical Society of America
Language
English
College
Humanities
Department
Linguistics
Copyright Status
© 2017 Acoustical Society of America.
Copyright Use Information
https://lib.byu.edu/about/copyright/