Keywords
Cepstrum, spectral analysis, continuous speech analysis, muscle tension dysphonia, auditory-perceptual ratings, dysphonia
Abstract
The purpose of the study was to identify a sub-set of spectral/cepstral-based analysis methods that would most effectively predict dysphonia severity (as estimated via auditory-perceptual analysis) in samples of continuous speech. Acoustic estimates of dysphonia severity were used as an objective treatment outcomes measure in a set of pre- vs post-treatment speech samples. Pre- and post-treatment continuous speech samples from 104 females with primary muscle tension dysphonia (MTD) were rated by listeners using a 100 point visual analogue scale (VAS) and analysed acoustically with spectral/cepstral-based measures. Stepwise linear regression produced a three-factor model consisting of the cepstral peak prominence (CPP); the mean ratio of low-to-high frequency spectral energy; and the standard deviation of the ratio of low-to-high frequency spectral energy that was strongly correlated with perceived dysphonia severity ratings (R ¼ .85; R2 ¼ .73). Mean differences between predicted vs perceptual ratings for pre- and post-treatment speech samples were < 6 points on the 100 point VAS; mean absolute differences between predicted and perceived ratings were < 16 points on the 100 point VAS (equivalent to within one scale value on commonly used 7-point equal-appearing interval rating scales). A multi-parameter acoustic model consisting of spectral/cepstral-based measures shows considerable promise as an objective measure of dysphonia severity in continuous speech, even across the diverse voice types and severities observed in pre- and post-treatment MTD speech samples.
Original Publication Citation
Awan, S.N., Roy, N. & Dromey, C. (2009). Estimating dysphonia severity in continuous speech: Application of a multiparameter spectral/cepstral model. Clinical Linguistics and Phonetics, 23, 825-841
BYU ScholarsArchive Citation
Dromey, Christopher; Awan, Shaheen N.; and Roy, Nelson, "Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model" (2009). Faculty Publications. 7258.
https://scholarsarchive.byu.edu/facpub/7258
Document Type
Peer-Reviewed Article
Publication Date
2009
Publisher
Clinical Linguistics & Phonetics
Language
English
College
David O. McKay School of Education
Department
Communication Disorders
Copyright Status
© 2009 Informa UK Ltd.