English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Automatic rating of hoarseness by text-based cepstral and prosodic evaluation

Haderlein, T., Moers, C., Möbius, B., & Nöth, E. (2012). Automatic rating of hoarseness by text-based cepstral and prosodic evaluation. In P. Sojka, A. Horák, I. Kopecek, & K. Pala (Eds.), Proceedings of the 15th International Conference on Text, Speech and Dialogue (TSD 2012) (pp. 573-580). Heidelberg: Springer.

Item is

Files

show Files
hide Files
:
Haderlein_Moers_2012.pdf (Publisher version), 156KB
Name:
Haderlein_Moers_2012.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Haderlein, Tino1, 2, Author
Moers, Cornelia3, 4, 5, Author           
Möbius, Bernd6, Author
Nöth, Elmar1, Author
Affiliations:
1University of Erlangen-Nuremberg, Pattern Recognition Lab (Informatik 5), Erlangen, Germany, ou_persistent22              
2University of Erlangen-Nuremberg, Department of Phoniatrics and Pedaudiology, Erlangen, Germany, ou_persistent22              
3International Max Planck Research School for Language Sciences, MPI for Psycholinguistics, Max Planck Society, Nijmegen, NL, ou_1119545              
4Psychology of Language Department, MPI for Psycholinguistics, Max Planck Society, Nijmegen, NL, ou_792545              
5University of Bonn, Department of Speech and Communication,Bonn, Germany, ou_persistent22              
6Saarland University, Department of Computational Linguistics and Phonetics, Saarbrücken, Germany, ou_persistent22              

Content

show
hide
Free keywords: -
 Abstract: The standard for the analysis of distorted voices is perceptual rating of read-out texts or spontaneous speech. Automatic voice evaluation, however, is usually done on stable sections of sustained vowels. In this paper, text-based and established vowel-based analysis are compared with respect to their ability to measure hoarseness and its subclasses. 73 hoarse patients (48.3±16.8 years) uttered the vowel /e/ and read the German version of the text “The North Wind and the Sun”. Five speech therapists and physicians rated roughness, breathiness, and hoarseness according to the German RBH evaluation scheme. The best human-machine correlations were obtained for measures based on the Cepstral Peak Prominence (CPP; up to |r | = 0.73). Support Vector Regression (SVR) on CPP-based measures and prosodic features improved the results further to r ≈0.8 and confirmed that automatic voice evaluation should be performed on a text recording.

Details

show
hide
Language(s):
 Dates: 2012
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1007/978-3-642-32790-2_70
 Degree: -

Event

show
hide
Title: the 15th International Conference on Text, Speech and Dialogue (TSD 2012)
Place of Event: Brno, Czech Republic
Start-/End Date: 2012-09-03 - 2012-09-07

Legal Case

show

Project information

show

Source 1

show
hide
Title: Proceedings of the 15th International Conference on Text, Speech and Dialogue (TSD 2012)
Source Genre: Proceedings
 Creator(s):
Sojka, Petr, Editor
Horák , Aleš, Editor
Kopecek , Ivan, Editor
Pala, Karel, Editor
Affiliations:
-
Publ. Info: Heidelberg : Springer
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 573 - 580 Identifier: -

Source 2

show
hide
Title: Lecture Notes in Computer Science
Source Genre: Series
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 7499 Sequence Number: - Start / End Page: - Identifier: -