Sentence Modality Recognition in French based on Prosody

This paper deals with automatic sentence modality recognition in French. In this work, only prosodic features are considered. The sentences are recognized according to the three following modalities: declarative, interrogative and exclamatory sentences. This information will be used to animate a talking head for deaf and hearing-impaired children. We first statistically study a real radio corpus in order to assess the feasibility of the automatic modeling of sentence types. Then, we test two sets of prosodic features as well as two different classifiers and their combination. We further focus our attention on questions recognition, as this modality is certainly the most important one for the target application.




References:
[1] R. O. Cornett, "Cued speech," in American Annals of the Deaf, 1967,
vol. 112, pp. 3-13.
[2] P. Kral and J. Kleckova, "Speech recognition and animation of talking
head," in IWSSIP-03, Prague, Czech Republic, September 2003.
[3] H. Gezundhajt, "La prosodie," in
http://www.linguistes.com/phonetique/prosodie.html
[4] V. Aubergé, "A gestalt morphology of prosody directed by functions: the
example of a step by step model developed at ICP," in 1st Conf on
Speech Prosody, 2002, pp. 151-155.
[5] R. Kompe, Prosody in Speech Understanding Systems, Springer, July
1997.
[6] V. Strom, "Detection of accents, phrase boundaries and sentence
modality in German with prosodic features," in Eurospeech-95, Madrid,
1995.
[7] J. Kleckova and V. Matousek, "Using prosodic characteristics in Czech
dialog system," in Interact-97, 1997.
[8] K. Chongdok and Y. Hiyon, "Defining modality by terminal contours in
standard korean," in 1st International Conference on Speech Sciences,
Seoul, 2002.
[9] H. Wright, M. Poesio, and S. Isard, "Using high level dialogue
information for dialogue act recognition using prosodic features," in
ESCA Workshop on Prosody and Dialogue, Eindhoven, Holland,
September 1999.
[10] H. Wright, "Automatic utterance type detection using suprasegmental
features," in ICSLP-98, Sydney, 1998, p.x1403.
[11] http://www.recherche.gouv.fr/technolangue/
[12] A. de Cheveigne and H. Kawahara, "Comparative evaluation of F
estimation algorithms," in Eurospeech-2001, Scandinavia, 2001.