ESTONIAN ACADEMY
PUBLISHERS
eesti teaduste
akadeemia kirjastus
PUBLISHED
SINCE 1965
 
Linguistica Uralica cover
Linguistica Uralica
ISSN 1736-7506 (Electronic)
ISSN 0868-4731 (Print)

Emotions and Speech Temporal Structure; pp. 209-217

Full article in PDF format | doi:10.3176/lu.2012.3.07

Authors
Kairi Tamuri, Meelis Mihkla

Abstract

 The focus of the article is on whether emotions could be traced in the temporal structure of Estonian speech. There are two research questions, namely, (a) Do emotions affect speech rate? and (b) What detectable traces, if any, might emotions generate in word prosody? To answer question (a), the articulation rate of emotional utterances was measured and the results were compared with those on neutral speech. The difference revealed was statistically significant. To answer question (b), the relations between emotions and the temporal characteristics of words with a vowel-centered structure were investigated. Sound durations were measured, various duration ratios were computed and various combinations of the characteristics where subjected to statistical analysis. The results revealed a certain difference between the temporal characteristics of Q2 and Q3 feet, and a loss of the difference between the second and third quantity degrees in sad speech.


References

Altrov, R., Pajupuu, H. 2012, Estonian Emotional Speech Corpus: Content and options. - Variation and Change in Spoken and Written Discourse. Perspectives from Corpus Linguistis, Amsterdam [forthcoming]

Banse, R., Scherer, K. R. 1996, Acoustic Profiles in Vocal Emotion Expression. - Journal of Personality and Social Psychology 70, 614-636.
http://dx.doi.org/10.1037/0022-3514.70.3.614 PMid:8851745

Braun, A., Oba, R. 2007, Speaking Tempo in Emotional Speech - a Cross-Cultural Study Using Dubbed Speech. - ParaLing’07, 77-82.

Eek, A. 2008, Eesti keele foneetika I, [Tallinn].

Eek, A., Meister, E. 1997, Simple Perception Experiments on Estonian Word Prosody: Foot Structure vs. Segmental Quantity. - Estonian Prosody: Papers from a Symposium, Proceedings of the International Symposium on Estonian Prosody, Tallinn, 71-99.

Eek, A., Meister, E. 2003, Foneetilisi katseid kvantiteedi alalt. - KK, 815-837, 902-916.

Iida, A., Campbell, N., Higuchi, F., Yasumura, M. 2003, A Corpus-Based Speech Synthesis System with Emotikon. - Speech Communication 40, 161-187.
http://dx.doi.org/10.1016/S0167-6393(02)00081-X

Juslin, P. N., Laukka, P. 2003, Communication of Emotions in Vocal Expression and Music Performance. Different Channels, Same Code? - Psychological Bulletin 129, 770-814.
http://dx.doi.org/10.1037/0033-2909.129.5.770 PMid:12956543

Kalvik, M-L., Mihkla, M. 2010, Modelling the Temporal Structure of Estonian Speech. - Human Language Technologies - The Baltic Perspective. Proceedings of the Fourth International Conference Baltic HLT 2010, Amsterdam, 53-60.

Kalvik, M-L., Mihkla, M., Kiissel, I., Hein, I. 2010, Estonian: Some Findings for Modelling Speech Rhythmicity and Perception of Speech Rate. - Text, Speech and Dialogue, Berlin-Heidelberg, 314-321.

Krull, D. 1993, Word-Prosodic fFeatures in Estonian Conversational Speech. Some Preliminary Results. - PERILUS (Phonetic Experimental Research, Institute of Linguistics, University of Stockholm) XVII, Stockholm, 45-54.

Laver, J. 1994, Principles of Phonetics, Cambridge.
http://dx.doi.org/10.1017/CBO9781139166621

Lehiste, I. 1960, Segmental and Syllabic Quantity in Estonian. - American Studies in Uralic Linguistics 1, Bloomington, 21-28.

Lehiste, I. 1997, Search for Phonetic Correlates in Estonian Prosody. - Estonian Prosody: Papers from a Symposium, Proceedings of the International Symposium on Estonian Prosody, Tallinn, 11-35.

Liiv, G. 1961, Eesti keele kolme vältusastme kestus ja meloodiatüübid. - KK, 412-424, 480-490.

Lippus, P., Pajusalu, K., Allik, J. 2007, The Tonal Component in Perception of the Estonian Quantity. - Proceedings of the 16th International Congress of Phonetic Sciences, Saarbrücken,1049-1052.

Mihkla, M. 2007, Modelling Speech Temporal Structure for Estonian Text-to-Speech Synthesis: Feature Selection. - Trames 11, 284-298.

Murray, I. R., Arnott, J. L. 2008, Applying an Analysis of Acted Vocal Emotions to Improve the Simulation of Synthetic Speech. - Computer Speech and Language 22, 107-129.
http://dx.doi.org/10.1016/j.csl.2007.06.001

Ross, J., Lehiste, I. 2001, The temporal Structure of Estonian Runic Songs, Berlin-New York.

Sagisaka, Y. 2003, Modeling and Perception of Temporal Characteristics in Speech. - Proceedings of 15th International Congress of Phonetic Sciences, Barcelona, 1-6.

Scherer, K. R. 1986, Vocal Affect Expression. A Review and a Model for Future Research. - Psychological Bulletin 99, 143-165.
http://dx.doi.org/10.1037/0033-2909.99.2.143 PMid:3515381

Scherer, K. R. 2003, Vocal Communication of Emotion. A Review of Research Paradigms. - Speech Communication 40, 227-256.
http://dx.doi.org/10.1016/S0167-6393(02)00084-5

Tamuri, K. 2010, Kas pausid kannavad emotsiooni? - Eesti Rakenduslingvistika Ühingu Aastaraamat 6, 297-306.

Tamuri, K. 2012, Kas formandid peegeldavad emotsiooni? - Eesti Rakenduslingvistika Ühingu Aastaraamat 8, 231-243.

Yildirim, S., Bulut, M., Lee, C. M., Kazemzadeh, A., Deng, Z., Lee, S., Narayanan, S., Busso, C. 2004, An Acoustic Study of Emotions Expressed in Speech. - Proceedings of InterSpeech 2004, Jeju Island, 2193-2196.


Back to Issue