ESTONIAN ACADEMY
PUBLISHERS
eesti teaduste
akadeemia kirjastus
PUBLISHED
SINCE 1997
 
TRAMES cover
TRAMES. A Journal of the Humanities and Social Sciences
ISSN 1736-7514 (Electronic)
ISSN 1406-0922 (Print)
Impact Factor (2022): 0.2
THE EFFECTS OF CULTURE ON VOICE LIKABILTY; pp. 239–257
PDF | https://doi.org/10.3176/tr.2019.2.08

Authors
Hille Pajupuu, Rene Altrov, Jaan Pajupuu
Abstract

This study investigated the effects of culture on voice likability assessments. A total of 32 Finns and 32 Estonians rated the likability of 40 Finnish and 40 Estonian female and male voices. The voices represented two phonogenres: poetry and interview. The results showed that Finns and Estonians liked the same voices, but the listeners preferred Finnish voices reading poetry to Estonians, and Estonian voices in interviews to Finnish. The gender and age of the speaker only had a low impact on likability ratings. An analysis of acoustic correlates of voice likability was also conducted, which showed that likable and unlikable voices were differentiated by a set of frequency, energy and spectral parameters, but not tempo parameters.

References

Altrov, Rene, Hille Pajupuu, and Jaan Pajupuu (2018) “Phonogenre affecting voice likability”. Proceedings of the 9th International Conference on Speech Prosody 2018, 177–181.
https://doi.org/10.21437/SpeechProsody.2018-36

Babel, Molly, Grant McGuire, and Joseph King (2014) “Towards a more nuanced view of vocal attractiveness”. PLoS ONE 9, 2, e88616.
https://doi.org/10.1371/journal.pone.0088616

Baumann, Timo (2017) “Large-scale speaker ranking from crowdsourced pairwise listener ratings”. Proceedings of Interspeech 2017, 2262–2266.
https://doi.org/10.21437/Interspeech.2017-1697

Biadsy, Fadi, Andrew Rosenberg, Rolf Carlson, Julia Hirschberg, and Eva Strangert (2008) “A cross-cultural comparison of American, Palestinian, and Swedish perception of charismatic speech”. Proceedings of Speech Prosody 2008, 579–582.

Bruckert, Laetitia, Jean-Sylvain Liénard, André Lacroix, Michel Kreutzer, and Gérard Leboucher (2006) “Women use voice parameters to assess men’s characteristics”. Proceedings of the Royal Society of London B: Biological Sciences 273 (November 2005), 83–89.
https://doi.org/10.1098/rspb.2005.3265

Burkhardt, Felix, Björn Schuller, Benjamin Weiss, and Felix Weninger (2011) “‘Would you buy a car from me?’ – On the likability of telephone voices”. Proceedings of Interspeech 2011, 1557–1560.

Chang, Rebecca Cherng-Shiow, Hsi-Peng Lu, and Peishan Yang (2018) “Stereotypes or golden rules? Exploring likable voice traits of social robots as active aging companions for tech-savvy baby boomers in Taiwan”. Computers in Human Behavior 84, 194–210.
https://doi.org/10.1016/j.chb.2018.02.025

Coelho, Luis, Daniela Braga, and Carmen Garcia-Mateo (2008) “Voice pleasantness: on the improvement of TTS voice quality”. V Jornadas En Tecnología Del Habla, 211–214.

Coelho, Luis, Daniela Braga, Miguel Sales Dias, and Carmen Garcia-Mateo (2011) “An automatic voice pleasantness classification system based on prosodic and acoustic patterns of voice preference”. Proceedings of Interspeech 2011, 2457–2460.

Collins, Sarah A. (2000) “Men’s voices and women’s choices”. Animal Behaviour 60, 6, 773–780.
https://doi.org/10.1006/anbe.2000.1523

Dahlbäck, Nils, QianYing Wang, Clifford Nass, and Jenny Alwin (2007) “Similarity is more important than expertise: accent effects in speech interfaces” Proceedings of CHI 2007 – Conference on Human Factors in Computing Systems, April 28–May 3, San José, CA, U.S.A, 1553–1556.
https://doi.org/10.1145/1240624.1240859

Deal Leo V. and Herbert J. Oyer (1991) “Ratings of vocal pleasantness and the aging process”. Folia Phoniatr (Basel) 43, 44–48.
https://doi.org/10.1159/000266100

Ding, Hongwei, Rüdiger Hoffmann, and Oliver Jokisch (2017) “Prosodic correlates of voice preference in Mandarin Chinese and German: a cross-linguistic comparison”. 28. Konferenz Elektronische Sprachsignalverarbeitung 2017, Saarbrücken, 83–90.

Ding, Hongwei, Rüdiger Hoffmann, and Oliver Jokisch (2018) “Voice preferences in German: a cross-linguistic comparison of native and Chinese listeners”. Proceedings of the 29th Conference on Electronic Speech Signal Processing. ESSV2018.

Eyben, Florian, Klaus Scherer, Bjorn Schuller, Johan Sundberg, Elisabeth Andre, Carlos Busso, Laurence Devillers, Julien Epps, Petri Laukka, Shrikanth Narayanan, and Khiet Truong (2016) “The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing”. IEEE Transactions on Affective Computing 7, 2, 190–202.
https://doi.org/10.1109/TAFFC.2015.2457417

Eyben, Florian, Felix Weninger, Erik Marchi, and Björn Schuller (2013) “Likability of human voices: a feature analysis and a neural network regression approach to automatic likability estimation”. Proceeding of the 14th International Workshop on Image Analysis for Multi­media Interactive Services (WIAMIS) 2013, 1–4.
https://doi.org/10.1109/WIAMIS.2013.6616159

Fraccaro, Paul J., Jillian J. M. O’Connor, Daniel E. Re, Benedict C. Jones, Lisa M. DeBruine, and David R. Feinberg (2013) “Faking it: deliberately altered voice pitch and vocal attractive­ness”. Animal Behaviour 85, 1, 127–136.
https://doi.org/10.1016/j.anbehav.2012.10.016

Gallardo, Laura Fernandez (2016) “Recording a high-quality German speech database for the study of speaker personality and likability”. Tagung Phonetik und Phonologie im deutsch­sprachigen Raum, 43–46.

Gallardo, Laura Fernandez, Rafael Zequeira Jimenez, and Sebastian Möller (2017). “Perceptual ratings of voice likability collected through in-lab listening tests vs. mobile-based crowd­sourcing”. Proceedings of Interspeech 2017, 2233–2237.
https://doi.org/10.21437/Interspeech.2017-326

Gampel, Deborah and Leslie Piccolotto Ferreira (2017) “How do adolescent students perceive aging teachers’ voices?” Journal of Voice 31, 4, 512.e9-512.e16.
https://doi.org/10.1016/j.jvoice.2016.11.021

Goy, Huiwen, Kathleen M. Pichora-Fuller, and Pascal van Lieshout (2016) “Effects of age on speech and voice quality ratings”. The Journal of the Acoustical Society of America 139, 4, 1648–1659.
https://doi.org/10.1121/1.4945094

Hinterleitner, Florian, Christiana Manolaina, and Sebastian Möller (2014) “Influence of a voice on the quality of synthesized speech”. 2014 Sixth International Workshop on Quality of Multimedia Experience (QoMEX), 99–104.
https://doi.org/10.1109/QoMEX.2014.6982303

Jokisch, Oliver, Viktor Iaroshenko, Michael Maruschke, and Hongwei Ding (2018) “Influence of age, gender and sample duration on the charisma assessment of German speakers”. Proceedings of the 29th Conference on Electronic Speech Signal Processing. ESSV2018.

McAleer, Phil, Alexander Todorov, and Pascal Belin (2014) “How do you say ‘hello’? Personality impressions from brief novel voices”. PLoS ONE 9, 3, 1–10.
https://doi.org/10.1371/journal.pone.0090779

Montacié, Claude and Marie-José Caraty (2012) “Pitch and intonation contribution to speakers’ traits classification”. Proceedings of Interspeech 2012, 526–529.

Niebuhr, Oliver, Radek Skarnitzl, and Lea Tylečková (2018) “The acoustic fingerprint of a charismatic voice – initial evidence from correlations between long-term spectral features and listener ratings”. Proceedings of the 9th International Conference on Speech Prosody 2018, 359–363.
https://doi.org/10.21437/SpeechProsody.2018-73

Obuchi, Yasunari (2017) “Personalized quantification of voice attractiveness in multidimensional merit space”. Proceedings of Interspeech 2017, 2223–2227.
https://doi.org/10.21437/Interspeech.2017-130

Parker, Michelle A. and Stephanie A. Borrie (2018) “Judgments of intelligence and likability of young adult female speakers of American English: the influence of vocal fry and the surrounding acoustic-prosodic context”. Journal of Voice 32, 5, 538–545.
https://doi.org/10.1016/j.jvoice.2017.08.002

Pinto-Coelho, Luis, Daniela Braga, Miguel Sales-Dias, and Carmen Garcia-Mateo (2013) “On the development of an automatic voice pleasantness classification and intensity estimation system”. Computer Speech and Language 27, 1, 75–88.
https://doi.org/10.1016/j.csl.2012.01.006

Riding, David, Deryle Lonsdale, and Bruce Brown (2006) “The effects of average fundamental frequency and variance of fundamental frequency on male vocal attractiveness to women”. Journal of Nonverbal Behavior 30, 2, 55–61.
https://doi.org/10.1007/s10919-006-0005-3

R Core Team (2017) “R: a language and environment for statistical computing”. Available online at <https://www.R-project.org/>. Accessed on February 4, 2019.

Schuller, Björn W. and Anton M. Batliner (2014) Computational paralinguistics: emotion, affect and personality in speech and language processing. Chichester, UK: John Wiley and Sons.
https://doi.org/10.1002/9781118706664

Schuller, Björn, Stefan Steidl, Anton Batliner, ElmarNöth, Alessandro Vinciarelli, Alessandro, Felix Burkhardt, Rob van Son, Felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi, and Benjamin Weiss (2012) “The Interspeech 2012 speaker trait challenge”. Proceedings of Interspeech 2012, 254–257.

Schuller, Björn, Stefan Steidl, Anton Batliner, Felix Burkhardt, Laurence Devillers, Christian Müller, and Shrikanth Narayanan (2013) “Paralinguistics in speech and language: state-of-the-art and the challenge”. Computer Speech and Language 27, 1, 4–39.
https://doi.org/10.1016/j.csl.2012.02.005

Schuller, Björn, Stefan Steidl, Anton Batliner, Elmar Nöth, Alessandro Vinciarelli, Felix Burkhardt, Rob van Son, Felix Weninger, Florian Eyben, Tobias Bocklet, Gelareh Mohammadi, and Benjamin Weiss (2015) “A survey on perceived speaker traits: personality, likability, pathology, and the first challenge”. Computer Speech and Language 29, 1, 100–131.
https://doi.org/10.1016/j.csl.2014.08.003

Schweitzer, Antje, Natalie Lewandowski, and Daniel Duran (2017) “Social attractiveness in dialogs”. Proceedings of Interspeech 2017, 2243–2247.
https://doi.org/10.21437/Interspeech.2017-833

Syrdal, Ann K., Alistair Conkie, and Yannis Stylianou (1998) “Exploration of acoustic correlates in speaker selection for concatenative synthesis”. Proceedings of International Conference on Spoken Language Processing (ICSLP 98), 2–5.

Trouvain, Jürgen and Frank Zimmerer (2017) “Attractiveness of French voices for German listeners: results from native and non-native read speech”. Proceedings of Interspeech 2017, 2238–2242.
https://doi.org/10.21437/Interspeech.2017-367

Ueda, Hiroshi, Yasunori Arita, and Katsumi Watanabe (2013) “Effects of different manners of speaking on voice likeability, credibility, and intentionality ratings”. Proceedings of the 2013 International Conference on Biometrics and Kansei Engineering (ICBAKE), 117–120.
https://doi.org/10.1109/ICBAKE.2013.23

Warhurst, Samantha, Catherine Madill, Patricia McCabe, Sten Ternström, Edwin Yiu, and Robert Heard (2017) “Perceptual and Acoustic Analyses of Good Voice Quality in Male Radio Performers”. Journal of Voice 31, 2, 259.e1-259.e12.
https://doi.org/10.1016/j.jvoice.2016.05.016

Weiss, Benjamin and Felix Burkhardt (2010) “Voice attributes affecting likability perception”. Proceedings of Interspeech 2010, 1934–1937.

Weiss, Benjamin and Felix Burkhardt (2012) “Is ‘not bad’ good enough? Aspects of unknown voices’ likability”. Proceedings of Interspeech 2012, 510–513.

Xu, Yi, Albert Lee, Wing Li Wu, Xuan Liu, and Peter Birkholz (2013) “Human vocal attractiveness as signaled by body size projection”. PLoS ONE 8, 4, e62397.
https://doi.org/10.1371/journal.pone.0062397

Zuta, Vivien (2007) “Phonetic criteria of attractive male voices”. Proceedings of the 17th International Congress of Phonetic Sciences, 1837–1840.

Zuta, Vivien (2009) “Voice pleasantness of female voices and the assessment of physical characteristics”. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 5641 LNAI, 116–125.
https://doi.org/10.1007/978-3-642-03320-9_12

Back to Issue