Repository logo
 
Publication

Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction

datacite.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
datacite.subject.sdg03:Saúde de Qualidade
datacite.subject.sdg07:Energias Renováveis e Acessíveis
datacite.subject.sdg11:Cidades e Comunidades Sustentáveis
dc.contributor.authorPrado, José Augusto
dc.contributor.authorSimplício, Carlos
dc.contributor.authorLori, Nicolás F.
dc.contributor.authorDias, Jorge
dc.date.accessioned2026-01-15T16:25:05Z
dc.date.available2026-01-15T16:25:05Z
dc.date.issued2011-12-20
dc.description.abstractWe propose an approach to analyze and synthesize a set of human facial and vocal expressions, and then use the classified expressions to decide the robot’s response in a human-robot-interaction. During a human-tohuman conversation, a person senses the interlocutor’s face and voice, perceives her/his emotional expressions, and processes this information in order to decide which response to give. Moreover, observed emotions are taken into account and the response may be aggressive, funny (henceforth meaning humorous) or just neutral according to not only the observed emotions, but also the personality of the person. The purpose of our proposed structure is to endow robots with the capability to model human emotions, and thus several subproblems need to be solved: feature extraction, classification, decision and synthesis. In the proposed approach we integrate two classifiers for emotion recognition from audio and video, and then use a new method for fusion with the social behavior profile. To keep the person engaged in the interaction, after each iterance of analysis, the robot synthesizes human voice with both lips synchronization and facial expressions. The social behavior profile conducts the personality of the robot. The structure and work flow of the synthesis and decision are addressed, and the Bayesian networks are discussed. We also studied how to analyze and synthesize the emotion from the facial expression and vocal expression. A new probabilistic structure that enables a higher level of interaction between a human and a robot is proposed.eng
dc.description.sponsorshipThe authors gratefully acknowledge support from Institute of Systems and Robotics at University of Coimbra (ISR-UC), Portuguese Foundation for Science and Technology (FCT) [SFRH/BD/60954/2009, Ciencia2007, PTDC/SAU-BEB/100147/2008], and Polytechnical Institute of Leiria (IPL).
dc.identifier.citationPrado, J.A., Simplício, C., Lori, N.F. et al. Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction. Int J of Soc Robotics 4, 29–51 (2012). https://doi.org/10.1007/s12369-011-0134-7
dc.identifier.doi10.1007/s12369-011-0134-7
dc.identifier.issn1875-4791
dc.identifier.issn1875-4805
dc.identifier.urihttp://hdl.handle.net/10400.8/15352
dc.language.isoeng
dc.peerreviewedyes
dc.publisherSpringer
dc.relation.hasversionhttps://link.springer.com/article/10.1007/s12369-011-0134-7
dc.relation.ispartofInternational Journal of Social Robotics
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectVisual perception
dc.subjectAuditory perception
dc.subjectEmotion recognition
dc.subjectMultimodal interaction
dc.subjectSocial behavior profile
dc.subjectBayesian networks
dc.titleVisuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interactioneng
dc.typejournal article
dspace.entity.typePublication
oaire.citation.endPage51
oaire.citation.issue1
oaire.citation.startPage29
oaire.citation.titleInternational Journal of Social Robotics
oaire.citation.volume4
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNameSimplício
person.givenNameCarlos
person.identifier.orcid0000-0001-6281-4059
relation.isAuthorOfPublication2e15bd33-f7a7-466f-8013-1fdd73b75e7c
relation.isAuthorOfPublication.latestForDiscovery2e15bd33-f7a7-466f-8013-1fdd73b75e7c

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
s12369-011-0134-7.pdf
Size:
2.89 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: