计算机科学
标记语言
韵律
聊天机器人
人机交互
语音识别
语音合成
语音技术
本体论
多媒体
自然语言处理
万维网
XML
认识论
哲学
作者
Sandra Pauletto,Bruce Balentine,Chris Pidcock,Kevin Jones,Leonardo Bottaci,Maria Aretoulaki,Jez Wells,Darren Mundy,James Balentine
标识
DOI:10.3109/14015439.2013.810303
摘要
Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.
科研通智能强力驱动
Strongly Powered by AbleSci AI