Singing for the Missing: Bringing the Body Back to AI Voice and Speech Technologies
Paper in proceeding, 2024

Technological advancements in deep learning for speech and voice have contributed to a recent expansion in applications for voice cloning, synthesis and generation. Invisibilised stakeholders in this expansion are numerous absent bodies, whose voices and voice data have been integral to the development and refinement of these speech technologies. This position paper probes current working practices for voice and speech in machine learning and AI, in which the bodies of voices are “invisibilised". We examine the facts and concerns about the voice-Body in applications of AI-voice technology. We do this through probing the wider connections between voice data and Schaefferian listening; speculating on the consequences of missing Bodies in AI-Voice; and by examining how vocalists and artists working with synthetic Bodies and AI-voices are ‘bringing the Body back’ in their own practices. We contribute with a series of considerations for how practitioners and researchers may help to ‘bring the Body back’ into AI-voice technologies.

body

musical AI

voice

STS

artificial intelligence

AI

Author

Kelsey Cotton

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Katja de Vries

Uppsala University

Kivanc Tatar

Chalmers, Computer Science and Engineering (Chalmers), Interaction Design and Software Engineering

9th International Conference on Movement and Computing


979-8-4007-0994-4 (ISBN)

Movement and Computing (MOCO)
Utrecht, Netherlands,

Subject Categories

Language Technology (Computational Linguistics)

Arts

Law

Media and Communications

Music

Computer Science

DOI

10.1145/3658852.365906

More information

Latest update

6/28/2024