« Latent Terrain » : Dissecting the Latent Space of Neural Audio Autoencoders

We present Latent Terrain, an algorithmic approach to dissecting the latent space of a neural audio autoencoder into a two-dimensional plane. Latent Terrain questions the conventional paradigms of dimensionality reduction in creative interactive systems, in which the projection from high to low dimensional spaces is done by modelling similar objects with nearby points. Instead, with a mountainous and steep surface, a terrain material generated by our approach affords greater spectral complexity when navigating an audio autoencoder's latent space.

Extending from this, we present Latent Terrain Synthesis, which is a method for sound synthesis whereby a waveform is generated by pathing through a terrain surface. Latent terrain synthesis aims to help musicians create tailorable and flexible materials to explore musical expressions leveraging the sonic capabilities of neural audio autoencoders such as RAVE.

We provide our MaxMSP externals nn_terrain that work together with nn~ to generate latent terrains for pre-trained RAVE models and allow users to navigate the terrain in real-time.

In this talk, I will first present the technical details behind latent terrain, workflow, how it integrates with RAVE, and a demo interface with a tablet and a stylus. I will also present a recent user study workshop at the Centre for Digital Music at Queen Mary University of London, with co-authors Anna Xambó Sedó and Nick Bryan-Kinns, of 18 musicians from various backgrounds exploring musical affordances and deriving sonic materials for musical expressions.

Acknowledgment: This work is supported by the UKRI Centre for Doctoral Training in Artificial Intelligence and Music are supported by UK Research and Innovation [grant number EP/S022694/1].

Les médias liés à cet évènement

Mettre en son The Powder Toy - Kieran McAuliffe

Sinusoidal run rhythm

ART MUSIC DENMARK presents : Presentation of “vssl” (new hardware electronic instrument) - Xavier Bonfill

Performance télématique immersive - Randall Packer, Théophile Clet, Federico Foderaro

Session C-LAB : Application de Dicy2 dans la production - The Day in Gad-Avia - Chia Hui Chen, Jing-shiuan Tsang

RAVE Model Challenge - Cérémonie de remise des prix

Installation vidéo interactive « Here's the Information We Collect » - Tansy Xiao

Overton - Synthèse spatiale décorrélée - Martin Antiphon

Point sur MacIntel et les logiciels du Forum - Carlos Amado Agon, Riccardo Borghesi, Karim Haddad, Nicholas Ellis

Nouveautés AudioSculpt 2.7 et SuperVP 2.91 - Xavier Rodet, Alain Lithaud, Niels Bogaards, Axel Roebel

Nouveautes OpenMusic - Gérard Assayag, Jean Bresson, Carlos Amado Agon, Karim Haddad

Point sur le Spatialisateur - Olivier Warusfel, Rémy Muller, Terence Caulkins

Nouveautés Modalys - Joël Bensoam, Nicholas Ellis, Jean Lochard

Mlys - une interface de contrôle de Modalys dans Max/MSP - Manuel Poletti

Accueil - Andrew Gerzso

Développements récents de l'équipe applications temps réel - Diemo Schwarz, Riccardo Borghesi, Norbert Schnell

« Latent Terrain » : Dissecting the Latent Space of Neural Audio Autoencoders

intervenants

informations

IRCAM

heures d'ouverture

accès en transports