3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant-Vowel Production from 2D Real Time MRI.

Fiche publication


Date publication

août 2022

Journal

Journal of imaging

Auteurs

Membres identifiés du Cancéropôle Est :
Pr FELBLINGER Jacques, Dr VUISSOZ Pierre-André


Tous les auteurs :
Douros IK, Xie Y, Dourou C, Isaieva K, Vuissoz PA, Felblinger J, Laprie Y

Résumé

In this work, we address the problem of creating a 3D dynamic atlas of the vocal tract that captures the dynamics of the articulators in all three dimensions in order to create a global speaker model independent of speaker-specific characteristics. The core steps of the proposed method are the temporal alignment of the real-time MR images acquired in several sagittal planes and their combination with adaptive kernel regression. As a preprocessing step, a reference space was created to be used in order to remove anatomical information of the speakers and keep only the variability in speech production for the construction of the atlas. The adaptive kernel regression makes the choice of atlas time points independently of the time points of the frames that are used as an input for the construction. The evaluation of this atlas construction method was made by mapping two new speakers to the atlas and by checking how similar the resulting mapped images are. The use of the atlas helps in reducing subject variability. The results show that the use of the proposed atlas can capture the dynamic behavior of the articulators and is able to generalize the speech production process by creating a universal-speaker reference space.

Mots clés

adaptive gaussian kernel, generic speaker model, spatiotemporal atlas

Référence

J Imaging. 2022 08 25;8(9):