Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers.

Fiche publication


Date publication

octobre 2021

Journal

Scientific data

Auteurs

Membres identifiés du Cancéropôle Est :
Pr FELBLINGER Jacques, Dr VUISSOZ Pierre-André


Tous les auteurs :
Isaieva K, Laprie Y, Leclère J, Douros IK, Felblinger J, Vuissoz PA

Résumé

The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus consisting of synthetic sentences was used to ensure a good coverage of the French phonetic context. A real-time MRI technology with temporal resolution of 20 ms was used to acquire vocal tract images of the participants speaking. The sound was recorded simultaneously with MRI, denoised and temporally aligned with the images. The speech was transcribed to obtain phoneme-wise segmentation of sound. We also acquired static 3D MR images for a wide list of French phonemes. In addition, we include annotations of spontaneous swallowing.

Référence

Sci Data. 2021 Oct 1;8(1):258