1. Automatic Vocal Tractlandmark Tracking in Rtmri Using Fully Convolutional Networks and Kalman Filter
- Author
-
Sasan Asadiabadi, Engin Erzin, Erzin, Engin (ORCID 0000-0002-2715-2368 & YÖK ID 34503), Asadiabadi, Sasan, College of Engineering, Graduate School of Sciences and Engineering, and Department of Electrical and Electronics Engineering
- Subjects
Heating systems ,Convolution ,Signal processing algorithms ,Speech production ,business.industry ,Computer science ,Pattern recognition ,02 engineering and technology ,Real-time MRI ,Kalman filter ,Tracking (particle physics) ,01 natural sciences ,0103 physical sciences ,Principal component analysis ,0202 electrical engineering, electronic engineering, information engineering ,Vocal tract dynamics ,Fully convolutional networks ,Heatmap regression ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,010301 acoustics ,Vocal tract - Abstract
Vocal tract (VT) contour detection in real time MRI is a pre-stage to many speech production related applications such as articulatory analysis and synthesis. In this work, we present an algorithm for robust detection of keypoints on the vocal tract in rtMRI sequences using fully convolutional networks (FCN) via a heatmap regression approach. We as well introduce a spatio-temporal stabilization scheme based on a combination of Principal Component Analysis (PCA) and Kalman filter (KF) to extract stable landmarks in space and time. The proposed VT landmark detection algorithm generalizes well across subjects and demonstrates significant improvement over the state of the art baselines, in terms of spatial and temporal errors., NA
- Published
- 2020
- Full Text
- View/download PDF