Lin, Qi, Zhou, Yuli, Shi, Siyuan, Zhang, Yujuan, Yin, Shaoli, Liu, Xuye, Peng, Qihui, Huang, Shaoting, Jiang, Yitao, Cui, Chen, She, Ruilian, Xu, Jinfeng, and Dong, Fajin
• A novel Fetus framework to auto-select standard images from fetal ultrasound screening is proposed. • Fetus Framework outperforms human experts with 1-, 3- and 5-year ultrasound training in the standard/ non-standard images classification task. • A novel 'divide-and-conquer' principle to improve key structure detection of fetus head is applied to Fetus framework. • Superior generalization capacity of Fetus Framework to classic CNN models has been confirmed by the external test. To investigate if artificial intelligence can identify fetus intracranial structures in pregnancy week 11–14; to provide an automated method of standard and non-standard sagittal view classification in obstetric ultrasound examination We proposed a newly designed scheme based on deep learning (DL) – Fetus Framework to identify nine fetus intracranial structures: thalami, midbrain, palate, 4th ventricle, cisterna magna, nuchal translucency (NT), nasal tip, nasal skin, and nasal bone. Fetus Framework was trained and tested on a dataset of 1528 2D sagittal-view ultrasound images from 1519 females collected from Shenzhen People's Hospital. Results from Fetus Framework were further used for standard/non-standard (S-NS) plane classification, a key step for NT measurement and Down Syndrome assessment. S-NS classification was also tested with 156 images from the Longhua branch of Shenzhen People's Hospital. Sensitivity, specificity, and area under the curve (AUC) were evaluated for comparison among Fetus Framework, three classic DL models, and human experts with 1-, 3- and 5-year ultrasound training. Furthermore, 4 physicians with more than 5 years of experience conducted a reader study of diagnosing fetal malformation on a dataset of 316 standard images confirmed by the Fetus framework and another dataset of 316 standard images selected by physicians. Accuracy, sensitivity, specificity, precision, and F1-Score of physicians' diagnosis on both sets are compared. Nine intracranial structures identified by Fetus Framework in validation are all consistent with that of senior radiologists. For S-NS sagittal view identification, Fetus Framework achieved an AUC of 0.996 (95%CI: 0.987, 1.000) in internal test, at par with classic DL models. In external test, FF reaches an AUC of 0.974 (95%CI: 0.952, 0.995), while ResNet-50 arrives at AUC∼0.883, 95% CI 0.828–0.939, Xception AUC∼0.890, 95% CI 0.834–0.946, and DenseNet-121 AUC∼0.894, 95% CI 0.839–0.949. For the internal test set, the sensitivity and specificity of the proposed framework are (0.905, 1), while the first-, third-, and fifth-year clinicians are (0.619, 0.986), (0.690, 0.958), and (0.798, 0.986), respectively. For the external test set, the sensitivity and specificity of FF is (0.989, 0.797), and first-, third-, and fifth-year clinicians are (0.533, 0.875), (0.609, 0.844), and (0.663, 0.781), respectively.On the fetal malformation classification task, all physicians achieved higher accuracy and F1-Score on Fetus selected standard images with statistical significance (p < 0.01). We proposed a new deep learning-based Fetus Framework for identifying key fetus intracranial structures. The framework was tested on data from two different medical centers. The results show consistency and improvement from classic models and human experts in standard and non-standard sagittal view classification during pregnancy week 11–13+6. With further refinement in larger population, the proposed model can improve the efficiency and accuracy of early pregnancy test using ultrasound examination. [ABSTRACT FROM AUTHOR]