Back to Search Start Over

Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine

Authors :
Dogan, RI
Kim, S
Chatr-aryamontri, A
Wei, C-H
Comeau, DC
Antunes, R
Matos, S
Chen, Q
Elangovan, A
Panyam, NC
Verspoor, K
Liu, H
Wang, Y
Liu, Z
Altinel, B
Husunbeyi, ZM
Ozgur, A
Fergadis, A
Wang, C-K
Dai, H-J
Tran, T
Kavuluru, R
Luo, L
Steppi, A
Zhang, J
Qu, J
Lu, Z
Dogan, RI
Kim, S
Chatr-aryamontri, A
Wei, C-H
Comeau, DC
Antunes, R
Matos, S
Chen, Q
Elangovan, A
Panyam, NC
Verspoor, K
Liu, H
Wang, Y
Liu, Z
Altinel, B
Husunbeyi, ZM
Ozgur, A
Fergadis, A
Wang, C-K
Dai, H-J
Tran, T
Kavuluru, R
Luo, L
Steppi, A
Zhang, J
Qu, J
Lu, Z
Publication Year :
2019

Abstract

The Precision Medicine Initiative is a multicenter effort aiming at formulating personalized treatments leveraging on individual patient data (clinical, genome sequence and functional genomic data) together with the information in large knowledge bases (KBs) that integrate genome annotation, disease association studies, electronic health records and other data types. The biomedical literature provides a rich foundation for populating these KBs, reporting genetic and molecular interactions that provide the scaffold for the cellular regulatory systems and detailing the influence of genetic variants in these interactions. The goal of BioCreative VI Precision Medicine Track was to extract this particular type of information and was organized in two tasks: (i) document triage task, focused on identifying scientific literature containing experimentally verified protein–protein interactions (PPIs) affected by genetic mutations and (ii) relation extraction task, focused on extracting the affected interactions (protein pairs). To assist system developers and task participants, a large-scale corpus of PubMed documents was manually annotated for this task. Ten teams worldwide contributed 22 distinct text-mining models for the document triage task, and six teams worldwide contributed 14 different text-mining systems for the relation extraction task. When comparing the text-mining system predictions with human annotations, for the triage task, the best F-score was 69.06%, the best precision was 62.89%, the best recall was 98.0% and the best average precision was 72.5%. For the relation extraction task, when taking homologous genes into account, the best F-score was 37.73%, the best precision was 46.5% and the best recall was 54.1%. Submitted systems explored a wide range of methods, from traditional rule-based, statistical and machine learning systems to state-of-the-art deep learning methods. Given the level of participation and the individual team results we find the precision

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1315673613
Document Type :
Electronic Resource