Back to Search Start Over

Schema Matching and Data Integration with Consistent Naming on Protein Crystallization Screens.

Authors :
Shrestha, Midusha
Tran, Truong X.
Bhattarai, Bidhan
Pusey, Marc L.
Aygun, Ramazan S.
Source :
IEEE/ACM Transactions on Computational Biology & Bioinformatics; Nov2020, Vol. 17 Issue 6, p2074-2085, 12p
Publication Year :
2020

Abstract

The data representation as well as naming conventions used in commercial screen files by different companies make the automated analysis of crystallization experiments difficult and time-consuming. In order to reduce the human effort required to deal with this problem, we present an approach for computationally matching elements of two schemas using linguistic schema matching methods and then transform the input screen format to another format with naming defined by the user. This approach is tested on a number of commercial screens from different companies and the results of the experiments showed an overall accuracy of 97 percent on schema matching which is significantly better than the other two matchers we tested. Our tool enables mapping a screen file in one format to another format preferred by the expert using their preferred chemical names. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15455963
Volume :
17
Issue :
6
Database :
Complementary Index
Journal :
IEEE/ACM Transactions on Computational Biology & Bioinformatics
Publication Type :
Academic Journal
Accession number :
147575117
Full Text :
https://doi.org/10.1109/TCBB.2019.2913368