Back to Search Start Over

Clinic expert information extraction based on domain model and block importance model.

Authors :
Zhang Y
Wang L
Qian D
Geng X
Yao D
Dong J
Source :
Computers in biology and medicine [Comput Biol Med] 2015 Nov 01; Vol. 66, pp. 337-42. Date of Electronic Publication: 2015 Jul 18.
Publication Year :
2015

Abstract

To extract expert clinic information from the Deep Web, there are two challenges to face. The first one is to make a judgment on forms. A novel method based on a domain model, which is a tree structure constructed by the attributes of query interfaces is proposed. With this model, query interfaces can be classified to a domain and filled in with domain keywords. Another challenge is to extract information from response Web pages indexed by query interfaces. To filter the noisy information on a Web page, a block importance model is proposed, both content and spatial features are taken into account in this model. The experimental results indicate that the domain model yields a precision 4.89% higher than that of the rule-based method, whereas the block importance model yields an F1 measure 10.5% higher than that of the XPath method.<br /> (Copyright © 2015 Elsevier Ltd. All rights reserved.)

Details

Language :
English
ISSN :
1879-0534
Volume :
66
Database :
MEDLINE
Journal :
Computers in biology and medicine
Publication Type :
Academic Journal
Accession number :
26231612
Full Text :
https://doi.org/10.1016/j.compbiomed.2015.07.009