Back to Search Start Over

Exploiting Textual Source Information for Epidemiosurveillance

Authors :
Arsevska, E.
Mathieu Roche
Lancelot, R.
Hendrikx, P.
Dufour, B.
Contrôle des maladies animales exotiques et émergentes (UMR CMAEE)
Institut National de la Recherche Agronomique (INRA)-Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)
Territoires, Environnement, Télédétection et Information Spatiale (UMR TETIS)
Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-AgroParisTech-Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture (IRSTEA)
ADVanced Analytics for data SciencE (ADVANSE)
Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM)
Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)
Agence nationale de sécurité sanitaire de l'alimentation, de l'environnement et du travail (ANSES)
Laboratoire Chrono-environnement - UFC (UMR 6249) (LCE)
Université Bourgogne Franche-Comté [COMUE] (UBFC)-Centre National de la Recherche Scientifique (CNRS)-Université de Franche-Comté (UFC)
Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-Institut National de la Recherche Agronomique (INRA)
Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)
Laboratoire Chrono-environnement - CNRS - UBFC (UMR 6249) (LCE)
Centre National de la Recherche Scientifique (CNRS)-Université de Franche-Comté (UFC)
Université Bourgogne Franche-Comté [COMUE] (UBFC)-Université Bourgogne Franche-Comté [COMUE] (UBFC)
Université de Franche-Comté (UFC)
Université Bourgogne Franche-Comté [COMUE] (UBFC)-Université Bourgogne Franche-Comté [COMUE] (UBFC)-Centre National de la Recherche Scientifique (CNRS)
Contrôle des maladies animales exotiques et émergentes [Montpellier] ( CMAEE )
Institut National de la Recherche Agronomique ( INRA ) -Centre de coopération internationale en recherche agronomique pour le développement [CIRAD] : UMR15
Territoires, Environnement, Télédétection et Information Spatiale ( UMR TETIS )
Centre de Coopération Internationale en Recherche Agronomique pour le Développement ( CIRAD ) -AgroParisTech-Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture ( IRSTEA )
ADVanced Analytics for data SciencE ( ADVANSE )
Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier ( LIRMM )
Université de Montpellier ( UM ) -Centre National de la Recherche Scientifique ( CNRS ) -Université de Montpellier ( UM ) -Centre National de la Recherche Scientifique ( CNRS )
French Agency for Food, Environmental and Occupational Health Safety [Maisons-Alfort] ( ANSES )
ANSES
Laboratoire Chrono-environnement ( LCE )
Université Bourgogne Franche-Comté ( UBFC ) -Centre National de la Recherche Scientifique ( CNRS ) -Université de Franche-Comté ( UFC )
Source :
Web of Science, Metadata and semantics research : 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings, MTSR: Metadata and Semantics Research, MTSR: Metadata and Semantics Research, Nov 2014, Karlsruhe, Germany. pp.359-361, Scopus-Elsevier, MTSR: Metadata and Semantics Research, Nov 2014, Karlsruhe, Germany. Springer, 478, pp.359-361, 2014, Communications in Computer and Information Science

Abstract

International audience; In recent years as a complement to the traditional surveillance reporting systems there is a great interest in developing methodologies for early detection of potential health threats from unstructured text present on the Internet. In this context, we examined the relevance of the combination of expert knowledge and automatic term extraction in the creation of appropriate Internet search queries for the acquisition of disease outbreak news. We propose a measure that is the number of relevant disease outbreak news detected in function of the terms automatically extracted from a set of example Google and PubMED corpora. Due to the recent emergence we have used the African swine fever as a disease example. The new and exotic infectious diseases are an incising threat to countries due to globalization, movement of passengers, and international trade. With the traditional reporting schemes, often there are miss, delays or underreporting of disease outbreaks; leading to unawareness of countries about potential disease threats. As the Internet is a source of numerous and dynamic information, services need tools that could refine the search and detect the information of interest. Two important systems of the state-of-the-art, MediSys (Mantero et al. 2011) and Healthmap (Collier 2012) are based on a series of automatic steps to detect and acquire disease related news. The algorithms rely upon predefined templates, such keywords or patterns. Internet search queries have been proposed as inexpensive method to detect signals of diseases (ex. avian influenza) (Polgreen et al. 2008). In the face of many diseases and even more symptoms, the analysts face another challenge: How to identify appropriate queries for Internet disease surveillance? One option is to use the terms from existing thesaurus (e.g., MeSH). In this paper we present a new combined approach of selection of terms automatically extracted from relevant scientific and non-scientific corpora in order to identify most appropriate search queries for the detection of disease outbreak news on the Internet. As it is a recently emerging disease we use African swine fever (ASF) as a disease example.

Details

Database :
OpenAIRE
Journal :
Web of Science, Metadata and semantics research : 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings, MTSR: Metadata and Semantics Research, MTSR: Metadata and Semantics Research, Nov 2014, Karlsruhe, Germany. pp.359-361, Scopus-Elsevier, MTSR: Metadata and Semantics Research, Nov 2014, Karlsruhe, Germany. Springer, 478, pp.359-361, 2014, Communications in Computer and Information Science
Accession number :
edsair.dedup.wf.001..67a4b569106c783c63548d5683f4fffe