Back to Search
Start Over
Mining Information from Collections of Papers: Illustrative Analysis of Groundwater and Disease.
- Source :
-
Ground Water . Nov/Dec2018, Vol. 56 Issue 6, p993-1001. 9p. 3 Diagrams, 2 Charts, 2 Graphs. - Publication Year :
- 2018
-
Abstract
- The academic world is driven by scholarly research and publications. Yet, for many fields, the volume of published research and the associated knowledge base have been expanding exponentially for decades. The result is that scientists are literally drowning in data and information. There are strategies and approaches that could help with this problem. The goal of this paper is to demonstrate the power of computer‐based approaches such as data mining and machine learning to evaluate large collections of papers. The objective is to conduct a systematic analysis of research related to the emerging area of groundwater‐related diseases. More specifically, the analysis of information from the database of papers will examine systematics in the research topics, the inter‐relationships among multiple diseases, contaminants, and groundwater, and discover styles of research associated with groundwater and disease. The analysis uses 426 papers (1971 to 2017) retrieved from a MEDLINE bibliographic database, PubMed, given the search terms "groundwater" and "disease." We developed tools that take care of necessary text processing steps, which lead naturally to clustering and visualization techniques that demonstrate published research. The resulting 2D article map shows how the collection of papers is subdivided into 11 article clusters. The cluster topics were determined by analyzing keywords or common words contained in the articles' titles, abstracts, and key words. We found that research on water‐related disease in groundwater primarily focuses on two types of contaminants—chemical compounds and pathogens. Cancer and diarrhea are two major diseases associated with groundwater contamination. According to the systematic analysis, the study of this area is still growing. Article impact statement: Using data mining and machine learning techniques to study the research strands of groundwater‐related disease from the database. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 0017467X
- Volume :
- 56
- Issue :
- 6
- Database :
- Academic Search Index
- Journal :
- Ground Water
- Publication Type :
- Academic Journal
- Accession number :
- 132851856
- Full Text :
- https://doi.org/10.1111/gwat.12804