Author: "Lixia Yao" / Publisher: biomed central - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Lixia Yao"' showing total 4 results

Start Over Author "Lixia Yao" Publisher biomed central

4 results on '"Lixia Yao"'

1. Evaluating global and local sequence alignment methods for comparing patient medical records

Author: Nilay Shah, Lixia Yao, and Ming Huang
Subjects: Local sequence alignment, Cross-Cultural Comparison, Dynamic time warping, 020205 medical informatics, Computer science, Electronic health record, Health Informatics, Sequence alignment, Needleman–Wunsch algorithm, 02 engineering and technology, Therapeutics, Needleman-Wunsch algorithm, lcsh:Computer applications to medicine. Medical informatics, Temporal sequence, 03 medical and health sciences, Similarity (network science), Diagnosis, 0202 electrical engineering, electronic engineering, information engineering, Electronic Health Records, Humans, Patient similarity, 030304 developmental biology, Sequence (medicine), Smith-Waterman algorithm, Smith–Waterman algorithm, 0303 health sciences, Disease trajectory, business.industry, Health Policy, Research, Pattern recognition, Prognosis, Computer Science Applications, lcsh:R858-859.7, Artificial intelligence, business, Algorithms
Abstract: Background Sequence alignment is a way of arranging sequences (e.g., DNA, RNA, protein, natural language, financial data, or medical events) to identify the relatedness between two or more sequences and regions of similarity. For Electronic Health Records (EHR) data, sequence alignment helps to identify patients of similar disease trajectory for more relevant and precise prognosis, diagnosis and treatment of patients. Methods We tested two cutting-edge global sequence alignment methods, namely dynamic time warping (DTW) and Needleman-Wunsch algorithm (NWA), together with their local modifications, DTW for Local alignment (DTWL) and Smith-Waterman algorithm (SWA), for aligning patient medical records. We also used 4 sets of synthetic patient medical records generated from a large real-world EHR database as gold standard data, to objectively evaluate these sequence alignment algorithms. Results For global sequence alignments, 47 out of 80 DTW alignments and 11 out of 80 NWA alignments had superior similarity scores than reference alignments while the rest 33 DTW alignments and 69 NWA alignments had the same similarity scores as reference alignments. Forty-six out of 80 DTW alignments had better similarity scores than NWA alignments with the rest 34 cases having the equal similarity scores from both algorithms. For local sequence alignments, 70 out of 80 DTWL alignments and 68 out of 80 SWA alignments had larger coverage and higher similarity scores than reference alignments while the rest DTWL alignments and SWA alignments received the same coverage and similarity scores as reference alignments. Six out of 80 DTWL alignments showed larger coverage and higher similarity scores than SWA alignments. Thirty DTWL alignments had the equal coverage but better similarity scores than SWA. DTWL and SWA received the equal coverage and similarity scores for the rest 44 cases. Conclusions DTW, NWA, DTWL and SWA outperformed the reference alignments. DTW (or DTWL) seems to align better than NWA (or SWA) by inserting new daily events and identifying more similarities between patient medical records. The evaluation results could provide valuable information on the strengths and weakness of these sequence alignment methods for future development of sequence alignment methods and patient similarity-based studies.
Published: 2019

2. Developing a healthcare dataset information resource (DIR) based on Semantic Web

Author: Yaorong Ge, Jingyi Shi, Lixia Yao, and Mingna Zheng
Subjects: 0301 basic medicine, Health informatics, lcsh:Internal medicine, Informatics, lcsh:QH426-470, Knowledge representation and reasoning, Databases, Factual, Computer science, 02 engineering and technology, 03 medical and health sciences, User-Computer Interface, Knowledge extraction, Knowledge integration, 0202 electrical engineering, electronic engineering, information engineering, Genetics, Question answering, Humans, Knowledge retrieval, lcsh:RC31-1245, Semantic Web, Genetics (clinical), Semantic query, Internet, Information retrieval, Research, Metadata, lcsh:Genetics, 030104 developmental biology, Knowledge representation, Dataset information resource, 020201 artificial intelligence & image processing, Delivery of Health Care, Semantic web
Abstract: Background The right dataset is essential to obtain the right insights in data science; therefore, it is important for data scientists to have a good understanding of the availability of relevant datasets as well as the content, structure, and existing analyses of these datasets. While a number of efforts are underway to integrate the large amount and variety of datasets, the lack of an information resource that focuses on specific needs of target users of datasets has existed as a problem for years. To address this gap, we have developed a Dataset Information Resource (DIR), using a user-oriented approach, which gathers relevant dataset knowledge for specific user types. In the present version, we specifically address the challenges of entry-level data scientists in learning to identify, understand, and analyze major datasets in healthcare. We emphasize that the DIR does not contain actual data from the datasets but aims to provide comprehensive knowledge about the datasets and their analyses. Methods The DIR leverages Semantic Web technologies and the W3C Dataset Description Profile as the standard for knowledge integration and representation. To extract tailored knowledge for target users, we have developed methods for manual extractions from dataset documentations as well as semi-automatic extractions from related publications, using natural language processing (NLP)-based approaches. A semantic query component is available for knowledge retrieval, and a parameterized question-answering functionality is provided to facilitate the ease of search. Results The DIR prototype is composed of four major components—dataset metadata and related knowledge, search modules, question answering for frequently-asked questions, and blogs. The current implementation includes information on 12 commonly used large and complex healthcare datasets. The initial usage evaluation based on health informatics novices indicates that the DIR is helpful and beginner-friendly. Conclusions We have developed a novel user-oriented DIR that provides dataset knowledge specialized for target user groups. Knowledge about datasets is effectively represented in the Semantic Web. At this initial stage, the DIR has already been able to provide sophisticated and relevant knowledge of 12 datasets to help entry health informacians learn healthcare data analysis using suitable datasets. Further development of both content and function levels is underway.
Published: 2018

3. Evaluation of the informatician perspective: determining types of research papers preferred by clinicians.

Author: Boshu Ru, Xiaoyan Wang, Lixia Yao, Ru, Boshu, Wang, Xiaoyan, and Yao, Lixia
Subjects: RESEARCH papers (Students), EVIDENCE-based medicine, MEDICAL subject headings, MEDICAL informatics, RECOMMENDER systems, CONSUMER preferences, BIBLIOGRAPHICAL citations, BIBLIOGRAPHY, DECISION making, MEDICAL research, MEDLINE, RESEARCH funding, SUBJECT headings
Abstract: Background: To deliver evidence-based medicine, clinicians often reference resources that are useful to their respective medical practices. Owing to their busy schedules, however, clinicians typically find it challenging to locate these relevant resources out of the rapidly growing number of journals and articles currently being published. The literature-recommender system may provide a possible solution to this issue if the individual needs of clinicians can be identified and applied.Methods: We thus collected from the CiteULike website a sample of 96 clinicians and 6,221 scientific articles that they read. We examined the journal distributions, publication types, reading times, and geographic locations. We then compared the distributions of MeSH terms associated with these articles with those of randomly sampled MEDLINE articles using two-sample Z-test and multiple comparison correction, in order to identify the important topics relevant to clinicians.Results: We determined that the sampled clinicians followed the latest literature in a timely manner and read papers that are considered landmarks in medical research history. They preferred to read scientific discoveries from human experiments instead of molecular-, cellular- or animal-model-based experiments. Furthermore, the country of publication may impact reading preferences, particularly for clinicians from Egypt, India, Norway, Senegal, and South Africa.Conclusion: These findings provide useful guidance for developing personalized literature-recommender systems for clinicians. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

4. Quantitative systems-level determinants of drug targets.

Author: Lixia Yao and Rzhetsky, Andrey
Subjects: *GENOMES, *PROTEINS, *MOLECULES, *ALGORITHMS, *NUCLEOTIDES
Abstract: Background Modern drug discovery tends to understand disease processes at the molecular level and then determine optimal molecular targets for drug intervention. Inferences have made from all available drug targets, such as how many drug targets there are, or how many novel drug targets could be potentially found in the human genome, to what functional families these proteins belong, and what structural properties make them bind to small molecules tightly and specifically. But all these are very intuitive and qualitative. The key question of which gene or protein in a disease process could be a successful drug target remains unanswered. Results We analyzed specific systems-level properties of human genes and proteins targeted by 919 FDA-approved drugs and identified a number of quantitative measures that distinguish them from other genes and proteins at a highly significant level. Compared to an average gene and its encoded protein(s), successful drug targets are more highly connected in a molecular interaction network, but are far from being the most highly connected; they have higher betweenness values, lower entropies of tissue expression, and lower ratios of non-synonymous to synonymous single-nucleotide polymorphisms (see Figure 1). We also tested the performance of different classification algorithms (see Figure 2). Furthermore, we have identified human tissues significantly over- or under-targeted relative to the full spectrum of genes active in each tissue. We also built a machine-learning model to demonstrate the usefulness of these quantitative descriptors for predicting drug targets. With increasing availability of experimental data, we foresee that screening the whole human genome for potential novel drug targets could be feasible in near future. Conclusion We found that genes associated with successful FDA-approved drugs have a number of properties at the network, sequence, and tissue-expression levels that significantly distinguish them from other human genes. Although the drug-target-selection guidelines that we suggest cannot replace expensive experiments, they can help pharmaceutical researchers narrow the prospective set of drug targets at the earliest stage of a drug development project. Specifically, when the pharmaceutical company must decide which target to pursue among pathologic pathways that are not fully understood, connectivity, betweenness, Cratio, and entropy might be useful quantitative estimates of each prospective target's expected success rate. [ABSTRACT FROM AUTHOR]
Published: 2008
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Lixia Yao"'

1. Evaluating global and local sequence alignment methods for comparing patient medical records

2. Developing a healthcare dataset information resource (DIR) based on Semantic Web

3. Evaluation of the informatician perspective: determining types of research papers preferred by clinicians.

4. Quantitative systems-level determinants of drug targets.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

4 results on '"Lixia Yao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources