31 results on '"Spasic, I."'
Search Results
2. An evaluation of TRAK physiotherapy self management intervention development and delivery for knee conditions
- Author
-
Button, K., primary, Nicholas, K., additional, Busse, M., additional, Collins, M., additional, and Spasic, I., additional
- Published
- 2018
- Full Text
- View/download PDF
3. External validation of the UKPDS risk engine in incident type 2 diabetes: a need for new type 2 diabetes-specific risk equations
- Author
-
Bannister, C.A., Poole, C.D., Jenkins-Jones, S., Morgan, C.L., Elwyn, G., Spasic, I., Currie, C.J., Bannister, C.A., Poole, C.D., Jenkins-Jones, S., Morgan, C.L., Elwyn, G., Spasic, I., and Currie, C.J.
- Abstract
Item does not contain fulltext, OBJECTIVE: To evaluate the performance of the UK Prospective Diabetes Study Risk Engine (UKPDS-RE) for predicting the 10-year risk of cardiovascular disease end points in an independent cohort of U.K. patients newly diagnosed with type 2 diabetes. RESEARCH DESIGN AND METHODS: This was a retrospective cohort study using routine health care data collected between April 1998 and October 2011 from approximately 350 U.K. primary care practices contributing to the Clinical Practice Research Datalink (CPRD). Participants comprised 79,966 patients aged between 35 and 85 years (388,269 person-years) with 4,984 cardiovascular events. Four outcomes were evaluated: first diagnosis of coronary heart disease (CHD), stroke, fatal CHD, and fatal stroke. RESULTS: Accounting for censoring, the observed versus predicted 10-year event rates were as follows: CHD 6.1 vs. 16.5%, fatal CHD 1.9 vs. 10.1%, stroke 7.0 vs. 10.1%, and fatal stroke 1.7 vs. 1.6%, respectively. The UKPDS-RE showed moderate discrimination for all four outcomes, with the concordance index values ranging from 0.65 to 0.78. CONCLUSIONS: The UKPDS stroke equations showed calibration ranging from poor to moderate; however, the CHD equations showed poor calibration and considerably overestimated CHD risk. There is a need for revised risk equations in type 2 diabetes.
- Published
- 2014
4. All-Wales licensed premises intervention (AWLPI): a randomised controlled trial to reduce alcohol-related violence
- Author
-
Moore, SC, O'Brien, C, Alam, MF, Cohen, D, Hood, K, Huang, C, Moore, L, Murphy, S, Playle, R, Sivarajasingam, V, Spasic, I, Williams, A, Shepherd, J, Moore, SC, O'Brien, C, Alam, MF, Cohen, D, Hood, K, Huang, C, Moore, L, Murphy, S, Playle, R, Sivarajasingam, V, Spasic, I, Williams, A, and Shepherd, J
- Abstract
BACKGROUND: Alcohol-related violence in and in the vicinity of licensed premises continues to place a considerable burden on the United Kingdom's (UK) health services. Robust interventions targeted at licensed premises are therefore required to reduce the costs of alcohol-related harm. Previous evaluations of interventions in licensed premises have a number of methodological limitations and none have been conducted in the UK. The aim of the trial was to determine the effectiveness of the Safety Management in Licensed Environments intervention designed to reduce alcohol-related violence in licensed premises, delivered by Environmental Health Officers, under their statutory authority to intervene in cases of violence in the workplace. METHODS/DESIGN: A national randomised controlled trial, with licensed premises as the unit of allocation. Premises were identified from all 22 Local Authorities in Wales. Eligible premises were those with identifiable violent incidents on premises, using police recorded violence data. Premises were allocated to intervention or control by optimally balancing by Environmental Health Officer capacity in each Local Authority, number of violent incidents in the 12 months leading up to the start of the project and opening hours. The primary outcome measure is the difference in frequency of violence between intervention and control premises over a 12 month follow-up period, based on a recurrent event model. The trial incorporates an embedded process evaluation to assess intervention implementation, fidelity, reach and reception, and to interpret outcome effects, as well as investigate its economic impact. DISCUSSION: The results of the trial will be applicable to all statutory authorities directly involved with managing violence in the night time economy and will provide the first formal test of Health and Safety policy in this environment. If successful, opportunities for replication and generalisation will be considered. TRIAL REGISTRATION: U
- Published
- 2014
5. Automatic development of clinical prediction models with genetic programming: A case study in cardiovascular disease
- Author
-
Bannister, C.A., primary, Currie, C.J., additional, Preece, A., additional, and Spasic, I., additional
- Published
- 2014
- Full Text
- View/download PDF
6. FlexiTerm: a flexible term recognition method
- Author
-
Spasic, I., Greenwood, M., Preece, A., Francis, N., Elwyn, G., Spasic, I., Greenwood, M., Preece, A., Francis, N., and Elwyn, G.
- Abstract
Contains fulltext : 125465.pdf (publisher's version ) (Open Access), BACKGROUND: The increasing amount of textual information in biomedicine requires effective term recognition methods to identify textual representations of domain-specific concepts as the first step toward automating its semantic interpretation. The dictionary look-up approaches may not always be suitable for dynamic domains such as biomedicine or the newly emerging types of media such as patient blogs, the main obstacles being the use of non-standardised terminology and high degree of term variation. RESULTS: In this paper, we describe FlexiTerm, a method for automatic term recognition from a domain-specific corpus, and evaluate its performance against five manually annotated corpora. FlexiTerm performs term recognition in two steps: linguistic filtering is used to select term candidates followed by calculation of termhood, a frequency-based measure used as evidence to qualify a candidate as a term. In order to improve the quality of termhood calculation, which may be affected by the term variation phenomena, FlexiTerm uses a range of methods to neutralise the main sources of variation in biomedical terms. It manages syntactic variation by processing candidates using a bag-of-words approach. Orthographic and morphological variations are dealt with using stemming in combination with lexical and phonetic similarity measures. The method was evaluated on five biomedical corpora. The highest values for precision (94.56%), recall (71.31%) and F-measure (81.31%) were achieved on a corpus of clinical notes. CONCLUSIONS: FlexiTerm is an open-source software tool for automatic term recognition. It incorporates a simple term variant normalisation method. The method proved to be more robust than the baseline against less formally structured texts, such as those found in patient blogs or medical notes. The software can be downloaded freely at http://www.cs.cf.ac.uk/flexiterm.
- Published
- 2013
7. Systematic integration of experimental data and models in systems biology.
- Author
-
Li, P., Dada, J.O., Jameson, D., Spasic, I., Swainston, N., Carroll, K., Dunn, W., Khan, F., Messiha, E., Simeonides, E., Weichart, D., Winder, C., Broomhead, D.S., Goble, C.A., Gaskell, S.J., Kell, D.B., Westerhoff, H.V., Mendes, P., Paton, N.W., Li, P., Dada, J.O., Jameson, D., Spasic, I., Swainston, N., Carroll, K., Dunn, W., Khan, F., Messiha, E., Simeonides, E., Weichart, D., Winder, C., Broomhead, D.S., Goble, C.A., Gaskell, S.J., Kell, D.B., Westerhoff, H.V., Mendes, P., and Paton, N.W.
- Abstract
Background: The behaviour of biological systems can be deduced from their mathematical models. However, multiple sources of data in diverse forms are required in the construction of a model in order to define its components and their biochemical reactions, and corresponding parameters. Automating the assembly and use of systems biology models is dependent upon data integration processes involving the interoperation of data and analytical resources.Results: Taverna workflows have been developed for the automated assembly of quantitative parameterised metabolic networks in the Systems Biology Markup Language (SBML). A SBML model is built in a systematic fashion by the workflows which starts with the construction of a qualitative network using data from a MIRIAM-compliant genome-scale model of yeast metabolism. This is followed by parameterisation of the SBML model with experimental data from two repositories, the SABIO-RK enzyme kinetics database and a database of quantitative experimental results. The models are then calibrated and simulated in workflows that call out to COPASIWS, the web service interface to the COPASI software application for analysing biochemical networks. These systems biology workflows were evaluated for their ability to construct a parameterised model of yeast glycolysis.Conclusions: Distributed information about metabolic reactions that have been described to MIRIAM standards enables the automated assembly of quantitative systems biology models of metabolic networks based on user-defined criteria. Such data integration processes can be implemented as Taverna workflows to provide a rapid overview of the components and their relationships within a biochemical system. © 2010 Li et al; licensee BioMed Central Ltd.
- Published
- 2010
- Full Text
- View/download PDF
8. Serum metabolomics reveals many novel metabolic markers of heart failure, including pseudouridine and 2-oxoglutarate
- Author
-
Dunn, W. B. And Broadhurst, Deepak, S. M., Buch, M. H., McDowell, G., Spasic, I., Ellis, D. I., Brooks, N., Kell, D. B., Neyses, Ludwig, Dunn, W. B. And Broadhurst, Deepak, S. M., Buch, M. H., McDowell, G., Spasic, I., Ellis, D. I., Brooks, N., Kell, D. B., and Neyses, Ludwig
- Abstract
There is intense interest in the identification of novel biomarkers which improve the diagnosis of heart failure. Serum samples from 52 patients with systolic heart failure (EF< 40% plus signs and symptoms of failure) and 57 controls were analyzed by gas chromatography - time of flight - mass spectrometry and the raw data reduced to 272 statistically robust metabolite peaks. 38 peaks showed a significant difference between case and control (p <5 × 10-5). Two such metabolites were pseudouridine, a modified nucleotide present in t- and rRNA and a marker of cell turnover, as well as the tricarboxylic acid cycle intermediate 2-oxoglutarate. Furthermore, 3 further new compounds were also excellent discriminators between patients and controls: 2-hydroxy, 2-methylpropanoic acid, erythritol and 2,4,6-trihydroxypyrimidine. Although renal disease may be associated with heart failure, and metabolites associated with renal disease and other markers were also elevated (e.g. urea, creatinine and uric acid), there was no correlation within the patient group between these metabolites and our heart failure biomarkers, indicating that these were indeed biomarkers of heart failure and not renal disease per se. These findings demonstrate the power of data-driven metabolomics approaches to identify such markers of disease. © Springer Science+Business Media, LLC 2007.
- Published
- 2007
9. A Text Mining Approach to the Prediction of Disease Status from Clinical Discharge Summaries
- Author
-
Yang, H., primary, Spasic, I., additional, Keane, J. A., additional, and Nenadic, G., additional
- Published
- 2009
- Full Text
- View/download PDF
10. Mass spectrometry tools and metabolite-specific databases for molecular identification in metabolomics
- Author
-
Brown, M., primary, Dunn, W. B., additional, Dobson, P., additional, Patel, Y., additional, Winder, C. L., additional, Francis-McIntyre, S., additional, Begley, P., additional, Carroll, K., additional, Broadhurst, D., additional, Tseng, A., additional, Swainston, N., additional, Spasic, I., additional, Goodacre, R., additional, and Kell, D. B., additional
- Published
- 2009
- Full Text
- View/download PDF
11. A GC-TOF-MS study of the stability of serum and urine metabolomes during the UK Biobank sample collection and preparation protocols
- Author
-
Dunn, W. B, primary, Broadhurst, D., additional, Ellis, D. I, additional, Brown, M., additional, Halsall, A., additional, O'Hagan, S., additional, Spasic, I., additional, Tseng, A., additional, and Kell, D. B, additional
- Published
- 2008
- Full Text
- View/download PDF
12. MaSTerClass: a case-based reasoning system for the classification of biomedical terms
- Author
-
Spasic, I., primary, Ananiadou, S., additional, and Tsujii, J., additional
- Published
- 2005
- Full Text
- View/download PDF
13. Text mining and ontologies in biomedicine: Making sense of raw text
- Author
-
Spasic, I., primary, Ananiadou, S., additional, McNaught, J., additional, and Kumar, A., additional
- Published
- 2005
- Full Text
- View/download PDF
14. PRM115 - Automatic development of clinical prediction models with genetic programming: A case study in cardiovascular disease
- Author
-
Bannister, C.A., Currie, C.J., Preece, A., and Spasic, I.
- Published
- 2014
- Full Text
- View/download PDF
15. All-Wales Licensed Premises Intervention (AWLPI): a randomised controlled trial of an intervention to reduce alcohol-related violence
- Author
-
Sc, Moore, Mf, Alam, Cohen D, Hood K, Huang C, Murphy S, Playle R, Moore L, Jonathan Shepherd, Sivarajasingam V, Spasic I, Stanton H, and Williams A
16. PRM115 Automatic development of clinical prediction models with genetic programming: A case study in cardiovascular disease
- Author
-
Bannister, C.A., Currie, C.J., Preece, A., and Spasic, I.
- Full Text
- View/download PDF
17. The differential diagnostic capacity of serum amyloid A protein between infectious and non-infectious febrile episodes of neutropenic patients with acute leukemia
- Author
-
Cast, M. T., Rogina, B., Glojnaric-Spasic, I., and Minigo, H.
- Published
- 1994
- Full Text
- View/download PDF
18. Associations Between Dog Breed and Clinical Features of Mammary Epithelial Neoplasia in Bitches: an Epidemiological Study of Submissions to a Single Diagnostic Pathology Centre Between 2008-2021.
- Author
-
Edmunds G, Beck S, Kale KU, Spasic I, O'Neill D, Brodbelt D, and Smalley MJ
- Subjects
- Female, Dogs, Humans, Animals, Epidemiologic Studies, Breeding, Mammary Neoplasms, Animal diagnosis, Mammary Neoplasms, Animal epidemiology, Mammary Neoplasms, Animal pathology, Carcinoma pathology, Breast Neoplasms
- Abstract
Mammary cancer is one of the most common neoplasms of dogs, primarily bitches. While studies have been carried out identifying differing risk of mammary neoplasia in different dog breeds, few studies have reported associations between dog breeds and clinical features such as number of neoplastic lesions found in an individual case or the likelihood of lesions being benign or malignant. Such epidemiological studies are essential as a foundation for exploring potential genetic drivers of mammary tumour behaviour. Here, we have examined associations between breed, age and neuter status and the odds of a diagnosis of a mammary epithelial-origin neoplastic lesion (as opposed to any other histopathological diagnosis from a biopsied lesion) as well as the odds of a bitch presenting with either a single mammary lesion or multiple lesions, and the odds that those lesions are benign or malignant. The study population consisted of 129,258 samples from bitches, including 13,401 mammary epithelial neoplasms, submitted for histological assessment to a single histopathology laboratory between 2008 and 2021.In multivariable analysis, breed, age and neuter status were all significantly associated with the odds of a diagnosis of a mammary epithelial-origin neoplastic lesion. Smaller breeds were more likely to receive such a diagnosis. In cases diagnosed with a mammary epithelial neoplasm, these three factors were also significantly associated with the odds of diagnosis with a malignant lesion and of diagnosis with multiple lesions. Notably, while neutered animals were less likely to have a mammary epithelial neoplasm diagnosed, and were less likely to have multiple neoplasms, they were more likely to have malignant disease. Exploration of the patterns of risk of developing malignant disease, or multiple lesions, across individual breeds showed no breed with increased odds of both outcomes. Breeds with altered odds compared to the Crossbreed baseline were either at increased risk of malignant disease and decreased risk of multiple lesions, or vice versa, or they were at significantly altered odds of one outcome with no change in the other outcome. Our analysis supports the hypothesis that age, neuter status and intrinsic biological and genetic factors all combine to influence the biological heterogeneity of canine mammary neoplasia., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
19. Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach.
- Author
-
Chopard D, Treder MS, Corcoran P, Ahmed N, Johnson C, Busse M, and Spasic I
- Abstract
Background: Pharmacovigilance and safety reporting, which involve processes for monitoring the use of medicines in clinical trials, play a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events., Objective: This study aims to demonstrate the feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable statistical analysis of the aforementioned patterns., Methods: We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as the International Classification of Diseases-10th Revision, Medical Dictionary for Regulatory Activities, and Systematized Nomenclature of Medicine). We used MetaMap, a highly configurable dictionary lookup software, to identify the mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformers (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represented adverse events and those that did not., Results: The model achieved a high F1 score of 0.8080, despite the class imbalance. This is 10.15 percent points lower than human-like performance but also 17.45 percent points higher than that of the baseline approach., Conclusions: These results confirmed that automated coding of adverse events described in the narrative section of serious adverse event reports is feasible. Once coded, adverse events can be statistically analyzed so that any correlations with the trialed medicines can be estimated in a timely fashion., (©Daphne Chopard, Matthias S Treder, Padraig Corcoran, Nagheen Ahmed, Claire Johnson, Monica Busse, Irena Spasic. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 24.12.2021.)
- Published
- 2021
- Full Text
- View/download PDF
20. Feasibility randomised controlled trial comparing TRAK-ACL digital rehabilitation intervention plus treatment as usual versus treatment as usual for patients following anterior cruciate ligament reconstruction.
- Author
-
Dunphy E, Button K, Hamilton F, Williams J, Spasic I, and Murray E
- Abstract
Objectives: To evaluate the feasibility of trialling taxonomy for the rehabilitation of knee conditions-ACL (TRAK-ACL), a digital health intervention that provides health information, personalised exercise plans and remote clinical support combined with treatment as usual (TAU), for people following ACL reconstruction., Methods: The study design was a two-arm parallel randomised controlled trial (RCT). Eligible participants were English-speaking adults who had undergone ACL reconstruction within the last 12 weeks, had access to the internet and could provide informed consent. Recruitment took place at three sites in the UK. TRAK-ACL intervention was an interactive website informed by behaviour change technique combined with TAU. The comparator was TAU. Outcomes were: recruitment and retention; completeness of outcome measures at follow-up; fidelity of intervention delivery and engagement with the intervention. Individuals were randomised using a computer-generated random number sequence. Blinded assessors allocated groups and collected outcome measures., Results: Fifty-nine people were assessed for eligibility at two of the participating sites, and 51 were randomised; 26 were allocated to TRAK-ACL and 25 to TAU. Follow-up data were collected on 44 and 40 participants at 3 and 6 months, respectively. All outcome measures were completed fully at 6 months except the Client Service Receipt Inventory. Two patients in each arm did not receive the treatment they were randomised to. Engagement with TRAK-ACL intervention was a median of 5 logins (IQR 3-13 logins), over 18 weeks (SD 12.2 weeks)., Conclusion: TRAK-ACL would be suitable for evaluation of effectiveness in a fully powered RCT., Competing Interests: Competing interests: None declared., (© Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY. Published by BMJ.)
- Published
- 2021
- Full Text
- View/download PDF
21. Patient Triage by Topic Modeling of Referral Letters: Feasibility Study.
- Author
-
Spasic I and Button K
- Abstract
Background: Musculoskeletal conditions are managed within primary care, but patients can be referred to secondary care if a specialist opinion is required. The ever-increasing demand for health care resources emphasizes the need to streamline care pathways with the ultimate aim of ensuring that patients receive timely and optimal care. Information contained in referral letters underpins the referral decision-making process but is yet to be explored systematically for the purposes of treatment prioritization for musculoskeletal conditions., Objective: This study aims to explore the feasibility of using natural language processing and machine learning to automate the triage of patients with musculoskeletal conditions by analyzing information from referral letters. Specifically, we aim to determine whether referral letters can be automatically assorted into latent topics that are clinically relevant, that is, considered relevant when prescribing treatments. Here, clinical relevance is assessed by posing 2 research questions. Can latent topics be used to automatically predict treatment? Can clinicians interpret latent topics as cohorts of patients who share common characteristics or experiences such as medical history, demographics, and possible treatments?, Methods: We used latent Dirichlet allocation to model each referral letter as a finite mixture over an underlying set of topics and model each topic as an infinite mixture over an underlying set of topic probabilities. The topic model was evaluated in the context of automating patient triage. Given a set of treatment outcomes, a binary classifier was trained for each outcome using previously extracted topics as the input features of the machine learning algorithm. In addition, a qualitative evaluation was performed to assess the human interpretability of topics., Results: The prediction accuracy of binary classifiers outperformed the stratified random classifier by a large margin, indicating that topic modeling could be used to predict the treatment, thus effectively supporting patient triage. The qualitative evaluation confirmed the high clinical interpretability of the topic model., Conclusions: The results established the feasibility of using natural language processing and machine learning to automate triage of patients with knee or hip pain by analyzing information from their referral letters., (©Irena Spasic, Kate Button. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 06.11.2020.)
- Published
- 2020
- Full Text
- View/download PDF
22. Clinical Text Data in Machine Learning: Systematic Review.
- Author
-
Spasic I and Nenadic G
- Abstract
Background: Clinical narratives represent the main form of communication within health care, providing a personalized account of patient history and assessments, and offering rich information for clinical decision making. Natural language processing (NLP) has repeatedly demonstrated its feasibility to unlock evidence buried in clinical narratives. Machine learning can facilitate rapid development of NLP tools by leveraging large amounts of text data., Objective: The main aim of this study was to provide systematic evidence on the properties of text data used to train machine learning approaches to clinical NLP. We also investigated the types of NLP tasks that have been supported by machine learning and how they can be applied in clinical practice., Methods: Our methodology was based on the guidelines for performing systematic reviews. In August 2018, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified 110 relevant studies and extracted information about text data used to support machine learning, NLP tasks supported, and their clinical applications. The data properties considered included their size, provenance, collection methods, annotation, and any relevant statistics., Results: The majority of datasets used to train machine learning models included only hundreds or thousands of documents. Only 10 studies used tens of thousands of documents, with a handful of studies utilizing more. Relatively small datasets were utilized for training even when much larger datasets were available. The main reason for such poor data utilization is the annotation bottleneck faced by supervised machine learning algorithms. Active learning was explored to iteratively sample a subset of data for manual annotation as a strategy for minimizing the annotation effort while maximizing the predictive performance of the model. Supervised learning was successfully used where clinical codes integrated with free-text notes into electronic health records were utilized as class labels. Similarly, distant supervision was used to utilize an existing knowledge base to automatically annotate raw text. Where manual annotation was unavoidable, crowdsourcing was explored, but it remains unsuitable because of the sensitive nature of data considered. Besides the small volume, training data were typically sourced from a small number of institutions, thus offering no hard evidence about the transferability of machine learning models. The majority of studies focused on text classification. Most commonly, the classification results were used to support phenotyping, prognosis, care improvement, resource management, and surveillance., Conclusions: We identified the data annotation bottleneck as one of the key obstacles to machine learning approaches in clinical NLP. Active learning and distant supervision were explored as a way of saving the annotation efforts. Future research in this field would benefit from alternatives such as data augmentation and transfer learning, or unsupervised learning, which do not require data annotation., (©Irena Spasic, Goran Nenadic. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 31.03.2020.)
- Published
- 2020
- Full Text
- View/download PDF
23. Sentiment Analysis in Health and Well-Being: Systematic Review.
- Author
-
Zunic A, Corcoran P, and Spasic I
- Abstract
Background: Sentiment analysis (SA) is a subfield of natural language processing whose aim is to automatically classify the sentiment expressed in a free text. It has found practical applications across a wide range of societal contexts including marketing, economy, and politics. This review focuses specifically on applications related to health, which is defined as "a state of complete physical, mental, and social well-being and not merely the absence of disease or infirmity.", Objective: This study aimed to establish the state of the art in SA related to health and well-being by conducting a systematic review of the recent literature. To capture the perspective of those individuals whose health and well-being are affected, we focused specifically on spontaneously generated content and not necessarily that of health care professionals., Methods: Our methodology is based on the guidelines for performing systematic reviews. In January 2019, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified a total of 86 relevant studies and extracted data about the datasets analyzed, discourse topics, data creators, downstream applications, algorithms used, and their evaluation., Results: The majority of data were collected from social networking and Web-based retailing platforms. The primary purpose of online conversations is to exchange information and provide social support online. These communities tend to form around health conditions with high severity and chronicity rates. Different treatments and services discussed include medications, vaccination, surgery, orthodontic services, individual physicians, and health care services in general. We identified 5 roles with respect to health and well-being among the authors of the types of spontaneously generated narratives considered in this review: a sufferer, an addict, a patient, a carer, and a suicide victim. Out of 86 studies considered, only 4 reported the demographic characteristics. A wide range of methods were used to perform SA. Most common choices included support vector machines, naïve Bayesian learning, decision trees, logistic regression, and adaptive boosting. In contrast with general trends in SA research, only 1 study used deep learning. The performance lags behind the state of the art achieved in other domains when measured by F-score, which was found to be below 60% on average. In the context of SA, the domain of health and well-being was found to be resource poor: few domain-specific corpora and lexica are shared publicly for research purposes., Conclusions: SA results in the area of health and well-being lag behind those in other domains. It is yet unclear if this is because of the intrinsic differences between the domains and their respective sublanguages, the size of training datasets, the lack of domain-specific sentiment lexica, or the choice of algorithms., (©Anastazia Zunic, Padraig Corcoran, Irena Spasic. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 28.01.2020.)
- Published
- 2020
- Full Text
- View/download PDF
24. Cohort Selection for Clinical Trials From Longitudinal Patient Records: Text Mining Approach.
- Author
-
Spasic I, Krzeminski D, Corcoran P, and Balinsky A
- Abstract
Background: Clinical trials are an important step in introducing new interventions into clinical practice by generating data on their safety and efficacy. Clinical trials need to ensure that participants are similar so that the findings can be attributed to the interventions studied and not to some other factors. Therefore, each clinical trial defines eligibility criteria, which describe characteristics that must be shared by the participants. Unfortunately, the complexities of eligibility criteria may not allow them to be translated directly into readily executable database queries. Instead, they may require careful analysis of the narrative sections of medical records. Manual screening of medical records is time consuming, thus negatively affecting the timeliness of the recruitment process., Objective: Track 1 of the 2018 National Natural Language Processing Clinical Challenge focused on the task of cohort selection for clinical trials, aiming to answer the following question: Can natural language processing be applied to narrative medical records to identify patients who meet eligibility criteria for clinical trials? The task required the participating systems to analyze longitudinal patient records to determine if the corresponding patients met the given eligibility criteria. We aimed to describe a system developed to address this task., Methods: Our system consisted of 13 classifiers, one for each eligibility criterion. All classifiers used a bag-of-words document representation model. To prevent the loss of relevant contextual information associated with such representation, a pattern-matching approach was used to extract context-sensitive features. They were embedded back into the text as lexically distinguishable tokens, which were consequently featured in the bag-of-words representation. Supervised machine learning was chosen wherever a sufficient number of both positive and negative instances was available to learn from. A rule-based approach focusing on a small set of relevant features was chosen for the remaining criteria., Results: The system was evaluated using microaveraged F measure. Overall, 4 machine algorithms, including support vector machine, logistic regression, naïve Bayesian classifier, and gradient tree boosting (GTB), were evaluated on the training data using 10-fold cross-validation. Overall, GTB demonstrated the most consistent performance. Its performance peaked when oversampling was used to balance the training data. The final evaluation was performed on previously unseen test data. On average, the F measure of 89.04% was comparable to 3 of the top ranked performances in the shared task (91.11%, 90.28%, and 90.21%). With an F measure of 88.14%, we significantly outperformed these systems (81.03%, 78.50%, and 70.81%) in identifying patients with advanced coronary artery disease., Conclusions: The holdout evaluation provides evidence that our system was able to identify eligible patients for the given clinical trial with high accuracy. Our approach demonstrates how rule-based knowledge infusion can improve the performance of machine learning algorithms even when trained on a relatively small dataset., (©Irena Spasic, Dominik Krzeminski, Padraig Corcoran, Alexander Balinsky. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 31.10.2019.)
- Published
- 2019
- Full Text
- View/download PDF
25. External validation of the UKPDS risk engine in incident type 2 diabetes: a need for new type 2 diabetes-specific risk equations.
- Author
-
Bannister CA, Poole CD, Jenkins-Jones S, Morgan CL, Elwyn G, Spasic I, and Currie CJ
- Subjects
- Adult, Aged, Female, Humans, Male, Middle Aged, Risk Factors, United Kingdom epidemiology, Diabetes Mellitus, Type 2 epidemiology
- Abstract
Objective: To evaluate the performance of the UK Prospective Diabetes Study Risk Engine (UKPDS-RE) for predicting the 10-year risk of cardiovascular disease end points in an independent cohort of U.K. patients newly diagnosed with type 2 diabetes., Research Design and Methods: This was a retrospective cohort study using routine health care data collected between April 1998 and October 2011 from ∼350 U.K. primary care practices contributing to the Clinical Practice Research Datalink (CPRD). Participants comprised 79,966 patients aged between 35 and 85 years (388,269 person-years) with 4,984 cardiovascular events. Four outcomes were evaluated: first diagnosis of coronary heart disease (CHD), stroke, fatal CHD, and fatal stroke., Results: Accounting for censoring, the observed versus predicted 10-year event rates were as follows: CHD 6.1 vs. 16.5%, fatal CHD 1.9 vs. 10.1%, stroke 7.0 vs. 10.1%, and fatal stroke 1.7 vs. 1.6%, respectively. The UKPDS-RE showed moderate discrimination for all four outcomes, with the concordance index values ranging from 0.65 to 0.78., Conclusions: The UKPDS stroke equations showed calibration ranging from poor to moderate; however, the CHD equations showed poor calibration and considerably overestimated CHD risk. There is a need for revised risk equations in type 2 diabetes.
- Published
- 2014
- Full Text
- View/download PDF
26. All-Wales licensed premises intervention (AWLPI): a randomised controlled trial to reduce alcohol-related violence.
- Author
-
Moore SC, O'Brien C, Alam MF, Cohen D, Hood K, Huang C, Moore L, Murphy S, Playle R, Sivarajasingam V, Spasic I, Williams A, and Shepherd J
- Subjects
- Alcohol Drinking psychology, Humans, Licensure, Police, Restaurants legislation & jurisprudence, Wales, Alcohol Drinking adverse effects, Health Promotion, Violence prevention & control
- Abstract
Background: Alcohol-related violence in and in the vicinity of licensed premises continues to place a considerable burden on the United Kingdom's (UK) health services. Robust interventions targeted at licensed premises are therefore required to reduce the costs of alcohol-related harm. Previous evaluations of interventions in licensed premises have a number of methodological limitations and none have been conducted in the UK. The aim of the trial was to determine the effectiveness of the Safety Management in Licensed Environments intervention designed to reduce alcohol-related violence in licensed premises, delivered by Environmental Health Officers, under their statutory authority to intervene in cases of violence in the workplace., Methods/design: A national randomised controlled trial, with licensed premises as the unit of allocation. Premises were identified from all 22 Local Authorities in Wales. Eligible premises were those with identifiable violent incidents on premises, using police recorded violence data. Premises were allocated to intervention or control by optimally balancing by Environmental Health Officer capacity in each Local Authority, number of violent incidents in the 12 months leading up to the start of the project and opening hours. The primary outcome measure is the difference in frequency of violence between intervention and control premises over a 12 month follow-up period, based on a recurrent event model. The trial incorporates an embedded process evaluation to assess intervention implementation, fidelity, reach and reception, and to interpret outcome effects, as well as investigate its economic impact., Discussion: The results of the trial will be applicable to all statutory authorities directly involved with managing violence in the night time economy and will provide the first formal test of Health and Safety policy in this environment. If successful, opportunities for replication and generalisation will be considered., Trial Registration: UKCRN 14077; ISRCTN78924818.
- Published
- 2014
- Full Text
- View/download PDF
27. Systematic integration of experimental data and models in systems biology.
- Author
-
Li P, Dada JO, Jameson D, Spasic I, Swainston N, Carroll K, Dunn W, Khan F, Malys N, Messiha HL, Simeonidis E, Weichart D, Winder C, Wishart J, Broomhead DS, Goble CA, Gaskell SJ, Kell DB, Westerhoff HV, Mendes P, and Paton NW
- Subjects
- Databases, Factual, Models, Biological, Metabolic Networks and Pathways, Systems Biology methods
- Abstract
Background: The behaviour of biological systems can be deduced from their mathematical models. However, multiple sources of data in diverse forms are required in the construction of a model in order to define its components and their biochemical reactions, and corresponding parameters. Automating the assembly and use of systems biology models is dependent upon data integration processes involving the interoperation of data and analytical resources., Results: Taverna workflows have been developed for the automated assembly of quantitative parameterised metabolic networks in the Systems Biology Markup Language (SBML). A SBML model is built in a systematic fashion by the workflows which starts with the construction of a qualitative network using data from a MIRIAM-compliant genome-scale model of yeast metabolism. This is followed by parameterisation of the SBML model with experimental data from two repositories, the SABIO-RK enzyme kinetics database and a database of quantitative experimental results. The models are then calibrated and simulated in workflows that call out to COPASIWS, the web service interface to the COPASI software application for analysing biochemical networks. These systems biology workflows were evaluated for their ability to construct a parameterised model of yeast glycolysis., Conclusions: Distributed information about metabolic reactions that have been described to MIRIAM standards enables the automated assembly of quantitative systems biology models of metabolic networks based on user-defined criteria. Such data integration processes can be implemented as Taverna workflows to provide a rapid overview of the components and their relationships within a biochemical system.
- Published
- 2010
- Full Text
- View/download PDF
28. Medication information extraction with linguistic pattern matching and semantic rules.
- Author
-
Spasic I, Sarafraz F, Keane JA, and Nenadic G
- Subjects
- Artificial Intelligence, Humans, Linguistics, Semantics, Electronic Health Records, Information Storage and Retrieval methods, Natural Language Processing, Pharmaceutical Preparations
- Abstract
Objective: This study presents a system developed for the 2009 i2b2 Challenge in Natural Language Processing for Clinical Data, whose aim was to automatically extract certain information about medications used by a patient from his/her medical report. The aim was to extract the following information for each medication: name, dosage, mode/route, frequency, duration and reason., Design: The system implements a rule-based methodology, which exploits typical morphological, lexical, syntactic and semantic features of the targeted information. These features were acquired from the training dataset and public resources such as the UMLS and relevant web pages. Information extracted by pattern matching was combined together using context-sensitive heuristic rules., Measurements: The system was applied to a set of 547 previously unseen discharge summaries, and the extracted information was evaluated against a manually prepared gold standard consisting of 251 documents. The overall ranking of the participating teams was obtained using the micro-averaged F-measure as the primary evaluation metric., Results: The implemented method achieved the micro-averaged F-measure of 81% (with 86% precision and 77% recall), which ranked this system third in the challenge. The significance tests revealed the system's performance to be not significantly different from that of the second ranked system. Relative to other systems, this system achieved the best F-measure for the extraction of duration (53%) and reason (46%)., Conclusion: Based on the F-measure, the performance achieved (81%) was in line with the initial agreement between human annotators (82%), indicating that such a system may greatly facilitate the process of extracting relevant information from medical records by providing a solid basis for a manual review process.
- Published
- 2010
- Full Text
- View/download PDF
29. KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways.
- Author
-
Spasic I, Simeonidis E, Messiha HL, Paton NW, and Kell DB
- Subjects
- Metabolic Networks and Pathways, Systems Biology, Computational Biology methods, Information Systems standards, Saccharomyces cerevisiae metabolism, Software
- Abstract
Motivation: Most experimental evidence on kinetic parameters is buried in the literature, whose manual searching is complex, time consuming and partial. These shortcomings become particularly acute in systems biology, where these parameters need to be integrated into detailed, genome-scale, metabolic models. These problems are addressed by KiPar, a dedicated information retrieval system designed to facilitate access to the literature relevant for kinetic modelling of a given metabolic pathway in yeast. Searching for kinetic data in the context of an individual pathway offers modularity as a way of tackling the complexity of developing a full metabolic model. It is also suitable for large-scale mining, since multiple reactions and their kinetic parameters can be specified in a single search request, rather than one reaction at a time, which is unsuitable given the size of genome-scale models., Results: We developed an integrative approach, combining public data and software resources for the rapid development of large-scale text mining tools targeting complex biological information. The user supplies input in the form of identifiers used in relevant data resources to refer to the concepts of interest, e.g. EC numbers, GO and SBO identifiers. By doing so, the user is freed from providing any other knowledge or terminology concerned with these concepts and their relations, since they are retrieved from these and cross-referenced resources automatically. The terminology acquired is used to index the literature by mapping concepts to their synonyms, and then to textual documents mentioning them. The indexing results and the previously acquired knowledge about relations between concepts are used to formulate complex search queries aiming at documents relevant to the user's information needs. The conceptual approach is demonstrated in the implementation of KiPar. Evaluation reveals that KiPar performs better than a Boolean search. The precision achieved for abstracts (60%) and full-text articles (48%) is considerably better than the baseline precision (44% and 24%, respectively). The baseline recall is improved by 36% for abstracts and by 100% for full text. It appears that full-text articles are a much richer source of information on kinetic data than are their abstracts. Finally, the combined results for abstracts and full text compared with the curated literature provide high values for relative recall (88%) and novelty ratio (92%), suggesting that the system is able to retrieve a high proportion of new documents., Availability: Source code and documentation are available at: (http://www.mcisb.org/resources/kipar/).
- Published
- 2009
- Full Text
- View/download PDF
30. Metabolic footprinting and systems biology: the medium is the message.
- Author
-
Kell DB, Brown M, Davey HM, Dunn WB, Spasic I, and Oliver SG
- Subjects
- Animals, Bacteria genetics, Bacteria metabolism, Chromatography, Gas, DNA Footprinting, DNA, Bacterial genetics, DNA, Fungal genetics, Fungi genetics, Fungi metabolism, Humans, Mass Spectrometry, Prokaryotic Cells chemistry, Proteomics, Genomics methods, Prokaryotic Cells metabolism, Systems Biology methods
- Abstract
One element of classical systems analysis treats a system as a black or grey box, the inner structure and behaviour of which can be analysed and modelled by varying an internal or external condition, probing it from outside and studying the effect of the variation on the external observables. The result is an understanding of the inner make-up and workings of the system. The equivalent of this in biology is to observe what a cell or system excretes under controlled conditions - the 'metabolic footprint' or exometabolome - as this is readily and accurately measurable. Here, we review the principles, experimental approaches and scientific outcomes that have been obtained with this useful and convenient strategy.
- Published
- 2005
- Full Text
- View/download PDF
31. Terminology-driven mining of biomedical literature.
- Author
-
Nenadic G, Spasic I, and Ananiadou S
- Subjects
- Abbreviations as Topic, Artificial Intelligence, Natural Language Processing, Pattern Recognition, Automated, Vocabulary, Controlled, Algorithms, Database Management Systems, Databases, Bibliographic, Information Storage and Retrieval methods, Terminology as Topic
- Abstract
Motivation: With an overwhelming amount of textual information in molecular biology and biomedicine, there is a need for effective literature mining techniques that can help biologists to gather and make use of the knowledge encoded in text documents. Although the knowledge is organized around sets of domain-specific terms, few literature mining systems incorporate deep and dynamic terminology processing., Results: In this paper, we present an overview of an integrated framework for terminology-driven mining from biomedical literature. The framework integrates the following components: automatic term recognition, term variation handling, acronym acquisition, automatic discovery of term similarities and term clustering. The term variant recognition is incorporated into terminology recognition process by taking into account orthographical, morphological, syntactic, lexico-semantic and pragmatic term variations. In particular, we address acronyms as a common way of introducing term variants in biomedical papers. Term clustering is based on the automatic discovery of term similarities. We use a hybrid similarity measure, where terms are compared by using both internal and external evidence. The measure combines lexical, syntactical and contextual similarity. Experiments on terminology recognition and clustering performed on a corpus of MEDLINE abstracts recorded the precision of 98 and 71% respectively., Availability: software for the terminology management is available upon request.
- Published
- 2003
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.