38 results on '"Ivan Olier"'
Search Results
2. Relationship between remnant cholesterol and risk of heart failure in participants with diabetes mellitus
- Author
-
Ruoting Wang, Hertzel C Gerstein, Harriette G C Van Spall, Gregory Y H Lip, Ivan Olier, Sandra Ortega-Martorell, Lehana Thabane, Zebing Ye, and Guowei Li
- Subjects
Health Policy ,Cardiology and Cardiovascular Medicine - Abstract
Background Evidence about the association between calculated remnant cholesterol (RC) and risk of heart failure (HF) in participants with diabetes mellitus (DM) remains sparse and limited. Methods We included a total of 22 230 participants with DM from the UK Biobank for analyses. Participants were categorized into three groups based on their baseline RC measures: low (with a mean RC of 0.41 mmol/L), moderate (0.66 mmol/L), and high (1.04 mmol/L). Cox proportional hazards models were used to evaluate the relationship between RC groups and HF risk. We performed discordance analysis to evaluate whether RC was associated with HF risk independently of low-density lipoprotein cholesterol (LDL-C). Results During a mean follow-up period of 11.5 years, there were a total of 2 232 HF events observed. The moderate RC group was significantly related with a 15% increased risk of HF when compared with low RC group (hazard ratio [HR] = 1.15, 95% confidence interval [CI]: 1.01—1.32), while the high RC group with a 23% higher HF risk (HR = 1.23, 95% CI: 1.05—1.43). There was significant relationship between RC as a continuous measure and the increased HF risk (P Conclusions Elevated RC was significantly associated with risk of HF in patients with DM. Moreover, RC was significantly related to HF risk independent of LDL-C measures. These findings may highlight the importance of RC management to HF risk in patients with DM.
- Published
- 2023
- Full Text
- View/download PDF
3. Association between metabolically healthy obesity and risk of atrial fibrillation: taking physical activity into consideration
- Author
-
Ruoting Wang, Ivan Olier, Sandra Ortega-Martorell, Yingxin Liu, Zebing Ye, Gregory YH Lip, and Guowei Li
- Subjects
Metabolic Syndrome ,Obesity, Metabolically Benign ,Physical activity ,Endocrinology, Diabetes and Metabolism ,Metabolic status ,Obesity, Metabolically Benign/diagnosis ,Atrial fibrillation ,Body Mass Index ,Risk Factors ,Atrial Fibrillation ,Atrial Fibrillation/diagnosis ,Humans ,Obesity ,Obesity/complications ,Cardiology and Cardiovascular Medicine ,Exercise - Abstract
The modification of physical activity (PA) on the metabolic status in relation to atrial fibrillation (AF) in obesity remains unknown. We aimed to investigate the independent and joint associations of metabolic status and PA with the risk of AF in obese population. Based on the data from UK Biobank study, we used Cox proportional hazards models for analyses. Metabolic status was categorized into metabolically healthy obesity (MHO) and metabolically unhealthy obesity (MUO). PA was categorized into four groups according to the level of moderate-to-vigorous PA (MVPA): none, low, medium, and high. A total of 119,424 obese participants were included for analyses. MHO was significantly associated with a 35% reduced AF risk compared with MUO (HR = 0.65, 95% CI: 0.57–0.73). No significant modification of PA on AF risk among individuals with MHO was found. Among the MUO participants, individuals with medium and high PA had significantly lower AF risk compared with no MVPA (HR = 0.84, 95% CI: 0.74–0.95, and HR = 0.87, 95% CI: 0.78–0.96 for medium and high PA, respectively). As the severity of MUO increased, the modification of PA on AF risk was elevated accordingly. To conclude, MHO was significantly associated with a reduced risk of AF when compared with MUO in obese participants. PA could significantly modify the relationship between metabolic status and risk of AF among MUO participants, with particular benefits of PA associated with the reduced AF risk as the MUO severity elevated.
- Published
- 2022
- Full Text
- View/download PDF
4. The Athlete's Heart and Machine Learning: A Review of Current Implementations and Gaps for Future Research
- Author
-
Ryan A. A. Bellfield, Sandra Ortega-Martorell, Gregory Y. H. Lip, David Oxborough, and Ivan Olier
- Subjects
Pharmacology (medical) ,General Pharmacology, Toxicology and Pharmaceutics - Abstract
Background: Intense training exercise regimes cause physiological changes within the heart to help cope with the increased stress, known as the “athlete’s heart”. These changes can mask pathological changes, making them harder to diagnose and increasing the risk of an adverse cardiac outcome. Aim: This paper reviews which machine learning techniques (ML) are being used within athlete’s heart research and how they are being implemented, as well as assesses the uptake of these techniques within this area of research. Methods: Searches were carried out on the Scopus and PubMed online datasets and a scoping review was conducted on the studies which were identified. Results: Twenty-eight studies were included within the review, with ML being directly referenced within 16 (57%). A total of 12 different techniques were used, with the most popular being artificial neural networks and the most common implementation being to perform classification tasks. The review also highlighted the subgroups of interest: predictive modelling, reviews, and wearables, with most of the studies being attributed to the predictive modelling subgroup. The most common type of data used was the electrocardiogram (ECG), with echocardiograms being used the second most often. Conclusion: The results show that over the last 11 years, there has been a growing desire of leveraging ML techniques to help further the understanding of the athlete’s heart, whether it be by expanding the knowledge of the physiological changes or by improving the accuracies of models to help improve the treatments and disease management.
- Published
- 2022
5. Renin-angiotensin system inhibitors effect before and during hospitalization in COVID-19 outcomes: Final analysis of the international HOPE COVID-19 (Health Outcome Predictive Evaluation for COVID-19) registry
- Author
-
Martino Pepe, Rodolfo Romero, Francisco Marín, Álvaro López-Masjuan, Juan García-Prieto, Marcos García-Aguado, Javier Elola, Antonio Fernández-Ortiz, Hope Covid Investigators, Carolina Espejo, Sergio Raposeiras-Roubín, Víctor Manuel Becerra-Muñoz, Christoph Liebetrau, Ivan Olier, Carlos Macaya, Jorge Luis Jativa Mendez, Charbel Maroun-Eid, Elvira Bondia, Alex F Castro-Mejía, Enrico Cerrato, Adelina Gonzalez, Javier López-Pais, Inmaculada Fernández-Rozas, Aitor Uribarri, Fabrizio Ugo, Harish Ramakrishna, Iván J. Núñez-Gil, María C Viana-Llamas, Mohammad Abumayyaleh, Miguel Corbí-Pascual, Gisela Feltes, Emilio Alfonso-Rodríguez, and Vicente Estrada
- Subjects
QA75 ,Male ,medicine.medical_specialty ,Heart disease ,Infecciones por coronavirus ,medicine.medical_treatment ,Inhibidores enzimáticos ,Population ,Clinical Investigations ,Enfermedad transmisible ,Angiotensin-Converting Enzyme Inhibitors ,Sistema renina-angiotensina ,Comorbidity ,030204 cardiovascular system & hematology ,Severity of Illness Index ,03 medical and health sciences ,0302 clinical medicine ,Risk Factors ,Outcome Assessment, Health Care ,Severity of illness ,medicine ,Humans ,Registries ,030212 general & internal medicine ,Medical prescription ,education ,Heart Failure ,Mechanical ventilation ,education.field_of_study ,SARS-CoV-2 ,business.industry ,COVID-19 ,Middle Aged ,Prognosis ,medicine.disease ,R1 ,Respiration, Artificial ,Hospitalization ,Italy ,Spain ,Heart failure ,Emergency medicine ,Female ,Observational study ,Cardiology and Cardiovascular Medicine ,business - Abstract
Background The use of Renin-Angiotensin system inhibitors (RASi) in patients with coronavirus disease 2019 (COVID-19) has been questioned because both share a target receptor site. Methods HOPE-COVID-19 (NCT04334291) is an international investigator-initiated registry. Patients are eligible when discharged after an in-hospital stay with COVID-19, dead or alive. Here, we analyze the impact of previous and continued in-hospital treatment with RASi in all-cause mortality and the development of in-stay complications. Results We included 6503 patients, over 18 years, from Spain and Italy with data on their RASi status. Of those, 36.8% were receiving any RASi before admission. RASi patients were older, more frequently male, with more comorbidities and frailer. Their probability of death and ICU admission was higher. However, after adjustment, these differences disappeared. Regarding RASi in-hospital use, those who continued the treatment were younger, with balanced comorbidities but with less severe COVID19. Raw mortality and secondary events were less frequent in RASi. After adjustment, patients receiving RASi still presented significantly better outcomes, with less mortality, ICU admissions, respiratory insufficiency, need for mechanical ventilation or prone, sepsis, SIRS and renal failure (p
- Published
- 2021
- Full Text
- View/download PDF
6. How machine learning is impacting research in atrial fibrillation: implications for risk prediction and future management
- Author
-
Ivan Olier, Gregory Y.H. Lip, Mark Pieroni, and Sandra Ortega-Martorell
- Subjects
QA75 ,Risk analysis ,Blood pressure control ,Artificial intelligence ,Databases, Factual ,Physiology ,Computer science ,Spotlight Reviews ,Clinical Decision-Making ,Atrial fibrillation ,030204 cardiovascular system & hematology ,Machine learning ,computer.software_genre ,Risk Assessment ,Decision Support Techniques ,Machine Learning ,Translational Research, Biomedical ,Stroke risk ,Electrocardiography ,03 medical and health sciences ,0302 clinical medicine ,Predictive Value of Tests ,Risk Factors ,Physiology (medical) ,Atrial Fibrillation ,medicine ,Animals ,Data Mining ,Humans ,AcademicSubjects/MED00200 ,Diagnosis, Computer-Assisted ,mHealth ,Modalities ,Modality (human–computer interaction) ,Wearables ,business.industry ,Prognosis ,medicine.disease ,R1 ,Therapy, Computer-Assisted ,Deep neural networks ,Cardiology and Cardiovascular Medicine ,business ,computer ,030217 neurology & neurosurgery - Abstract
There has been an exponential growth of artificial intelligence (AI) and machine learning (ML) publications aimed at advancing our understanding of atrial fibrillation (AF), which has been mainly driven by the confluence of two factors: the advances in deep neural networks (DeepNNs) and the availability of large, open access databases. It is observed that most of the attention has centred on applying ML for dvsetecting AF, particularly using electrocardiograms (ECGs) as the main data modality. Nearly a third of them used DeepNNs to minimize or eliminate the need for transforming the ECGs to extract features prior to ML modelling; however, we did not observe a significant advantage in following this approach. We also found a fraction of studies using other data modalities, and others centred in aims, such as risk prediction, AF management, and others. From the clinical perspective, AI/ML can help expand the utility of AF detection and risk prediction, especially for patients with additional comorbidities. The use of AI/ML for detection and risk prediction into applications and smart mobile health (mHealth) technology would enable ‘real time’ dynamic assessments. AI/ML could also adapt to treatment changes over time, as well as incident risk factors. Incorporation of a dynamic AI/ML model into mHealth technology would facilitate ‘real time’ assessment of stroke risk, facilitating mitigation of modifiable risk factors (e.g. blood pressure control). Overall, this would lead to an improvement in clinical care for patients with AF., Graphical Abstract
- Published
- 2021
- Full Text
- View/download PDF
7. Breast cancer patient characterisation and visualisation using deep learning and fisher information networks
- Author
-
Sandra Ortega-Martorell, Patrick Riley, Ivan Olier, Renata G. Raidou, Raul Casana-Eslava, Marc Rea, Li Shen, Paulo J. G. Lisboa, and Carlo Palmieri
- Subjects
QA75 ,RC0254 ,Information Services ,Multidisciplinary ,Deep Learning ,Humans ,Breast Neoplasms ,Female ,Breast ,RA ,Mammography - Abstract
Breast cancer is the most commonly diagnosed female malignancy globally, with better survival rates if diagnosed early. Mammography is the gold standard in screening programmes for breast cancer, but despite technological advances, high error rates are still reported. Machine learning techniques, and in particular deep learning (DL), have been successfully used for breast cancer detection and classification. However, the added complexity that makes DL models so successful reduces their ability to explain which features are relevant to the model, or whether the model is biased. The main aim of this study is to propose a novel visualisation to help characterise breast cancer patients using Fisher Information Networks on features extracted from mammograms using a DL model. In the proposed visualisation, patients are mapped out according to their similarities and can be used to study new patients as a ‘patient-like-me’ approach. When applied to the CBIS-DDSM dataset, it was shown that it is a competitive methodology that can (i) facilitate the analysis and decision-making process in breast cancer diagnosis with the assistance of the FIN visualisations and ‘patient-like-me’ analysis, and (ii) help improve diagnostic accuracy and reduce overdiagnosis by identifying the most likely diagnosis based on clinical similarities with neighbouring patients.
- Published
- 2022
8. How to Open a Black Box Classifier for Tabular Data
- Author
-
Bradley Walters, Sandra Ortega-Martorell, Ivan Olier, and Paulo J. G. Lisboa
- Subjects
ANOVA ,Shapley values ,self-explaining neural networks ,generalised additive models ,interpretability ,Computational Mathematics ,Numerical Analysis ,Computational Theory and Mathematics ,Theoretical Computer Science - Abstract
A lack of transparency in machine learning models can limit their application. We show that analysis of variance (ANOVA) methods extract interpretable predictive models from them. This is possible because ANOVA decompositions represent multivariate functions as sums of functions of fewer variables. Retaining the terms in the ANOVA summation involving functions of only one or two variables provides an efficient method to open black box classifiers. The proposed method builds generalised additive models (GAMs) by application of L1 regularised logistic regression to the component terms retained from the ANOVA decomposition of the logit function. The resulting GAMs are derived using two alternative measures, Dirac and Lebesgue. Both measures produce functions that are smooth and consistent. The term partial responses in structured models (PRiSM) describes the family of models that are derived from black box classifiers by application of ANOVA decompositions. We demonstrate their interpretability and performance for the multilayer perceptron, support vector machines and gradient-boosting machines applied to synthetic data and several real-world data sets, namely Pima Diabetes, German Credit Card, and Statlog Shuttle from the UCI repository. The GAMs are shown to be compliant with the basic principles of a formal framework for interpretability.
- Published
- 2023
- Full Text
- View/download PDF
9. Externally validated models for first diagnosis and risk of progression of knee osteoarthritis
- Author
-
Philippa Grace McCabe, Paulo Lisboa, Bill Baltzopoulos, and Ivan Olier
- Subjects
Radiography ,Multidisciplinary ,Knee Joint ,Risk Factors ,Disease Progression ,Humans ,Osteoarthritis, Knee ,QA ,QP - Abstract
Objective We develop and externally validate two models for use with radiological knee osteoarthritis. They consist of a diagnostic model for KOA and a prognostic model of time to onset of KOA. Model development and optimisation used data from the Osteoarthritis initiative (OAI) and external validation for both models was by application to data from the Multicenter Osteoarthritis Study (MOST). Materials and methods The diagnostic model at first presentation comprises subjects in the OAI with and without KOA (n = 2006), modelling with multivariate logistic regression. The prognostic sample involves 5-year follow-up of subjects presenting without clinical KOA (n = 1155), with modelling with Cox regression. In both instances the models used training data sets of n = 1353 and 1002 subjects and optimisation used test data sets of n = 1354 and 1003. The external validation data sets for the diagnostic and prognostic models comprised n = 2006 and n = 1155 subjects respectively. Results The classification performance of the diagnostic model on the test data has an AUC of 0.748 (0.721–0.774) and 0.670 (0.631–0.708) in external validation. The survival model has concordance scores for the OAI test set of 0.74 (0.7325–0.7439) and in external validation 0.72 (0.7190–0.7373). The survival approach stratified the population into two risk cohorts. The separation between the cohorts remains when the model is applied to the validation data. Discussion The models produced are interpretable with app interfaces that implement nomograms. The apps may be used for stratification and for patient education over the impact of modifiable risk factors. The externally validated results, by application to data from a substantial prospective observational study, show the robustness of models for likelihood of presenting with KOA at an initial assessment based on risk factors identified by the OAI protocol and stratification of risk for developing KOA in the next five years. Conclusion Modelling clinical KOA from OAI data validates well for the MOST data set. Both risk models identified key factors for differentiation of the target population from commonly available variables. With this analysis there is potential to improve clinical management of patients.
- Published
- 2022
- Full Text
- View/download PDF
10. Associations of Hepatosteatosis With Cardiovascular Disease in HIV-Positive and HIV-Negative Patients: The Liverpool HIV-Heart Project
- Author
-
Gregory Y.H. Lip, Ivan Olier, Scott W. Murray, T Heseltine, Saye Khoo, and Sandra Ortega-Martorell
- Subjects
Adult ,Male ,medicine.medical_specialty ,Population ,HIV Infections ,Disease ,Logistic regression ,Gastroenterology ,RA0421 ,Risk Factors ,Internal medicine ,HIV Seronegativity ,medicine ,Humans ,Pharmacology (medical) ,education ,education.field_of_study ,Receiver operating characteristic ,business.industry ,Odds ratio ,Middle Aged ,medicine.disease ,Confidence interval ,Fatty Liver ,Infectious Diseases ,Cardiovascular Diseases ,Female ,Metabolic syndrome ,business ,Tomography, X-Ray Computed ,Dyslipidemia - Abstract
BACKGROUND Hepatosteatosis (HS) has been associated with cardiovascular disorders in the general population. We sought to investigate whether HS is a marker of cardiovascular disease (CVD) risk in HIV-positive individuals, given that metabolic syndrome is implicated in the increasing CVD burden in this population. AIMS To investigate the association of HS with CVD in HIV-positive and HIV-negative individuals. METHODS AND RESULTS We analyzed computed tomography (CT) images of 1306 subjects of whom 209 (16%) were HIV-positive and 1097 (84%) HIV-negative. CVD was quantified by the presence of coronary calcification from both dedicated cardiac CT and nondedicated thorax CT. HS was diagnosed from CT data sets in those with noncontrast dedicated cardiac CT and those with venous phase liver CT using previously validated techniques. Previous liver ultrasound was also assessed for the presence of HS. The HIV-positive group had lower mean age (P < 0.005), higher proportions of male sex (P < 0.005), and more current smokers (P < 0.005). The HIV-negative group had higher proportions of hypertension (P < 0.005), type II diabetes (P = 0.032), dyslipidemia (P < 0.005), statin use (P = 0.008), and HS (P = 0.018). The prevalence of coronary calcification was not significantly different between the groups. Logistic regression (LR) demonstrated that in the HIV-positive group, increasing age [odds ratio (OR): 1.15, P < 0.005], male sex (OR 3.37, P = 0.022), and HS (OR 3.13, P = 0.005) were independently associated with CVD. In the HIV-negative group, increasing age (OR: 1.11, P < 0.005), male sex (OR 2.97, P < 0.005), current smoking (OR 1.96, P < 0.005), and dyslipidemia (OR 1.66, P = 0.03) were independently associated with CVD. Using a machine learning random forest algorithm to assess the variables of importance, the top 3 variables of importance in the HIV-positive group were age, HS, and male sex. In the HIV-negative group, the top 3 variables were age, hypertension and male sex. The LR models predicted CVD well, with the mean area under the receiver operator curve (AUC) for the HIV-positive and HIV-negative cohorts being 0.831 [95% confidence interval (CI): 0.713 to 0.928] and 0.786 (95% CI: 0.735 to 0.836), respectively. The random forest models outperformed LR models, with a mean AUC in HIV-positive and HIV-negative populations of 0.877 (95% CI: 0.775 to 0.959) and 0.828 (95% CI: 0.780 to 0.873) respectively, with differences between both methods being statistically significant. CONCLUSION In contrast to the general population, HS is a strong and independent predictor of CVD in HIV-positive individuals. This suggests that metabolic dysfunction may be attributable to the excess CVD risk seen with these patient groups. Assessment of HS may help accurate quantification of CVD risk in HIV-positive patients.
- Published
- 2021
11. Chitosan nanoparticles for enhancing drugs and cosmetic components penetration through the skin
- Author
-
Nicola Browning, Quynh Ta, Kehinde Ross, Raida Al-Kassas, Ivan Olier, Jessica Ting, Alan Simm, and Sophie Harwood
- Subjects
RM ,Swine ,Pharmaceutical Science ,Nanoparticle ,02 engineering and technology ,030226 pharmacology & pharmacy ,Chitosan ,Dermal fibroblast ,03 medical and health sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Drug Delivery Systems ,mental disorders ,Zeta potential ,Animals ,MTT assay ,Particle Size ,Drug Carriers ,technology, industry, and agriculture ,Penetration (firestop) ,021001 nanoscience & nanotechnology ,chemistry ,Pharmaceutical Preparations ,Drug delivery ,Nanoparticles ,Particle size ,0210 nano-technology ,Nuclear chemistry - Abstract
Chitosan nanoparticles (CT NPs) have attractive biomedical applications due to their unique properties. This present research aimed at development of chitosan nanoparticles to be used as skin delivery systems for cosmetic components and drugs and to track their penetration behaviour through pig skin. CT NPs were prepared by ionic gelation technique using sodium tripolyphosphate (TPP) and Acacia as crosslinkers. The particle sizes of NPs appeared to be dependent on the molecular weight of chitosan and concentration of both chitosan and crosslinkers. CT NPs were positively charged as demonstrated by their Zeta potential values. The formation of the nanoparticles was confirmed by FTIR and DSC. Both SEM and TEM micrographs showed that both CT-Acacia and CT:TPP NPs were smooth, spherical in shape and are distributed uniformly with a size range of 200nm to 300 nm. The CT:TPP NPs retained an average of 98% of the added water over a 48-hour period. CT-Acacia NPs showed high moisture absorption but lower moisture retention capacity, which indicates their competency to entrap polar actives in cosmetics and release the encapsulated actives in low polarity skin conditions. The cytotoxicity studies using MTT assay showed that CT NPs made using TPP or Acacia crosslinkers were similarly non-toxic to the human dermal fibroblast cells. Cellular uptake study of NPs observed using live-cell imaging microscopy, proving the great cellular internalisation of CT:TPP NPs and CT-Acacia NPs. Confocal laser scanning microscopy revealed that CT NPs of particle size 530nm containing fluorescein sodium salt as a marker were able to penetrate through the pig skin and gather in the dermis layer. These results show that CT NPs have the ability to deliver the actives and cosmetic components through the skin and to be used as cosmetics and dermal drug delivery system.
- Published
- 2020
12. Using MLP partial responses to explain in-hospital mortality in ICU
- Author
-
Ivan Olier, Sandra Ortega-Martorell, Paulo J. G. Lisboa, and Annabel Sansom
- Subjects
QA75 ,Noise ,Variable (computer science) ,In hospital mortality ,Intensive care ,Multilayer perceptron ,Statistics ,Linear model ,Logistic regression ,RA ,QA76 ,Data modeling ,Mathematics - Abstract
In this paper we propose to use partial responses derived from an initial multilayer perceptron (MLP) to build an explanatory risk prediction model of in-hospital mortality in intensive care units (ICU). Traditionally, MLPs deliver higher performance than linear models such as multivariate logistic regression (MLR). However, MLPs interlink input variables in such a complex way that is not straightforward to explain how the outcome is influenced by inputs and/or input interactions. In this paper, we hypothesized that in some scenarios, such as when the data noise is significant or when the data is just marginally non-linear, we could find slightly more complex associations by obtaining MLP partial responses. That is, by letting change one variable at the time, while keeping constant the rest. Overall, we found that, although the MLR and MLP in-hospital mortality model performances were equivalent, the MLP could explain non-linear associations that otherwise the MLR had considered non-significant. We considered that, although deeming higher-other interactions as disposable noise could be a strong assumption, building explanatory models based on the MLP partial responses could still be more informative than on MLR.
- Published
- 2020
- Full Text
- View/download PDF
13. Multi-task learning with a natural metric for quantitative structure activity relationship learning
- Author
-
Jérémy Besnard, Joaquin Vanschoren, Crina Grosan, Jan N. van Rijn, Ross D. King, Noureddin Sadawi, Ivan Olier, Larisa N. Soldatova, G. Richard J. Bickerton, Data Mining, Soldatova, Larisa [0000-0001-6489-3029], and Apollo - University of Cambridge Repository
- Subjects
Quantitative structure–activity relationship ,Computer science ,media_common.quotation_subject ,education ,multi-task learning ,Multi-task learning ,Sequence-based similarity ,quantitative structure activity relationship ,02 engineering and technology ,Library and Information Sciences ,Machine learning ,computer.software_genre ,sequence-based similarity ,lcsh:Chemistry ,03 medical and health sciences ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,Physical and Theoretical Chemistry ,QA ,Function (engineering) ,030304 developmental biology ,media_common ,0303 health sciences ,lcsh:T58.5-58.64 ,lcsh:Information technology ,business.industry ,chEMBL ,Computer Graphics and Computer-Aided Design ,Computer Science Applications ,Random forest ,Drug activity ,lcsh:QD1-999 ,Quantitative structure activity relationship ,Metric (mathematics) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,random forest ,psychological phenomena and processes ,Research Article - Abstract
© The Author(s) 2019. The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. This research was funded by the Engineering and Physical Sciences Research Council (EPSRC) grant EP/K030469/1. NS would like to thank the EU PhenoM-eNal project (Horizon 2020, 654241)
- Published
- 2020
- Full Text
- View/download PDF
14. Impact of Access Site Practice on Clinical Outcomes in Patients Undergoing Percutaneous Coronary Intervention Following Thrombolysis for ST-Segment Elevation Myocardial Infarction in the United Kingdom
- Author
-
Mamas A. Mamas, Peter Ludman, Chun Shing Kwok, Claire A Rushton, James Nolan, Tim Kinnaird, Mark A. de Belder, Muhammad Rashid, Evangelos Kontopantelis, and Ivan Olier
- Subjects
medicine.medical_specialty ,business.industry ,medicine.medical_treatment ,Percutaneous coronary intervention ,Odds ratio ,Thrombolysis ,030204 cardiovascular system & hematology ,medicine.disease ,03 medical and health sciences ,0302 clinical medicine ,Internal medicine ,Conventional PCI ,Propensity score matching ,Cohort ,Cardiology ,Medicine ,cardiovascular diseases ,030212 general & internal medicine ,Myocardial infarction ,Cardiology and Cardiovascular Medicine ,business ,Mace - Abstract
Objectives This study sought to examine the relationship between access site practice and clinical outcomes in patients requiring percutaneous coronary intervention (PCI) following thrombolysis for ST-segment elevation myocardial infarction (STEMI). Background Transradial access (TRA) is associated with better outcomes in patients requiring PCI for STEMI. A significant proportion of STEMI patients may receive thrombolysis before undergoing PCI in many countries across the world. There are limited data around access site practice and its associated outcomes in this cohort of patients. Methods The author used the British Cardiovascular Intervention Society dataset to investigate the outcomes of patients undergoing PCI following thrombolysis between 2007 and 2014. Patients were divided into TRA and transfemoral access groups depending on the access site used. Multiple logistic regression and propensity score matching were used to study the association of access site with in-hospital and long-term mortality, major bleeding, and access site–related complications. Results A total of 10,209 patients received thrombolysis and PCI during the study time. TRA was used in 48% (n = 4,959) of patients; 3.3% (n = 336) patients died in hospital, 1.6% (n = 165) of patients experienced major bleeding, 4.2% (n = 437) experienced major adverse cardiac events (MACE), and 4.6% (n = 468) experienced 30-day mortality. After multivariate adjustment, TRA was associated with significantly reduced odds of in-hospital mortality (odds ratio [OR]: 0.59; 95% confidence interval [CI]: 0.42 to 0.83; p = 0.002), major bleeding (OR: 0.45; 95% CI: 0.31 to 0.66; p Conclusions TRA is associated with decreased odds of bleeding complications, mortality, and MACE in patients undergoing PCI following thrombolysis and should be preferred access site choice in this cohort of patients.
- Published
- 2017
- Full Text
- View/download PDF
15. Efficient Estimation of General Additive Neural Networks: A Case Study for CTG Data
- Author
-
Ivan Olier, Paulo J. G. Lisboa, M. Jayabalan, and Sandra Ortega-Martorell
- Subjects
Estimation ,Artificial neural network ,Computer science ,business.industry ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,computer ,Generalised additive model ,Interpretability - Abstract
This paper discusses the concepts of interpretability and explainability and outlines desiderata for robust interpretability. It then describes a neural network model that meets all criteria, with the addition of global faithfulness.
- Published
- 2020
- Full Text
- View/download PDF
16. Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
- Author
-
Ivan Olier, Paulo J. G. Lisboa, and Sandra Ortega-Martorell
- Subjects
Cervical cancer ,Artificial neural network ,business.industry ,Computer science ,Model selection ,02 engineering and technology ,Missing data ,Perceptron ,Machine learning ,computer.software_genre ,medicine.disease ,Data set ,020204 information systems ,Covariate ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,020201 artificial intelligence & image processing ,Artificial intelligence ,Representation (mathematics) ,business ,computer - Abstract
Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariates, with a prevalence of 73 cases (9.69%), we explain how model selection can be applied to the Multi-Layer Perceptron (MLP) by deriving a representation using a General Additive Neural Network.
- Published
- 2020
- Full Text
- View/download PDF
17. Benchmarking multi-task learning in predictive models for drug discovery
- Author
-
Ivan Olier, Philippa Grace McCabe, and Sandra Ortega-Martorell
- Subjects
0303 health sciences ,Quantitative structure–activity relationship ,Protein family ,Computer science ,business.industry ,Drug discovery ,Drug target ,A protein ,Multi-task learning ,02 engineering and technology ,Benchmarking ,Machine learning ,computer.software_genre ,Task (project management) ,Random forest ,03 medical and health sciences ,Drug development ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,030304 developmental biology - Abstract
Being able to predict the activity of chemical compounds against a drug target (e.g. a protein) in the preliminary stages of drug development is critical. In drug discovery, this is known as Quantitative Structure Activity Relationships (QSARs). Datasets for QSARs are often ill-posed for traditional machine learning to provide meaningful insights (e.g. very high dimensionality). Here, we propose a multi-task learning (MTL) approach to enrich the original QSAR datasets with the hope of improving overall QSAR performance. The proposed approach, henceforth named MTL-AT, increases the size of the useable data by the use of an assistant task: a supplementary dataset formed by compounds automatically extracted from other possibly related tasks. The main novelty in our MTL-AT approach is the addition of control for data leakage. We tested MTL-AT in two drug discovery scenarios: 1) using 100 unrelated QSAR datasets, and 2) using 20 QSAR datasets that are related to the same protein family. Results were compared against equivalent single-task approach (STL). MTL-AT outperformed STL in 45 tasks of scenario 1, and in 12 tasks of scenario 2. The best overall method appears to be MTL-AT on both scenarios, with the small datasets yielded the best performance improvement from using multi-task learning. These results show that implementing multi-task learning with QSAR data has promise, but more investigation is required to test its applicability to certain features in datasets to make it suitable for widespread use in the drug discovery area. To the best of our knowledge, this is the first study that benchmarks the use of MTL on a large number of small datasets, which represents a more realistic scenario in drug development.
- Published
- 2019
- Full Text
- View/download PDF
18. Classifying and Grouping Mammography Images into Communities Using Fisher Information Networks to Assist the Diagnosis of Breast Cancer
- Author
-
Patrick Riley, Meenal Srivastava, Sandra Ortega-Martorell, Paulo J. G. Lisboa, and Ivan Olier
- Subjects
Similarity (geometry) ,Computer science ,computer.software_genre ,QA76 ,030218 nuclear medicine & medical imaging ,Visualization ,03 medical and health sciences ,symbols.namesake ,0302 clinical medicine ,RA0421 ,030220 oncology & carcinogenesis ,Metric (mathematics) ,symbols ,Probability distribution ,Information geometry ,Data mining ,Fisher information ,computer ,Fisher information metric ,Interpretability - Abstract
© 2020, Springer Nature Switzerland AG. The aim of this paper is to build a computer based clinical decision support tool using a semi-supervised framework, the Fisher Information Network (FIN), for visualization of a set of mammographic images. The FIN organizes the images into a similarity network from which, for any new image, reference images that are closely related can be identified. This enables clinicians to review not just the reference images but also ancillary information e.g. about response to therapy. The Fisher information metric defines a Riemannian space where distances reflect similarity with respect to a given probability distribution. This metric is informed about generative properties of data, and hence assesses the importance of directions in space of parameters. It automatically performs feature relevance detection. This approach focusses on the interpretability of the model from the standpoint of the clinical user. Model predictions were validated using the prevalence of classes in each of the clusters identified by the FIN.
- Published
- 2019
- Full Text
- View/download PDF
19. MRSI-based molecular imaging of therapy response to temozolomide in preclinical glioblastoma using source analysis
- Author
-
Ana Paula Candiota, Francisco V. Fernández, Martí Pumarola, Ivan Olier, Carles Arús, Teresa Delgado-Goñi, Paulo J. G. Lisboa, Margarida Julià-Sapé, Sandra Ortega-Martorell, and Magdalena Ciezka
- Subjects
Pathology ,medicine.medical_specialty ,Temozolomide ,medicine.diagnostic_test ,business.industry ,Dacarbazine ,Magnetic resonance spectroscopic imaging ,Magnetic resonance imaging ,medicine.disease ,03 medical and health sciences ,0302 clinical medicine ,Therapy response ,030220 oncology & carcinogenesis ,medicine ,Molecular Medicine ,Radiology, Nuclear Medicine and imaging ,Histopathology ,Molecular imaging ,business ,030217 neurology & neurosurgery ,Spectroscopy ,medicine.drug ,Glioblastoma - Abstract
Characterization of glioblastoma (GB) response to treatment is a key factor for improving patients' survival and prognosis. MRI and magnetic resonance spectroscopic imaging (MRSI) provide morphologic and metabolic profiles of GB but usually fail to produce unequivocal biomarkers of response. The purpose of this work is to provide proof of concept of the ability of a semi-supervised signal source extraction methodology to produce images with robust recognition of response to temozolomide (TMZ) in a preclinical GB model. A total of 38 female C57BL/6 mice were used in this study. The semi-supervised methodology extracted the required sources from a training set consisting of MRSI grids from eight GL261 GBs treated with TMZ, and six control untreated GBs. Three different sources (normal brain parenchyma, actively proliferating GB and GB responding to treatment) were extracted and used for calculating nosologic maps representing the spatial response to treatment. These results were validated with an independent test set (7 control and 17 treated cases) and correlated with histopathology. Major differences between the responder and non-responder sources were mainly related to the resonances of mobile lipids (MLs) and polyunsaturated fatty acids in MLs (0.9, 1.3 and 2.8 ppm). Responding tumors showed significantly lower mitotic (3.3 ± 2.9 versus 14.1 ± 4.2 mitoses/field) and proliferation rates (29.8 ± 10.3 versus 57.8 ± 5.4%) than control untreated cases. The methodology described in this work is able to produce nosological images of response to TMZ in GL261 preclinical GBs and suitably correlates with the histopathological analysis of tumors. A similar strategy could be devised for monitoring response to treatment in patients. Copyright © 2016 John Wiley & Sons, Ltd.
- Published
- 2016
- Full Text
- View/download PDF
20. A Novel Methodology for Prediction Urban Water Demand by Wavelet Denoising and Adaptive Neuro-Fuzzy Inference System Approach
- Author
-
Salah L. Zubaidi, Nabeel Saleem Saad Al-Bdairi, Khalid S. Hashim, Ivan Olier, Patryk Kot, Sadik Kamel Gharghan, Hussein Al-Bugharbee, and Sandra Ortega-Martorell
- Subjects
Discrete wavelet transform ,lcsh:Hydraulic engineering ,Computer science ,Geography, Planning and Development ,crow search algorithm ,Aquatic Science ,computer.software_genre ,Biochemistry ,wavelet denoising ,Engineering optimization ,lcsh:Water supply for domestic and industrial purposes ,lcsh:TC1-978 ,Range (statistics) ,ANFIS ,QA ,Water Science and Technology ,lcsh:TD201-500 ,Adaptive neuro fuzzy inference system ,Artificial neural network ,municipal water demand ,TA ,Metric (mathematics) ,Data pre-processing ,Data mining ,TD ,Nash–Sutcliffe model efficiency coefficient ,computer - Abstract
Accurate and reliable urban water demand prediction is imperative for providing the basis to design, operate, and manage water system, especially under the scarcity of the natural water resources. A new methodology combining discrete wavelet transform (DWT) with an adaptive neuro-fuzzy inference system (ANFIS) is proposed to predict monthly urban water demand based on several intervals of historical water consumption. This ANFIS model is evaluated against a hybrid crow search algorithm and artificial neural network (CSA-ANN), since these methods have been successfully used recently to tackle a range of engineering optimization problems. The study outcomes reveal that 1) data preprocessing is essential for denoising raw time series and choosing the model inputs to render the highest model performance, 2) both methodologies, ANFIS and CSA-ANN, are statistically equivalent and capable of accurately predicting monthly urban water demand with high accuracy based on several statistical metric measures such as coefficient of efficiency (0.974, 0.971, respectively). This study could help policymakers to manage extensions of urban water system in response to the increasing demand with low risk related to a decision.
- Published
- 2020
- Full Text
- View/download PDF
21. Comparative Analysis for Computer-Based Decision Support: Case Study of Knee Osteoarthritis
- Author
-
Ian H. Jarman, Paulo J. G. Lisboa, Philippa Grace McCabe, Ivan Olier, Sandra Ortega-Martorell, and Vasilios Baltzopoulos
- Subjects
050101 languages & linguistics ,Decision support system ,Artificial neural network ,business.industry ,05 social sciences ,Decision tree ,02 engineering and technology ,Logistic regression ,Machine learning ,computer.software_genre ,CHAID ,Regression ,Multilayer perceptron ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,0501 psychology and cognitive sciences ,Artificial intelligence ,business ,computer ,Interpretability - Abstract
This case study benchmarks a range of statistical and machine learning methods relevant to computer-based decision support in clinical medicine, focusing on the diagnosis of knee osteoarthritis at first presentation. The methods, comprising logistic regression, Multilayer Perceptron (MLP), Chi-square Automatic Interaction Detector (CHAID) and Classification and Regression Trees (CART), are applied to a public domain database, the Osteoarthritis Initiative (OAI), a 10 year longitudinal study starting in 2002 (n = 4,796). In this real-world application, it is shown that logistic regression is comparable with the neural networks and decision trees for discrimination of positive diagnosis on this data set. This is likely because of weak non-linearities among high levels of noise. After comparing the explanations provided by the different methods, it is concluded that the interpretability of the risk score index provided by logistic regression is expressed in a form that most naturally integrates with clinical reasoning. The reason for this is that it gives a statistical assessment of the weight of evidence for making the diagnosis, so providing a direction for future research to improve explanation of generic non-linear models.
- Published
- 2019
- Full Text
- View/download PDF
22. A Genomewide Screen for Tolerance to Cationic Drugs Reveals Genes Important for Potassium Homeostasis in Saccharomyces cerevisiae
- Author
-
Ivan Olier, Clara Navarrete, Rito Herrera, Lydie Maresova, Joaquín Ariño, Jesús Giraldo, José Ramos, Silvia Petrezsélyová, Lina Barreto, Lynne Yenush, Jorge Pérez-Valle, David Canadell, and Hana Sychrová
- Subjects
Saccharomyces cerevisiae Proteins ,Quaternary ammonium derivative ,TRK1 protein, S cerevisiae ,Physiology ,Potassium ,Mutant ,Saccharomyces cerevisiae ,chemistry.chemical_element ,Microbiology ,Article ,Membrane Potentials ,chemistry.chemical_compound ,Saccharomyces cerevisiae protein ,Genetics ,BIOQUIMICA Y BIOLOGIA MOLECULAR ,Homeostasis ,Transport at the cellular level ,Electrochemical gradient ,Cation Transport Proteins ,Molecular Biology ,Gene ,biology ,Tetramethylammonium ,Kinase ,Cell membrane potential ,Biological Transport ,Articles ,General Medicine ,PMA1 protein, S cerevisiae ,biology.organism_classification ,Cation transport protein ,Quaternary Ammonium Compounds ,Proton-Translocating ATPases ,Metabolism ,Phenotype ,Proton transporting adenosine triphosphatase ,chemistry ,Biochemistry ,Mutation ,Spermine ,Hygromycin B ,Intracellular - Abstract
[EN] Potassium homeostasis is crucial for living cells. In the yeast Saccharomyces cerevisiae, the uptake of potassium is driven by the electrochemical gradient generated by the Pma1 H +-ATPase, and this process represents a major consumer of the gradient. We considered that any mutation resulting in an alteration of the electrochemical gradient could give rise to anomalous sensitivity to any cationic drug independently of its toxicity mechanism. Here, we describe a genomewide screen for mutants that present altered tolerance to hygromycin B, spermine, and tetramethylammonium. Two hundred twenty-six mutant strains displayed altered tolerance to all three drugs (202 hypersensitive and 24 hypertolerant), and more than 50% presented a strong or moderate growth defect at a limiting potassium concentration (1 mM). Functional groups such as protein kinases and phosphatases, intracellular trafficking, transcription, or cell cycle and DNA processing were enriched. Essentially, our screen has identified a substantial number of genes that were not previously described to play a direct or indirect role in potassium homeostasis. A subset of 27 representative mutants were selected and subjected to diverse biochemical tests that, in some cases, allowed us to postulate the basis for the observed phenotypes. © 2011, American Society for Microbiology. All Rights Reserved., This work was supported by grants BFU2008-04188-C03-01, GEN2006-27748-C2-1-E/SYS (SysMo ERA-NET), and EUI200904147 (SysMo2 ERA-NET) to J.A.; GEN2006-27748-C2-2-E/SYS (SysMo ERA-NET) and BFU2008-04188-C03-03 to J.R.; and BFU2008-04188-C03-03 to L.Y. (Ministry of Science and Innovation, Spain, and FEDER). Work in the laboratory of the Institute of Physiology in Prague was supported by grants MSMT LC531, GA AS CR IAA500110801, and AV0Z 50110509. J.A. was the recipient of an Ajut 2009SGR-1091 and an ICREA academia award (Generalitat de Catalunya).
- Published
- 2011
- Full Text
- View/download PDF
23. A variational Bayesian approach for the robust analysis of the cortical silent period from EMG recordings of brain stroke patients
- Author
-
Julií Amengual, Ivan Olier, and Alfredo Vellido
- Subjects
medicine.diagnostic_test ,Computer science ,Cognitive Neuroscience ,Speech recognition ,medicine.medical_treatment ,Bayesian probability ,Electromyography ,Computer Science Applications ,Transcranial magnetic stimulation ,Noise ,medicine.anatomical_structure ,Artificial Intelligence ,Duration (music) ,medicine ,Silent period ,Hidden Markov model ,Neuroscience ,Motor cortex - Abstract
Transcranial magnetic stimulation (TMS) is a powerful tool for the calculation of parameters related to the intracortical excitability and inhibition of the motor cortex. The cortical silent period (CSP) is one such parameter that corresponds to the suppression of muscle activity for a short period after a muscle response to TMS. The duration of the CSP is known to be correlated with the prognosis of brain stroke patients' motor ability. Current methods for the estimation of the CSP duration are very sensitive to the presence of noise. A variational Bayesian formulation of a manifold-constrained hidden Markov model is applied in this paper to the segmentation of a set of multivariate time series (MTS) of electromyographic recordings corresponding to stroke patients and control subjects. A novel index of variability associated to this model is defined and applied to the detection of the silent period interval of the signal and to the estimation of its duration. This model and its associated index are shown to behave robustly in the presence of noise and provide more reliable estimations than the current standard in clinical practice.
- Published
- 2011
- Full Text
- View/download PDF
24. Multiproject–multicenter evaluation of automatic brain tumor classification by magnetic resonance spectroscopy
- Author
-
Salvador Tortajada, Geert Postma, Àngel Moreno-Torres, Ana Paula Candiota, Lutgarde M. C. Buydens, Daniel Monleon, Margarida Julià-Sapé, Jesús Pujol, Elies Fuster-Garcia, Sabine Van Huffel, Johan A. K. Suykens, Bernardo Celda, Juan M. García-Gómez, Carles Arús, Willem J. Melssen, P.W.T. Krooshof, Jan Luts, Javier Vicente Robledo, Ivan Olier, M. Carmen Martínez-Bisbal, and Montserrat Robles
- Subjects
Multicenter evaluation study ,Decision support system ,Computer science ,Biophysics ,Brain tumor ,Decision support systems ,Machine learning ,computer.software_genre ,Sensitivity and Specificity ,Brain tumors ,Health informatics ,Analytical Chemistry ,Pattern Recognition, Automated ,Artificial Intelligence ,Magnetic resonance spectroscopy ,Biomarkers, Tumor ,CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL ,medicine ,Humans ,Radiology, Nuclear Medicine and imaging ,Diagnosis, Computer-Assisted ,Radiological and Ultrasound Technology ,Brain Neoplasms ,business.industry ,Reproducibility of Results ,Pattern classification ,medicine.disease ,R1 ,Europe ,Radiology Nuclear Medicine and imaging ,FISICA APLICADA ,Artificial intelligence ,business ,computer ,Algorithms ,Research Article - Abstract
[EN] Automatic brain tumor classification by MRS has been under development for more than a decade. Nonetheless, to our knowledge, there are no published evaluations of predictive models with unseen cases that are subsequently acquired in different centers. The multicenter eTUMOUR project (2004-2009), which builds upon previous expertise from the INTERPRET project (2000-2002) has allowed such an evaluation to take place. A total of 253 pairwise classifiers for glioblastoma, meningioma, metastasis, and low-grade glial diagnosis were inferred based on 211 SV short TE INTERPRET MR spectra obtained at 1.5 T (PRESS or STEAM, 20-32 ms) and automatically pre-processed. Afterwards, the classifiers were tested with 97 spectra, which were subsequently compiled during eTUMOUR. In our results based on subsequently acquired spectra, accuracies of around 90% were achieved for most of the pairwise discrimination problems. The exception was for the glioblastoma versus metastasis discrimination, which was below 78%. A more clear definition of metastases may be obtained by other approaches, such as MRSI + MRI. The prediction of the tumor type of in-vivo MRS is possible using classifiers developed from previously acquired data, in different hospitals with different instrumentation under the same acquisition protocols. This methodology may find application for assisting in the diagnosis of new brain tumor cases and for the quality control of multicenter MRS databases., We would like to thank the INTERPRET and eTUMOUR partners for providing data, particularly, Carles Majos (IDI-Bellvitge), John Griffiths and Franklyn Howe (SGUL), Arend Heerschap (RU), Witold Gajewicz (MUL), Jorge Calvar (FLENI), and Antoni Capdevila (H. de Sant Joan de Deu). This work was partially funded by the European Commission: eTUMOUR (contract no. FP62002-LIFESCIHEALTH 503094), the HEALTHAGENTS EC project (HEALTHAGENTS) (contract no. FP6-2005-IST 027213), BIOPATTERN (contract no. FP6-2002-IST 508803). The authors appreciate the suggestions from the reviewers that have improved the discussion presented in this work. We also thank the following for their contributions: Programa de Apoyo a la Investigacion y Desarrollo, PAID-00-06 UPV; Research Council KUL: GOA-AMBioRICS, Centers-of-excellence optimisation; Belgian Federal Government: DWTC, IUAPV P6/04 (DYSCO 2007-2011); the following participants acknowledge the following: JVR acknowledges to Programa Torres Quevedo from Ministerio de Educacion y Ciencia, co-founded by the European Social Fund (PTQ05-02-03386). JL is a PhD student supported by an IWT grant. DM is supported by the Ministerio de Educacion y Ciencia del Gobierno de Espa a for a Ramon y Cajal 2006 Contract. BC and CA gratefully acknowledge the Ministerio de Educacion y Ciencia del Gobierno de Espa a (BC: SAF2004-06297 and SAF2007-6547; CA: SAF2005-03650). CIBER-BBN is an initiative of the "Instituto de Salud Carlos III" (ISCIII), Spain
- Published
- 2009
- Full Text
- View/download PDF
25. Variational Bayesian Generative Topographic Mapping
- Author
-
Alfredo Vellido, Ivan Olier, Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Multivariate statistics ,Generalization ,Visualització de la informació ,Bayesian probability ,Overfitting ,Regularization (mathematics) ,Clustering ,symbols.namesake ,Data visualization ,Information visualization ,Variational methods ,Cluster analysis ,Gaussian process ,Data minig ,Mathematics ,business.industry ,Applied Mathematics ,Pattern recognition ,Adaptive regularization ,Modeling and Simulation ,symbols ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,Mineria de dades ,Artificial intelligence ,business ,Generative topographic mapping - Abstract
General finite mixture models are powerful tools for the density-based grouping of multivariate i.i.d. data, but they lack data visualization capabilities, which reduces their practical applicability to real-world problems. Generative topographic mapping (GTM) was originally formulated as a constrained mixture of distributions in order to provide simultaneous visualization and clustering of multivariate data. In its inception, the adaptive parameters were determined by maximum likelihood (ML), using the expectation-maximization (EM) algorithm. The original GTM is, therefore, prone to data overfitting unless a regularization mechanism is included. In this paper, we define an alternative variational formulation of GTM that provides a full Bayesian treatment to a Gaussian process (GP)-based variation of the model. The generalization capabilities of the proposed Variational Bayesian GTM are assessed in some detail and compared with those of alternative GTM regularization approaches in terms of test log-likelihood, using several artificial and real datasets.
- Published
- 2008
- Full Text
- View/download PDF
26. Inequalities in physical comorbidity: a longitudinal comparative cohort study of people with severe mental illness in the UK
- Author
-
Tim Doran, Darren M. Ashcroft, Ivan Olier, Evangelos Kontopantelis, Linda Gask, David Reeves, Siobhan Reilly, and Claire Planner
- Subjects
Gerontology ,Adult ,Male ,medicine.medical_specialty ,Databases, Factual ,Population ,education ,Prevalence ,Comorbidity ,Q1 ,Severity of Illness Index ,Hypothyroidism ,Neoplasms ,severe mental illness ,Epidemiology ,Severity of illness ,mental disorders ,medicine ,Humans ,EPIDEMIOLOGY ,Longitudinal Studies ,Renal Insufficiency, Chronic ,Aged ,Retrospective Studies ,education.field_of_study ,Primary Health Care ,business.industry ,Mental Disorders ,Research ,fungi ,Retrospective cohort study ,General Medicine ,Health Status Disparities ,Middle Aged ,medicine.disease ,Mental health ,United Kingdom ,Mental Health ,Female ,business ,physical health ,Demography ,Cohort study - Abstract
Objectives Little is known about the prevalence of comorbidity rates in people with severe mental illness (SMI) in UK primary care. We calculated the prevalence of SMI by UK country, English region and deprivation quintile, antipsychotic and antidepressant medication prescription rates for people with SMI, and prevalence rates of common comorbidities in people with SMI compared with people without SMI.\ud \ud Design Retrospective cohort study from 2000 to 2012.\ud \ud Setting 627 general practices contributing to the Clinical Practice Research Datalink, a UK primary care database.\ud \ud Participants Each identified case (346 551) was matched for age, sex and general practice with 5 randomly selected control cases (1 732 755) with no diagnosis of SMI in each yearly time point.\ud \ud Outcome measures Prevalence rates were calculated for 16 conditions.\ud \ud Results SMI rates were highest in Scotland and in more deprived areas. Rates increased in England, Wales and Northern Ireland over time, with the largest increase in Northern Ireland (0.48% in 2000/2001 to 0.69% in 2011/2012). Annual prevalence rates of all conditions were higher in people with SMI compared with those without SMI. The discrepancy between the prevalence of those with and without SMI increased over time for most conditions. A greater increase in the mean number of additional conditions was observed in the SMI population over the study period (0.6 in 2000/2001 to 1.0 in 2011/2012) compared with those without SMI (0.5 in 2000/2001 to 0.6 in 2011/2012). For both groups, most conditions were more prevalent in more deprived areas, whereas for the SMI group conditions such as hypothyroidism, chronic kidney disease and cancer were more prevalent in more affluent areas.\ud \ud Conclusions Our findings highlight the health inequalities faced by people with SMI. The provision of appropriate timely health prevention, promotion and monitoring activities to reduce these health inequalities are needed, especially in deprived areas.
- Published
- 2015
27. Semi-supervised source extraction methodology for the nosological imaging of glioblastoma response to therapy
- Author
-
Margarida Julià-Sapé, Paulo J. G. Lisboa, Sandra Ortega-Martorell, Ivan Olier, Carles Arús, Magdalena Ciezka, and Teresa Delgado-Goñi
- Subjects
QA75 ,Oncology ,medicine.medical_specialty ,Medical treatment ,Response to therapy ,business.industry ,medicine.disease ,Tumor response ,Response to treatment ,RC0254 ,Internal medicine ,medicine ,business ,Glioblastoma - Abstract
Glioblastomas are one the most aggressive brain tumors. Their usual bad prognosis is due to the heterogeneity of their response to treatment and the lack of early and robust biomarkers to decide whether the tumor is responding to therapy. In this work, we propose the use of a semi-supervised methodology for source extraction to identify the sources representing tumor response to therapy, untreated/unresponsive tumor, and normal brain; and create nosological images of the response to therapy based on those sources. Fourteen mice were used to calculate the sources, and an independent test set of eight mice was used to further evaluate the proposed approach. The preliminary results obtained indicate that was possible to discriminate response and untreated/unresponsive areas of the tumor, and that the color-coded images allowed convenient tracking of response, especially throughout the course of therapy.
- Published
- 2014
- Full Text
- View/download PDF
28. A switching multi-scale dynamical network model of EEG/MEG
- Author
-
Nelson J. Trujillo-Barreto, Wael El-Deredy, and Ivan Olier
- Subjects
Mathematical optimization ,Cognitive Neuroscience ,Gaussian ,Models, Neurological ,Action Potentials ,Upper and lower bounds ,Pattern Recognition, Automated ,symbols.namesake ,Bayes' theorem ,Humans ,Computer Simulation ,Statistical physics ,Cluster analysis ,Mathematics ,Neurons ,Brain Mapping ,Models, Statistical ,Brain ,Signal Processing, Computer-Assisted ,Inverse problem ,Generative model ,Neurology ,symbols ,Piecewise ,A priori and a posteriori ,Nerve Net ,Algorithms - Abstract
We introduce a new generative model of the Encephalography (EEG/MEG) data, the inversion of which allows for inferring the locations and temporal evolution of the underlying sources as well as their dynamical interactions. The proposed Switching Mesostate Space Model (SMSM) builds on the multi-scale generative model for EEG/MEG by Daunizeau and Friston (2007). SMSM inherits the assumptions that (1) bioelectromagnetic activity is generated by a set of distributed sources, (2) the dynamics of these sources can be modelled as random fluctuations about a small number of mesostates, and (3) the number of mesostates engaged by a cognitive task is small. Additionally, four generalising assumptions are now included: (4) the mesostates interact according to a full Dynamical Causal Network (DCN) that can be estimated; (5) the dynamics of the mesostates can switch between multiple approximately linear operating regimes; (6) each operating regime remains stable over finite periods of time (temporal clusters); and (7) the total number of times the mesostates' dynamics can switch is small. The proposed model adds, therefore, a level of flexibility by accommodating complex brain processes that cannot be characterised by purely linear and stationary Gaussian dynamics. Importantly, the SMSM furnishes a new interpretation of the EEG/MEG data in which the source activity may have multiple discrete modes of behaviour, each with approximately linear dynamics. This is modelled by assuming that the connection strengths of the underlying mesoscopic DCN are time-dependent but piecewise constant, i.e. they can undergo discrete changes over time. A Variational Bayes inversion scheme is derived to estimate all the parameters of the model by maximising a (Negative Free Energy) lower bound on the model evidence. This bound is used to select among different model choices that are defined by the number of mesostates as well as by the number of stationary linear regimes. The full model is compared to a simplified version that uses no dynamical assumptions as well as to a standard EEG inversion technique. The comparison is carried out using an extensive set of simulations, and the application of SMSM to a real data set is also demonstrated. Our results show that for experimental situations in which we have some a priori belief that there are multiple approximately linear dynamical regimes, the proposed SMSM provides a natural modelling tool.
- Published
- 2012
29. A variational formulation for GTM through time
- Author
-
Alfredo Vellido, Ivan Olier, Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Multivariate statistics ,Time series ,Computer science ,Maximum likelihood ,Visualització de la informació ,Bayesian statistical decision theory ,Overfitting ,Machine learning ,computer.software_genre ,Data modeling ,symbols.namesake ,Data visualization ,Information visualization ,Hidden Markov models ,Cluster analysis ,Latent variable model ,Hidden Markov model ,Gaussian process ,Variational techniques ,business.industry ,Bayes methods ,Manifold ,Estadística bayesiana ,symbols ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,Artificial intelligence ,business ,computer ,Data visualisation - Abstract
Generative Topographic Mapping (GTM) is a latent variable model that, in its original version, was conceived to provide clustering and visualization of multivariate, realvalued, i.i.d. data. It was also extended to deal with noni-i.i.d. data such as multivariate time series in a variant called GTM Through Time (GTM-TT), defined as a constrained Hidden Markov Model (HMM). In this paper, we provide the theoretical foundations of the reformulation of GTM-TT within the Variational Bayesian framework and provide an illustrative example of its application. This approach handles the presence of noise in the time series, helping to avert the problem of data overfitting.
- Published
- 2008
- Full Text
- View/download PDF
30. Advances in clustering and visualization of time series using GTM through time
- Author
-
Ivan Olier, Alfredo Vellido, Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Multivariate statistics ,Time Factors ,Computer science ,Cognitive Neuroscience ,Change point detection ,Machine learning ,computer.software_genre ,Models, Biological ,Clustering ,Artificial Intelligence ,Cluster Analysis ,Humans ,Relevance (information retrieval) ,Cluster analysis ,Multivariate time series ,Interpretability ,Visualization ,Data minig ,Multidimensional analysis ,Series (mathematics) ,business.industry ,Unsupervised relevance determination ,Visualització ,ComputingMethodologies_PATTERNRECOGNITION ,Nonlinear Dynamics ,Data Interpretation, Statistical ,Unsupervised learning ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,Artificial intelligence ,Mineria de dades ,business ,computer ,Generative topographic mapping ,Change detection - Abstract
Most of the existing research on multivariate time series concerns supervised forecasting problems. In comparison, little research has been devoted to their exploration through unsupervised clustering and visualization. In this paper, the capabilities of Generative Topographic Mapping Through Time, a model with foundations in probability theory, that performs simultaneous time series clustering and visualization, are assessed in detail. Focus is placed on the visualization of the evolution of signal regimes and the exploration of sudden transitions, for which a novel identification index is defined. The interpretability of time series clustering results may become extremely difficult, even in exploratory visualization, for high dimensional datasets. Here, we define and test an unsupervised time series relevance determination method, fully integrated in the Generative Topographic Mapping Through Time model, that can be used as a basis for time series selection. This method should ease the interpretation of time series clustering results.
- Published
- 2006
31. Capturing the Dynamics of Multivariate Time Series Through Visualization Using Generative Topographic Mapping Through Time
- Author
-
Alfredo Vellido, Ivan Olier, Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Multivariate statistics ,Series (mathematics) ,Data visualization ,business.industry ,Computer science ,Pattern recognition ,Multivariate time series analysis ,Clustering ,Visualization ,Identification (information) ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,Artificial intelligence ,Time series ,business ,Focus (optics) ,Cluster analysis ,Generative topographic mapping ,Topology-constrained hidden Markov models - Abstract
Most of the existing research on time series concerns supervised forecasting problems. In comparison, little research has been devoted to unsupervised methods for the visual exploration of multivariate time series. In this paper, the capabilities of the Generative Topographic Mapping Through Time, a model with solid foundations in probability theory that performs simultaneous time series data clustering and visualization, are assessed in detail in several experiments. The focus is placed on the detection of atypical data, the visualization of the evolution of signal regimes, and the exploration of sudden transitions, for which a novel identification index is defined.
- Published
- 2006
- Full Text
- View/download PDF
32. Time Series Relevance Determination Through a Topology-Constrained Hidden Markov Model
- Author
-
Ivan Olier and Alfredo Vellido
- Subjects
Series (mathematics) ,business.industry ,Computer science ,Feature selection ,Machine learning ,computer.software_genre ,Visualization ,ComputingMethodologies_PATTERNRECOGNITION ,Data visualization ,Artificial intelligence ,Time series ,Cluster analysis ,business ,Hidden Markov model ,computer ,Interpretability - Abstract
Most of the existing research on multivariate time series concerns supervised forecasting problems. In comparison, little research has been devoted to unsupervised methods for the visual exploration of this type of data. The interpretability of time series clustering results may be difficult, even in exploratory visualization, for high dimensional datasets. In this paper, we define and test an unsupervised time series relevance determination method for Generative Topographic Mapping Through Time, a topology-constrained Hidden Markov Model that performs simultaneous time series data clustering and visualization. This relevance determination method can be used as a basis for time series selection, and should ease the interpretation of the time series clustering results.
- Published
- 2006
- Full Text
- View/download PDF
33. Comparative Assessment of the Robustness of Missing Data Imputation Through Generative Topographic Mapping
- Author
-
Ivan Olier and Alfredo Vellido
- Subjects
Multivariate statistics ,Computer science ,Density estimation ,computer.software_genre ,Missing data ,Information extraction ,Robustness (computer science) ,Data_GENERAL ,Generative topographic mapping ,Statistics::Methodology ,Data mining ,Imputation (statistics) ,Latent variable model ,computer - Abstract
The incompleteness of data is a most common source of uncertainty in real-world Data Mining applications. The management of this uncertainty is, therefore, a task of paramount importance for the data analyst. Many methods have been developed for missing data imputation, but few of them approach the problem of imputation as part of a general data density estimation scheme. Amongst the latter, a method for imputing and visualizing multivariate missing data using Generative Topographic Mapping was recently presented. This model and some of its extensions are tested under different experimental conditions. Its performance is compared to that of other missing data imputation techniques, thus assessing its relative capabilities and limitations.
- Published
- 2005
- Full Text
- View/download PDF
34. ClinicalCodes: An Online Clinical Codes Repository to Improve the Validity and Reproducibility of Research Using Electronic Medical Records
- Author
-
Rosa Parisi, Ivan Olier, David A. Springate, David Reeves, Edmore Chamapiwa, Darren M. Ashcroft, and Evangelos Kontopantelis
- Subjects
Research Report ,Disease informatics ,Research Validity ,Databases, Factual ,Epidemiology ,Download ,lcsh:Medicine ,030204 cardiovascular system & hematology ,Bioinformatics ,Disease Informatics ,Database and Informatics Methods ,Upload ,0302 clinical medicine ,Medicine and Health Sciences ,Electronic Health Records ,Medicine ,030212 general & internal medicine ,lcsh:Science ,Multidisciplinary ,Medical record ,RC660 ,Publications ,Research Assessment ,Reproducibility ,3. Good health ,Research Design ,Observational Studies ,Epidemiological Methods and Statistics ,The Internet ,Research Article ,Primary research ,Health Informatics ,Research and Analysis Methods ,Open Access ,03 medical and health sciences ,Humans ,Internet ,Information retrieval ,Primary Health Care ,business.industry ,Research ,Pharmacoepidemiology ,lcsh:R ,Clinical Coding ,Reproducibility of Results ,R1 ,Metadata ,lcsh:Q ,business ,Publication Practices ,Coding (social sciences) - Abstract
Lists of clinical codes are the foundation for research undertaken using electronic medical records (EMRs). If clinical code lists are not available, reviewers are unable to determine the validity of research, full study replication is impossible, researchers are unable to make effective comparisons between studies, and the construction of new code lists is subject to much duplication of effort. Despite this, the publication of clinical codes is rarely if ever a requirement for obtaining grants, validating protocols, or publishing research. In a representative sample of 450 EMR primary research articles indexed on PubMed, we found that only 19 (5.1%) were accompanied by a full set of published clinical codes and 32 (8.6%) stated that code lists were available on request. To help address these problems, we have built an online repository where researchers using EMRs can upload and download lists of clinical codes. The repository will enable clinical researchers to better validate EMR studies, build on previous code lists and compare disease definitions across studies. It will also assist health informaticians in replicating database studies, tracking changes in disease definitions or clinical coding practice through time and sharing clinical code information across platforms and data sources as research objects.
- Published
- 2014
- Full Text
- View/download PDF
35. A Bayesian model of EEG/MEG source dynamics and effective connectivity
- Author
-
Wael El-Deredy, Nelson J. Trujillo-Barreto, and Ivan Olier
- Subjects
Neuropsychology and Physiological Psychology ,business.industry ,Dynamics (music) ,Computer science ,Physiology (medical) ,General Neuroscience ,Pattern recognition ,Artificial intelligence ,business ,Bayesian inference - Published
- 2012
- Full Text
- View/download PDF
36. Complementing kernel-based visualization of protein sequences with their phylogenetic tree
- Author
-
Jesús Giraldo, Xavier Rovira, Alfredo Vellido, Martha Ivón Cárdenas, Ivan Olier, Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Genetics ,Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC] ,Artificial intelligence ,Phylogenetic tree ,business.industry ,Bioinformatics ,Intel·ligència artificial ,education ,Nonlinear dimensionality reduction ,Genomics ,Pattern recognition systems ,Computational biology ,Biology ,Visualization ,Data visualization ,Protein sequencing ,Kernel method ,Bioinformàtica ,Kernel (statistics) ,Reconeixement de formes (Informàtica) ,business - Abstract
The world of pharmacology is becoming increasingly dependent on the advances in the fields of genomics and proteomics. This dependency brings about the challenge of finding robust methods to analyze the complex data they generate. In this brief paper, we focus on the analysis of a specific type of proteins, the G protein-couple receptors, which are the target for over 15% of current drugs. We describe a kernel method of the manifold learning family for the analysis and intuitive visualization of their protein amino acid symbolic sequences. This method is shown to reveal the grouping structure of the sequences in a way that closely resembles the corresponding phylogenetic trees.
37. Primary care consultation rates among people with and without severe mental illness: a UK cohort study using the Clinical Practice Research Database
- Author
-
Evangelos Kontopantelis, Claire Planner, Ivan Olier, David Reeves, Darren Ashcroft, Doran, S., and Reilly, S.
38. Probability ridges and distortion flows: Visualizing multivariate time series using a variational Bayesian manifold learning method
- Author
-
Ivan Olier, Alessandra Tosi, Alfredo Vellido, Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, and Universitat Politècnica de Catalunya. SOCO - Soft Computing
- Subjects
Nonlinear dimensionality reduction ,Bayesian probability ,Visualització de la informació ,Bayesian statistical decision theory ,Machine learning ,computer.software_genre ,Variational Bayesian methods ,Data visualization ,Information visualization ,Magnification factors ,Hidden Markov model ,Multivariate time series ,Interpretability ,Mathematics ,Visualization ,business.industry ,Probabilistic logic ,Mapping distortion ,Estadística bayesiana ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,Artificial intelligence ,business ,computer ,Algorithm ,Generative topographic mapping - Abstract
Time-dependent natural phenomena and artificial processes can often be quantitatively expressed as multivariate time series (MTS). As in any other process of knowledge extraction from data, the analyst can benefit from the exploration of the characteristics of MTS through data visualization. This visualization often becomes difficult to interpret when MTS are modelled using nonlinear techniques. Despite their flexibility, nonlinear models can be rendered useless if such interpretability is lacking. In this brief paper, we model MTS using Variational Bayesian Generative Topographic Mapping Through Time (VB-GTM-TT), a variational Bayesian variant of a constrained hidden Markov model of the manifold learning family defined for MTS visualization. We aim to increase its interpretability by taking advantage of two results of the probabilistic definition of the model: the explicit estimation of probabilities of transition between states described in the visualization space and the quantification of the nonlinear mapping distortion.
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.