Descriptor: "biostatistic" / Publisher: bmc - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"biostatistic"' showing total 2 results

Start Over Descriptor "biostatistic" Publisher bmc

2 results on '"biostatistic"'

1. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Author: Davide Chicco, Giuseppe Jurman, Chicco, D, and Jurman, G
Subjects: Dataset imbalance, lcsh:QH426-470, lcsh:Biotechnology, Biostatistic, Confusion matrice, Binary number, 02 engineering and technology, Biology, Biostatistics, Machine learning, computer.software_genre, Measure (mathematics), 03 medical and health sciences, Confusion matrices, F1 score, lcsh:TP248.13-248.65, 0202 electrical engineering, electronic engineering, information engineering, Genetics, False positive paradox, score, Use case, Correlation of Data, Binary classification, Accuracy, 030304 developmental biology, 0303 health sciences, business.industry, INF/01 - INFORMATICA, Confusion matrix, Computational Biology, Genomics, Matthews correlation coefficient, lcsh:Genetics, Data Interpretation, Statistical, Genomic, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Algorithms, Biotechnology, Research Article
Abstract: Background To evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet. Accuracy and F1 score computed on confusion matrices have been (and still are) among the most popular adopted metrics in binary classification tasks. However, these statistical measures can dangerously show overoptimistic inflated results, especially on imbalanced datasets. Results The Matthews correlation coefficient (MCC), instead, is a more reliable statistical rate which produces a high score only if the prediction obtained good results in all of the four confusion matrix categories (true positives, false negatives, true negatives, and false positives), proportionally both to the size of positive elements and the size of negative elements in the dataset. Conclusions In this article, we show how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario. We believe that the Matthews correlation coefficient should be preferred to accuracy and F1 score in evaluating binary classification tasks by all scientific communities.
Published: 2020

2. Graphical comparisons of relative disease burden across multiple risk factors

Author: Salim Yusuf, Fabrizio Maturo, John Ferguson, Neil O'Leary, Martin O'Donnell, Ferguson, John, O’Leary, Neil, Maturo, Fabrizio, Yusuf, Salim, and O’Donnell, Martin
Subjects: Male, Alcohol Drinking, Epidemiology, Population, Prevalence, Health Informatics, Logistic regression, 01 natural sciences, 010104 statistics & probability, 03 medical and health sciences, 0302 clinical medicine, Risk Factors, Statistics, Odds Ratio, Medicine, Humans, graphical statistics, 030212 general & internal medicine, 0101 mathematics, Risk factor, education, Disease burden, Apolipoproteins B, disease, education.field_of_study, lcsh:R5-920, Apolipoprotein A-I, business.industry, Smoking, PAF, Odds ratio, Models, Theoretical, Stroke, Logistic Models, Relative risk, Case-Control Studies, Attributable risk, Hypertension, Female, biostatistic, business, lcsh:Medicine (General), Algorithms, Research Article
Abstract: Background Population attributable fractions (PAF) measure the proportion of disease prevalence that would be avoided in a hypothetical population, similar to the population of interest, but where a particular risk factor is eliminated. They are extensively used in epidemiology to quantify and compare disease burden due to various risk factors, and directly influence public policy regarding possible health interventions. In contrast to individual specific metrics such as relative risks and odds ratios, attributable fractions depend jointly on both risk factor prevalence and relative risk. The relative contributions of these two components is important, and usually needs to be presented in summary tables that are presented together with the attributable fraction calculation. However, representing PAF in an accessible graphical format, that captures both prevalence and relative risk, may assist interpretation. Methods Taylor-series approximations to PAF in terms of risk factor prevalence and log-odds ratio are derived that facilitate simultaneous representation of PAF, risk factor prevalence and risk-factor/disease log-odds ratios on a single co-ordinate axis. Methods are developed for binary, multi-category and continuous exposure variables. Results The methods are demonstrated using INTERSTROKE, a large international case control dataset focused on risk factors for stroke. Conclusions The described methods could be used as a complement to tables summarizing prevalence, odds ratios and PAF, and may convey the same information in a more intuitive and visually appealing manner. The suggested nomogram can also be used to visually estimate the effects of health interventions which only partially reduce risk factor prevalence. Finally, in the binary risk factor case, the approximations can also be used to quickly convert logistic regression coefficients for a risk factor into approximate PAFs.
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"biostatistic"'

1. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

2. Graphical comparisons of relative disease burden across multiple risk factors

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

2 results on '"biostatistic"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources