144 results on '"Tiulpin A"'
Search Results
2. Kellgren-Lawrence grading of knee osteoarthritis using deep learning: Diagnostic performance with external dataset and comparison with four readers
- Author
-
Elias Vaattovaara, Egor Panfilov, Aleksei Tiulpin, Tuukka Niinimäki, Jaakko Niinimäki, Simo Saarakkala, and Mika T. Nevalainen
- Subjects
Osteoarthritis ,Deep learning ,Kellgren-Lawrence ,Diseases of the musculoskeletal system ,RC925-935 - Abstract
Objective: To evaluate the performance of a deep learning (DL) model in an external dataset to assess radiographic knee osteoarthritis using Kellgren-Lawrence (KL) grades against versatile human readers. Materials and methods: Two-hundred-eight knee anteroposterior conventional radiographs (CRs) were included in this retrospective study. Four readers (three radiologists, one orthopedic surgeon) assessed the KL grades and consensus grade was derived as the mean of these. The DL model was trained using all the CRs from Multicenter Osteoarthritis Study (MOST) and validated on Osteoarthritis Initiative (OAI) dataset and then tested on our external dataset. To assess the agreement between the graders, Cohen's quadratic kappa (k) with 95 % confidence intervals were used. Diagnostic performance was measured using confusion matrices and receiver operating characteristic (ROC) analyses. Results: The multiclass (KL grades from 0 to 4) diagnostic performance of the DL model was multifaceted: sensitivities were between 0.372 and 1.000, specificities 0.691–0.974, PPVs 0.227–0.879, NPVs 0.622–1.000, and AUCs 0.786–0.983. The overall balanced accuracy was 0.693, AUC 0.886, and kappa 0.820. If only dichotomous KL grading (i.e. KL0-1 vs. KL2-4) was utilized, superior metrics were seen with an overall balanced accuracy of 0.902 and AUC of 0.967. A substantial agreement between each reader and DL model was found: the inter-rater agreement was 0.737 [0.685–0.790] for the radiology resident, 0.761 [0.707–0.816] for the musculoskeletal radiology fellow, 0.802 [0.761–0.843] for the senior musculoskeletal radiologist, and 0.818 [0.775–0.860] for the orthopedic surgeon. Conclusion: In an external dataset, our DL model can grade knee osteoarthritis with diagnostic accuracy comparable to highly experienced human readers.
- Published
- 2025
- Full Text
- View/download PDF
3. Metrics reloaded: recommendations for image analysis validation
- Author
-
Maier-Hein, Lena, Reinke, Annika, Godau, Patrick, Tizabi, Minu D., Buettner, Florian, Christodoulou, Evangelia, Glocker, Ben, Isensee, Fabian, Kleesiek, Jens, Kozubek, Michal, Reyes, Mauricio, Riegler, Michael A., Wiesenfarth, Manuel, Kavur, A. Emre, Sudre, Carole H., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Rädsch, Tim, Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Blaschko, Matthew B., Cardoso, M. Jorge, Cheplygina, Veronika, Cimini, Beth A., Collins, Gary S., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Haase, Robert, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kofler, Florian, Kopp-Schneider, Annette, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rajpoot, Nasir, Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, van Smeden, Maarten, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, and Jäger, Paul F.
- Published
- 2024
- Full Text
- View/download PDF
4. Understanding metric-related pitfalls in image analysis validation
- Author
-
Reinke, Annika, Tizabi, Minu D., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Kavur, A. Emre, Rädsch, Tim, Sudre, Carole H., Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Buettner, Florian, Cardoso, M. Jorge, Cheplygina, Veronika, Chen, Jianxu, Christodoulou, Evangelia, Cimini, Beth A., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Glocker, Ben, Godau, Patrick, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Isensee, Fabian, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Kleesiek, Jens, Kofler, Florian, Kooi, Thijs, Kopp-Schneider, Annette, Kozubek, Michal, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rafelski, Susanne M., Rajpoot, Nasir, Reyes, Mauricio, Riegler, Michael A., Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, Yaniv, Ziv R., Jäger, Paul F., and Maier-Hein, Lena
- Published
- 2024
- Full Text
- View/download PDF
5. Quantification of Upper Limb Movements in Patients with Hereditary or Idiopathic Ataxia
- Author
-
Lipponen, Joonas, Tiulpin, Aleksei, Majamaa, Kari, and Rusanen, Harri
- Published
- 2023
- Full Text
- View/download PDF
6. The KNee OsteoArthritis Prediction (KNOAP2020) challenge: An image analysis challenge to predict incident symptomatic radiographic knee osteoarthritis from MRI and X-ray images
- Author
-
Hirvasniemi, J., Runhaar, J., van der Heijden, R.A., Zokaeinikoo, M., Yang, M., Li, X., Tan, J., Rajamohan, H.R., Zhou, Y., Deniz, C.M., Caliva, F., Iriondo, C., Lee, J.J., Liu, F., Martinez, A.M., Namiri, N., Pedoia, V., Panfilov, E., Bayramoglu, N., Nguyen, H.H., Nieminen, M.T., Saarakkala, S., Tiulpin, A., Lin, E., Li, A., Li, V., Dam, E.B., Chaudhari, A.S., Kijowski, R., Bierma-Zeinstra, S., Oei, E.H.G., and Klein, S.
- Published
- 2023
- Full Text
- View/download PDF
7. Predicting total knee arthroplasty from ultrasonography using machine learning
- Author
-
Tiulpin, Aleksei, Saarakkala, Simo, Mathiessen, Alexander, Hammer, Hilde Berner, Furnes, Ove, Nordsletten, Lars, Englund, Martin, and Magnusson, Karin
- Published
- 2022
- Full Text
- View/download PDF
8. Outcome and biomarker supervised deep learning for survival prediction in two multicenter breast cancer series
- Author
-
Bychkov, Dmitrii, Joensuu, Heikki, Nordling, Stig, Tiulpin, Aleksei, Kücükel, Hakan, Lundin, Mikael, Sihto, Harri, Isola, Jorma, Lehtimäki, Tiina, Kellokumpu-Lehtinen, Pirkko-Liisa, von Smitten, Karl, Lundin, Johan, and Linder, Nina
- Published
- 2022
- Full Text
- View/download PDF
9. Acoustic emissions and kinematic instability of the osteoarthritic knee joint: comparison with radiographic findings
- Author
-
Mika T. Nevalainen, Olli Veikkola, Jerome Thevenot, Aleksei Tiulpin, Jukka Hirvasniemi, Jaakko Niinimäki, and Simo S. Saarakkala
- Subjects
Medicine ,Science - Abstract
Abstract To evaluate the acoustic emissions (AE) and kinematic instability (KI) of the osteoarthritic (OA) knee joints, and to compare these signals to radiographic findings. Sixty-six female and 43 male participants aged 44–67 were recruited. On radiography, joint-space narrowing, osteophytes and Kellgren–Lawrence (KL) grade were evaluated. Based on radiography, 54 subjects (the study group) were diagnosed with radiographic OA (KL-grade ≥ 2) while the remaining 55 subjects (KL-grade
- Published
- 2021
- Full Text
- View/download PDF
10. Automating three-dimensional osteoarthritis histopathological grading of human osteochondral tissue using machine learning on contrast-enhanced micro-computed tomography
- Author
-
Rytky, S.J.O., Tiulpin, A., Frondelius, T., Finnilä, M.A.J., Karhula, S.S., Leino, J., Pritzker, K.P.H., Valkealahti, M., Lehenkari, P., Joukainen, A., Kröger, H., Nieminen, H.J., and Saarakkala, S.
- Published
- 2020
- Full Text
- View/download PDF
11. Development of osteoarthritis in patients with degenerative meniscal tears treated with exercise therapy or surgery: a randomized controlled trial
- Author
-
Berg, B., Roos, E.M., Englund, M., Kise, N.J., Tiulpin, A., Saarakkala, S., Engebretsen, L., Eftang, C.N., Holm, I., and Risberg, M.A.
- Published
- 2020
- Full Text
- View/download PDF
12. Adaptive segmentation of knee radiographs for selecting the optimal ROI in texture analysis
- Author
-
Bayramoglu, N., Tiulpin, A., Hirvasniemi, J., Nieminen, M.T., and Saarakkala, S.
- Published
- 2020
- Full Text
- View/download PDF
13. Critical evaluation of deep neural networks for wrist fracture detection
- Author
-
Abu Mohammed Raisuddin, Elias Vaattovaara, Mika Nevalainen, Marko Nikki, Elina Järvenpää, Kaisa Makkonen, Pekka Pinola, Tuula Palsio, Arttu Niemensivu, Osmo Tervonen, and Aleksei Tiulpin
- Subjects
Medicine ,Science - Abstract
Abstract Wrist Fracture is the most common type of fracture with a high incidence rate. Conventional radiography (i.e. X-ray imaging) is used for wrist fracture detection routinely, but occasionally fracture delineation poses issues and an additional confirmation by computed tomography (CT) is needed for diagnosis. Recent advances in the field of Deep Learning (DL), a subfield of Artificial Intelligence (AI), have shown that wrist fracture detection can be automated using Convolutional Neural Networks. However, previous studies did not pay close attention to the difficult cases which can only be confirmed via CT imaging. In this study, we have developed and analyzed a state-of-the-art DL-based pipeline for wrist (distal radius) fracture detection—DeepWrist, and evaluated it against one general population test set, and one challenging test set comprising only cases requiring confirmation by CT. Our results reveal that a typical state-of-the-art approach, such as DeepWrist, while having a near-perfect performance on the general independent test set, has a substantially lower performance on the challenging test set—average precision of 0.99 (0.99–0.99) versus 0.64 (0.46–0.83), respectively. Similarly, the area under the ROC curve was of 0.99 (0.98–0.99) versus 0.84 (0.72–0.93), respectively. Our findings highlight the importance of a meticulous analysis of DL-based models before clinical use, and unearth the need for more challenging settings for testing medical AI systems.
- Published
- 2021
- Full Text
- View/download PDF
14. Deep learning identifies morphological features in breast cancer predictive of cancer ERBB2 status and trastuzumab treatment efficacy
- Author
-
Dmitrii Bychkov, Nina Linder, Aleksei Tiulpin, Hakan Kücükel, Mikael Lundin, Stig Nordling, Harri Sihto, Jorma Isola, Tiina Lehtimäki, Pirkko-Liisa Kellokumpu-Lehtinen, Karl von Smitten, Heikki Joensuu, and Johan Lundin
- Subjects
Medicine ,Science - Abstract
Abstract The treatment of patients with ERBB2 (HER2)-positive breast cancer with anti-ERBB2 therapy is based on the detection of ERBB2 gene amplification or protein overexpression. Machine learning (ML) algorithms can predict the amplification of ERBB2 based on tumor morphological features, but it is not known whether ML-derived features can predict survival and efficacy of anti-ERBB2 treatment. In this study, we trained a deep learning model with digital images of hematoxylin–eosin (H&E)-stained formalin-fixed primary breast tumor tissue sections, weakly supervised by ERBB2 gene amplification status. The gene amplification was determined by chromogenic in situ hybridization (CISH). The training data comprised digitized tissue microarray (TMA) samples from 1,047 patients. The correlation between the deep learning–predicted ERBB2 status, which we call H&E-ERBB2 score, and distant disease-free survival (DDFS) was investigated on a fully independent test set, which included whole-slide tumor images from 712 patients with trastuzumab treatment status available. The area under the receiver operating characteristic curve (AUC) in predicting gene amplification in the test sets was 0.70 (95% CI, 0.63–0.77) on 354 TMA samples and 0.67 (95% CI, 0.62–0.71) on 712 whole-slide images. Among patients with ERBB2-positive cancer treated with trastuzumab, those with a higher than the median morphology–based H&E-ERBB2 score derived from machine learning had more favorable DDFS than those with a lower score (hazard ratio [HR] 0.37; 95% CI, 0.15–0.93; P = 0.034). A high H&E-ERBB2 score was associated with unfavorable survival in patients with ERBB2-negative cancer as determined by CISH. ERBB2-associated morphology correlated with the efficacy of adjuvant anti-ERBB2 treatment and can contribute to treatment-predictive information in breast cancer.
- Published
- 2021
- Full Text
- View/download PDF
15. Gray Matter Age Prediction as a Biomarker for Risk of Dementia
- Author
-
Wang, Johnny, Knol, Maria J., Tiulpin, Aleksei, Dubost, Florian, de Bruijne, Marleen, Vernooij, Meike W., Adams, Hieab H. H., Ikram, M. Arfan, Niessen, Wiro J., and Roshchupkin, Gennady V.
- Published
- 2019
16. Semiautomatic Assessment of Facet Tropism From Lumbar Spine MRI Using Deep Learning: A Northern Finland Birth Cohort Study.
- Author
-
Kowlagi, Narasimharao, Kemppainen, Antti, Panfilov, Egor, McSweeney, Terence, Saarakkala, Simo, Nevalainen, Mika, Niinimäki, Jaakko, Karppinen, Jaro, and Tiulpin, Aleksei
- Published
- 2024
- Full Text
- View/download PDF
17. Automatic Grading of Individual Knee Osteoarthritis Features in Plain Radiographs Using Deep Convolutional Neural Networks
- Author
-
Aleksei Tiulpin and Simo Saarakkala
- Subjects
multi-task learning ,deep learning ,transfer learning ,knee osteoarthritis ,OARSI grading atlas ,Medicine (General) ,R5-920 - Abstract
Knee osteoarthritis (OA) is the most common musculoskeletal disease in the world. In primary healthcare, knee OA is diagnosed using clinical examination and radiographic assessment. Osteoarthritis Research Society International (OARSI) atlas of OA radiographic features allows performing independent assessment of knee osteophytes, joint space narrowing and other knee features. This provides a fine-grained OA severity assessment of the knee, compared to the gold standard and most commonly used Kellgren–Lawrence (KL) composite score. In this study, we developed an automatic method to predict KL and OARSI grades from knee radiographs. Our method is based on Deep Learning and leverages an ensemble of residual networks with 50 layers. We used transfer learning from ImageNet with a fine-tuning on the Osteoarthritis Initiative (OAI) dataset. An independent testing of our model was performed on the Multicenter Osteoarthritis Study (MOST) dataset. Our method yielded Cohen’s kappa coefficients of 0.82 for KL-grade and 0.79, 0.84, 0.94, 0.83, 0.84 and 0.90 for femoral osteophytes, tibial osteophytes and joint space narrowing for lateral and medial compartments, respectively. Furthermore, our method yielded area under the ROC curve of 0.98 and average precision of 0.98 for detecting the presence of radiographic OA, which is better than the current state-of-the-art.
- Published
- 2020
- Full Text
- View/download PDF
18. Understanding metric-related pitfalls in image analysis validation
- Author
-
Reinke, Annika, Tizabi, Minu D., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Kavur, A. Emre, Rädsch, Tim, Sudre, Carole H., Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Blaschko, Matthew, Büttner, Florian, Cardoso, M. Jorge, Cheplygina, Veronika, Chen, Jianxu, Christodoulou, Evangelia, Cimini, Beth A., Collins, Gary S., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Glocker, Ben, Godau, Patrick, Haase, Robert, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Isensee, Fabian, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kenngott, Hannes, Kleesiek, Jens, Kofler, Florian, Kooi, Thijs, Kopp-Schneider, Annette, Kozubek, Michal, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rafelski, Susanne M., Rajpoot, Nasir, Reyes, Mauricio, Riegler, Michael A., Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, van Smeden, Maarten, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, Wiesenfarth, Manuel, Yaniv, Ziv R., Jäger, Paul F., and Maier-Hein, Lena
- Subjects
FOS: Computer and information sciences ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibility of metric-related knowledge: While taking into account the individual strengths, weaknesses, and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multi-stage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides the first reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Focusing on biomedical image analysis but with the potential of transfer to other fields, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. To facilitate comprehension, illustrations and specific examples accompany each pitfall. As a structured body of information accessible to researchers of all levels of expertise, this work enhances global comprehension of a key topic in image analysis validation.
- Published
- 2023
19. A Stronger Baseline For Automatic Pfirrmann Grading Of Lumbar Spine MRI Using Deep Learning
- Author
-
Kowlagi, Narasimharao, Nguyen, Huy Hoang, McSweeney, Terence, Saarakkala, Simo, määttä, Juhani, Karppinen, Jaro, and Tiulpin, Aleksei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
This paper addresses the challenge of grading visual features in lumbar spine MRI using Deep Learning. Such a method is essential for the automatic quantification of structural changes in the spine, which is valuable for understanding low back pain. Multiple recent studies investigated different architecture designs, and the most recent success has been attributed to the use of transformer architectures. In this work, we argue that with a well-tuned three-stage pipeline comprising semantic segmentation, localization, and classification, convolutional networks outperform the state-of-the-art approaches. We conducted an ablation study of the existing methods in a population cohort, and report performance generalization across various subgroups. Our code is publicly available to advance research on disc degeneration and low back pain., Comment: 5 pages, under review
- Published
- 2022
20. On confidence intervals for precision matrices and the eigendecomposition of covariance matrices
- Author
-
Popordanoska, Teodora, Tiulpin, Aleksei, Bounliphone, Wacha, and Blaschko, Matthew B.
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,FOS: Mathematics ,Mathematics - Statistics Theory ,Statistics Theory (math.ST) ,Machine Learning (cs.LG) - Abstract
The eigendecomposition of a matrix is the central procedure in probabilistic models based on matrix factorization, for instance principal component analysis and topic models. Quantifying the uncertainty of such a decomposition based on a finite sample estimate is essential to reasoning under uncertainty when employing such models. This paper tackles the challenge of computing confidence bounds on the individual entries of eigenvectors of a covariance matrix of fixed dimension. Moreover, we derive a method to bound the entries of the inverse covariance matrix, the so-called precision matrix. The assumptions behind our method are minimal and require that the covariance matrix exists, and its empirical estimator converges to the true covariance. We make use of the theory of U-statistics to bound the $L_2$ perturbation of the empirical covariance matrix. From this result, we obtain bounds on the eigenvectors using Weyl's theorem and the eigenvalue-eigenvector identity and we derive confidence intervals on the entries of the precision matrix using matrix inversion perturbation bounds. As an application of these results, we demonstrate a new statistical test, which allows us to test for non-zero values of the precision matrix. We compare this test to the well-known Fisher-z test for partial correlations, and demonstrate the soundness and scalability of the proposed statistical test, as well as its application to real-world data from medical and physics domains., arXiv admin note: text overlap with arXiv:1604.01733
- Published
- 2022
21. Metrics reloaded: Recommendations for image analysis validation
- Author
-
Maier-Hein, Lena, Reinke, Annika, Godau, Patrick, Tizabi, Minu D., Büttner, Florian, Christodoulou, Evangelia, Glocker, Ben, Isensee, Fabian, Kleesiek, Jens, Kozubek, Michal, Reyes, Mauricio, Riegler, Michael A., Wiesenfarth, Manuel, Kavur, A. Emre, Sudre, Carole H., Baumgartner, Michael, Eisenmann, Matthias, Heckmann-Nötzel, Doreen, Rädsch, A. Tim, Acion, Laura, Antonelli, Michela, Arbel, Tal, Bakas, Spyridon, Benis, Arriel, Blaschko, Matthew, Cardoso, M. Jorge, Cheplygina, Veronika, Cimini, Beth A., Collins, Gary S., Farahani, Keyvan, Ferrer, Luciana, Galdran, Adrian, van Ginneken, Bram, Haase, Robert, Hashimoto, Daniel A., Hoffman, Michael M., Huisman, Merel, Jannin, Pierre, Kahn, Charles E., Kainmueller, Dagmar, Kainz, Bernhard, Karargyris, Alexandros, Karthikesalingam, Alan, Kenngott, Hannes, Kofler, Florian, Kopp-Schneider, Annette, Kreshuk, Anna, Kurc, Tahsin, Landman, Bennett A., Litjens, Geert, Madani, Amin, Maier-Hein, Klaus, Martel, Anne L., Mattson, Peter, Meijering, Erik, Menze, Bjoern, Moons, Karel G. M., Müller, Henning, Nichyporuk, Brennan, Nickel, Felix, Petersen, Jens, Rajpoot, Nasir, Rieke, Nicola, Saez-Rodriguez, Julio, Sánchez, Clara I., Shetty, Shravya, van Smeden, Maarten, Summers, Ronald M., Taha, Abdel A., Tiulpin, Aleksei, Tsaftaris, Sotirios A., Van Calster, Ben, Varoquaux, Gaël, and Jäger, Paul F.
- Subjects
FOS: Computer and information sciences ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. The framework was developed in a multi-stage Delphi process and is based on the novel concept of a problem fingerprint - a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), data set and algorithm output. Based on the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as a classification task at image, object or pixel level, namely image-level classification, object detection, semantic segmentation, and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool, which also provides a point of access to explore weaknesses, strengths and specific recommendations for the most common validation metrics. The broad applicability of our framework across domains is demonstrated by an instantiation for various biological and medical image analysis use cases., Shared first authors: Lena Maier-Hein, Annika Reinke
- Published
- 2022
22. AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching
- Author
-
Nguyen, Khanh, Nguyen, Huy Hoang, and Tiulpin, Aleksei
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Vision and Pattern Recognition (cs.CV) ,Image and Video Processing (eess.IV) ,FOS: Electrical engineering, electronic engineering, information engineering ,Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing ,Machine Learning (cs.LG) - Abstract
This paper tackles the challenge of forensic medical image matching (FMIM) using deep neural networks (DNNs). FMIM is a particular case of content-based image retrieval (CBIR). The main challenge in FMIM compared to the general case of CBIR, is that the subject to whom a query image belongs may be affected by aging and progressive degenerative disorders, making it difficult to match data on a subject level. CBIR with DNNs is generally solved by minimizing a ranking loss, such as Triplet loss (TL), computed on image representations extracted by a DNN from the original data. TL, in particular, operates on triplets: anchor, positive (similar to anchor) and negative (dissimilar to anchor). Although TL has been shown to perform well in many CBIR tasks, it still has limitations, which we identify and analyze in this work. In this paper, we introduce (i) the AdaTriplet loss -- an extension of TL whose gradients adapt to different difficulty levels of negative samples, and (ii) the AutoMargin method -- a technique to adjust hyperparameters of margin-based losses such as TL and our proposed loss dynamically. Our results are evaluated on two large-scale benchmarks for FMIM based on the Osteoarthritis Initiative and Chest X-ray-14 datasets. The codes allowing replication of this study have been made publicly available at \url{https://github.com/Oulu-IMEDS/AdaTriplet}., 15 pages, 6 figures, accepted as a conference paper at MICCAI 2022
- Published
- 2022
23. Deep semi-supervised active learning for knee osteoarthritis severity grading
- Author
-
Raisuddin, A. M. (Abu Mohammed), Nguyen, H. H. (Huy Hoang), and Tiulpin, A. (Aleksei)
- Subjects
Knee Osteoarthritis ,Consistency Regularization ,Epistemic Uncertainty ,Deep Active Learning ,Monte-Carlo Dropout - Abstract
This paper tackles the problem of developing active learning (AL) methods in the context of knee osteoarthritis (OA) diagnosis from X-ray images. OA is known to be a huge burden for society, and its associated costs are constantly rising. Automatic diagnostic methods can potentially reduce these costs, and Deep Learning (DL) methodology may be its key enabler. To date, there have been numerous studies on knee OA severity grading using DL, and all but one of them assume a large annotated dataset available for model development. In contrast, our study shows one can develop a knee OA severity grading model using AL from as little as 50 samples randomly chosen from a pool of unlabeled data. The main insight of this work is that the performance of AL improves when the model developer leverages the consistency regularization technique, commonly applied in semi-supervised learning.
- Published
- 2022
24. CLIMAT:clinically-inspired multi-agent transformers for knee osteoarthritis trajectory forecasting
- Author
-
Nguyen, H. H. (Huy Hoang), Saarakkala, S. (Simo), Blaschko, M. B. (Matthew B.), and Tiulpin, A. (Aleksei)
- Subjects
osteoarthritis ,transformer ,deep learning ,prognosis ,trajectory forecasting - Abstract
In medical applications, deep learning methods are designed to automate diagnostic tasks. However, a clinically relevant question that practitioners usually face, is how to predict the future trajectory of a disease (prognosis). Current methods for such a problem often require domain knowledge, and are complicated to apply. In this paper, we formulate the prognosis prediction problem as a one-to-many forecasting problem from multimodal data. Inspired by a clinical decision-making process with two agents — a radiologist and a general practitioner, we model a prognosis prediction problem with two transformer-based components that share information between each other. The first block in this model aims to analyze the imaging data, and the second block leverages the internal representations of the first one as inputs, also fusing them with auxiliary patient data. We show the effectiveness of our method in predicting the development of structural knee osteoarthritis changes over time. Our results show that the proposed method outperforms the state-of-the-art baselines in terms of various performance metrics. In addition, we empirically show that the existence of the multi-agent transformers with depths of 2 is sufficient to achieve good performances. Our code is publicly available at https://github.com/MIPT-Oulu/CLIMAT.
- Published
- 2022
25. Predicting total knee arthroplasty from ultrasonography using machine learning
- Author
-
Aleksei Tiulpin, Simo Saarakkala, Alexander Mathiessen, Hilde Berner Hammer, Ove Furnes, Lars Nordsletten, Martin Englund, and Karin Magnusson
- Subjects
Total knee replacement ,Multivariate predictive modeling ,Machine learning ,General Medicine ,Ultrasonography - Abstract
Objective: To investigate the value of ultrasonographic data in predicting total knee replacement (TKR). Design: Data from the Musculoskeletal Pain in Ullensaker study (MUST) was linked to the Norwegian Arthroplasty Register to form a 5–7 year prospective cohort study of 630 persons (69% women, mean (SD) age 64 (8.7) years). We examined the predictive power of ultrasound (US) features, i.e. osteophytes, meniscal extrusion, synovitis in the suprapatellar recess, femoral cartilage thickness, and quality for future knee osteoarthritis (OA) surgery. We investigated 4 main settings for multivariate predictive modeling: 1) clinical predictors (age, sex, body mass index, knee injury, familial OA and workload), 2) radiographic data (assessed by the Kellgren Lawrence grade, KL) with clinical predictors, 3) US features and clinical predictors. Finally, we also considered an ensemble of models 2) and 3) and used it as our fifth model. All models were compared using the Average Precision (AP) and the Area Under Receiver Operating Characteristic Curve (AUC) metrics. Results: Clinical predictors yielded AP of 0.11 (95% confidence interval [CI] 0.05–0.23) and AUC of 0.69 (0.58–0.79). Clinical predictors with KL grade yielded AP of 0.20 (0.12–0.33) and AUC of 0.81 (0.67–0.90). The clinical variables with ultrasound yielded AP of 0.17 (0.08–0.30) and AUC of 0.79 (0.69–0.86). Conclusions: Ultrasonographic examination of the knee may provide added value to basic clinical and demographic descriptors when predicting TKR. While it does not achieve the same predictive performance as radiography, it can provide additional value to the radiographic examination.
- Published
- 2022
26. Automatic Segmentation Algorithms Improve the Efficiency and Inter-Observer Agreement of Gross Target Volumes Contours
- Author
-
Czeizler, Elena, Pajula, Juha, Pölönen, Harri, Antila, Kari, Lehtio, Kaisa, Lehto, Joonas, Tiulpin, Aleksei, Maslowski, Alexander, Hakala, Mikko, Kuusela, Esa, and Bush, K.
- Published
- 2022
27. Predicting knee osteoarthritis progression from structural MRI using deep learning
- Author
-
Panfilov, Egor, Saarakkala, Simo, Nieminen, Miika T., and Tiulpin, Aleksei
- Subjects
FOS: Computer and information sciences ,Knee Osteoarthritis ,Transformer ,End-to-End ,Computer Vision and Pattern Recognition (cs.CV) ,Image and Video Processing (eess.IV) ,FOS: Electrical engineering, electronic engineering, information engineering ,Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing ,Progression Predic- tion ,MRI - Abstract
Accurate prediction of knee osteoarthritis (KOA) progression from structural MRI has a potential to enhance disease understanding and support clinical trials. Prior art focused on manually designed imaging biomarkers, which may not fully exploit all disease-related information present in MRI scan. In contrast, our method learns relevant representations from raw data end-to-end using Deep Learning, and uses them for progression prediction. The method employs a 2D CNN to process the data slice-wise and aggregate the extracted features using a Transformer. Evaluated on a large cohort (n=4,866), the proposed method outperforms conventional 2D and 3D CNN-based models and achieves average precision of $0.58\pm0.03$ and ROC AUC of $0.78\pm0.01$. This paper sets a baseline on end-to-end KOA progression prediction from structural MRI. Our code is publicly available at https://github.com/MIPT-Oulu/OAProgressionMR., $\copyright$ 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
- Published
- 2022
28. External Validation of SpineNet, an Open-Source Deep Learning Model for Grading Lumbar Disk Degeneration MRI Features, Using the Northern Finland Birth Cohort 1966.
- Author
-
McSweeney, Terence P., Tiulpin, Aleksei, Saarakkala, Simo, Niinimäki, Jaakko, Windsor, Rhydian, Jamaludin, Amir, Kadir, Timor, Karppinen, Jaro, and Määttä, Juhani
- Subjects
- *
DEEP learning , *LUMBAR pain , *COHORT analysis , *IMAGE recognition (Computer vision) , *MAGNETIC resonance imaging - Abstract
Study Design.: This is a retrospective observational study to externally validate a deep learning image classification model. Objective.: Deep learning models such as SpineNet offer the possibility of automating the process of disk degeneration (DD) classification from magnetic resonance imaging (MRI). External validation is an essential step to their development. The aim of this study was to externally validate SpineNet predictions for DD using Pfirrmann classification and Modic changes (MCs) on data from the Northern Finland Birth Cohort 1966 (NFBC1966). Summary of Data.: We validated SpineNet using data from 1331 NFBC1966 participants for whom both lumbar spine MRI data and consensus DD gradings were available. Materials and Methods.: SpineNet returned Pfirrmann grade and MC presence from T2-weighted sagittal lumbar MRI sequences from NFBC1966, a data set geographically and temporally separated from its training data set. A range of agreement and reliability metrics were used to compare predictions with expert radiologists. Subsets of data that match SpineNet training data more closely were also tested. Results.: Balanced accuracy for DD was 78% (77%–79%) and for MC 86% (85%–86%). Interrater reliability for Pfirrmann grading was Lin concordance correlation coefficient=0.86 (0.85–0.87) and Cohen κ=0.68 (0.67–0.69). In a low back pain subset, these reliability metrics remained largely unchanged. In total, 20.83% of disks were rated differently by SpineNet compared with the human raters, but only 0.85% of disks had a grade difference >1. Interrater reliability for MC detection was κ=0.74 (0.72–0.75). In the low back pain subset, this metric was almost unchanged at κ=0.76 (0.73–0.79). Conclusions.: In this study, SpineNet has been benchmarked against expert human raters in the research setting. It has matched human reliability and demonstrates robust performance despite the multiple challenges facing model generalizability. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. 497 - Automatic grading of radiographic knee chondrocalcinosis using a convolutional neural network
- Author
-
Yu, Jing, Tang, Jinchi, WU, Yahong, Kang, Wenjie, Boer, Justin, Voortman, Trudy, Agricola, Rintje, Tiulpin, Aleksei, Oei, Edwin, Bierma-Zeinstra, Sita, Roshchupkin, Gennady, Hirvasniemi, Jukka, Meurs, Joyce v., and Boer, Cindy
- Published
- 2024
- Full Text
- View/download PDF
30. 096 - MEASURING LUMBAR FACET TROPISM USING T2-WEIGHTED MRI – A DEEP LEARNING APPROACH
- Author
-
Kowlagi, Narasimharao, Kemppainen, Antti, Panfilov, Egor, McSweeney, Terence, Saarakkala, Simo, Nevalainen, Mika, Niinimäki, Jaakko, Karppinen, Jaro, and Tiulpin, Aleksei
- Published
- 2024
- Full Text
- View/download PDF
31. 005 - Patient-level Active Surveillance of Knee Osteoarthritis with Reinforcement Learning
- Author
-
Nguyen, Khanh, Nguyen, Huy Hoang, Panfilov, Egor, and Tiulpin, Aleksei
- Published
- 2024
- Full Text
- View/download PDF
32. Targeted Active Learning for Bayesian Decision-Making
- Author
-
Filstroff, Louis, Sundin, Iiris, Mikkola, Petrus, Tiulpin, Aleksei, Kylm��oja, Juuso, and Kaski, Samuel
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Artificial Intelligence (cs.AI) ,Statistics - Machine Learning ,Computer Science - Artificial Intelligence ,Machine Learning (stat.ML) ,Machine Learning (cs.LG) - Abstract
Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for decision-making, for example in personalized medicine or economics. We argue that when acquiring samples sequentially, separating learning and decision-making is sub-optimal, and we introduce an active learning strategy which takes the down-the-line decision problem into account. Specifically, we introduce a novel active learning criterion which maximizes the expected information gain on the posterior distribution of the optimal decision. We compare our targeted active learning strategy to existing alternatives on both simulated and real data, and show improved performance in decision-making accuracy.
- Published
- 2021
33. Deep learning identifies morphological features in breast cancer predictive of cancer ERBB2 status and trastuzumab treatment efficacy
- Author
-
Bychkov, D. (Dmitrii), Linder, N. (Nina), Tiulpin, A. (Aleksei), Kücükel, H. (Hakan), Lundin, M. (Mikael), Nordling, S. (Stig), Sihto, H. (Harri), Isola, J. (Jorma), Lehtimäki, T. (Tiina), Kellokumpu-Lehtinen, P.-L. (Pirkko-Liisa), von Smitten, K. (Karl), Joensuu, H. (Heikki), Lundin, J. (Johan), Helsinki Institute for Information Technology, Institute for Molecular Medicine Finland, University of Helsinki, Digital Precision Cancer Medicine (iCAN), Medicum, Johan Edvard Lundin / Principal Investigator, Department of Pathology, Department of Diagnostics and Therapeutics, Clinicum, Department of Surgery, Research Programs Unit, Heikki Joensuu / Principal Investigator, HUS Comprehensive Cancer Center, Department of Oncology, Helsinki University Hospital Area, HUS Medical Imaging Center, Tampere University, BioMediTech, and Clinical Medicine
- Subjects
Adult ,Receptor, ErbB-2 ,Science ,3122 Cancers ,Breast Neoplasms ,Biomarkers, Pharmacological ,Disease-Free Survival ,Article ,Cohort Studies ,Tumour biomarkers ,Deep Learning ,Breast cancer ,Humans ,skin and connective tissue diseases ,neoplasms ,Finland ,In Situ Hybridization ,Proportional Hazards Models ,Cancer och onkologi ,Gene Amplification ,Middle Aged ,Trastuzumab ,Prognosis ,Treatment Outcome ,ROC Curve ,Cancer and Oncology ,Medicine ,Female ,Biomarkers - Abstract
The treatment of patients with ERBB2 (HER2)-positive breast cancer with anti-ERBB2 therapy is based on the detection of ERBB2 gene amplification or protein overexpression. Machine learning (ML) algorithms can predict the amplification of ERBB2 based on tumor morphological features, but it is not known whether ML-derived features can predict survival and efficacy of anti-ERBB2 treatment. In this study, we trained a deep learning model with digital images of hematoxylin–eosin (H&E)-stained formalin-fixed primary breast tumor tissue sections, weakly supervised by ERBB2 gene amplification status. The gene amplification was determined by chromogenic in situ hybridization (CISH). The training data comprised digitized tissue microarray (TMA) samples from 1,047 patients. The correlation between the deep learning–predicted ERBB2 status, which we call H&E-ERBB2 score, and distant disease-free survival (DDFS) was investigated on a fully independent test set, which included whole-slide tumor images from 712 patients with trastuzumab treatment status available. The area under the receiver operating characteristic curve (AUC) in predicting gene amplification in the test sets was 0.70 (95% CI, 0.63–0.77) on 354 TMA samples and 0.67 (95% CI, 0.62–0.71) on 712 whole-slide images. Among patients with ERBB2-positive cancer treated with trastuzumab, those with a higher than the median morphology–based H&E-ERBB2 score derived from machine learning had more favorable DDFS than those with a lower score (hazard ratio [HR] 0.37; 95% CI, 0.15–0.93; P = 0.034). A high H&E-ERBB2 score was associated with unfavorable survival in patients with ERBB2-negative cancer as determined by CISH. ERBB2-associated morphology correlated with the efficacy of adjuvant anti-ERBB2 treatment and can contribute to treatment-predictive information in breast cancer. publishedVersion
- Published
- 2021
34. Critical evaluation of deep neural networks for wrist fracture detection
- Author
-
Raisuddin, Abu Mohammed, Vaattovaara, Elias, Nevalainen, Mika, Nikki, Marko, Järvenpää, Elina, Makkonen, Kaisa, Pinola, Pekka, Palsio, Tuula, Niemensivu, Arttu, Tervonen, Osmo, and Tiulpin, Aleksei
- Subjects
FOS: Computer and information sciences ,Male ,Computer Science - Machine Learning ,Databases, Factual ,Computer Vision and Pattern Recognition (cs.CV) ,Science ,Computer Science - Computer Vision and Pattern Recognition ,Quantitative Biology - Quantitative Methods ,Article ,Machine Learning (cs.LG) ,Fractures, Bone ,Machine learning ,FOS: Electrical engineering, electronic engineering, information engineering ,Humans ,Quantitative Methods (q-bio.QM) ,Retrospective Studies ,Image and Video Processing (eess.IV) ,Electrical Engineering and Systems Science - Image and Video Processing ,Wrist ,Wrist Injuries ,Radiography ,FOS: Biological sciences ,Medicine ,Female ,Neural Networks, Computer ,Tomography, X-Ray Computed - Abstract
Wrist Fracture is the most common type of fracture with a high incidence rate. Conventional radiography (i.e. X-ray imaging) is used for wrist fracture detection routinely, but occasionally fracture delineation poses issues and an additional confirmation by computed tomography (CT) is needed for diagnosis. Recent advances in the field of Deep Learning (DL), a subfield of Artificial Intelligence (AI), have shown that wrist fracture detection can be automated using Convolutional Neural Networks. However, previous studies did not pay close attention to the difficult cases which can only be confirmed via CT imaging. In this study, we have developed and analyzed a state-of-the-art DL-based pipeline for wrist (distal radius) fracture detection—DeepWrist, and evaluated it against one general population test set, and one challenging test set comprising only cases requiring confirmation by CT. Our results reveal that a typical state-of-the-art approach, such as DeepWrist, while having a near-perfect performance on the general independent test set, has a substantially lower performance on the challenging test set—average precision of 0.99 (0.99–0.99) versus 0.64 (0.46–0.83), respectively. Similarly, the area under the ROC curve was of 0.99 (0.98–0.99) versus 0.84 (0.72–0.93), respectively. Our findings highlight the importance of a meticulous analysis of DL-based models before clinical use, and unearth the need for more challenging settings for testing medical AI systems.
- Published
- 2021
35. Acoustic emissions and kinematic instability of the osteoarthritic knee joint:comparison with radiographic findings
- Author
-
Nevalainen, M. T. (Mika T.), Veikkola, O. (Olli), Thevenot, J. (Jerome), Tiulpin, A. (Aleksei), Hirvasniemi, J. (Jukka), Niinimäki, J. (Jaakko), and Saarakkala, S. S. (Simo S.)
- Subjects
Musculoskeletal system ,Osteoarthritis ,Diagnostic markers ,Imaging techniques ,Predictive markers ,Biomedical engineering - Abstract
To evaluate the acoustic emissions (AE) and kinematic instability (KI) of the osteoarthritic (OA) knee joints, and to compare these signals to radiographic findings. Sixty-six female and 43 male participants aged 44–67 were recruited. On radiography, joint-space narrowing, osteophytes and Kellgren–Lawrence (KL) grade were evaluated. Based on radiography, 54 subjects (the study group) were diagnosed with radiographic OA (KL-grade ≥ 2) while the remaining 55 subjects (KL-grade
- Published
- 2021
36. Knee osteoarthritis development five years following arthroscopic partial meniscectomy or exercise therapy for degenerative meniscal tears: the odense-oslo meniscectomy versus exercise trial
- Author
-
Inger Holm, Nina Jullum Kise, May Arna Risberg, Lars Engebretsen, Ewa M. Roos, Aleksei Tiulpin, Martin Englund, Cathrine N. Eftang, Simo Saarakkala, and Bjørnar Berg
- Subjects
medicine.medical_specialty ,Rheumatology ,business.industry ,Biomedical Engineering ,medicine ,Physical therapy ,Meniscal tears ,Orthopedics and Sports Medicine ,Exercise therapy ,Osteoarthritis ,medicine.disease ,business - Published
- 2020
- Full Text
- View/download PDF
37. Learning A Model Of Lumbar Disc Degeneration Progression From A Cross-Sectional Population Cohort
- Author
-
McSweeney, T.P., Tiulpin, A., Kowlagi, N., Karppinen, J., and Saarakkala, S.
- Published
- 2023
- Full Text
- View/download PDF
38. Deep learning‐based segmentation of knee MRI for fully automatic subregional morphological assessment of cartilage tissues: Data from the Osteoarthritis Initiative.
- Author
-
Panfilov, Egor, Tiulpin, Aleksei, Nieminen, Miika T., Saarakkala, Simo, and Casula, Victor
- Subjects
- *
CARTILAGE , *ARTICULAR cartilage , *MAGNETIC resonance imaging , *OSTEOARTHRITIS , *KNEE - Abstract
Morphological changes in knee cartilage subregions are valuable imaging‐based biomarkers for understanding progression of osteoarthritis, and they are typically detected from magnetic resonance imaging (MRI). So far, accurate segmentation of cartilage has been done manually. Deep learning approaches show high promise in automating the task; however, they lack clinically relevant evaluation. We introduce a fully automatic method for segmentation and subregional assessment of articular cartilage, and evaluate its predictive power in context of radiographic osteoarthritis progression. Two data sets of 3D double‐echo steady‐state (DESS) MRI derived from the Osteoarthritis Initiative were used: first, n = 88; second, n = 600, 0‐/12‐/24‐month visits. Our method performed deep learning‐based segmentation of knee cartilage tissues, their subregional division via multi‐atlas registration, and extraction of subregional volume and thickness. The segmentation model was developed and assessed on the first data set. Subsequently, on the second data set, the morphological measurements from our and the prior methods were analyzed in correlation and agreement, and, eventually, by their discriminative power of radiographic osteoarthritis progression over 12 and 24 months, retrospectively. The segmentation model showed very high correlation (r > 0.934) and agreement (mean difference < 116 mm3) in volumetric measurements with the reference segmentations. Comparison of our and manual segmentation methods yielded r = 0.845–0.973 and mean differences = 262–501 mm3 for weight‐bearing cartilage volume, and r = 0.770–0.962 and mean differences = 0.513–1.138 mm for subregional cartilage thickness. With regard to osteoarthritis progression, our method found most of the significant associations identified using the manual segmentation method, for both 12‐ and 24‐month subregional cartilage changes. The method may be effectively applied in osteoarthritis progression studies to extract cartilage‐related imaging biomarkers. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
39. Semixup: In- and Out-of-Manifold Regularization for Deep Semi-Supervised Knee Osteoarthritis Severity Grading from Plain Radiographs
- Author
-
Matthew B. Blaschko, Simo Saarakkala, Aleksei Tiulpin, and Huy Hoang Nguyen
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Radiography ,Computer Science - Computer Vision and Pattern Recognition ,Severity grading ,Osteoarthritis ,Machine Learning (cs.LG) ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,0302 clinical medicine ,Musculoskeletal disorder ,FOS: Electrical engineering, electronic engineering, information engineering ,medicine ,Humans ,Electrical and Electronic Engineering ,Observer Variation ,Radiological and Ultrasound Technology ,Manifold regularization ,business.industry ,Image and Video Processing (eess.IV) ,Pattern recognition ,Osteoarthritis, Knee ,Electrical Engineering and Systems Science - Image and Video Processing ,medicine.disease ,Computer Science Applications ,Plain radiographs ,Artificial intelligence ,Supervised Machine Learning ,business ,Software ,Algorithms - Abstract
Knee osteoarthritis (OA) is one of the highest disability factors in the world. This musculoskeletal disorder is assessed from clinical symptoms, and typically confirmed via radiographic assessment. This visual assessment done by a radiologist requires experience, and suffers from moderate to high inter-observer variability. The recent literature has shown that deep learning methods can reliably perform the OA severity assessment according to the gold standard Kellgren-Lawrence (KL) grading system. However, these methods require large amounts of labeled data, which are costly to obtain. In this study, we propose the Semixup algorithm, a semi-supervised learning (SSL) approach to leverage unlabeled data. Semixup relies on consistency regularization using in- and out-of-manifold samples, together with interpolated consistency. On an independent test set, our method significantly outperformed other state-of-the-art SSL methods in most cases. Finally, when compared to a well-tuned fully supervised baseline that yielded a balanced accuracy (BA) of $70.9\pm0.8%$ on the test set, Semixup had comparable performance -- BA of $71\pm0.8%$ $(p=0.368)$ while requiring $6$ times less labeled data. These results show that our proposed SSL method allows building fully automatic OA severity assessment tools with datasets that are available outside research settings., 11 main, 03 supplementary pages. The manuscript was accepted to IEEE Transactions on Medical Imaging in August 2020
- Published
- 2020
40. Deep learning for knee osteoarthritis diagnosis and progression prediction from plain radiographs and clinical data
- Author
-
Tiulpin, A. (Aleksei) and Saarakkala, S. (Simo)
- Subjects
osteoarthritis ,nivelrikko ,machine learning ,koneoppiminen ,polvet ,deep learning ,knee ,syväoppiminen ,konenäkö ,computer vision - Abstract
Osteoarthritis (OA) is the most common musculoskeletal disorder in the world, affecting hand, hip, and knee joints. At the final stage, OA leads to joint replacement, causing an immense burden at the individual and societal levels. Multiple risk factors that can lead to OA are known; however, the etiology of OA and the underlying mechanisms of OA progression are not currently known. OA is currently diagnosed by a clinical examination and, when necessary, confirmed by imaging — a radiographic evaluation. However, these conventional tools are not sensitive to detect the early stages of OA, which makes the development of preventive measures for further disease progression difficult. Therefore, there is a need for other methods that could allow for the early diagnosis of OA. As such, computer vision-based techniques provide quantitative biomarkers that allow for an automatic and systematic assessment of OA severity from images. In recent years, the rapid development of computer vision and machine learning methods have merged into a new field — deep learning (DL). DL allows for one to formulate the problems of computer vision and other fields in a machine learning fashion. In the medical field, DL has made a tremendous impact and allowed to approach for human-level decision-making accuracy in diagnostic and prognostic tasks compared with the traditional computer vision-based methods. The focus of this thesis is on the development of DL-based methods for fully automatic knee OA severity diagnosis and the prediction of its progression. Multiple new methods for localizing the region of interest, landmark localization, knee OA severity assessment, and OA progression prediction are proposed. The results exceeded the state-of-the-art or formed completely new benchmarks for the evaluation of diagnostic and predictive model performance in OA. The main conclusion is that DL yields excellent performance in the diagnostics of OA and in the prediction of its progression. All the source codes of all the developed methods and the annotations for some of the datasets have been made publicly available. Tiivistelmä Nivelrikko on maailman yleisin käden, lonkan ja polven niveliin vaikuttava liikuntaelinsairaus. Viimekädessä nivelrikko johtaa tekonivelleikkauksiin, aiheuttaen merkittävää rasitetta niin yksilö- kuin yhteiskunnallisella tasolla. Monia nivelrikolle altistavia tekijöitä on jo tunnistettu, mutta kaikkia nivelrikon syitä ja vaikutusmekanismeja nivelrikon etenemisessä ei tunneta. Nivelrikko diagnosoidaan kliinisellä tutkimuksella ja vahvistetaan/varmistetaan tarvittaessa tehtävällä kuvantamistutkimuksella — tekemällä radiografinen arviointi. Nämä perinteiset työkalut eivät kuitenkaan ole riittävän herkkiä nivelrikon varhaisten vaiheiden havaitsemiseen, ja tämä hankaloittaa sairauden kehittymistä ehkäisevien toimenpiteiden kehittämistä. Näistä syistä johtuen tarvitaan muita menetelmiä, jotka mahdollistavat nivelrikon varhaisen diagnosoinnin. Konenäkömenetelmät sellaisenaan tuottavat kvantitatiivisia biologisia indikaattoreita jotka mahdollistavat automaattisen ja järjestelmällisen nivelrikon vakavuusarvion tekemisen kuvamateriaalista. Viime vuosina konenäkö- ja koneoppimismenetelmien nopea kehitys on synnyttänyt uuden syväoppimisen haaran. Syväoppiminen mahdollistaa konenäkö- ja muiden ongelmien määrittelyn koneoppimisongelman tavoin. Verrattuna perinteisiin lääketieteessä käytettyihin tietokonenäkömenetelmiin, syväoppiminen on mahdollistanut ihmisen suorituskykyä lähestyvät toteutukset lääketieteen diagnostisissa ja prognostisissa tehtävissä ja niiden vaikutus alan kehitykselle on ollut merkittävä. Tämän väitöskirja keskittyy kehittämään syväoppimismenetelmiä täysautomaattiseen polven nivelrikon vakavuuden diagnosointiin ja taudin kehittymisen ennustamiseen. Työssä ehdotetaan/esitetään useita uusia menetelmiä kohdealueen paikallistamiseen, maamerkkien paikallistamiseen, polven nivelrikon vakavuuden arviointiin ja nivelrikon etenemisen ennustamiseen. Työn tulokset ylittävät viimeisintä tekniikkaa edustavat ratkaisut tai muodostavat täysin uuden mittarin diagnostisten ja ennustavien menetelmien suorituskyvyn evaluoinnille nivelrikon kontekstissa. Työn keskeisimpänä johtopäätöksenä esitetään, että syväoppimisella on mahdollista saavuttaa erittäin hyvä suorituskyky nivelrikon diagnosoinnissa ja sen etenemisen ennustamisessa. Kaikki työssä kehitetyt menetelmät lähdekoodeineen sekä annotoinnit osalle tutkimuksessa käytetyistä aineistoista on saatettu avoimesti saataville.
- Published
- 2020
41. Semixup:in- and out-of-manifold regularization for deep semi-supervised knee osteoarthritis severity grading from plain radiographs
- Author
-
Nguyen, H. H. (Huy Hoang), Saarakkala, S. (Simo), Blaschko, M. B. (Matthew B.), and Tiulpin, A. (Aleksei)
- Abstract
Knee osteoarthritis (OA) is one of the highest disability factors in the world. This musculoskeletal disorder is assessed from clinical symptoms, and typically confirmed via radiographic assessment. This visual assessment done by a radiologist requires experience, and suffers from moderate to high inter-observer variability. The recent literature has shown that deep learning methods can reliably perform the OA severity assessment according to the gold standard Kellgren-Lawrence (KL) grading system. However, these methods require large amounts of labeled data, which are costly to obtain. In this study, we propose the Semixup algorithm, a semi-supervised learning (SSL) approach to leverage unlabeled data. Semixup relies on consistency regularization using in- and out-of-manifold samples, together with interpolated consistency. On an independent test set, our method significantly outperformed other state-of-the-art SSL methods in most cases. Finally, when compared to a well-tuned fully supervised baseline that yielded a balanced accuracy (BA) of 70.9 ± 0.8% on the test set, Semixup had comparable performance - BA of 71 ± 0.8% (p = 0.368) while requiring 6 times less labeled data. These results show that our proposed SSL method allows building fully automatic OA severity assessment tools with datasets that are available outside research settings.
- Published
- 2020
42. Detection of experimental cartilage damage with acoustic emissions technique:an in vitro equine study
- Author
-
Shakya, B. R. (B. R.), Tiulpin, A. (A.), Saarakkala, S. (S.), Turunen, S. (S.), and Thevenot, J. (J.)
- Subjects
osteoarthritis ,extension ,flexion ,fetlock ,acoustic emission ,horse - Abstract
Background: In horses, osteoarthritis (OA) mostly affects metacarpophalangeal and metatarsophalangeal (fetlock) joints. The current modalities used for diagnosis of equine limb disorders lack ability to detect early OA. Here, we propose a new alternative approach to assess experimental cartilage damage in fetlock joint using Acoustic Emissions (AE). Objectives: To evaluate the potential of AE technique in diagnosing OA and see how AE signals changes with increasing severity of OA. Study design: An in vitro experimental study. Methods: A total of 16 distal limbs (8 forelimbs and 8 hindlimbs) from six Finn horses were collected from an abattoir and fitted in a custom‐made frame allowing fetlock joint bending. Eight fetlock joints were opened, and cartilage surface was progressively damaged mechanically three times using sandpaper to mimic mild, moderate and severe OA. The remaining eight fetlock joints were opened and closed without any mechanical procedure, serving as controls. Before cartilage alteration, synovial fluid was aspirated, mixed with phosphate‐buffered saline solution, and then reinjected before suturing for constant joint lubrication. For each simulated condition of OA severity, a force was applied to the frame and then released to mimic joint flexion and extension. AE signals were acquired using air microphones. Results: A strong association was found between the joint condition and the power of AE signals analysed in 1.5–6 kHz range. The signal from both forelimb and hindlimb joints followed a similar pattern for increased cartilage damage. There were statistically significant differences between each joint condition progressively (generalised linear mixed model, P
- Published
- 2020
43. Automating Three-dimensional Osteoarthritis Histopathological Grading of Human Osteochondral Tissue using Machine Learning on Contrast-Enhanced Micro-Computed Tomography
- Author
-
Santeri J. O. Rytky, Janina Leino, Petri Lehenkari, Heikki Kröger, T. Frondelius, Maarit Valkealahti, Kenneth P.H. Pritzker, Antti Joukainen, Mikko A. J. Finnilä, Heikki J. Nieminen, Sakari S. Karhula, Aleksei Tiulpin, Simo Saarakkala, University of Oulu, University of Toronto, University of Eastern Finland, Department of Neuroscience and Biomedical Engineering, Aalto-yliopisto, and Aalto University
- Subjects
Cartilage, Articular ,0301 basic medicine ,Contrast-enhanced micro-computed tomography ,Mean squared error ,Histopathological grading ,Total knee arthroplasty ,3D histopathological grading ,Biomedical Engineering ,Contrast Media ,Osteoarthritis ,Machine learning ,computer.software_genre ,Severity of Illness Index ,030218 nuclear medicine & medical imaging ,Machine Learning ,03 medical and health sciences ,Imaging, Three-Dimensional ,0302 clinical medicine ,Rheumatology ,Linear regression ,medicine ,Humans ,Orthopedics and Sports Medicine ,Femur ,Arthroplasty, Replacement, Knee ,Grading (tumors) ,Mathematics ,030203 arthritis & rheumatology ,Tibia ,Receiver operating characteristic ,business.industry ,Micro computed tomography ,X-Ray Microtomography ,Osteoarthritis, Knee ,medicine.disease ,Confidence interval ,Textural analysis ,Cartilage ,030104 developmental biology ,Test set ,Tomography ,Artificial intelligence ,business ,computer - Abstract
ObjectiveTo develop and validate a machine learning (ML) approach for automatic three-dimensional (3D) histopathological grading of osteochondral samples imaged with contrast-enhanced micro-computed tomography (CEμCT).DesignOsteochondral cores from 24 total knee arthroplasty patients and 2 asymptomatic cadavers (n = 34, Ø = 2 mm; n = 45, Ø = 4 mm) were imaged using CEμCT with phosphotungstic acid-staining. Volumes-of-interest (VOI) in surface (SZ), deep (DZ) and calcified (CZ) zones were extracted depthwise and subjected to dimensionally reduced Local Binary Pattern-textural feature analysis. Regularized Ridge and Logistic regression (LR) models were trained zone-wise against the manually assessed semi-quantitative histopathological CEμCT grades (Ø = 2 mm samples). Models were validated using nested leave-one-out cross-validation and an independent test set (Ø = 4 mm samples). The performance was assessed using Spearman’s correlation, Average Precision (AP) and Area under the Receiver Operating Characteristic Curve (AUC).ResultsHighest performance on cross-validation was observed for SZ, both on Ridge regression (ρ = 0.68, p < 0.0001) and LR (AP = 0.89, AUC = 0.92). The test set evaluations yielded decreased Spearman’s correlations on all zones. For LR, performance was almost similar in SZ (AP = 0.89, AUC = 0.86), decreased in CZ (AP = 0.71→0.62, AUC = 0.77→0.63) and increased in DZ (AP = 0.50→0.83, AUC = 0.72→0.72).ConclusionWe showed that the ML-based automatic 3D histopathological grading of osteochondral samples is feasible from CEμCT. The developed method can be directly applied by OA researchers since the grading software and all source codes are publicly available.
- Published
- 2019
- Full Text
- View/download PDF
44. Acoustic emissions and kinematic instability of the osteoarthritic knee joint: comparison with radiographic findings.
- Author
-
Nevalainen, Mika T., Veikkola, Olli, Thevenot, Jerome, Tiulpin, Aleksei, Hirvasniemi, Jukka, Niinimäki, Jaakko, and Saarakkala, Simo S.
- Subjects
KNEE osteoarthritis ,ACOUSTIC emission ,KNEE ,LOGISTIC regression analysis ,PREDICTION models ,REGRESSION analysis ,RADIOGRAPHY ,JOINT instability - Abstract
To evaluate the acoustic emissions (AE) and kinematic instability (KI) of the osteoarthritic (OA) knee joints, and to compare these signals to radiographic findings. Sixty-six female and 43 male participants aged 44–67 were recruited. On radiography, joint-space narrowing, osteophytes and Kellgren–Lawrence (KL) grade were evaluated. Based on radiography, 54 subjects (the study group) were diagnosed with radiographic OA (KL-grade ≥ 2) while the remaining 55 subjects (KL-grade < 2) formed the control group. AE and KI were recorded with a custom-made prototype and compared with radiographic findings using area-under-curve (AUC) and independent T-test. Predictive logistic regression models were constructed using leave-one-out cross validation. In females, the parameters reflecting consistency of the AE patterns during specific tasks, KI, BMI and age had a significant statistical difference between the OA and control groups (p = 0.001–0.036). The selected AE signals, KI, age and BMI were used to construct a predictive model for radiographic OA with AUC of 90.3% (95% CI 83.5–97.2%) which showed a statistical improvement of the reference model based on age and BMI, with AUC of 84.2% (95% CI 74.8–93.6%). In males, the predictive model failed to improve the reference model. AE and KI provide complementary information to detect radiographic knee OA in females. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
45. Grey Matter Age Prediction as a Biomarker for Risk of Dementia: A Population-based Study
- Author
-
Wiro J. Niessen, Gennady V. Roshchupkin, Meike W. Vernooij, M. Arfan Ikram, Aleksei Tiulpin, Florian Dubost, Hieab H.H. Adams, Marleen de Bruijne, Johnny Wang, and Maria J. Knol
- Subjects
education.field_of_study ,business.industry ,Proportional hazards model ,Intraclass correlation ,Population ,Hazard ratio ,Odds ratio ,medicine.disease ,Confidence interval ,Rotterdam Study ,Medicine ,Dementia ,business ,education ,Demography - Abstract
Key PointsQuestionIs the gap between brain age predicted from MRI and chronological age associated with incident dementia in a general population of Dutch adults?FindingsBrain age was predicted using a deep learning model, using MRI-derived grey matter density maps. In a population based study including 5496 participants, the observed gap was significantly associated with the risk of dementia.MeaningThe gap between MRI-brain predicted and chronological age is potentially a biomarker for dementia risk screening.AbstractImportanceThe gap between predicted brain age using magnetic resonance imaging (MRI) and chronological age may serve as biomarker for early-stage neurodegeneration and potentially as a risk indicator for dementia. However, owing to the lack of large longitudinal studies, it has been challenging to validate this link.ObjectiveWe aimed to investigate the utility of such a gap as a risk biomarker for incident dementia in a general Dutch population, using a deep learning approach for predicting brain age based on MRI-derived grey matter maps.DesignData was collected from participants of the cohort-based Rotterdam Study who underwent brain magnetic resonance imaging between 2006 and 2015. This study was performed in a longitudinal setting and all participant were followed up for incident dementia until 2016.SettingThe Rotterdam Study is a prospective population-based study, initiated in 1990 in the suburb Ommoord of in Rotterdam, the Netherlands.ParticipantsAt baseline, 5496 dementia- and stroke-free participants (mean age 64.67±9.82, 54.73% women) were scanned and screened for incident dementia. During 6.66±2.46 years of follow-up, 159 people developed dementia.Main outcomes and measuresWe built a convolutional neural network (CNN) model to predict brain age based on its MRI. Model prediction performance was measured in mean absolute error (MAE). Reproducibility of prediction was tested using the intraclass correlation coefficient (ICC) computed on a subset of 80 subjects. Logistic regressions and Cox proportional hazards were used to assess the association of the age gap with incident dementia, adjusted for years of education, ApoEε4 allele carriership, grey matter volume and intracranial volume. Additionally, we computed the attention maps of CNN, which shows which brain regions are important for age prediction.ResultsMAE of brain age prediction was 4.45±3.59 years and ICC was 0.97 (95% confidence interval CI=0.96-0.98). Logistic regression and Cox proportional hazards models showed that the age gap was significantly related to incident dementia (odds ratio OR=1.11 and 95% confidence intervals CI=1.05-1.16; hazard ratio HR=1.11 and 95% CI=1.06-1.15, respectively). Attention maps indicated that grey matter density around the amygdalae and hippocampi primarily drive the age estimation.Conclusion and relevanceWe show that the gap between predicted and chronological brain age is a biomarker associated with risk of dementia development. This suggests that it can be used as a biomarker, complimentary to those that are known, for dementia risk screening.
- Published
- 2019
- Full Text
- View/download PDF
46. Breast tumor cellularity assessment using deep neural networks
- Author
-
Rakhlin, A. (Alexander), Tiulpin, A. (Aleksei), Shvets, A. A. (Alexey A.), Kalinin, A. A. (Alexandr A.), Iglovikov, V. I. (Vladimir I.), and Nikolenko, S. (Sergey)
- Subjects
Deep Neural Networks ,Breast cancer ,Cellularity ,Deep Learning ,Diagnostic - Abstract
Breast cancer is one of the main causes of death worldwide. Histopathological cellularity assessment of residual tumors in post-surgical tissues is used to analyze a tumor’s response to a therapy. Correct cellularity assessment increases the chances of getting an appropriate treatment and facilitates the patient’s survival. In current clinical practice, tumor cellularity is manually estimated by pathologists; this process is tedious and prone to errors or low agreement rates between assessors. In this work, we evaluated three strong novel Deep Learning-based approaches for automatic assessment of tumor cellularity from post-treated breast surgical specimens stained with hematoxylin and eosin. We validated the proposed methods on the BreastPathQ SPIE challenge dataset that consisted of 2395 image patches selected from whole slide images acquired from 64 patients. Compared to expert pathologist scoring, our best performing method yielded the Cohen’s kappa coefficient of 0.69 (vs. 0.42 previously known in literature) and the intra-class correlation coefficient of 0.89 (vs. 0.83). Our results suggest that Deep Learning-based methods have a significant potential to alleviate the burden on pathologists, enhance the diagnostic workflow, and, thereby, facilitate better clinical outcomes in breast cancer treatment.
- Published
- 2019
47. Improving robustness of deep learning based knee MRI segmentation:mixup and adversarial domain adaptation
- Author
-
Panfilov, E. (Egor), Tiulpin, A. (Aleksei), Klein, S. (Stefan), Nieminen, M. T. (Miika T.), and Saarakkala, S. (Simo)
- Subjects
osteoarthritis ,domain adaptation ,knee cartilage ,robust segmentation ,magnetic resonance imaging - Abstract
Degeneration of articular cartilage (AC) is actively studied in knee osteoarthritis (OA) research via magnetic resonance imaging (MRI). Segmentation of AC tissues from MRI data is an essential step in quantification of their damage. Deep learning (DL) based methods have shown potential in this realm and are the current state-of-the-art, however, their robustness to heterogeneity of MRI acquisition settings remains an open problem. In this study, we investigated two modern regularization techniques — mixup and adversarial unsupervised domain adaptation (UDA) — to improve the robustness of DL-based knee cartilage segmentation to new MRI acquisition settings. Our validation setup included two datasets produced by different MRI scanners and using distinct data acquisition protocols. We assessed the robustness of automatic segmentation by comparing mixup and UDA approaches to a strong baseline method at different OA severity stages and, additionally, in relation to anatomical locations. Our results showed that for moderate changes in knee MRI data acquisition settings both approaches may provide notable improvements in the robustness, which are consistent for all stages of the disease and affect the clinically important areas of the knee joint. However, mixup may be considered as a recommended approach, since it is more computationally efficient and does not require additional data from the target acquisition setup.
- Published
- 2019
48. Kneel:knee anatomical landmark localization using hourglass networks
- Author
-
Tiulpin, A. (Aleksei), Melekhov, I. (Iaroslav), and Saarakkala, S. (Simo)
- Subjects
Deep Learning ,Anatomical Landmark Localization ,Osteoarthritis ,Knee - Abstract
This paper addresses the challenge of localization of anatomical landmarks in knee X-ray images at different stages of osteoarthritis (OA). Landmark localization can be viewed as regression problem, where the landmark position is directly predicted by using the region of interest or even full-size images leading to large memory footprint, especially in case of high resolution medical images. In this work, we propose an efficient deep neural networks framework with an hourglass architecture utilizing a soft-argmax layer to directly predict normalized coordinates of the landmark points. We provide an extensive evaluation of different regularization techniques and various loss functions to understand their influence on the localization performance. Furthermore, we introduce the concept of transfer learning from low-budget annotations, and experimentally demonstrate that such approach is improving the accuracy of landmark localization. Compared to the prior methods, we validate our model on two datasets that are independent from the train data and assess the performance of the method for different stages of OA severity. The proposed approach demonstrates better generalization performance compared to the current state-of-the-art.
- Published
- 2019
49. EXTERNAL VALIDATION OF SPINENET, A DEEP LEARNING MODEL FOR AUTOMATED GRADING OF LUMBAR DISC DEGENERATION MRI FEATURES, USING THE NORTHERN FINLAND BIRTH COHORT
- Author
-
McSweeney, T.P., Saarakkala, S., Tiulpin, A., Jamaludin, A., Kadir, T., Niinimäki, J., Karppinen, J., and Määttä, J.
- Published
- 2022
- Full Text
- View/download PDF
50. RADIOGRAPHIC OSTEOARTHRITIS PROGRESSION PREDICTION VIA MULTI-MODAL IMAGING DATA AND DEEP LEARNING
- Author
-
Panfilov, E., Tiulpin, A., Nieminen, M.T., and Saarakkala, S.
- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.