34 results on '"Hassanpour, Saeed"'
Search Results
2. HistoPerm: A permutation-based view generation approach for improving histopathologic feature representation learnin.
- Author
-
DiPalma, Joseph, Torresani, Lorenzo, and Hassanpour, Saeed
- Subjects
RENAL cell carcinoma ,IMAGE recognition (Computer vision) ,CELIAC disease ,PERMUTATIONS ,DEEP learning ,IMAGE analysis - Abstract
Deep learning has been effective for histology image analysis in digital pathology. However, many current deep learning approaches require large, strongly- or weakly labeled images and regions of interest, which can be time-consuming and resource-intensive to obtain. To address this challenge, we present HistoPerm, a view generation method for representation learning using joint embedding architectures that enhances representation learning for histology images. HistoPerm permutes augmented views of patches extracted from whole-slide histology images to improve classification performance. We evaluated the effectiveness of HistoPerm on 2 histology image datasets for Celiac disease and Renal Cell Carcinoma, using 3 widely used joint embedding architecture-based representation learning methods: BYOL, SimCLR, and VICReg. Our results show that HistoPerm consistently improves patch- and slide-level classification performance in terms of accuracy, F1-score, and AUC. Specifically, for patch-level classification accuracy on the Celiac disease dataset, HistoPerm boosts BYOL and VICReg by 8% and SimCLR by 3%. On the Renal Cell Carcinoma dataset, patch-level classification accuracy is increased by 2% for BYOL and VICReg, and by 1% for SimCLR. In addition, on the Celiac disease dataset, models with HistoPerm outperform the fully supervised baseline model by 6%, 5%, and 2% for BYOL, SimCLR, and VICReg, respectively. For the Renal Cell Carcinoma dataset, HistoPerm lowers the classification accuracy gap for the models up to 10% relative to the fully supervised baseline. These findings suggest that HistoPerm can be a valuable tool for improving representation learning of histopathology features when access to labeled data is limited and can lead to whole-slide classification results that are comparable to or superior to fully supervised methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
3. Association between fat‐infiltrated axillary lymph nodes on screening mammography and cardiometabolic disease.
- Author
-
Song, Qingyuan, diFlorio‐Alexander, Roberta M., Patel, Sohum D., Sieberg, Ryan T., Margron, Michael J., Ansari, Saif M., Karagas, Margaret R., Mackenzie, Todd A., and Hassanpour, Saeed
- Subjects
HEART metabolism disorders ,MEDICAL screening ,LYMPH nodes ,DIGITAL mammography ,NON-alcoholic fatty liver disease ,FATTY liver ,DYSLIPIDEMIA - Abstract
Objective: Ectopic fat deposition within and around organs is a stronger predictor of cardiometabolic disease status than body mass index (BMI). Fat deposition within the lymphatic system is poorly understood. This study examined the association between the prevalence of cardiometabolic disease and ectopic fat deposition within axillary lymph nodes (LNs) visualized on screening mammograms. Methods: A cross‐sectional study was conducted on 834 women presenting for full‐field digital screening mammography. The status of fat‐infiltrated LNs was assessed based on the size and morphology of axillary LNs from screening mammograms. The prevalence of cardiometabolic disease was retrieved from the electronic medical records, including type 2 diabetes mellitus (T2DM), hypertension, dyslipidemia, high blood glucose, cardiovascular disease, stroke, and non‐alcoholic fatty liver disease. Results: Fat‐infiltrated axillary LNs were associated with a high prevalence of T2DM among all women (adjusted odds ratio: 3.92, 95% CI: [2.40, 6.60], p‐value < 0.001) and in subgroups of women with and without obesity. Utilizing the status of fatty LNs improved the classification of T2DM status in addition to age and BMI (1.4% improvement in the area under the receiver operating characteristic curve). Conclusion: Fat‐infiltrated axillary LNs visualized on screening mammograms were associated with the prevalence of T2DM. If further validated, fat‐infiltrated axillary LNs may represent a novel imaging biomarker of T2DM in women undergoing screening mammography. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
4. Enhancing research data infrastructure to address the opioid epidemic: the Opioid Overdose Network (O2-Net).
- Author
-
Lenert, Leslie A., Zhu, Vivienne, Jennings, Lindsey, McCauley, Jenna L., Obeid, Jihad S., Ward, Ralph, Hassanpour, Saeed, Marsch, Lisa A., Hogarth, Michael, Shipman, Perry, Harris, Daniel R., and Talbert, Jeffery C.
- Published
- 2022
- Full Text
- View/download PDF
5. The Feasibility and Utility of Harnessing Digital Health to Understand Clinical Trajectories in Medication Treatment for Opioid Use Disorder: D-TECT Study Design and Methodological Considerations.
- Author
-
Marsch, Lisa A., Chen, Ching-Hua, Adams, Sara R., Asyyed, Asma, Does, Monique B., Hassanpour, Saeed, Hichborn, Emily, Jackson-Morris, Melanie, Jacobson, Nicholas C., Jones, Heather K., Kotz, David, Lambert-Harris, Chantal A., Li, Zhiguo, McLeman, Bethany, Mishra, Varun, Stanger, Catherine, Subramaniam, Geetha, Wu, Weiyi, and Campbell, Cynthia I.
- Subjects
OPIOID abuse ,DIGITAL health ,ECOLOGICAL momentary assessments (Clinical psychology) ,OPIOIDS ,DRUG abuse treatment ,NEONATAL abstinence syndrome - Abstract
Introduction: Across the U.S., the prevalence of opioid use disorder (OUD) and the rates of opioid overdoses have risen precipitously in recent years. Several effective medications for OUD (MOUD) exist and have been shown to be life-saving. A large volume of research has identified a confluence of factors that predict attrition and continued substance use during substance use disorder treatment. However, much of this literature has examined a small set of potential moderators or mediators of outcomes in MOUD treatment and may lead to over-simplified accounts of treatment non-adherence. Digital health methodologies offer great promise for capturing intensive, longitudinal ecologically-valid data from individuals in MOUD treatment to extend our understanding of factors that impact treatment engagement and outcomes. Methods: This paper describes the protocol (including the study design and methodological considerations) from a novel study supported by the National Drug Abuse Treatment Clinical Trials Network at the National Institute on Drug Abuse (NIDA). This study (D-TECT) primarily seeks to evaluate the feasibility of collecting ecological momentary assessment (EMA), smartphone and smartwatch sensor data, and social media data among patients in outpatient MOUD treatment. It secondarily seeks to examine the utility of EMA, digital sensing, and social media data (separately and compared to one another) in predicting MOUD treatment retention, opioid use events, and medication adherence [as captured in electronic health records (EHR) and EMA data]. To our knowledge, this is the first project to include all three sources of digitally derived data (EMA, digital sensing, and social media) in understanding the clinical trajectories of patients in MOUD treatment. These multiple data streams will allow us to understand the relative and combined utility of collecting digital data from these diverse data sources. The inclusion of EHR data allows us to focus on the utility of digital health data in predicting objectively measured clinical outcomes. Discussion: Results may be useful in elucidating novel relations between digital data sources and OUD treatment outcomes. It may also inform approaches to enhancing outcomes measurement in clinical trials by allowing for the assessment of dynamic interactions between individuals' daily lives and their MOUD treatment response. Clinical Trial Registration: Identifier: NCT04535583. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Response to commentary on "Automated classification of fat-infiltrated axillary lymph nodes on screening mammograms".
- Author
-
Song, Qingyuan, diFlorio-Alexander, Roberta M, Sieberg, Ryan T, Dwan, Dennis, Boyce, William, Stumetz, Kyle, Patel, Sohum D, Karagas, Margaret R, Mackenzie, Todd A, and Hassanpour, Saeed
- Subjects
MAMMOGRAMS ,AXILLA ,LYMPH nodes - Abstract
This document is a response to a commentary on a study published in the British Journal of Radiology. The authors address concerns raised about the potential overestimation of their model's accuracy due to data imbalance in their internal test set. They clarify the distribution of the internal test set and explain that they used various metrics, including precision, sensitivity, specificity, and the area under the receiver operating characteristic curve, to evaluate their model. They also mention that the model was evaluated on an independent external dataset for additional validation. The authors hope that their response clarifies their approach and evaluation methodology. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
7. Predicting prognosis and IDH mutation status for patients with lower-grade gliomas using whole slide images.
- Author
-
Jiang, Shuai, Zanazzi, George J., and Hassanpour, Saeed
- Subjects
ISOCITRATE dehydrogenase ,GENETIC mutation ,GLIOMAS ,BIOMARKERS ,DEEP learning - Abstract
We developed end-to-end deep learning models using whole slide images of adults diagnosed with diffusely infiltrating, World Health Organization (WHO) grade 2 gliomas to predict prognosis and the mutation status of a somatic biomarker, isocitrate dehydrogenase (IDH) 1/2. The models, which utilize ResNet-18 as a backbone, were developed and validated on 296 patients from The Cancer Genome Atlas (TCGA) database. To account for the small sample size, repeated random train/test splits were performed for hyperparameter tuning, and the out-of-sample predictions were pooled for evaluation. Our models achieved a concordance- (C-) index of 0.715 (95% CI: 0.569, 0.830) for predicting prognosis and an area under the curve (AUC) of 0.667 (0.532, 0.784) for predicting IDH mutations. When combined with additional clinical information, the performance metrics increased to 0.784 (95% CI: 0.655, 0.880) and 0.739 (95% CI: 0.613, 0.856), respectively. When evaluated on the WHO grade 3 gliomas from the TCGA dataset, which were not used for training, our models predicted survival with a C-index of 0.654 (95% CI: 0.537, 0.768) and IDH mutations with an AUC of 0.814 (95% CI: 0.721, 0.897). If validated in a prospective study, our method could potentially assist clinicians in managing and treating patients with diffusely infiltrating gliomas. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
8. Fat-enlarged axillary lymph nodes are associated with node-positive breast cancer in obese patients.
- Author
-
diFlorio-Alexander, Roberta M., Song, Qingyuan, Dwan, Dennis, Austin-Strohbehn, Judith A., Muller, Kristen E., Kinlaw, William B., MacKenzie, Todd A., Karagas, Margaret R., and Hassanpour, Saeed
- Abstract
Purpose: Obesity associated fat infiltration of organ systems is accompanied by organ dysfunction and poor cancer outcomes. Obese women demonstrate variable degrees of fat infiltration of axillary lymph nodes (LNs), and they are at increased risk for node-positive breast cancer. However, the relationship between enlarged axillary nodes and axillary metastases has not been investigated. The purpose of this study is to evaluate the association between axillary metastases and fat-enlarged axillary nodes visualized on mammograms and breast MRI in obese women with a diagnosis of invasive breast cancer. Methods: This retrospective case–control study included 431 patients with histologically confirmed invasive breast cancer. The primary analysis of this study included 306 patients with pre-treatment and pre-operative breast MRI and body mass index (BMI) > 30 (201 node-positive cases and 105 randomly selected node-negative controls) diagnosed with invasive breast cancer between April 1, 2011, and March 1, 2020. The largest visible LN was measured in the axilla contralateral to the known breast cancer on breast MRI. Multivariate logistic regression models were used to assess the association between node-positive status and LN size adjusting for age, BMI, tumor size, tumor grade, tumor subtype, and lymphovascular invasion. Results: A strong likelihood of node-positive breast cancer was observed among obese women with fat-expanded lymph nodes (adjusted OR for the 4th vs. 1st quartile for contralateral LN size on MRI: 9.70; 95% CI 4.26, 23.50; p < 0.001). The receiver operating characteristic curve for size of fat-enlarged nodes in the contralateral axilla identified on breast MRI had an area under the curve of 0.72 for predicting axillary metastasis, and this increased to 0.77 when combined with patient and tumor characteristics. Conclusion: Fat expansion of axillary lymph nodes was associated with a high likelihood of axillary metastases in obese women with invasive breast cancer independent of BMI and tumor characteristics. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
9. Natural language processing for automated annotation of medication mentions in primary care visit conversations.
- Author
-
Ganoe, Craig H., Weiyi Wu, Barr, Paul J., Haslett, William, Dannenberg, Michelle D., Bonasia, Kyra L., Finora, James C., Schoonmaker, Jesse A., Onsando, Wambui M., Ryan, James, Elwyn, Glyn, Bruce, Martha L., Das, Amar K., and Hassanpour, Saeed
- Published
- 2021
- Full Text
- View/download PDF
10. Development and evaluation of a deep neural network for histologic classification of renal cell carcinoma on biopsy and surgical resection slides.
- Author
-
Zhu, Mengdan, Ren, Bing, Richards, Ryland, Suriawinata, Matthew, Tomita, Naofumi, and Hassanpour, Saeed
- Subjects
RENAL cell carcinoma ,ARTIFICIAL neural networks ,RENAL biopsy ,CANCER histopathology ,PATHOLOGISTS - Abstract
Renal cell carcinoma (RCC) is the most common renal cancer in adults. The histopathologic classification of RCC is essential for diagnosis, prognosis, and management of patients. Reorganization and classification of complex histologic patterns of RCC on biopsy and surgical resection slides under a microscope remains a heavily specialized, error-prone, and time-consuming task for pathologists. In this study, we developed a deep neural network model that can accurately classify digitized surgical resection slides and biopsy slides into five related classes: clear cell RCC, papillary RCC, chromophobe RCC, renal oncocytoma, and normal. In addition to the whole-slide classification pipeline, we visualized the identified indicative regions and features on slides for classification by reprocessing patch-level classification results to ensure the explainability of our diagnostic model. We evaluated our model on independent test sets of 78 surgical resection whole slides and 79 biopsy slides from our tertiary medical institution, and 917 surgical resection slides from The Cancer Genome Atlas (TCGA) database. The average area under the curve (AUC) of our classifier on the internal resection slides, internal biopsy slides, and external TCGA slides is 0.98 (95% confidence interval (CI): 0.97–1.00), 0.98 (95% CI: 0.96–1.00) and 0.97 (95% CI: 0.96–0.98), respectively. Our results suggest that the high generalizability of our approach across different data sources and specimen types. More importantly, our model has the potential to assist pathologists by (1) automatically pre-screening slides to reduce false-negative cases, (2) highlighting regions of importance on digitized slides to accelerate diagnosis, and (3) providing objective and accurate diagnosis as the second opinion. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
11. Automated Detection of Celiac Disease on Duodenal Biopsy Slides: A Deep Learning Approach.
- Author
-
Wei, Jason W., Wei, Jerry W., Jackson, Christopher R., Bing Ren, Suriawinata, Arief A., and Hassanpour, Saeed
- Subjects
DEEP learning ,CELIAC disease ,DUODENAL diseases ,CONVOLUTIONAL neural networks ,RECEIVER operating characteristic curves ,BIOPSY - Abstract
Context: Celiac disease (CD) prevalence and diagnosis have increased substantially in recent years. The current gold standard for CD confirmation is visual examination of duodenal mucosal biopsies. An accurate computer-aided biopsy analysis system using deep learning can help pathologists diagnose CD more efficiently. Subjects and Methods: In this study, we trained a deep learning model to detect CD on duodenal biopsy images. Our model uses a state-of-the-art residual convolutional neural network to evaluate patches of duodenal tissue and then aggregates those predictions for whole-slide classification. We tested the model on an independent set of 212 images and evaluated its classification results against reference standards established by pathologists. Results: Our model identified CD, normal tissue, and nonspecific duodenitis with accuracies of 95.3%, 91.0%, and 89.2%, respectively. The area under the receiver operating characteristic curve was >0.95 for all classes. Conclusions: We have developed an automated biopsy analysis system that achieves high performance in detecting CD on biopsy slides. Our system can highlight areas of interest and provide preliminary classification of duodenal biopsies before review by pathologists. This technology has great potential for improving the accuracy and efficiency of CD diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
12. Prediction of Atypical Ductal Hyperplasia Upgrades Through a Machine Learning Approach to Reduce Unnecessary Surgical Excisions.
- Author
-
Harrington, Lia, diFlorio-Alexander, Roberta, Trinh, Katherine, MacKenzie, Todd, Suriawinata, Arief, and Hassanpour, Saeed
- Subjects
HYPERPLASIA ,MACHINE learning ,SURGICAL complications ,MEDICAL technology ,K-nearest neighbor classification - Abstract
Purpose: Surgical excision is currently recommended for all occurrences of atypical ductal hyperplasia (ADH) found on core needle biopsies for malignancy diagnoses and treatment of lesions. The excision of all ADH lesions may lead to overtreatment, which results in invasive surgeries for benign lesions in many women. A machine learning method to predict ADH upgrade may help clinicians and patients decide whether combined active surveillance and hormonal therapy is a reasonable alternative to surgical excision. Methods: The following six machine learning models were developed to predict ADH upgrade from core needle biopsy: gradient-boosting trees, random forest, radial support vector machine (SVM), weighted K-nearest neighbors (KNN), logistic elastic net, and logistic regression. The study cohort consisted of 128 lesions from 124 women at a tertiary academic care center in New Hampshire who had ADH on core needle biopsy and who underwent an associated surgical excision from 2011 to 2017. Results: The best-performing models were gradient-boosting trees (area under the curve [AUC], 68%; accuracy, 78%) and random forest (AUC, 67%; accuracy, 77%). The top five most important features that determined ADH upgrade were age at biopsy, lesion size, number of biopsies, needle gauge, and personal and family history of breast cancer. Using the random forest model, 98% of all malignancies would have been diagnosed through surgical biopsies, whereas 16% of unnecessary surgeries on benign lesions could have been avoided (ie, 87% sensitivity at 45% specificity). Conclusion: These results add to the growing body of support for machine learning models as useful aids for clinicians and patients in decisions about the clinical management of ADH. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
13. Automated classification of fat‐infiltrated axillary lymph nodes on screening mammograms.
- Author
-
Song, Qingyuan, diFlorio-Alexander, Roberta M., Sieberg, Ryan T., Dwan, Dennis, Boyce, William, Stumetz, Kyle, Patel, Sohum D., Karagas, Margaret R., MacKenzie, Todd A., and Hassanpour, Saeed
- Subjects
MAMMOGRAMS ,LYMPH nodes ,FAT ,DEEP learning ,RAPID tooling ,DATABASES - Abstract
Objective: Fat‐infiltrated axillary lymph nodes (LNs) are unique sites for ectopic fat deposition. Early studies showed a strong correlation between fatty LNs and obesity‐related diseases. Confirming this correlation requires large‐scale studies, hindered by scarce labeled data. With the long‐term goal of developing a rapid and generalizable tool to aid data labeling, we developed an automated deep learning (DL)‐based pipeline to classify the status of fatty LNs on screening mammograms. Methods: Our internal data set included 886 mammograms from a tertiary academic medical institution, with a binary status of the fat‐infiltrated LNs based on the size and morphology of the largest visible axillary LN. A two‐stage DL model training and fine‐tuning pipeline was developed to classify the fat‐infiltrated LN status using the internal training and development data set. The model was evaluated on a held‐out internal test set and a subset of the Digital Database for Screening Mammography. Results: Our model achieved 0.97 (95% CI: 0.94–0.99) accuracy and 1.00 (95% CI: 1.00–1.00) area under the receiver operator characteristic curve on 264 internal testing mammograms, and 0.82 (95% CI: 0.77–0.86) accuracy and 0.87 (95% CI: 0.82–0.91) area under the receiver operator characteristic curve on 70 external testing mammograms. Conclusion: This study confirmed the feasibility of using a DL model for fat‐infiltrated LN classification. The model provides a practical tool to identify fatty LNs on mammograms and to allow for future large‐scale studies to evaluate the role of fatty LNs as an imaging biomarker of obesity‐associated pathologies. Advances in knowledge: Our study is the first to classify fatty LNs using an automated DL approach. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Monitoring of Technology Adoption Using Web Content Mining of Location Information and Geographic Information Systems: A Case Study of Digital Breast Tomosynthesis.
- Author
-
Onega, Tracy, Kamra, Dharmanshu, Alford-Teaster, Jennifer, and Hassanpour, Saeed
- Subjects
TOMOSYNTHESIS ,GEOGRAPHIC information systems ,MACHINE learning ,SPATIOTEMPORAL processes - Abstract
Purpose: To our knowledge, integration of Web content mining of publicly available addresses with a geographic information system (GIS) has not been applied to the timely monitoring of medical technology adoption. Here, we explore the diffusion of a new breast imaging technology, digital breast tomosynthesis (DBT). Methods: We used natural language processing and machine learning to extract DBT facility location information using a set of potential sites for the New England region of the United States via a Google search application program interface. We assessed the accuracy of the algorithm using a validated set of publicly available addresses of locations that provide DBT from the DBT technology vendor, Hologic. We quantified precision, recall, and F1 score, aiming for an F1 score of ≥ 95% as the desirable performance. By reverse geocoding on the basis of the results of the Google Maps application program interface, we derived a spatial data set for use in an ArcGIS environment. Within the GIS, a host of spatiotemporal analyses and geovisualization techniques are possible. Results: We developed a semiautomated system that integrated DBT location information into a GIS that was feasible and of reasonable quality. Initial accuracy of the algorithm was poor using only a search term list for information retrieval (precision, 35%; recall, 44%; F1 score, 39%), but performance dramatically improved by leveraging natural language processing and simple machine learning techniques to isolate single, valid instances of DBT location information (precision, 92%; recall, 96%; F1 score, 94%). Reverse geocoding yielded reliable geographic coordinates for easy implementation into a GIS for mapping and planned monitoring. Conclusion: Our novel approach can be applicable to technologies beyond DBT, which may inform equitable access over time and space. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
15. Using Natural Language Processing of Free-Text Radiology Reports to Identify Type 1 Modic Endplate Changes.
- Author
-
Huhdanpaa, Hannu T., Tan, W. Katherine, Rundell, Sean D., Suri, Pradeep, Chokshi, Falgun H., Comstock, Bryan A., Heagerty, Patrick J., James, Kathryn T., Avins, Andrew L., Nedeljkovic, Srdjan S., Nerenz, David R., Kallmes, David F., Luetmer, Patrick H., Sherman, Karen J., Organ, Nancy L., Griffith, Brent, Langlotz, Curtis P., Carrell, David, Hassanpour, Saeed, and Jarvik, Jeffrey G.
- Subjects
CONFIDENCE intervals ,INFORMATION storage & retrieval systems ,MEDICAL databases ,LUMBAR vertebrae ,SPINE diseases ,MEDICAL records ,NATURAL language processing ,STATISTICS - Abstract
Electronic medical record (EMR) systems provide easy access to radiology reports and offer great potential to support quality improvement efforts and clinical research. Harnessing the full potential of the EMR requires scalable approaches such as natural language processing (NLP) to convert text into variables used for evaluation or analysis. Our goal was to determine the feasibility of using NLP to identify patients with Type 1 Modic endplate changes using clinical reports of magnetic resonance (MR) imaging examinations of the spine. Identifying patients with Type 1 Modic change who may be eligible for clinical trials is important as these findings may be important targets for intervention. Four annotators identified all reports that contained Type 1 Modic change, using
N = 458 randomly selected lumbar spine MR reports. We then implemented a rule-based NLP algorithm in Java using regular expressions. The prevalence of Type 1 Modic change in the annotated dataset was 10%. Results were recall (sensitivity) 35/50 = 0.70 (95% confidence interval (C.I.) 0.52–0.82), specificity 404/408 = 0.99 (0.97–1.0), precision (positive predictive value) 35/39 = 0.90 (0.75–0.97), negative predictive value 404/419 = 0.96 (0.94–0.98), and F1-score 0.79 (0.43–1.0). Our evaluation shows the efficacy of rule-based NLP approach for identifying patients with Type 1 Modic change if the emphasis is on identifying only relevant cases with low concern regarding false negatives. As expected, our results show that specificity is higher than recall. This is due to the inherent difficulty of eliciting all possible keywords given the enormous variability of lumbar spine reporting, which decreases recall, while availability of good negation algorithms improves specificity. [ABSTRACT FROM AUTHOR]- Published
- 2018
- Full Text
- View/download PDF
16. Deep Learning for Classification of Colorectal Polyps on Whole‑slide Images.
- Author
-
Korbar, Bruno, Olofson, Andrea M., Miraflor, Allen P., Nicka, Catherine M., Suriawinata, Matthew A., Torresani, Lorenzo, Suriawinata, Arief A., and Hassanpour, Saeed
- Subjects
COLON polyps ,HISTOPATHOLOGY ,DEEP learning ,DIAGNOSIS - Abstract
Context: Histopathological characterization of colorectal polyps is critical for determining the risk of colorectal cancer and future rates of surveillance for patients. However, this characterization is a challenging task and suffers from significant inter‑ and intra‑observer variability. Aims: We built an automatic image analysis method that can accurately classify different types of colorectal polyps on whole‑slide images to help pathologists with this characterization and diagnosis. Setting and Design: Our method is based on deep‑learning techniques, which rely on numerous levels of abstraction for data representation and have shown state‑of‑the‑art results for various image analysis tasks. Subjects and Methods: Our method covers five common types of polyps (i.e., hyperplastic, sessile serrated, traditional serrated, tubular, and tubulovillous/villous) that are included in the US Multisociety Task Force guidelines for colorectal cancer risk assessment and surveillance. We developed multiple deep‑learning approaches by leveraging a dataset of 2074 crop images, which were annotated by multiple domain expert pathologists as reference standards. Statistical Analysis: We evaluated our method on an independent test set of 239 whole‑slide images and measured standard machine‑learning evaluation metrics of accuracy, precision, recall, and F1 score and their 95% confidence intervals. Results: Our evaluation shows that our method with residual network architecture achieves the best performance for classification of colorectal polyps on whole‑slide images (overall accuracy: 93.0%, 95% confidence interval: 89.0%–95.9%). Conclusions: Our method can reduce the cognitive burden on pathologists and improve their efficacy in histopathological characterization of colorectal polyps and in subsequent risk assessment and follow‑up recommendations. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
17. Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing.
- Author
-
Hassanpour, Saeed, Bay, Graham, and Langlotz, Curtis
- Subjects
MEDICAL informatics ,DECISION support systems ,HOSPITAL radiological services ,INFORMATION storage & retrieval systems ,MEDICAL databases ,NATURAL language processing ,PUBLIC health surveillance ,REPORT writing ,SEMANTICS ,EVALUATION - Abstract
We built a natural language processing (NLP) method to automatically extract clinical findings in radiology reports and characterize their level of change and significance according to a radiology-specific information model. We utilized a combination of machine learning and rule-based approaches for this purpose. Our method is unique in capturing different features and levels of abstractions at surface, entity, and discourse levels in text analysis. This combination has enabled us to recognize the underlying semantics of radiology report narratives for this task. We evaluated our method on radiology reports from four major healthcare organizations. Our evaluation showed the efficacy of our method in highlighting important changes (accuracy 99.2%, precision 96.3%, recall 93.5%, and F1 score 94.7%) and identifying significant observations (accuracy 75.8%, precision 75.2%, recall 75.7%, and F1 score 75.3%) to characterize radiology reports. This method can help clinicians quickly understand the key observations in radiology reports and facilitate clinical decision support, review prioritization, and disease surveillance. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
18. Deep Learning for Classification of Colorectal Polyps on Whole-slide Images.
- Author
-
Korbar, Bruno, Olofson, Andrea M., Miraflor, Allen P., Nicka, Catherine M., Suriawinata, Matthew A., Torresani, Lorenzo, Suriawinata, Arief A., and Hassanpour, Saeed
- Subjects
COLON polyps ,DEEP learning ,COLON cancer risk factors - Abstract
Context: Histopathological characterization of colorectal polyps is critical for determining the risk of colorectal cancer and future rates of surveillance for patients. However, this characterization is a challenging task and suffers from significant inter- and intra-observer variability. Aims: We built an automatic image analysis method that can accurately classify different types of colorectal polyps on whole-slide images to help pathologists with this characterization and diagnosis. Setting and Design: Our method is based on deep-learning techniques, which rely on numerous levels of abstraction for data representation and have shown state-of-the-art results for various image analysis tasks. Subjects and Methods: Our method covers five common types of polyps (i.e., hyperplastic, sessile serrated, traditional serrated, tubular, and tubulovillous/villous) that are included in the US Multisociety Task Force guidelines for colorectal cancer risk assessment and surveillance. We developed multiple deep-learning approaches by leveraging a dataset of 2074 crop images, which were annotated by multiple domain expert pathologists as reference standards. Statistical Analysis: We evaluated our method on an independent test set of 239 whole-slide images and measured standard machine-learning evaluation metrics of accuracy, precision, recall, and F1 score and their 95% confidence intervals. Results: Our evaluation shows that our method with residual network architecture achieves the best performance for classification of colorectal polyps on whole-slide images (overall accuracy: 93.0%, 95% confidence interval: 89.0%-95.9%). Conclusions: Our method can reduce the cognitive burden on pathologists and improve their efficacy in histopathological characterization of colorectal polyps and in subsequent risk assessment and follow-up recommendations. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
19. Unsupervised Topic Modeling in a Large Free Text Radiology Report Repository.
- Author
-
Hassanpour, Saeed and Langlotz, Curtis
- Subjects
DATABASES ,INFORMATION retrieval ,NATURAL language processing ,RADIOGRAPHY ,REPORT writing ,ACCESS to information - Abstract
Radiology report narrative contains a large amount of information about the patient's health and the radiologist's interpretation of medical findings. Most of this critical information is entered in free text format, even when structured radiology report templates are used. The radiology report narrative varies in use of terminology and language among different radiologists and organizations. The free text format and the subtlety and variations of natural language hinder the extraction of reusable information from radiology reports for decision support, quality improvement, and biomedical research. Therefore, as the first step to organize and extract the information content in a large multi-institutional free text radiology report repository, we have designed and developed an unsupervised machine learning approach to capture the main concepts in a radiology report repository and partition the reports based on their main foci. In this approach, radiology reports are modeled in a vector space and compared to each other through a cosine similarity measure. This similarity is used to cluster radiology reports and identify the repository's underlying topics. We applied our approach on a repository of 1,899,482 radiology reports from three major healthcare organizations. Our method identified 19 major radiology report topics in the repository and clustered the reports accordingly to these topics. Our results are verified by a domain expert radiologist and successfully explain the repository's primary topics and extract the corresponding reports. The results of our system provide a target-based corpus and framework for information extraction and retrieval systems for radiology reports. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
20. A Software Tool for Visualizing, Managing and Eliciting SWRL Rules.
- Author
-
Hassanpour, Saeed, O΄Connor, Martin J., and Das, Amar K.
- Abstract
SWRL rule are increasingly being used to represent knowledge on the Semantic Web. As these SWRL rule bases grows larger, managing the resulting complexity can become a challenge. Developers and end-users need rule management tools to tackle this complexity. We developed a rule management tool called Axiomé that aims to address this challenge. Axiomé support the paraphrasing of SWRL into simple English, the visualization of the structure both of individual rules and of rule bases, and supports the categorization of rules based on an analysis of their syntactic structure. It also supports the automatic generation of rule acquisition templates to facilitate rule elicitation. Axiomé is available as a plugin to the Protégé-OWL ontology development environment. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
21. Visualizing Logical Dependencies in SWRL Rule Bases.
- Author
-
Hassanpour, Saeed, O΄Connor, Martin J., and Das, Amar K.
- Abstract
Rule bases are common in many business rule applications, clinical decision support programs, and other types of intelligent systems. As the size of the rule bases grows and the interrelationships between rules become more complex, understanding dependencies among rules can be quite difficult. To address this challenge, we propose a novel approach for modeling logical dependencies among rules and for discovering patterns based on these dependencies. Our method uses rules bases written in the Semantic Web Rule Language (SWRL); we exploit SWRL΄s logical relationship with OWL to incorporate these semantics in our analysis. We couple this analysis with visualization techniques that create a rule dependency graph. We group nodes into layers based on their dependencies and cluster nodes within a layer if they have similar dependencies. We have evaluated our approach by applying it to two independently developed, publicly available ontologies containing SWRL rules. We show how our analysis and visualization approach can allow users to quickly examine patterns of logical relationships in such rule bases. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
22. Exploration of SWRL Rule Bases through Visualization, Paraphrasing, and Categorization of Rules.
- Author
-
Hassanpour, Saeed, O΄Connor, Martin J., and Das, Amar K.
- Abstract
Rule bases are increasingly being used as repositories of knowledge content on the Semantic Web. As the size and complexity of these rule bases increases, developers and end users need methods of rule abstraction to facilitate rule management. In this paper, we describe a rule abstraction method for Semantic Web Rule Language (SWRL) rules that is based on lexical analysis and a set of heuristics. Our method results in a tree data structure that we exploit in creating techniques to visualize, paraphrase, and categorize SWRL rules. We evaluate our approach by applying it to several biomedical ontologies that contain SWRL rules, and show how the results reveal rule patterns within the rule base. We have implemented our method as a plug-in tool for Protégé-OWL, the most widely used ontology modeling software for the Semantic Web. Our tool can allow users to rapidly explore content and patterns in SWRL rule bases, enabling their acquisition and management. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
23. A semantic-based method for extracting concept definitions from scientific publications: evaluation in the autism phenotype domain.
- Author
-
Hassanpour, Saeed, O'Connor, Martin J., and Das, Amar K.
- Subjects
INFORMATION science ,AUTISM research ,SCIENCE periodicals ,PERIODICAL publishing ,DATA mining - Abstract
Background: A variety of informatics approaches have been developed that use information retrieval, NLP and text-mining techniques to identify biomedical concepts and relations within scientific publications or their sentences. These approaches have not typically addressed the challenge of extracting more complex knowledge such as biomedical definitions. In our efforts to facilitate knowledge acquisition of rule-based definitions of autism phenotypes, we have developed a novel semantic-based text-mining approach that can automatically identify such definitions within text. Results: Using an existing knowledge base of 156 autism phenotype definitions and an annotated corpus of 26 source articles containing such definitions, we evaluated and compared the average rank of correctly identified rule definition or corresponding rule template using both our semantic-based approach and a standard term-based approach. We examined three separate scenarios: (1) the snippet of text contained a definition already in the knowledge base; (2) the snippet contained an alternative definition for a concept in the knowledge base; and (3) the snippet contained a definition not in the knowledge base. Our semantic-based approach had a higher average rank than the term-based approach for each of the three scenarios (scenario 1: 3.8 vs. 5.0; scenario 2: 2.8 vs. 4.9; and scenario 3: 4.5 vs. 6.2), with each comparison significant at the p-value of 0.05 using the Wilcoxon signed-rank test. Conclusions: Our work shows that leveraging existing domain knowledge in the information extraction of biomedical definitions significantly improves the correct identification of such knowledge within sentences. Our method can thus help researchers rapidly acquire knowledge about biomedical definitions that are specified and evolving within an ever-growing corpus of scientific publications [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
24. Evaluation of an Artificial Intelligence–Augmented Digital System for Histologic Classification of Colorectal Polyps.
- Author
-
Nasir-Moin, Mustafa, Suriawinata, Arief A., Ren, Bing, Liu, Xiaoying, Robertson, Douglas J., Bagchi, Srishti, Tomita, Naofumi, Wei, Jason W., MacKenzie, Todd A., Rees, Judy R., and Hassanpour, Saeed
- Published
- 2021
- Full Text
- View/download PDF
25. Understanding Urgency in Radiology Reporting: Identifying Associations Between Clinical Findings in Radiology Reports and Their Prompt Communication to Referring Physicians.
- Author
-
Xing Meng, Heinz, Michael V., Ganoe, Craig H., Sieberg, Ryan T., Cheung, Yvonne Y., and Hassanpour, Saeed
- Abstract
In this study, we aim to develop an automatic pipeline to identify clinical findings in the unstructured text of radiology reports that necessitate communications between radiologists and referring physicians. Our approach identified 20 distinct clinical concepts and highlighted statistically significant concepts with strong associations to cases that require prompt communication. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
26. Understanding Urgency in Radiology Reporting: Identifying Associations Between Clinical Findings in Radiology Reports and Their Prompt Communication to Referring Physicians.
- Author
-
Xing Meng, Heinz, Michael V., Ganoe, Craig H., Sieberg, Ryan T., Cheung, Yvonne Y., and Hassanpour, Saeed
- Abstract
In this study, we aim to develop an automatic pipeline to identify clinical findings in the unstructured text of radiology reports that necessitate communications between radiologists and referring physicians. Our approach identified 20 distinct clinical concepts and highlighted statistically significant concepts with strong associations to cases that require prompt communication. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
27. Evaluation of a Deep Neural Network for Automated Classification of Colorectal Polyps on Histopathologic Slides.
- Author
-
Wei, Jason W., Suriawinata, Arief A., Vaickus, Louis J., Ren, Bing, Liu, Xiaoying, Lisovsky, Mikhail, Tomita, Naofumi, Abdollahi, Behnaz, Kim, Adam S., Snover, Dale C., Baron, John A., Barry, Elizabeth L., and Hassanpour, Saeed
- Published
- 2020
- Full Text
- View/download PDF
28. Attention-Based Deep Neural Networks for Detection of Cancerous and Precancerous Esophagus Tissue on Histopathological Slides.
- Author
-
Tomita, Naofumi, Abdollahi, Behnaz, Wei, Jason, Ren, Bing, Suriawinata, Arief, and Hassanpour, Saeed
- Published
- 2019
- Full Text
- View/download PDF
29. Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source.
- Author
-
Palmer, Ellen L., Higgins, John, Hassanpour, Saeed, Sargent, James, Robinson, Christina M., Doherty, Jennifer A., and Onega, Tracy
- Abstract
Background: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance.Methods: Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire.Results: We found that 73% of available notes contained no smoking behavior information. The weighted Cohen's kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56-0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%.Conclusions: We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority. [ABSTRACT FROM AUTHOR]- Published
- 2019
- Full Text
- View/download PDF
30. Building a tobacco user registry by extracting multiple smoking behaviors from clinical notes.
- Author
-
Palmer, Ellen L., Hassanpour, Saeed, Higgins, John, Doherty, Jennifer A., and Onega, Tracy
- Subjects
CLINICAL trial registries ,ELECTRONIC health records ,MEDICAL protocols ,TOBACCO ,MEDICAL informatics ,LOCAL history - Abstract
Background: Usage of structured fields in Electronic Health Records (EHRs) to ascertain smoking history is important but fails in capturing the nuances of smoking behaviors. Knowledge of smoking behaviors, such as pack year history and most recent cessation date, allows care providers to select the best care plan for patients at risk of smoking attributable diseases.Methods: We developed and evaluated a health informatics pipeline for identifying complete smoking history from clinical notes in EHRs. We utilized 758 patient-visit notes (from visits between 03/28/2016 and 04/04/2016) from our local EHR in addition to a public dataset of 502 clinical notes from the 2006 i2b2 Challenge to assess the performance of this pipeline. We used a machine-learning classifier to extract smoking status and a comprehensive set of text processing regular expressions to extract pack years and cessation date information from these clinical notes.Results: We identified smoking status with an F1 score of 0.90 on both the i2b2 and local data sets. Regular expression identification of pack year history in the local test set was 91.7% sensitive and 95.2% specific, but due to variable context the pack year extraction was incomplete in 25% of cases, extracting packs per day or years smoked only. Regular expression identification of cessation date was 63.2% sensitive and 94.6% specific.Conclusions: Our work indicates that the development of an EHR-based Smokers' Registry containing information relating to smoking behaviors, not just status, from free-text clinical notes using an informatics pipeline is feasible. This pipeline is capable of functioning in external EHRs, reducing the amount of time and money needed at the institute-level to create a Smokers' Registry for improved identification of patient risk and eligibility for preventative and early detection services. [ABSTRACT FROM AUTHOR]- Published
- 2019
- Full Text
- View/download PDF
31. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks.
- Author
-
Wei, Jason W., Tafe, Laura J., Linnik, Yevgeniy A., Vaickus, Louis J., Tomita, Naofumi, and Hassanpour, Saeed
- Abstract
Classification of histologic patterns in lung adenocarcinoma is critical for determining tumor grade and treatment for patients. However, this task is often challenging due to the heterogeneous nature of lung adenocarcinoma and the subjective criteria for evaluation. In this study, we propose a deep learning model that automatically classifies the histologic patterns of lung adenocarcinoma on surgical resection slides. Our model uses a convolutional neural network to identify regions of neoplastic cells, then aggregates those classifications to infer predominant and minor histologic patterns for any given whole-slide image. We evaluated our model on an independent set of 143 whole-slide images. It achieved a kappa score of 0.525 and an agreement of 66.6% with three pathologists for classifying the predominant patterns, slightly higher than the inter-pathologist kappa score of 0.485 and agreement of 62.7% on this test set. All evaluation metrics for our model and the three pathologists were within 95% confidence intervals of agreement. If confirmed in clinical practice, our model can assist pathologists in improving classification of lung adenocarcinoma patterns by automatically pre-screening and highlighting cancerous regions prior to review. Our approach can be generalized to any whole-slide image classification task, and code is made publicly available at https://github.com/BMIRDS/deepslide. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
32. Automated detection of nonmelanoma skin cancer using digital images: a systematic review.
- Author
-
Marka, Arthur, Carter, Joi B., Toto, Ermal, and Hassanpour, Saeed
- Subjects
SKIN cancer diagnosis ,DIGITAL image processing ,MACHINE learning ,BASAL cell carcinoma ,HISTOPATHOLOGY - Abstract
Background: Computer-aided diagnosis of skin lesions is a growing area of research, but its application to nonmelanoma skin cancer (NMSC) is relatively under-studied. The purpose of this review is to synthesize the research that has been conducted on automated detection of NMSC using digital images and to assess the quality of evidence for the diagnostic accuracy of these technologies. Methods: Eight databases (PubMed, Google Scholar, Embase, IEEE Xplore, Web of Science, SpringerLink, ScienceDirect, and the ACM Digital Library) were searched to identify diagnostic studies of NMSC using image-based machine learning models. Two reviewers independently screened eligible articles. The level of evidence of each study was evaluated using a five tier rating system, and the applicability and risk of bias of each study was assessed using the Quality Assessment of Diagnostic Accuracy Studies tool. Results: Thirty-nine studies were reviewed. Twenty-four models were designed to detect basal cell carcinoma, two were designed to detect squamous cell carcinoma, and thirteen were designed to detect both. All studies were conducted in silico. The overall diagnostic accuracy of the classifiers, defined as concordance with histopathologic diagnosis, was high, with reported accuracies ranging from 72 to 100% and areas under the receiver operating characteristic curve ranging from 0.832 to 1. Most studies had substantial methodological limitations, but several were robustly designed and presented a high level of evidence. Conclusion: Most studies of image-based NMSC classifiers report performance greater than or equal to the reported diagnostic accuracy of the average dermatologist, but relatively few studies have presented a high level of evidence. Clinical studies are needed to assess whether these technologies can feasibly be implemented as a real-time aid for clinical diagnosis of NMSC. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
33. Automated detection of celiac disease on duodenal biopsy slides: A deep learning approach.
- Author
-
Wei, Jason, Wei, Jerry, Jackson, Christopher, Ren, Bing, Suriawinata, Arief, and Hassanpour, Saeed
- Subjects
DEEP learning ,CELIAC disease ,DUODENAL diseases ,RECEIVER operating characteristic curves ,BIOPSY - Abstract
Context: Celiac disease (CD) prevalence and diagnosis have increased substantially in recent years. The current gold standard for CD confirmation is visual examination of duodenal mucosal biopsies. An accurate computer-aided biopsy analysis system using deep learning can help pathologists diagnose CD more efficiently. Subjects and Methods: In this study, we trained a deep learning model to detect CD on duodenal biopsy images. Our model uses a state-of-the-art residual convolutional neural network to evaluate patches of duodenal tissue and then aggregates those predictions for whole-slide classification. We tested the model on an independent set of 212 images and evaluated its classification results against reference standards established by pathologists. Results: Our model identified CD, normal tissue, and nonspecific duodenitis with accuracies of 95.3%, 91.0%, and 89.2%, respectively. The area under the receiver operating characteristic curve was >0.95 for all classes. Conclusions: We have developed an automated biopsy analysis system that achieves high performance in detecting CD on biopsy slides. Our system can highlight areas of interest and provide preliminary classification of duodenal biopsies before review by pathologists. This technology has great potential for improving the accuracy and efficiency of CD diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
34. An intelligent interface for rule elicitation.
- Author
-
Hassanpour, Saeed and Das, Amar K.
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.