1,832 results on '"Unsupervised Clustering"'
Search Results
2. Utilizing structural MRI and unsupervised clustering to differentiate schizophrenia and Alzheimer's disease in late-onset psychosis
- Author
-
Hojjati, Seyed Hani, Chen, Kewei, Chiang, Gloria C., Kuceyeski, Amy, Wang, Xiuyuan H., Razlighi, Qolamreza R., Pahlajani, Silky, Glodzik, Lidia, Tanzi, Emily B., Reinhardt, Michael, and Butler, Tracy A.
- Published
- 2025
- Full Text
- View/download PDF
3. Just a little human intelligence feedback! Unsupervised learning assisted supervised learning data poisoning based backdoor removal
- Author
-
Luo, Ting, Peng, Huaibing, Fu, Anmin, Yang, Wei, Pang, Lihui, Al-Sarawi, Said F., Abbott, Derek, and Gao, Yansong
- Published
- 2025
- Full Text
- View/download PDF
4. Prediction of submarine soil dredging difficulty scale in cutter suction dredger construction with clustering-based deep learning
- Author
-
Chen, Yong, Ren, Qiubing, Li, Mingchao, Tian, Huijing, Qin, Liang, and Wu, Dianchun
- Published
- 2025
- Full Text
- View/download PDF
5. A novel online sensing approach for monitoring micro-defect and damage mode during the plastic deformation of metal matrix composites: Experiment and crystal plasticity analysis
- Author
-
Tang, Xuefeng, He, Chuanyue, Wang, Xinyun, Hu, Feifei, Deng, Lei, Xie, Jianxin, and Fu, M.W.
- Published
- 2025
- Full Text
- View/download PDF
6. Latent personality profiles of analog astronauts: An unsupervised clustering method analysis
- Author
-
Gonzalez-Torre, Sara, Rasero, Adrian, Diaz-Artiles, Ana, Ramallo, Miguel A., and de la Torre, Gabriel G.
- Published
- 2024
- Full Text
- View/download PDF
7. Identification of adipocyte infiltration-related gene subtypes for predicting colorectal cancer prognosis and responses of immunotherapy/chemotherapy
- Author
-
Fu, Daan, Zhang, Tianhao, Liu, Jia, Chang, Bingcheng, Zhang, Qingqing, Tan, Yuyan, Chen, Xiangdong, and Tan, Lulu
- Published
- 2024
- Full Text
- View/download PDF
8. Identification of clear cell renal cell carcinoma subtypes by integrating radiomics and transcriptomics
- Author
-
Gao, Ruizhi, Pang, Jinshu, Lin, Peng, Wen, Rong, Wen, Dongyue, Liang, Yiqiong, Ma, Zhen, Liang, Li, He, Yun, and Yang, Hong
- Published
- 2024
- Full Text
- View/download PDF
9. An adaptive mixture prior in Bayesian convolutional autoencoder for early detecting anomalous degradation behaviors in lithium-ion batteries
- Author
-
Chae, Sun Geu and Bae, Suk Joo
- Published
- 2025
- Full Text
- View/download PDF
10. Identification of circadian rhythm-related gene classification patterns and immune infiltration analysis in heart failure based on machine learning
- Author
-
Wang, Xuefu, Rao, Jin, Zhang, Li, Liu, Xuwen, and Zhang, Yufeng
- Published
- 2024
- Full Text
- View/download PDF
11. Melt pool instability in surface polishing by laser remelting: preliminary analysis and online monitoring with K-means clustering
- Author
-
Cvijanovic, Srdjan, Bordatchev, Evgueni V., and Tutunea-Fatan, O. Remus
- Published
- 2024
- Full Text
- View/download PDF
12. Developing an integrated POC spectrophotometric device for discrimination and determination of opioids based on gold nanoparticles
- Author
-
Farahani, Ali, Azimi, Shamim, and Azimi, Minoo
- Published
- 2022
- Full Text
- View/download PDF
13. Multimodal Imaging Unveils the Impact of Nanotopography on Cellular Metabolic Activities
- Author
-
Li, Zhi, Sarikhani, Einollah, Prayotamornkul, Sirasit, Meganathan, Dhivya Pushpa, Jahed, Zeinab, and Shi, Lingyan
- Subjects
Biochemistry and Cell Biology ,Biological Sciences ,Biotechnology ,Regenerative Medicine ,Biomedical Imaging ,Bioengineering ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Generic health relevance ,Nanotopography ,Nanopillar ,Cell metabolism ,Metabolic dynamics ,Multimodal imaging ,Multivariateanalysis ,Unsupervised clustering - Abstract
Nanoscale surface topography is an effective approach in modulating cell-material interactions, significantly impacting cellular and nuclear morphologies, as well as their functionality. However, the adaptive changes in cellular metabolism induced by the mechanical and geometrical microenvironment of the nanotopography remain poorly understood. In this study, we investigated the metabolic activities in cells cultured on engineered nanopillar substrates by using a label-free multimodal optical imaging platform. This multimodal imaging platform, integrating two photon fluorescence (TPF) and stimulated Raman scattering (SRS) microscopy, allowed us to directly visualize and quantify metabolic activities of cells in 3D at the subcellular scale. We discovered that the nanopillar structure significantly reduced the cell spreading area and circularity compared to flat surfaces. Nanopillar-induced mechanical cues significantly modulate cellular metabolic activities with variations in nanopillar geometry further influencing these metabolic processes. Cells cultured on nanopillars exhibited reduced oxidative stress, decreased protein and lipid synthesis, and lower lipid unsaturation in comparison to those on flat substrates. Hierarchical clustering also revealed that pitch differences in the nanopillar had a more significant impact on cell metabolic activity than diameter variations. These insights improve our understanding of how engineered nanotopographies can be used to control cellular metabolism, offering possibilities for designing advanced cell culture platforms which can modulate cell behaviors and mimic natural cellular environment and optimize cell-based applications. By leveraging the unique metabolic effects of nanopillar arrays, one can develop more effective strategies for directing the fate of cells, enhancing the performance of cell-based therapies, and creating regenerative medicine applications.
- Published
- 2024
14. Comprehensive evaluation and practical guideline of gating methods for high-dimensional cytometry data: manual gating, unsupervised clustering, and auto-gating.
- Author
-
Liu, Peng, Pan, Yuchen, Chang, Hung-Ching, Wang, Wenjia, Fang, Yusi, Xue, Xiangning, Zou, Jian, Toothaker, Jessica, Olaloye, Oluwabunmi, Santiago, Eduardo, McCourt, Black, Mitsialis, Vanessa, Presicce, Pietro, Kallapur, Suhas, Snapper, Scott, Liu, Jia-Jun, Tseng, George, Konnikova, Liza, and Liu, Silvia
- Subjects
auto-gating ,cytometry ,manual gating ,unsupervised clustering ,Flow Cytometry ,Humans ,Cluster Analysis ,Algorithms ,Single-Cell Analysis ,Computational Biology ,Animals - Abstract
Cytometry is an advanced technique for simultaneously identifying and quantifying many cell surface and intracellular proteins at a single-cell resolution. Analyzing high-dimensional cytometry data involves identifying and quantifying cell populations based on their marker expressions. This study provided a quantitative review and comparison of various ways to phenotype cellular populations within the cytometry data, including manual gating, unsupervised clustering, and supervised auto-gating. Six datasets from diverse species and sample types were included in the study, and manual gating with two hierarchical layers was used as the truth for evaluation. For manual gating, results from five researchers were compared to illustrate the gating consistency among different raters. For unsupervised clustering, 23 tools were quantitatively compared in terms of accuracy with the truth and computing cost. While no method outperformed all others, several tools, including PAC-MAN, CCAST, FlowSOM, flowClust, and DEPECHE, generally demonstrated strong performance. For supervised auto-gating methods, four algorithms were evaluated, where DeepCyTOF and CyTOF Linear Classifier performed the best. We further provided practical recommendations on prioritizing gating methods based on different application scenarios. This study offers comprehensive insights for biologists to understand diverse gating methods and choose the best-suited ones for their applications.
- Published
- 2024
15. Decoding Heterogeneity in Quadruple-Negative Breast Cancer: A Data-driven Clustering Approach
- Author
-
Sahoo, Bikram, Jinna, Nikita, Rida, Padmashree, Pinnix, Zandra, Zelikovsky, Alex, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bansal, Mukul S., editor, Chen, Wei, editor, Khudyakov, Yury, editor, Măndoiu, Ion I., editor, Moussa, Marmar R., editor, Patterson, Murray, editor, Rajasekaran, Sanguthevar, editor, Skums, Pavel, editor, Thankachan, Sharma V., editor, and Zelikovsky, Alexander, editor
- Published
- 2025
- Full Text
- View/download PDF
16. Splicing Localization in Digital Images Through Agglomerative Clustering on Optimized Feature Sets with Zero Training Data Dependency
- Author
-
Das, Debjit, Naskar, Ruchira, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
17. Hajj Crowd Control for Better Management of Pilgrims
- Author
-
Amit, Prince, Sinha, Ambika, Kumar, Himanshu, di Prisco, Marco, Series Editor, Chen, Sheng-Hong, Series Editor, Vayas, Ioannis, Series Editor, Kumar Shukla, Sanjay, Series Editor, Sharma, Anuj, Series Editor, Kumar, Nagesh, Series Editor, Wang, Chien Ming, Series Editor, Cui, Zhen-Dong, Series Editor, Lu, Xinzheng, Series Editor, Verma, Amit Kumar, editor, Singh, T. N., editor, Mohamad, Edy Tonnizam, editor, Mishra, A. K., editor, Gamage, Ranjith Pathegama, editor, Bhatawdekar, Ramesh, editor, and Wilkinson, Stephen, editor
- Published
- 2025
- Full Text
- View/download PDF
18. Clustering Performance of an Evolutionary K-Means Algorithm
- Author
-
Nigro, Libero, Cicirelli, Franco, Pupo, Francesco, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Yang, Xin-She, editor, Sherratt, R. Simon, editor, Dey, Nilanjan, editor, and Joshi, Amit, editor
- Published
- 2025
- Full Text
- View/download PDF
19. Deep learning powered single-cell clustering framework with enhanced accuracy and stability.
- Author
-
Zhang, Yi, Feng, Xi, Wang, Yin, and Shi, Kai
- Subjects
- *
REPRESENTATIONS of graphs , *RNA sequencing , *SOURCE code , *CELL populations , *SCALABILITY - Abstract
Single-cell RNA sequencing (scRNA-seq) has revolutionized the field of cellular diversity research. Unsupervised clustering, a key technique in this exploration, allows for the identification of distinct cell types within a population. Graph-based deep clustering methods have shown promise in preserving the structural relationships between cells (nodes) within the data. However, these methods often neglect the inherent distribution of nodes in the graph, leading to incomplete representations of cell populations. Additionally, conventional graph convolutional networks (GCNs) can suffer from oversmoothing, a phenomenon where the network loses the ability to differentiate between samples with similar expression profiles. To address these limitations, we proposed scG-cluster, an innovative deep structural clustering method. This method incorporates two key innovations: (1) Dual-topology adjacency graph: scG-cluster integrates information about node distribution into the traditional adjacency graph used by GCNs. This enriches the graph representation by capturing the spatial relationships between cells in addition to their pairwise similarities. (2) Dual-topology adaptive graph convolutional network (TAGCN): The framework employs a TAGCN architecture with residual concatenation. This network utilizes an attention mechanism to dynamically weight features within the graph, focusing on the most informative aspects for clustering. Additionally, residual connections are implemented to combat oversmoothing, ensuring the network retains the ability to distinguish between subtle differences in cell expression profiles. Furthermore, scG-cluster iteratively refines the clustering centers, leading to enhanced stability and accuracy in the final cluster assignments. Extensive evaluations on six diverse scRNA-seq datasets demonstrate that scG-cluster consistently outperforms existing state-of-the-art methods in terms of both clustering accuracy and scalability. Ablation studies are also conducted to validate the significant contributions of both the residual connections and the attention mechanism to the overall performance of the model. The source code for scG-cluster is publicly available at https://github.com/xixi-wq/scG-cluster. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
20. High-parameter immunophenotyping reveals distinct immune cell profiles in pruritic dogs and cats.
- Author
-
McDonald, Erin, Kehoe, Eric, Deines, Darcy, McCarthy, Mary, Wright, Brie, and Huse, Susan
- Subjects
MACHINE learning ,CELL populations ,STATISTICAL learning ,ARTIFICIAL intelligence ,FLOW cytometry - Abstract
Introduction: Immunophenotyping is a powerful tool for grading disease severity, aiding in diagnosis, predicting clinical response, and guiding the development of novel therapeutics. Methods: This pilot study employs high parameter immunophenotyping panels (15 markers for dog, 12 for cat) and leverages unsupervised clustering to identify immune cell populations. Our analysis uses machine learning and statistical algorithms to perform unsupervised clustering, multiple visualizations, and statistical analysis of high parameter flow cytometry data. This method reduces user bias and precisely identifies cell populations, demonstrating its potential to detect variations and differentiate populations effectively. To enhance our understanding of cat and dog biology and test the unsupervised clustering approach on real-world samples, we performed in-depth profiling of immune cell populations in blood collected from client-owned and laboratory animals [dogs (n = 55) and cats (n = 68)]. These animals were categorized based on pruritic behavior or routine check-ups (non-pruritic controls). Results: Unsupervised clustering revealed various immune cell populations, including T-cell subsets distinguished by CD62L expression and distinct monocyte subsets. Notably, there were significant differences in monocyte subsets between pruritic and non-pruritic animals. Pruritic dogs and cats showed significant shifts in CD62LHi T-cell subsets compared to non-pruritic controls, with opposite trends observed between pruritic cats and dogs. Discussion: These findings underscore the importance of advancing veterinary immunophenotyping, expanding our knowledge about marker expression on circulating immune cells and driving progress in understanding veterinary-specific biology and uncovering new insights into various conditions and diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
21. Unsupervised clustering reveals noncanonical myeloid cell subsets in the brain tumor microenvironment.
- Author
-
Hermelo, Ismaïl, Virtanen, Tuomo, Salonen, Iida, Nätkin, Reetta, Keitaanniemi, Sofia, Tiihonen, Aliisa M., Lehtipuro, Suvi, Kummola, Laura, Raulamo, Ella, Nordfors, Kristiina, Haapasalo, Hannu, Rauhala, Minna, Kesseli, Juha, Nykter, Matti, Haapasalo, Joonas, and Rautajoki, Kirsi
- Subjects
- *
MYELOID cells , *LYMPHOCYTE subsets , *MEDICAL sciences , *MOSAICISM , *T cells - Abstract
The tumor immune microenvironment (TiME) of human central nervous system (CNS) tumors remains to be comprehensively deciphered. Here, we employed flow cytometry and RNA sequencing analysis for a deep data-driven dissection of a diverse TiME and to uncover noncanonical immune cell types in human CNS tumors by using seven tumors from five patients. Myeloid subsets comprised classical microglia, monocyte-derived macrophages, neutrophils, and two noncanonical myeloid subsets: CD3+ myeloids and CD19+ myeloids. T lymphocyte subsets included double-negative (CD4− CD8−) T cells (DNTs). Noncanonical myeloids and DNTs were explored on independent datasets, suggesting that our DNT phenotype represents γδ T cells. Noncanonical myeloids were validated using orthogonal methods across 73 patients from three independent datasets. While the proportions of classical myeloids agreed with reported malignancy type-associated TiMEs, unexpectedly high lymphocyte frequencies were detected in gliosarcoma, which also showed a unique expression pattern of immune-related genes. Our findings highlight the potential of data-driven approaches in resolving CNS TiME to reveal the mosaic of immune cell types constituting TiME, warranting the need for future studies on the nonclassical immune cell subsets. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
22. Self Organizing Map based Land Cover Clustering for Decision-Level Jaccard Index and Block Activity based Pan-Sharpened Images.
- Author
-
Jayashree, S., Maya, Karki V., Indira, K., and Dinesh, P. A.
- Abstract
Pan-sharpening is very often employed in remote sensing to transform low-resolution multispectral (LMS) images into equivalent high-resolution multispectral images (HMS). Images resulting from pan-sharpening are sharper and more detailed that is resulted by improving spatial features of the multispectral image. One such approach of jointly processing LMS and Panchromatic images is discussed in the present study. The decision-level fusion suggested here involves choosing or combining details from numerous sources by taking decisions while analyzing features recovered from the input images. The proposed methodology is an amalgamation of principal component analysis used for separating spatial and spectral details of LMS, non-subsampled contourlet transform for feature analysis, and Jaccard similarity index and block activity measurement for localized decision-based fusion. The algorithm tries to provide an adaptive approach to address the trade-off between spectral and spatial resolution. Self-Organizing Maps based clustering technique is employed with the intension of grouping the image pixels into three categories soil, water and vegetation. The paper highlights the performance comparison of proposed method with various pixel-level fusion techniques ranging from techniques from Intensity, Hue and Saturation (IHS) transform to Neural Networks based pan-sharpening methods. This comparison is implemented using various reference and non-reference indicators along with Kolmogorov–Smirnov test. Additional analysis using Kolmogorov–Smirnov test is done to statistically analyze spectral degradation. The comparative analysis provides enough evidence that the suggested method yields fused images with enhanced edge details without forgoing the spectral features which was also evident from the mutual information obtained from clustered images. The resulting sharpened images tend to possess good spatial and spectral details that would simplify the automatic image analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
23. Identifying time patterns in Huntington's disease trajectories using dynamic time warping-based clustering on multi-modal data.
- Author
-
Giannoula, Alexia, De Paepe, Audrey E., Sanz, Ferran, Furlong, Laura I., and Camara, Estela
- Subjects
- *
HUNTINGTON disease , *INFORMATION storage & retrieval systems , *ELECTRONIC health records , *ARTIFICIAL intelligence , *INDIVIDUALIZED medicine - Abstract
One of the principal goals of Precision Medicine is to stratify patients by accounting for individual variability. However, extracting meaningful information from Real-World Data, such as Electronic Health Records, still remains challenging due to methodological and computational issues. A Dynamic Time Warping-based unsupervised-clustering methodology is presented in this paper for the clustering of patient trajectories of multi-modal health data on the basis of shared temporal characteristics. Building on an earlier methodology, a new dimension of time-varying clinical and imaging features is incorporated, through an adapted cost-minimization algorithm for clustering on different, possibly overlapping, feature subsets. The model disease chosen is Huntington's disease (HD), characterized by progressive neurodegeneration. From a wide range of examined user-defined parameters, four case examples are highlighted to demonstrate the identified temporal patterns in multi-modal HD trajectories and to study how these differ due to the combined effects of feature weights and granularity threshold. For each identified cluster, polynomial fits that describe the time behavior of the assessed features are provided for an informative comparison, together with their averaged values. The proposed data-mining methodology permits the stratification of distinct time patterns of multi-modal health data in individuals that share a diagnosis, by employing user-customized criteria beyond the current clinical practice. Overall, this work bears implications for better analysis of individual variability in disease progression, opening doors to personalized preventative, diagnostic and therapeutic strategies. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
24. The influence of social interactions in mitigating psychological distress during the COVID−19 pandemic: a study in Sri Lanka.
- Author
-
Thilakasiri, Isuru, Fonseka, Tharaka, Mapa, Isuri, Godaliyadda, Roshan, Herath, Vijitha, Thowfeek, Ramila, Rathnayake, Anuruddhika, Ekanayake, Parakrama, and Ekanayake, Janaka
- Subjects
MENTAL health policy ,SOCIAL interaction ,COVID-19 pandemic ,SOCIAL influence ,SOCIAL groups ,LONELINESS - Abstract
Massive changes in many aspects related to social groups of different socioeconomic backgrounds were caused by the COVID-19 pandemic and as a result, the overall state of mental health was severely affected globally. This study examined how the pandemic affected Sri Lankan citizens representing a range of socioeconomic backgrounds in terms of their mental health. The data used in this research was gathered from 3,020 households using a nationwide face-to-face survey, from which a processed dataset of 921 responses was considered for the final analysis. Four distinct factors were identified by factor analysis (FA) that was conducted and subsequently, the population was clustered using unsupervised clustering to determine which population subgroups were affected similarly. Two such subgroups were identified where the respective relationships to the retrieved principal factors and their demographics were thoroughly examined and interpreted. This resulted in the identification of contrasting perspectives between the two groups toward the maintenance and the state of social relationships during the pandemic, which revealed that one group was more "socially connected" in nature resulting in their mental state being comparatively better in coping with the pandemic. The other group was seen to be more "socially reserved" showing an opposite reaction toward social connections while their mental well-being declined showing symptoms such as loneliness, and emptiness in response to the pandemic. The study examined the role of social media, and it was observed that social media was perceived as a substitute for the lack of social connections or primarily used as a coping mechanism in response to the challenges of the pandemic and results show that maintaining social connections physically or via online rather than the use of social media has helped one group over the other in decreasing their symptoms such as emptiness, loneliness and fear of death. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
25. Renewable Energy Community Sizing Based on Stochastic Optimization and Unsupervised Clustering.
- Author
-
Budin, Luka and Delimar, Marko
- Abstract
Renewable Energy Communities (RECs) are emerging as significant in the global paradigm shift towards a smart and sustainable energy environment. By empowering energy consumers to actively participate in local energy generation, and sharing, using renewable energy sources, energy storage, and flexible loads, REC participants can reduce costs, and also contribute to low-carbon objectives, providing the flexibility needed to address modern smart grid challenges. This article presents a mixed integer linear programming model for optimal sizing of the solar PVs and battery energy storage systems (BESS) of REC participants who engage in P2P energy exchange. The model is formulated using a two-stage stochastic optimization to address load and PV uncertainty, and unsupervised clustering to structure the data for the stochastic optimization process. The model enables sizing solar PVs for different rooftop geometries and the objective function includes comprehensively defined electricity, operational, and scaled investment costs for solar PV and BESS, where economic fairness constraints are analyzed and implemented. The model is validated on real solar and atmospheric measured data from Zagreb, Croatia, and publicly available household consumption data from Northern Germany. The article also analyzes how tariff models, and electricity prices affect PV and BESS sizes, cost reductions, and P2P energy exchange for different REC participants with varying consumption and production profiles. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
26. A meta-analysis of data-driven cognitive subgroups in bipolar disorder.
- Author
-
Bora, E
- Subjects
- *
EXECUTIVE function , *COGNITIVE processing speed , *COGNITION , *VISUAL memory , *VERBAL memory - Abstract
• There is consistent evidence for the existence of 3 cognitive subgroups in BD • Good-performance subgroup is characterized by better executive functions and normal functioning in all other domains. • Severe-impairment subgroup is associated with more severe course of illness and higher ratio of antipsychotic use. • Severe-impairment subgroup is also chaarctereized by lower educational attainment and later-onset of illness. • The characteristics of the moderate-impairment subgroup was lying between the other two subgroups for most of the measures. The delineation of cognitive subgroups of bipolar disorder (BD) might be helpful for identifying biologically valid subtypes of this disorder. This meta-analysis identified peer-reviewed literature on studies investigating cognitive subgroups of BD with data-driven clustering methods. Relevant studies were searched in PubMed, Scopus, and Web of Science. Random-effects meta-analysis was performed using R software. A total of 14 cross-sectional studies including euthymic or mildly symptomatic patients with BD were included in the current meta-analysis. The available studies have consistently supported a 3-cluster solution. The pooled prevalence of the severe-impairment, moderate-impairment, and major good-functioning groups were 23.1 % (95%CI, 18.5 %–27.7 %), 42.5 % (95%CI, 36.3 %–48.8 %), and 33.5 % (95%CI, 25.9 %–41.1 %) respectively. Compared to healthy controls, both the severe-impairment (g=−1.40 to −1.73) and moderate-impairment groups (g=−0.59 to −0.96) had significant deficits in all six cognitive domains (verbal memory, visual memory, executive functions, working memory, attention and processing speed). The good-performance subgroup had a small increase in the performance of executive functions (g=0.23) and normal functioning in all other domains. Compared to the good-performance subgroup, the severe-impairment subgroup was characterized by more severe functional impairment, more hospital admissions, a higher percentage of type I BD and antipsychotic use. The characteristics of the moderate-impairment subgroup were lying between the other two subgroups for most of the measures. The current findings support the existence of 3 cognitive subgroups in BD including severe-impairment and moderate-impairment groups which are associated with a more severe course of illness. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
27. Anchor Clustering for million-scale immune repertoire sequencing data.
- Author
-
Chang, Haiyang, Ashlock, Daniel, Graether, Steffen, and Keller, Stefan
- Subjects
Clonal relationship ,Immune repertoire ,Lymphocyte antigen receptor ,Unsupervised clustering ,Algorithms ,Cluster Analysis ,Receptors ,Antigen - Abstract
BACKGROUND: The clustering of immune repertoire data is challenging due to the computational cost associated with a very large number of pairwise sequence comparisons. To overcome this limitation, we developed Anchor Clustering, an unsupervised clustering method designed to identify similar sequences from millions of antigen receptor gene sequences. First, a Point Packing algorithm is used to identify a set of maximally spaced anchor sequences. Then, the genetic distance of the remaining sequences to all anchor sequences is calculated and transformed into distance vectors. Finally, distance vectors are clustered using unsupervised clustering. This process is repeated iteratively until the resulting clusters are small enough so that pairwise distance comparisons can be performed. RESULTS: Our results demonstrate that Anchor Clustering is faster than existing pairwise comparison clustering methods while providing similar clustering quality. With its flexible, memory-saving strategy, Anchor Clustering is capable of clustering millions of antigen receptor gene sequences in just a few minutes. CONCLUSIONS: This method enables the meta-analysis of immune-repertoire data from different studies and could contribute to a more comprehensive understanding of the immune repertoire data space.
- Published
- 2024
28. Comprehensive analysis of the relationship between RNA modification writers and immune microenvironment in head and neck squamous cell carcinoma
- Author
-
Wei Li, Ying Chen, Yao Zhang, Wen Wen, and Yingying Lu
- Subjects
RNA modification writers ,HNSCC ,Tumor microenvironment ,Unsupervised clustering ,Immunotherapy ,Immunologic diseases. Allergy ,RC581-607 - Abstract
Abstract Objectives Head and neck squamous cell carcinoma (HNSCC) is the sixth most common cancer worldwide. Four types of RNA modification writers (m6A, m1A, A-I editing, and APA) are widely involved in tumorigenesis and the TME. We aimed to comprehensively explore the role of the four RNA modification writers in the progression and immune microenvironment of HNSCC. Materials and methods We first obtained transcription profile data and transcriptional variation of the four types of RNA modification writers from The Cancer Genome Atlas (TCGA) database. HNSCC patients in TCGA dataset were divided into different clusters based on the four types of RNA modification writers. Univariate Cox and Least absolute shrinkage and selection operator (LASSO) analyses were performed to conduct a Writer-score scoring system, which was successfully verified in the GSE65858 dataset and our clinical sample dataset. Finally, we evaluated the relationship between different RNA modification clusters (Writer-score) and immunological characteristics of HNSCC. Results Two different RNA modification clusters (A and B) were obtained. These RNA modification clusters (Writer-score) were strongly associated with immunological characteristics (immunomodulators, cancer immunity cycles, infiltrating immune cells (TIICs), inhibitory immune checkpoints, and T cell inflamed score (TIS)) of HNSCC. Conclusions This study identified two different RNA modification clusters and explored the potential relationship between RNA modification clusters (Writer-score) and immunological characteristics, offering a new theoretical basis for precision immunotherapy in patients with HNSCC.
- Published
- 2024
- Full Text
- View/download PDF
29. Is Unsupervised Clustering Somehow Truer?
- Author
-
Søgaard, Anders
- Abstract
Scientists increasingly approach the world through machine learning techniques, but philosophers of science often question their epistemic status. Some philosophers have argued that the use of unsupervised clustering algorithms is more justified than the use of supervised classification, because supervised classification is more biased, and because (parametric) simplicity plays a different and more interesting role in unsupervised clustering. I call these arguments the No-Bias Argument and the Simplicity-Truth Argument. I show how both arguments are fallacious and how, on the contrary, the use of supervised classification is at least as justified as the use of unsupervised clustering. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Association Between FABP7‐5‐HT Pattern and Anxiety or Depression in Patients With Psoriasis: A Cross‐Sectional Study.
- Author
-
Huang, Dawei, Jiang, Yuxiong, Wu, Min, Ma, Rui, Yu, Yingyuan, Zhong, Xiaoyuan, Li, Ying, Chen, Jianhua, Tan, Fei, Lu, Jiajing, and Shi, Yuling
- Subjects
- *
HYPERTENSION epidemiology , *MENTAL depression risk factors , *RISK assessment , *CROSS-sectional method , *PSORIASIS , *RESEARCH funding , *BODY mass index , *MULTIPLE regression analysis , *ANXIETY , *DESCRIPTIVE statistics , *SEVERITY of illness index , *MULTIVARIATE analysis , *FATTY acid-binding proteins , *SEROTONIN , *COMPARATIVE studies , *BIOMARKERS , *SENSITIVITY & specificity (Statistics) , *DISEASE incidence , *MENTAL depression , *EVALUATION - Abstract
Psoriasis exhibits a higher incidence of anxiety and depression. However, the diagnostic process heavily relies on subjective evaluation. Fatty acid‐binding protein 7 (FABP7) and serotonin (5‐HT) are considered as potential plasma biomarkers. We aimed to investigate the potentiality of plasma FABP7 and 5‐HT as biomarkers for predicting anxiety and depression in psoriasis. Data were analysed from 140 patients with psoriasis in the Shanghai Psoriasis Effectiveness Evaluation CoHort (SPEECH). Unsupervised clustering was employed to group patients based on their FABP7 and 5‐HT profiles. Subsequently, patients were categorised into Group 1 (lower FABP7 and higher 5‐HT) or Group 2. Multivariate logistic regression was employed to investigate the correlation between the FABP7‐5‐HT pattern and anxiety or depression in psoriasis patients. Patients with psoriasis have a higher incidence of anxiety or depression, as well as higher levels of FABP7 and lower levels of 5‐HT. After clustering patients using K‐means clustering, Group 2 showed a higher body mass index, a higher incidence of hypertension, more severe psoriasis, and more significant anxiety and depression compared to Group 1. Multivariate logistic regression shows that adjusting for covariates except PASI, duration of psoriasis, and psoriatic arthritis, Group 2 had a higher risk of anxiety and depression compared to Group 1. Further adjustment for covariates yielded similar results. Pattern of FABP7‐5‐HT that may indicate an association with psoriasis accompanied by anxiety or depression. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. scRNA-seq revealed transcriptional signatures of human umbilical cord primitive stem cells and their germ lineage origin regulated by imprinted genes.
- Author
-
Jarczak, Justyna, Bujko, Kamila, Ratajczak, Mariusz Z., and Kucia, Magdalena
- Subjects
- *
POLYCOMB group proteins , *CORD blood , *EMBRYONIC stem cells , *CD45 antigen , *GENE expression , *HOMEOBOX genes , *PURINERGIC receptors - Abstract
A population of CD133+lin-CD45- and CD34+lin-CD45- very small embryonic-like stem cells (VSELs) has been identified in postnatal human tissues, including bone marrow (BM), mobilized peripheral blood (mPB) and umbilical cord blood (UCB). Under appropriate conditions, VSELs in vitro and in vivo differentiate into tissue-committed stem cells for all three germ layers. Molecular analysis of adult murine BM-purified VSELs revealed that these rare cells deposited during development in adult tissues (i) express a similar transcriptome as embryonic stem cells, (ii) share several markers characteristic for epiblast and migratory primordial germ cells (PGCs), (iii) highly express a polycomb group protein enhancer of zeste drosophila homolog 2 (Ezh2) and finally (iv) display a unique pattern of imprinting at crucial paternally inherited genes that promotes their quiescence. Here, by employing single-cell RNA sequencing we demonstrate for the first time that purified from UCB human VSELs defined by expression of CD34 or CD133 antigens and lack of lineage markers, including CD45 antigen express similar molecular signature as murine BM-derived VSELs. Specifically, unsupervised clustering revealed numerous subpopulations of VSELs including ones i) annotated to germline compartments, ii) regulated by parental imprinting, iii) responding to early developmental fate decisions, iv) transcription factors involved in differentiation and development, including homeobox family of genes, and v) expressing innate immunity and purinergic signaling genes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Comprehensive analysis of the relationship between RNA modification writers and immune microenvironment in head and neck squamous cell carcinoma.
- Author
-
Li, Wei, Chen, Ying, Zhang, Yao, Wen, Wen, and Lu, Yingying
- Subjects
RNA modification & restriction ,SQUAMOUS cell carcinoma ,IMMUNE checkpoint proteins ,GENETIC transcription ,T cells - Abstract
Objectives: Head and neck squamous cell carcinoma (HNSCC) is the sixth most common cancer worldwide. Four types of RNA modification writers (m6A, m1A, A-I editing, and APA) are widely involved in tumorigenesis and the TME. We aimed to comprehensively explore the role of the four RNA modification writers in the progression and immune microenvironment of HNSCC. Materials and methods: We first obtained transcription profile data and transcriptional variation of the four types of RNA modification writers from The Cancer Genome Atlas (TCGA) database. HNSCC patients in TCGA dataset were divided into different clusters based on the four types of RNA modification writers. Univariate Cox and Least absolute shrinkage and selection operator (LASSO) analyses were performed to conduct a Writer-score scoring system, which was successfully verified in the GSE65858 dataset and our clinical sample dataset. Finally, we evaluated the relationship between different RNA modification clusters (Writer-score) and immunological characteristics of HNSCC. Results: Two different RNA modification clusters (A and B) were obtained. These RNA modification clusters (Writer-score) were strongly associated with immunological characteristics (immunomodulators, cancer immunity cycles, infiltrating immune cells (TIICs), inhibitory immune checkpoints, and T cell inflamed score (TIS)) of HNSCC. Conclusions: This study identified two different RNA modification clusters and explored the potential relationship between RNA modification clusters (Writer-score) and immunological characteristics, offering a new theoretical basis for precision immunotherapy in patients with HNSCC. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Inferential Tools for Assessing Dependence Across Response Categories in Multinomial Models with Discrete Random Effects.
- Author
-
Masci, Chiara, Ieva, Francesca, and Paganoni, Anna Maria
- Subjects
- *
RANDOM effects model , *DATA structures , *DISTRIBUTION (Probability theory) , *MULTILEVEL models , *REGRESSION analysis - Abstract
We propose a discrete random effects multinomial regression model to deal with estimation and inference issues in the case of categorical and hierarchical data. Random effects are assumed to follow a discrete distribution with an a priori unknown number of support points. For a K-categories response, the modelling identifies a latent structure at the highest level of grouping, where groups are clustered into subpopulations. This model does not assume the independence across random effects relative to different response categories, and this provides an improvement from the multinomial semi-parametric multilevel model previously proposed in the literature. Since the category-specific random effects arise from the same subjects, the independence assumption is seldom verified in real data. To evaluate the improvements provided by the proposed model, we reproduce simulation and case studies of the literature, highlighting the strength of the method in properly modelling the real data structure and the advantages that taking into account the data dependence structure offers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Quantifying Hybrid Failure Modes of Unreinforced Masonry Walls through Experimental Data Analysis.
- Author
-
Asjodi, Amir Hossein, Saeidi, Sepehr, Dolatshahi, Kiarash M., and Burton, Henry V.
- Subjects
- *
FAILURE mode & effects analysis , *WALL design & construction , *MECHANICAL failures , *REGRESSION analysis , *DATABASES - Abstract
Failure mode identification in unreinforced masonry (URM) walls is a challenging task because of the presence of multiple damage mechanisms, including so-called hybrid modes. This paper employed a novel application of supervised and unsupervised learning to link URM wall design and mechanical properties to the failure mode. We used a database with 330 backbone curves from cyclically loaded URM walls as well as the associated images captured during the experiments. Based on the observations documented during the experiments, information on the cyclic curves, and the associated images, the walls were first manually classified into four failure modes: bed-joint sliding, diagonal tension, rocking, and toe crushing. Then k -means clustering was used to group the damaged walls into four classes based on critical points along the backbone curves. A hybridity index was introduced to quantify the contribution of each failure mode based on the distance from the centroid of the clusters. The design and mechanical properties of the URM walls were then used to predict the hybridity index using a multioutput regression model. The hybridity prediction model determines the contribution of the various failure modes to the ultimate behavior of a damaged URM wall. The proposed framework provides a robust approach to quantifying the relative contribution of each failure mechanism to the overall performance of the URM wall. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Identifying the Vertical Stratification of Sediment Samples by Visible and Near-Infrared Spectroscopy.
- Author
-
Fan, Pingping, Jia, Zongchao, Qiu, Huimin, Wang, Hongru, and Gao, Yang
- Subjects
- *
MACHINE learning , *MARINE sediments , *OPTICAL spectroscopy , *NEAR infrared spectroscopy , *SEDIMENTATION & deposition - Abstract
Vertical stratification in marine sediment profiles indicates physical and chemical sedimentary processes and, thus, is the first step in sedimentary research and in studying their relationship with global climate change. Traditional technologies for studying vertical stratification have low efficiency; thus, new technologies are highly needed. Recently, visible and near-infrared spectroscopy (VNIR) has been explored to rapidly determine sediment parameters, such as clay content, particle size, total carbon (TC), total nitrogen (TN), and so on. Here, we explored vertical stratification in a sediment column in the South China Sea using VNIR. The sediment column was 160 cm and divided into 160 samples by 1 cm intervals. All samples were classified into three layers by depth, that is, 0–50 cm (the upper layer), 50–100 cm (the middle layer), and 100–160 cm (the bottom layer). Concentrations of TC and TN in each sample were measured by Elementa Vario EL III. Visible and near-infrared reflectance spectra of each sample were collected by Agilent Cary 5000. A global model and several classification models for vertical stratification in sediments were established by a Support Vector Machine (SVM) after the characteristic spectra were identified using Competitive Adaptive Reweighted Sampling. In the classification models, K-means clustering and Density Peak Clustering (DPC) were employed as the unsupervised clustering algorithms. The results showed that the stratification was successful by VNIR, especially when using the combination of unsupervised clustering and machine learning algorithms. The correct classification rate (CCR) was much higher in the classification models than in the global model. And the classification models had a higher CCR using K-means combined with SVM (94.8%) and using DPC combined with SVM (96.0%). The higher CCR might be derived from the chemical classification. Indeed, similar results were also found in the chemical stratification. This study provided a theoretical basis for the rapid and synchronous measurement of chemical and physical parameters in sediment profiles by VNIR. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Exploring the role of Yuxuebi tablet in neuropathic pain with the method of similarity research of drug pharmacological effects based on unsupervised machine learning.
- Author
-
Xiao Du, Chunhui Zhao, Yujie Xi, Pengfei Lin, Huihui Liu, Shuling Wang, and Feifei Guo
- Subjects
LABORATORY rats ,CHINESE medicine ,NEURALGIA ,ANIMAL experimentation ,PAIN perception ,MACHINE learning ,ION channels - Abstract
Introduction: Having multiple pharmacological effects is a characteristic of Traditional Chinese Medicine (TCM). Currently, there is a lack of suitable methods to explore and discover modern diseases suitable for TCM treatment using this characteristic. Unsupervised machine learning technology is an efficient strategy to predict the pharmacological activity of drugs. This study takes Yuxuebi Tablet (YXB) as the research object. Using the unsupervised machine learning technology of drug cell functional fingerprint similarity research, the potential pharmacological effects of YXB were discovered and verified. Methods: LC-MS combined with the in vitro intestinal absorption method was used to identify components of YXB that could be absorbed by the intestinal tract of rats. Unsupervised learning hierarchical clustering was used to calculate the degree of similarity of cellular functional fingerprints between these components and 121 marketed Western drugs whose indications are diseases and symptoms that YXB is commonly used to treat. Then, based on the Library of Integrated Network-based Cellular Signatures database, pathway analysis was performed for selected Western drugs with high similarity in cellular functional fingerprints with the components of YXB to discover the potential pharmacological effects of YXB, which were validated by animal experiments. Results: We identified 40 intestinally absorbed components of YXB. Through predictive studies, we found that they have pharmacological effects very similar to non-steroidal anti-inflammatory drugs (NSAIDs) and corticosteroids. In addition, we found that they have very similar pharmacological effects to antineuropathic pain medications (such as gabapentin, duloxetine, and pethidine) and may inhibit the NF-B signaling pathway and biological processes related to pain perception. Therefore, YXB may have an antinociceptive effect on neuropathic pain. Finally, we demonstrated that YXB significantly reduced neuropathic pain in a rat model of sciatic nerve chronic constriction injury (CCI). Transcriptome analysis further revealed that YXB regulates the expression of multiple genes involved in nerve injury repair, signal transduction, ion channels, and inflammatory response, with key regulatory targets including Sgk1, Sst, Isl1, and Shh. Conclusion: This study successfully identified and confirmed the previously unknown pharmacological activity of YXB against neuropathic pain through unsupervised learning prediction and experimental verification. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. High-parameter immunophenotyping reveals distinct immune cell profiles in pruritic dogs and cats
- Author
-
Erin McDonald, Eric Kehoe, Darcy Deines, Mary McCarthy, Brie Wright, and Susan Huse
- Subjects
flow cytometry ,pruritis ,immunophenotyping ,machine learning ,artificial intelligence ,unsupervised clustering ,Veterinary medicine ,SF600-1100 - Abstract
IntroductionImmunophenotyping is a powerful tool for grading disease severity, aiding in diagnosis, predicting clinical response, and guiding the development of novel therapeutics.MethodsThis pilot study employs high parameter immunophenotyping panels (15 markers for dog, 12 for cat) and leverages unsupervised clustering to identify immune cell populations. Our analysis uses machine learning and statistical algorithms to perform unsupervised clustering, multiple visualizations, and statistical analysis of high parameter flow cytometry data. This method reduces user bias and precisely identifies cell populations, demonstrating its potential to detect variations and differentiate populations effectively. To enhance our understanding of cat and dog biology and test the unsupervised clustering approach on real-world samples, we performed in-depth profiling of immune cell populations in blood collected from client-owned and laboratory animals [dogs (n = 55) and cats (n = 68)]. These animals were categorized based on pruritic behavior or routine check-ups (non-pruritic controls).ResultsUnsupervised clustering revealed various immune cell populations, including T-cell subsets distinguished by CD62L expression and distinct monocyte subsets. Notably, there were significant differences in monocyte subsets between pruritic and non-pruritic animals. Pruritic dogs and cats showed significant shifts in CD62LHi T-cell subsets compared to non-pruritic controls, with opposite trends observed between pruritic cats and dogs.DiscussionThese findings underscore the importance of advancing veterinary immunophenotyping, expanding our knowledge about marker expression on circulating immune cells and driving progress in understanding veterinary-specific biology and uncovering new insights into various conditions and diseases.
- Published
- 2025
- Full Text
- View/download PDF
38. The influence of social interactions in mitigating psychological distress during the COVID−19 pandemic: a study in Sri Lanka
- Author
-
Isuru Thilakasiri, Tharaka Fonseka, Isuri Mapa, Roshan Godaliyadda, Vijitha Herath, Ramila Thowfeek, Anuruddhika Rathnayake, Parakrama Ekanayake, and Janaka Ekanayake
- Subjects
COVID-19 ,demographics ,social media ,factor analysis ,unsupervised clustering ,social interactions ,Psychology ,BF1-990 - Abstract
Massive changes in many aspects related to social groups of different socioeconomic backgrounds were caused by the COVID-19 pandemic and as a result, the overall state of mental health was severely affected globally. This study examined how the pandemic affected Sri Lankan citizens representing a range of socioeconomic backgrounds in terms of their mental health. The data used in this research was gathered from 3,020 households using a nationwide face-to-face survey, from which a processed dataset of 921 responses was considered for the final analysis. Four distinct factors were identified by factor analysis (FA) that was conducted and subsequently, the population was clustered using unsupervised clustering to determine which population subgroups were affected similarly. Two such subgroups were identified where the respective relationships to the retrieved principal factors and their demographics were thoroughly examined and interpreted. This resulted in the identification of contrasting perspectives between the two groups toward the maintenance and the state of social relationships during the pandemic, which revealed that one group was more “socially connected” in nature resulting in their mental state being comparatively better in coping with the pandemic. The other group was seen to be more “socially reserved” showing an opposite reaction toward social connections while their mental well-being declined showing symptoms such as loneliness, and emptiness in response to the pandemic. The study examined the role of social media, and it was observed that social media was perceived as a substitute for the lack of social connections or primarily used as a coping mechanism in response to the challenges of the pandemic and results show that maintaining social connections physically or via online rather than the use of social media has helped one group over the other in decreasing their symptoms such as emptiness, loneliness and fear of death.
- Published
- 2025
- Full Text
- View/download PDF
39. Creation of signatures and identification of molecular subtypes of glioblastoma based on disulfidptosis-related genes for predicting patient prognosis and immunological activity
- Author
-
Dongjun Li, Xiaodong Li, Jianfeng Lv, and Shaoyi Li
- Subjects
Glioblastoma ,Programmed cell death ,Disulfidptosis ,Unsupervised clustering ,Risk score model ,Surgery ,RD1-811 - Abstract
Background: In recent times, disulfidptosis, an intricate form of cellular demise, has garnered attention due to its impact on prognosis, tumor progression and treatment response. Nevertheless, the exact significance of disulfidptosis-related genes (DisRGs) in glioblastoma (GBM) remains enigmatic. Methods: The GEO and TCGA databases provided transcriptional and clinically relevant data on tumor samples, while the GTEx database provided data on healthy tissues. Disulfidptosis-related genes (DisRGs) were procured from previous scholarly investigations. The expression profile of DisRGs was initially scrutinized among patients diagnosed with GBM, subsequent to which their prognostic value was explored. Through consensus clustering, we constructed DisRGs-related clusters and gene subtypes. Our results established that the DisRG-related clusters had differentially expressed genes, resulting in a DisulfidptosisScore model, which had a positive prognostic value. Results: The differential expression profile of 24 DisRGs between GBM samples and healthy samples was acquired. Through consensus cluster analysis, two distinct disulfidptosis subtypes, namely DisRGcluster A and DisRGcluster B, were identified. Then, the DisulfidptosisScore model including 4 characteristic genes was constructed.Notably, patients with GBM assigned with lower score demonstrated a considerably longer overall survival (OS) compared to those with higher score. Conclusion: We have effectively devised a prognostic model associated with disulfidptosis, presenting autonomous prognostic predictions for patients with GBM. These findings serve as a valuable addition to the current comprehension of disulfidptosis and offer fresh theoretical substantiation for the development of enhanced treatment strategies.
- Published
- 2024
- Full Text
- View/download PDF
40. Creation of a Spatiotemporal Algorithm and Application to COVID-19 Data
- Author
-
Natalia Bou Sakr, Gihane Mansour, and Yahia Salhi
- Subjects
spatiotemporal data ,unsupervised clustering ,COVID-19 ,Specialties of internal medicine ,RC581-951 - Abstract
This study offers an in-depth analysis of the COVID-19 pandemic’s trajectory in several member countries of the European Union (EU) in order to assess similarities in their crisis experiences. We also examine data from the United States to facilitate a larger comparison across continents. We introduce our new approach, which uses a spatiotemporal algorithm to identify five distinct and recurring phases that each country underwent at different times during the COVID-19 pandemic. These stages include: Comfort Period, characterized by minimal COVID-19 activity and limited impacts; Preventive Situation, demonstrating the implementation of proactive measures, with relatively low numbers of cases, deaths, and Intensive Care Unit (ICU) admissions; Worrying Situation, is defined by high levels of concern and preparation as deaths and cases begin to rise and reach substantial levels; Panic Situation, marked by a high number of deaths relative to the number of cases and a rise in ICU admissions, denoting a critical and alarming period of the pandemic; and finally, Epidemic Control Situation, distinguished by limited numbers of COVID-19 deaths despite a high number of new cases. By examining these phases, we identify the various waves of the pandemic, indicating periods where the health crisis had a significant impact. This comparative analysis highlights the time lags between countries as they transitioned through these different critical stages and navigated the waves of the COVID-19 pandemic.
- Published
- 2024
- Full Text
- View/download PDF
41. Single-cell analysis via manifold fitting: A framework for RNA clustering and beyond.
- Author
-
Zhigang Yao, Bingjie Li, Yukun Lu, and Shing-Tung Yau
- Subjects
- *
RNA sequencing , *GENE expression , *BANKING industry , *DEEP learning , *SPATIAL ability - Abstract
Single-cell RNA sequencing (scRNA-seq) data, susceptible to noise arising from biological variability and technical errors, can distort gene expression analysis and impact cell similarity assessments, particularly in heterogeneous populations. Current methods, including deep learning approaches, often struggle to accurately characterize cell relationships due to this inherent noise. To address these challenges, we introduce scAMF (Single-cell Analysis via Manifold Fitting), a framework designed to enhance clustering accuracy and data visualization in scRNA-seq studies. At the heart of scAMF lies the manifold fitting module, which effectively denoises scRNA-seq data by unfolding their distribution in the ambient space. This unfolding aligns the gene expression vector of each cell more closely with its underlying structure, bringing it spatially closer to other cells of the same cell type. To comprehensively assess the impact of scAMF, we compile a collection of 25 publicly available scRNA-seq datasets spanning various sequencing platforms, species, and organ types, forming an extensive RNA data bank. In our comparative studies, benchmarking scAMF against existing scRNA-seq analysis algorithms in this data bank, we consistently observe that scAMF outperforms in terms of clustering efficiency and data visualization clarity. Further experimental analysis reveals that this enhanced performance stems from scAMF's ability to improve the spatial distribution of the data and capture class-consistent neighborhoods. These findings underscore the promising application potential of manifold fitting as a tool in scRNA-seq analysis, signaling a significant enhancement in the precision and reliability of data interpretation in this critical field of study. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Gut microbial subtypes and clinicopathological value for colorectal cancer.
- Author
-
Han, Shuwen, Zhuang, Jing, Song, Yifei, Wu, Xinyue, Yu, Xiaojian, Tao, Ye, Chu, Jian, Qu, Zhanbo, Wu, Yinhang, Han, Shugao, and Yang, Xi
- Subjects
- *
FREE fatty acids , *COLORECTAL cancer , *LACTIC acid bacteria , *BACTERIAL communities , *BACTEROIDES - Abstract
Background: Gut bacteria are related to colorectal cancer (CRC) and its clinicopathologic characteristics. Objective: To develop gut bacterial subtypes and explore potential microbial targets for CRC. Methods: Stool samples from 914 volunteers (376 CRCs, 363 advanced adenomas, and 175 normal controls) were included for 16S rRNA sequencing. Unsupervised learning was used to generate gut microbial subtypes. Gut bacterial community composition and clustering effects were plotted. Differences of gut bacterial abundance were analyzed. Then, the association of CRC‐associated bacteria with subtypes and the association of gut bacteria with clinical information were assessed. The CatBoost models based on gut differential bacteria were constructed to identify the diseases including CRC and advanced adenoma (AA). Results: Four gut microbial subtypes (A, B, C, D) were finally obtained via unsupervised learning. The characteristic bacteria of each subtype were Escherichia‐Shigella in subtype A, Streptococcus in subtype B, Blautia in subtype C, and Bacteroides in subtype D. Clinical information (e.g., free fatty acids and total cholesterol) and CRC pathological information (e.g., tumor depth) varied among gut microbial subtypes. Bacilli, Lactobacillales, etc., were positively correlated with subtype B. Positive correlation of Blautia, Lachnospiraceae, etc., with subtype C and negative correlation of Coriobacteriia, Coriobacteriales, etc., with subtype D were found. Finally, the predictive ability of CatBoost models for CRC identification was improved based on gut microbial subtypes. Conclusion: Gut microbial subtypes provide characteristic gut bacteria and are expected to contribute to the diagnosis of CRC. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Assessment of TGx-DDI genes for genotoxicity in a comprehensive panel of chemicals.
- Author
-
Barutcu, A. Rasim
- Subjects
- *
FLUOROALKYL compounds , *TOXICITY testing , *HIERARCHICAL clustering (Cluster analysis) , *DNA damage , *CHEMICAL potential , *GENETIC toxicology - Abstract
Background: The TGx-DDI biomarker identifies transcripts specifically induced by primary DNA damage. Profiling similarity of TGx-DDI signatures can allow clustering compounds by genotoxic mechanism. This transcriptomics-based approach complements conventional toxicology testing by enhancing mechanistic resolution. Methods: Unsupervised hierarchical clustering and t-distributed stochastic neighbor embedding (tSNE) were utilized to assess similarity of publicly-available per- and polyfluoroalkyl substances (PFAS) and ToxCast chemicals based on TGx-DDI modulation. TempO-seq transcriptomic data after highest chemical concentrations were analyzed. Results: Clustering discriminated between genotoxic and non-genotoxic compounds while drawing similarity among chemicals with shared mechanisms. PFAS largely clustered distinctly from classical mutagens. However, dynamic range across PFAS types and durations indicated variable potential for DNA damage. tSNE visualization reinforced phenotypic groupings, with genotoxins clustering separately from non-DNA damaging agents. Discussion: Unsupervised learning approaches applied to TGx-DDI profiles effectively categorizes chemical genotoxicity potential, aiding elucidation of biological response pathways. This transcriptomics-based strategy gives further insight into the role and effect of individual TGx-DDI biomarker genes and complements existing assays by enhancing mechanistic resolution. Overall, TGx-DDI biomarker profiling holds promise for predictive safety screening. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. scVGATAE: A Variational Graph Attentional Autoencoder Model for Clustering Single-Cell RNA-seq Data.
- Author
-
Liu, Lijun, Wu, Xiaoyang, Yu, Jun, Zhang, Yuduo, Niu, Kaixing, and Yu, Anli
- Subjects
- *
ARTIFICIAL neural networks , *RNA sequencing , *SCALABILITY , *HETEROGENEITY - Abstract
Simple Summary: Due to the rapid development of single-cell RNA sequencing technology, the volume of single-cell RNA sequencing data has grown exponentially. Traditional clustering methods have proven increasingly difficult to cluster this large-scale and highly complex single-cell RNA sequencing data. Although many unsupervised clustering methods based on deep neural networks have been developed to cluster cell subpopulations, these methods are complex in models and poor in scalability. In this paper, we propose a novel clustering method for single-cell RNA sequencing, which successfully combines the advantages of these two clustering models, maintaining high clustering performance while also preserving the stable computational efficiency of traditional clustering methods. Experiments conducted on nine public datasets have demonstrated that our proposed novel clustering method for single-cell RNA sequencing outperforms both classic and state-of-the-art clustering methods. Single-cell RNA sequencing (scRNA-seq) is now a successful technology for identifying cell heterogeneity, revealing new cell subpopulations, and predicting developmental trajectories. A crucial component in scRNA-seq is the precise identification of cell subsets. Although many unsupervised clustering methods have been developed for clustering cell subpopulations, the performance of these methods is prone to be affected by dropout, high dimensionality, and technical noise. Additionally, most existing methods are time-consuming and fail to fully consider the potential correlations between cells. In this paper, we propose a novel unsupervised clustering method called scVGATAE (Single-cell Variational Graph Attention Autoencoder) for scRNA-seq data. This method constructs a reliable cell graph through network denoising, utilizes a novel variational graph autoencoder model integrated with graph attention networks to aggregate neighbor information and learn the distribution of the low-dimensional representations of cells, and adaptively determines the model training iterations for various datasets. Finally, the obtained low-dimensional representations of cells are clustered using kmeans. Experiments on nine public datasets show that scVGATAE outperforms classical and state-of-the-art clustering methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Generalized Time‐Series Analysis for In Situ Spacecraft Observations: Anomaly Detection and Data Prioritization Using Principal Components Analysis and Unsupervised Clustering.
- Author
-
Finley, Matthew G., Martinez‐Ledesma, Miguel, Paterson, William R., Argall, Matthew R., Miles, David M., Dorelli, John C., and Zesta, Eftyhia
- Subjects
- *
PRINCIPAL components analysis , *STATISTICAL measurement , *SUPPORT vector machines , *SOLAR system , *SPACE vehicles - Abstract
In situ spacecraft observations are critical to our study and understanding of the various phenomena that couple mass, momentum, and energy throughout near‐Earth space and beyond. However, on‐orbit telemetry constraints can severely limit the capability of spacecraft to transmit high‐cadence data, and missions are often only able to telemeter a small percentage of their captured data at full rate. This presents a programmatic need to prioritize intervals with the highest probability of enabling the mission's science goals. Larger missions such as the Magnetospheric Multiscale mission (MMS) aim to solve this problem with a Scientist‐In‐The‐Loop (SITL), where a domain expert flags intervals of time with potentially interesting data for high‐cadence data downlink and subsequent study. Although suitable for some missions, the SITL solution is not always feasible, especially for low‐cost missions such as CubeSats and NanoSats. This manuscript presents a generalizable method for the detection of anomalous data points in spacecraft observations, enabling rapid data prioritization without substantial computational overhead or the need for additional infrastructure on the ground. Specifically, Principal Components Analysis and One‐Class Support Vector Machines are used to generate an alternative representation of the data and provide an indication, for each point, of the data's potential for scientific utility. The technique's performance and generalizability is demonstrated through application to intervals of observations, including magnetic field data and plasma moments, from the CASSIOPE e‐POP/Swarm‐Echo and MMS missions. Plain Language Summary: Measurements captured by spacecraft are necessary to our understanding the space environment near Earth and throughout our solar system. However, spacecraft can often only transmit a small portion of the data they capture back to Earth. This means that many spacecraft must prioritize intervals of data that have the highest probability of helping to further our understanding of these environments. Some missions utilize humans, on Earth, to help select these scientifically important intervals. This solution, called the Scientist‐In‐The‐Loop, can be too expensive or programmatically complex for many small missions to implement. This manuscript presents a technique for the detection of anomalous events in spaceflight measurements using statistical analysis and machine learning. These detected anomalies can be used to prioritize data that has a high probability of scientific relevance. Further, the proposed technique is highly generalizable and computationally lightweight, making it suitable for a variety of missions. Several case studies from multiple existing missions will be analyzed throughout this paper. Key Points: Spacecraft often cannot transmit all measurements to Earth at full cadence due to telemetry bandwidth limitationsMany missions must implement complex data prioritization schemes to ensure only the highest‐priority data is transmitted at high cadenceThe proposed data prioritization technique is highly generic, compatible with inexpensive hardware, and suitable for low‐cost missions [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. Democratizing cheminformatics: interpretable chemical grouping using an automated KNIME workflow.
- Author
-
Moreira-Filho, José T., Ranganath, Dhruv, Conway, Mike, Schmitt, Charles, Kleinstreuer, Nicole, and Mansouri, Kamel
- Subjects
- *
MACHINE learning , *FEATURE selection , *DATA analytics , *ARTIFICIAL intelligence , *DATA mining - Abstract
With the increased availability of chemical data in public databases, innovative techniques and algorithms have emerged for the analysis, exploration, visualization, and extraction of information from these data. One such technique is chemical grouping, where chemicals with common characteristics are categorized into distinct groups based on physicochemical properties, use, biological activity, or a combination. However, existing tools for chemical grouping often require specialized programming skills or the use of commercial software packages. To address these challenges, we developed a user-friendly chemical grouping workflow implemented in KNIME, a free, open-source, low/no-code, data analytics platform. The workflow serves as an all-encompassing tool, expertly incorporating a range of processes such as molecular descriptor calculation, feature selection, dimensionality reduction, hyperparameter search, and supervised and unsupervised machine learning methods, enabling effective chemical grouping and visualization of results. Furthermore, we implemented tools for interpretation, identifying key molecular descriptors for the chemical groups, and using natural language summaries to clarify the rationale behind these groupings. The workflow was designed to run seamlessly in both the KNIME local desktop version and KNIME Server WebPortal as a web application. It incorporates interactive interfaces and guides to assist users in a step-by-step manner. We demonstrate the utility of this workflow through a case study using an eye irritation and corrosion dataset. Scientific contributions This work presents a novel, comprehensive chemical grouping workflow in KNIME, enhancing accessibility by integrating a user-friendly graphical interface that eliminates the need for extensive programming skills. This workflow uniquely combines several features such as automated molecular descriptor calculation, feature selection, dimensionality reduction, and machine learning algorithms (both supervised and unsupervised), with hyperparameter optimization to refine chemical grouping accuracy. Moreover, we have introduced an innovative interpretative step and natural language summaries to elucidate the underlying reasons for chemical groupings, significantly advancing the usability of the tool and interpretability of the results. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Layer-by-layer unsupervised clustering of statistically relevant fluctuations in noisy time-series data of complex dynamical systems.
- Author
-
Becchi, Matteo, Fantolino, Federico, and Pavan, Giovanni M.
- Subjects
- *
DYNAMICAL systems , *TIME series analysis , *CLEARCUTTING , *ONIONS , *CLUSTER analysis (Statistics) - Abstract
Complex systems are typically characterized by intricate internal dynamics that are often hard to elucidate. Ideally, this requires methods that allow to detect and classify in an unsupervised way the microscopic dynamical events occurring in the system. However, decoupling statistically relevant fluctuations from the internal noise remains most often nontrivial. Here, we describe "Onion Clustering": a simple, iterative unsupervised clustering method that efficiently detects and classifies statistically relevant fluctuations in noisy time-series data. We demonstrate its efficiency by analyzing simulation and experimental trajectories of various systems with complex internal dynamics, ranging from the atomic-to the microscopic-scale, in- and out-of-equilibrium. The method is based on an iterative detect-classify-archive approach. In a similar way as peeling the external (evident) layer of an onion reveals the internal hidden ones, the method performs a first detection/classification of the most populated dynamical environment in the system and of its characteristic noise. The signal of such dynamical cluster is then removed from the time-series data and the remaining part, cleared-out from its noise, is analyzed again. At every iteration, the detection of hidden dynamical subdomains is facilitated by an increasing (and adaptive) relevance-to-noise ratio. The process iterates until no new dynamical domains can be uncovered, revealing, as an output, the number of clusters that can be effectively distinguished/classified in a statistically robust way as a function of the time-resolution of the analysis. Onion Clustering is general and benefits from clear-cut physical interpretability. We expect that it will help analyzing a variety of complex dynamical systems and time-series data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Machine learning-based derivation and validation of three immune phenotypes for risk stratification and prognosis in community-acquired pneumonia: a retrospective cohort study.
- Author
-
Qiangqiang Qin, Haiyang Yu, Jie Zhao, Xue Xu, Qingxuan Li, Wen Gu, and Xuejun Guo
- Subjects
COMMUNITY-acquired pneumonia ,MACHINE learning ,RANDOM forest algorithms ,PROGNOSTIC models ,HOSPITAL patients - Abstract
Background: The clinical presentation of Community-acquired pneumonia (CAP) in hospitalized patients exhibits heterogeneity. Inflammation and immune responses play significant roles in CAP development. However, research on immunophenotypes in CAP patients is limited, with few machine learning (ML) models analyzing immune indicators. Methods: A retrospective cohort study was conducted at Xinhua Hospital, affiliated with Shanghai Jiaotong University. Patients meeting predefined criteria were included and unsupervised clustering was used to identify phenotypes. Patients with distinct phenotypes were also compared in different outcomes. By machine learning methods, we comprehensively assess the disease severity of CAP patients. Results: A total of 1156 CAP patients were included in this research. In the training cohort (n=809), we identified three immune phenotypes among patients: Phenotype A (42.0%), Phenotype B (40.2%), and Phenotype C (17.8%), with Phenotype C corresponding to more severe disease. Similar results can be observed in the validation cohort. The optimal prognostic model, SuperPC, achieved the highest average C-index of 0.859. For predicting CAP severity, the random forest model was highly accurate, with C-index of 0.998 and 0.794 in training and validation cohorts, respectively. Conclusion: CAP patients can be categorized into three distinct immune phenotypes, each with prognostic relevance. Machine learning exhibits potential in predicting mortality and disease severity in CAP patients by leveraging clinical immunological data. Further external validation studies are crucial to confirm applicability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Cross-Domain Recommendation To Cold-Start Users Via Categorized Preference Transfer.
- Author
-
Liu, Xiaoyang, Fu, Xiaoyang, Meo, Pasquale De, and Fiumara, Giacomo
- Subjects
- *
DATA mapping , *EMBEDDED computer systems , *ROBUST control , *COMPUTER networks - Abstract
Most existing cross-domain recommendation (CDR) systems apply the embedding and mapping idea to tackle the cold-start user problem and, to this end, they learn a common bridge function to transfer the user preferences from the source domain into the target domain. However, sharing a bridge function for all users inevitably leads to biased recommendations. This paper proposes a novel method, named CDR to cold-start users via categorized preference transfer (CDRCPT), to overcome the shortcomings of existing approaches. First, the embeddings of users and items in both the source and target domain are learned through pretraining and we utilize preference encoder to obtain the preference embeddings of users in the source domain. Second, mini-batch clustering is applied in the source domain to group users according to their preferences; here, each cluster identifies a specific class of users, and each cluster is represented by its center. Finally, the general representation is fed into a meta network to learn a bridge function for each available class of users. Experiments on two real data sets show that our CDRCPT method is effective in improving the accuracy and robustness of recommendations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Creation of signatures and identification of molecular subtypes of glioblastoma based on disulfidptosis-related genes for predicting patient prognosis and immunological activity.
- Author
-
Li, Dongjun, Li, Xiaodong, Lv, Jianfeng, and Li, Shaoyi
- Abstract
In recent times, disulfidptosis, an intricate form of cellular demise, has garnered attention due to its impact on prognosis, tumor progression and treatment response. Nevertheless, the exact significance of disulfidptosis-related genes (DisRGs) in glioblastoma (GBM) remains enigmatic. The GEO and TCGA databases provided transcriptional and clinically relevant data on tumor samples, while the GTEx database provided data on healthy tissues. Disulfidptosis-related genes (DisRGs) were procured from previous scholarly investigations. The expression profile of DisRGs was initially scrutinized among patients diagnosed with GBM, subsequent to which their prognostic value was explored. Through consensus clustering, we constructed DisRGs-related clusters and gene subtypes. Our results established that the DisRG-related clusters had differentially expressed genes, resulting in a DisulfidptosisScore model, which had a positive prognostic value. The differential expression profile of 24 DisRGs between GBM samples and healthy samples was acquired. Through consensus cluster analysis, two distinct disulfidptosis subtypes, namely DisRGcluster A and DisRGcluster B, were identified. Then, the DisulfidptosisScore model including 4 characteristic genes was constructed.Notably, patients with GBM assigned with lower score demonstrated a considerably longer overall survival (OS) compared to those with higher score. We have effectively devised a prognostic model associated with disulfidptosis, presenting autonomous prognostic predictions for patients with GBM. These findings serve as a valuable addition to the current comprehension of disulfidptosis and offer fresh theoretical substantiation for the development of enhanced treatment strategies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.