Descriptor: "Community Detection" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Community Detection"' showing total 6,954 results

Start Over Descriptor "Community Detection"

6,954 results on '"Community Detection"'

1. Overlapping community detection algorithms using Modularity and the cosine

Author: Hieu, Do Duy and Duong, Phan Thi Ha
Subjects: Computer Science - Social and Information Networks, Computer Science - Discrete Mathematics, Community detection
Abstract: The issue of network community detection has been extensively studied across many fields. Most community detection methods assume that nodes belong to only one community. However, in many cases, nodes can belong to multiple communities simultaneously.This paper presents two overlapping network community detection algorithms that build on the two-step approach, using the extended modularity and cosine function. The applicability of our algorithms extends to both undirected and directed graph structures. To demonstrate the feasibility and effectiveness of these algorithms, we conducted experiments using real data.
Published: 2024

2. Practical Privacy-Preserving Community Detection in Decentralized Weighted Networks

Author: Han, Tingxuan, Tong, Wei, Niu, Jiacheng, Zhong, Sheng, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Duan, Haixin, editor, Debbabi, Mourad, editor, de Carné de Carnavalet, Xavier, editor, Luo, Xiapu, editor, Du, Xiaojiang, editor, and Au, Man Ho Allen, editor
Published: 2025
Full Text: View/download PDF

3. Exploring over a decade of systems engineering research center: A community detection and text analytics approach.

Author: Zavala, Araceli, Verma, Dinesh, and Marquez, Jose E. Ramirez
Abstract: The Systems Engineering Research Center (SERC) is a University Affiliated Research Center (UARC) of the US Department of Defense (DoD) formed in 2008 with more than 20 collaborator universities in the United States. Over the last decade, SERC has conducted research with Principal Investigators from universities within the SERC network, as reflected in technical reports (TR). These reports describe detailed information and analysis of the conducted research for every project under SERC support, such as written records of experiments or results of a scientific project. We analyzed the TRs from 2009 to early 2023 to identify research streams, topics, and evolution in systems engineering (SE) research using text mining and network analysis techniques, such as Louvain Community Detection and word similarity. As a result, we identified four major research streams over a decade of research projects, along with insights about topics and the evolution of SE within this time frame. Finally, we distinguished most profile authors and their most significant collaborations and networks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Age‐related differences of the time‐varying features in the brain functional connectivity and cognitive aging.

Author: Yao, Furong, Zhao, Ziyang, Wang, Yin, Li, Tongtong, Chen, Miao, Yao, Zhijun, Jiao, Jin, and Hu, Bin
Subjects: *LARGE-scale brain networks, *OLDER people, *COGNITIVE aging, *MODULAR construction, *FUNCTIONAL magnetic resonance imaging
Abstract: Brain functional modular organization changes with age. Considering the brain as a dynamic system, recent studies have suggested that time‐varying connectivity provides more information on brain functions. However, the spontaneous reconfiguration of modular brain structures over time during aging remains poorly understood. In this study, we investigated the age‐related dynamic modular reconfiguration using resting‐state functional MRI data (615 participants, aged 18–88 years) from Cam‐CAN. We employed a graph‐based modularity analysis to investigate modular variability and the transition of nodes from one module to another in modular brain networks across the adult lifespan. Results showed that modular structure exhibits both linear and nonlinear age‐related trends. The modular variability is higher in early and late adulthood, with higher modular variability in the association networks and lower modular variability in the primary networks. In addition, the whole‐brain transition matrix showed that the times of transition from other networks to the dorsal attention network were the largest. Furthermore, the modular structure was closely related to the number of cognitive components and memory‐related cognitive performance, suggesting a potential contribution to flexibility cognitive function. Our findings highlighted the notable dynamic characteristics in large‐scale brain networks across the adult lifespan, which enhanced our understanding of the neural substrate in various cognitions during aging. These findings also provided further evidence that dedifferentiation and compensation are the outcomes of functional brain interactions. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Semantic community query in a large‐scale attributed graph based on an attribute cohesiveness optimization strategy.

Author: Ge, Jinhuan, Sun, Heli, Lin, Yezhi, and He, Liang
Subjects: *EVALUATION methodology, *ALGORITHMS, *HEURISTIC
Abstract: The task of a semantic community query is to obtain a subgraph S based on a given query vertex q (or vertex set) and other query parameters in an attributed graph G such that S belongs to G, contains q and satisfies a predefined community cohesiveness model. In most cases, existing community query models based on the network structure for traditional attributed networks usually lack community semantics. However, the features of vertex attributes, especially the attributes of the query vertices, which are closely related to the community semantics, are rarely considered in an attributed graph. Existing community query algorithms based on both structure cohesiveness and attribute cohesiveness usually do not take the attributes of the query vertex as an important factor of the community cohesiveness model, which leads to weak semantics of the communities. This paper proposes a semantic community query method named SCQ in a large‐scale attributed graph. First, the k‐core structure model is adopted as the structure cohesiveness of our community query model to obtain a subgraph of the original graph. Second, we define attribute cohesiveness based on the average distance between the query vertices and other vertices in terms of attributes in the community to prune the subgraph and obtain the semantic community. In order to improve the community query efficiency in large‐scale attributed graphs, SCQ applies two heuristic pruning strategies. The experimental results show that our method outperforms the existing community query methods in multiple evaluation metrics and is ideal for querying semantic communities in large‐scale attributed graphs. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Community detection based on influential nodes in dynamic networks.

Author: Kherad, Mahdi, dadras, Meimanat, and Mokhtari, Marjan
Subjects: *INFORMATION dissemination, *ADAPTIVE control systems, *VIRAL marketing, *DENSITY
Abstract: Communities in a network are groups of nodes that are more strongly connected to each other. This article proposes a novel method for community detection in dynamic networks, focusing on influential nodes and overlapping communities. The method, named community detection based on adaptive multi-centrality aggregation (CDAMA), tackles two key challenges identifying influential nodes and overlapping communities. CDAMA introduces the Adaptive multi-centrality aggregation (AMCA) approach to identify influential nodes. AMCA integrates multiple centrality measures. The adaptive overlap control and merging (AOC-CM) approach addresses overlapping communities. AOC-CM utilizes structural, temporal, and semantic factors to strategically merge communities while preserving those with minimal overlap. CDAMA consists of five phases: receiving network snapshots, selecting influential nodes, launching communities, checking overlap and merging communities, and updating communities. Evaluation on three benchmark datasets demonstrates that CDAMA outperforms existing state-of-th-art methods in terms of Newman modularity, Modularity with split penalty and density modularity and Execution time. This suggests CDAMA is a valuable tool for tasks like viral marketing, information diffusion analysis, and network resilience studies. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. A robust two-step algorithm for community detection based on node similarity.

Author: Lounnas, Bilal, Benazi, Makhlouf, and Kamel, Mohamed
Subjects: *SOCIAL network analysis, *SOCIAL networks, *ALGORITHMS
Abstract: The rapid development of the internet and social network platforms has given rise to a new field of research, social network analysis. This field of research has many fundamental problems, one of which is community detection. The objective of this research is to understand hidden connections among individuals. However, uncovering these connections are still challenging, despite the existence of several methods. In this paper, we propose a new algorithm called MCCD (Modified Cosine for Community Detection) for community detection in social networks based on node similarity. Our algorithm consists of two steps. In the first step, we use a novel cosine similarity formula to identify initial communities. In the second step, we merge these communities based on a new similarity measure. MCCD can be used in two different ways. The first way uses K as an input to identify the exact communities. The second way does not require K and aims to provide the best partitioning by maximizing modularity. Our algorithm has been tested on a variety of artificial and real-world networks, and the experimental results demonstrate its superiority over existing methods in detecting communities. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. 融合异质性和动态性的社区发现研究综述.

Author: 武永亮, 窦世卯, 李景辉, 董家浩, and 魏丹
Abstract: Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

9. Improving community detection in social networks using enhanced BSO by exploring network structure.

Author: Beldi, Zohra and Bessedik, Malika
Subjects: *SOCIAL network analysis, *SOCIAL networks, *INFORMATION dissemination, *SOCIAL facts, *BRAINSTORMING
Abstract: Community detection in social networks plays a crucial role in understanding various social phenomena. However, accurately identifying communities is challenging due to the complex nature of social networks. To address this challenge, we propose CD-BSO, an innovative metaheuristic based on brainstorming. CD-BSO leverages network knowledge through specialized initialization and solution generation operators designed to capture social network characteristics. In the initialization phase, a technique combining Depth-First Search (DFS) is employed to consider both connectivity and information diffusion, ensuring accurate community representation. The generation step introduces search operators that consider link formation and node similarity, facilitating convergence towards a community structure that closely resembles the network's actual structure. The evaluation of CD-BSO outperforms recent and well-known community detection methods. The evaluation results were further analyzed using visual analysis, providing valuable insights into the network structure. CD-BSO exhibits significant potential for accurately identifying communities and extracting meaningful information from social networks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. A multi-objective optimization approach for overlapping dynamic community detection.

Author: Bahadori, Sondos, Mirzaie, Mansooreh, and Nooraei Abadeh, Maryam
Subjects: *MATRIX decomposition, *NONNEGATIVE matrices, *TIME-varying networks
Abstract: Community detection is a valuable tool for studying the function and dynamic structure of most real-world networks. Existing techniques either concentrate on the network's topological structure or node properties without adequately addressing the dynamic aspect. As a result, in this research, we present a unique technique called Multi-Objective Optimization Overlapping Dynamic Community Detection (MOOODCD) that leverages both the topological structure and node attributes of dynamic networks. By incorporating the Dirichlet distribution to control network dynamics, we formulate dynamic community detection as a non-negative matrix factorization problem. The block coordinate ascent method is used to estimate the latent elements of the model. Our experiments on artificial and real networks indicate that MOOODCD detects overlapping communities in dynamic networks with acceptable precision and scalability. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. 应对显著变化的动态社区检测方法.

Author: 刘澳, 张玮杰, 王焕, and 张庆明
Subjects: *GENETIC algorithms, *SPIDER webs, *KNOWLEDGE transfer, *PARETO optimum, *COMMUNITY change, *DIFFERENTIAL evolution
Abstract: In real-world networks, the structure and connections are constantly evolving over time. Detecting community changes within dynamic networks has always been an important research topic. When such changes are significant, it leads to difficulty for community detection algorithms to effectively utilize valuable information from the previous network snapshot, resulting in negative transfer in the next time step. To address the issue of poor algorithm adaptability to network mutations, this paper proposed a dynamic community detection algorithm based on genetic evolution ideas and higher-order knowledge transfer strategies. Firstly, it used the adjacency matrix similarity of adjacent snapshots to determine the use of first-order or higherorder information. Then, it employed the spider Web model for population initialization, followed by the non-dominated sorting genetic algorithm NSGA-II to iteratively obtain multi-objective optimal solutions on the Pareto front. It designed a novel gene crossover method to enhance population diversity. Finally, experimental results on multiple real and simulated datasets demonstrate that, compared to existing algorithms, the proposed method achieves higher temporal smoothness in community detection results during network upheavals while maintaining a good community modularity level. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. AGCFN:基于图神经网络多层网络社团检测模型.

Author: 陈龙, 张振宇, 李晓明, and 白宏鹏
Subjects: *GRAPH neural networks, *INFORMATION networks, *KNOWLEDGE transfer
Abstract: Multiplex network community detection methods based on graph neural network face two main challenges. Firstly, how to effectively utilize the node content information of multiplex network; and secondly, how to effectively utilize the interlayer relationships in multiplex networks. Therefore, this paper proposed the multiplex network community detection model AGCFN. Firstly, the autoencoder independently extracted the node content information of each network layer and passed the extracted node content information to the graph autoencoder for fusing the node content information of the current network layer with the topology information through the transfer operator to obtain the representation of each node of the current network layer, which made full use of the node content information of the network and the topology information of the network. The modularity maximization module and graph decoder optimized the obtained node representation. Secondly, the multilayer information fusion module fused the node representations extracted from each network layer to obtain a comprehensive representation of each node. Finally, the model under went training, and it achieved community detection results through a self-training mechanism. Comparison with six models on three datasets demonstrate improvements in both ACC and NMI evaluation metrics, thereby validating the effectiveness of AGCFN. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. MNCD-KE: a novel framework for simultaneous attribute- and interaction-based geographical regionalization.

Author: Xu, Liyan, Tang, Jintong, Jiang, Hezhishi, Yu, Hongbin, Huang, Qian, Zhou, Yinsheng, and Liu, Yu
Subjects: *ALGORITHMS, *COST
Abstract: Existing regionalization methods tend to be either spatial attribute- or spatial interaction-based, while real-world tasks usually involve both considerations to satisfy multiple objectives simultaneously. In this research, we propose Multilayer Network Community Detection and Kernel Extension (MNCD-KE), a two-step regionalization framework, as a feasible solution for such tasks. First, spatial attributes are embedded into attributes of nodes in a spatial interaction-defined multilayer network, and the kernel and marginal parts of the regions are determined by giving the membership value of the regionalization units to network communities. Second, the final result is obtained through a kernel extension process considering geographical constraints, including spatial contiguity, size balance, morphological regularity, and existing boundary consistency of the regions. Empirical experiments show that the proposed method yields outcomes that, in maintaining comparable performances with most baseline algorithms with either 'attribute' or 'interaction' objectives as measured by the respective criteria, simultaneously meet the dual objectives with results intuitively comprehensible. Its low computing costs and parameter adjustment flexibility make the proposed framework a convenient approach for real-world multi-objective regionalization tasks. We conclude the research with discussions on the boundary conditions for the framework to work and their relevance to city science theories, along with practical implications. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. A clan detector algorithm to identify independent clans in the kinship networks of elite family dynasties.

Author: Armandola, Niccolò Giorgio
Subjects: SOCIAL status, ELITE (Social sciences), SOCIAL networks, HIERARCHICAL clustering (Cluster analysis), SOCIAL network analysis
Abstract: The sociology of elites has long considered families as the unit of analysis in studies of power dynamics between elite dynasties and their transmission of wealth and prestige over generations. However, the assumption that families are cohesive units with common goals and agendas does not hold, especially for large and powerful family dynasties. Internal conflicts and clan rivalries throughout history suggest that independent clans, rather than families, are the more appropriate level for aggregation. The increasing availability of large-scale genealogical datasets and advances in social network analysis allow this more fine-grained perspective to be implemented even without historical documentation on observed clan structures. This paper builds on socio-anthropological conceptualizations of kinship and on hierarchical clustering techniques to present a new method for identifying independent clans within families that relies only on network-dependent terms. I use simulated data and an empirical kinship network of families of early modern Basel, Switzerland to compare a clan detector algorithm's performance with common community detection techniques. The historical accuracy of the clan structures detected is further assessed with various status indicators. The analyses show that the proposed clan detector algorithm is more suitable for identifying historically accurate clans than the traditional approaches. The application of the new method to the kinship network of Basel families sheds light on the city's stratification into high- and low-status societies in which elite families were also divided into privileged and less privileged clans. • New method for identifying independent clans within large family dynasties. • The algorithm's performance is assessed with simulated and empirical data. • Comparison with other community detection techniques. • The detected clans in the empirical network are historically accurate. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. Too many options: How to identify coalitions in a policy network?

Author: Deguilhem, Thibaud, Schlegel, Juliette, Berrou, Jean-Philippe, Djibo, Ousmane, and Piveteau, Alain
Subjects: ADVOCACY coalition framework, POLICY analysis, MISSING data (Statistics), INFORMATION policy, SOCIAL networks
Abstract: For different currents in policy analysis as policy networks and the Advocacy Coalition Framework (ACF), identifying coalitions from policy beliefs and coordination between actors is crucial to a precise understanding of a policy process. Focusing particularly the relational dimension of ACF approaches linked with policy network analysis, determining policy subsystems from the actor collaborations and exchanges has recently begun offering fertile links with the network analysis. Studies in this way frequently apply Block Modeling and Community Detection (BMCD) strategies to define homogeneous political groups. However, the BMCD literature is growing quickly, using a wide variety of algorithms and interesting selection methods that are much more diverse than those used in the policy network analysis and particularly the ACF when this current focused on the collaboration networks before or after regarding the belief distance between actors. Identifying the best methodological option in a specific context can therefore be difficult and few ACF studies give an explicit justification. On the other hand, few BMCD publications offer a systematic comparison of real social networks and they are never applied to policy network datasets. This paper offers a new, relevant 5-Step selection method to reconcile advances in both the policy networks/ACF and BMCD. Using an application based on original African policy network data collected in Madagascar and Niger, we provide a useful set of practical recommendations for future ACF studies using policy network analysis: (i) the density and size of the policy network affect the identification process, (ii) the "best algorithm" can be rigorously determined by maximizing a novel indicator based on convergence and homogeneity between algorithm results, (iii) researchers need to be careful with missing data: they affect the results and imputation does not solve the problem. • Identifying policy coalitions is fundamental to understand a policy network. • Studies apply different strategies but identifying the best option can be difficult. • We offer a relevant and consistent 5-Step method by maximizing a novel indicator. • Further research need to be careful with missing data. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. A motif-based probabilistic approach for community detection in complex networks.

Author: Hajibabaei, Hossein, Seydi, Vahid, and Koochari, Abbas
Abstract: Community detection in complex networks is an important task for discovering hidden information in network analysis. Neighborhood density between nodes is one of the fundamental indicators of community presence in the network. A community with a high edge density will have correlations between nodes that extend beyond their immediate neighbors, denoted by motifs. Motifs are repetitive patterns of edges observed with high frequency in the network. We proposed the PCDMS method (Probabilistic Community Detection with Motif Structure) that detects communities by estimating the triangular motif in the network. This study employs structural density between nodes, a key concept in graph analysis. The proposed model has the advantage of using a probabilistic generative model that calculates the latent parameters of the probabilistic model and determines the community based on the likelihood of triangular motifs. The relationship between observing two pairs of nodes in multiple communities leads to an increasing likelihood estimation of the existence of a motif structure between them. The output of the proposed model is the intensity of each node in the communities. The efficiency and validity of the proposed method are evaluated through experimental work on both synthetic and real-world networks; the findings will show that the community identified by the proposed method is more accurate and dense than other algorithms with modularity, NMI, and F1score evaluation metrics. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Describing group evolution in temporal data using multi-faceted events.

Author: Failla, Andrea, Cazabet, Rémy, Rossetti, Giulio, and Citraro, Salvatore
Subjects: GROUP dynamics, SOCIAL groups, DATA mining, ARCHETYPES, TAXONOMY
Abstract: Groups—such as clusters of points or communities of nodes—are fundamental when addressing various data mining tasks. In temporal data, the predominant approach for characterizing group evolution has been through the identification of "events". However, the events usually described in the literature, e.g., shrinks/growths, splits/merges, are often arbitrarily defined, creating a gap between such theoretical/predefined types and real-data group observations. Moving beyond existing taxonomies, we think of events as "archetypes" characterized by a unique combination of quantitative dimensions that we call "facets". Group dynamics are defined by their position within the facet space, where archetypal events occupy extremities. Thus, rather than enforcing strict event types, our approach can allow for hybrid descriptions of dynamics involving group proximity to multiple archetypes. We apply our framework to evolving groups from several face-to-face interaction datasets, showing it enables richer, more reliable characterization of group dynamics with respect to state-of-the-art methods, especially when the groups are subject to complex relationships. Our approach also offers intuitive solutions to common tasks related to dynamic group analysis, such as choosing an appropriate aggregation scale, quantifying partition stability, and evaluating event quality. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. Semantic Code Clone Detection Based on Community Detection.

Author: Wan, Zexuan, Xie, Chunli, Lv, Quanrun, and Fan, Yasheng
Subjects: ARTIFICIAL neural networks, TIME complexity, MOLECULAR cloning, TREE graphs, FLOWGRAPHS
Abstract: Semantic code clone detection is to find code snippets that are structurally or syntactically different, but semantically identical. It plays an important role in software reuse, code compression. Many existing studies have achieved good performance in non-semantic clone, but semantic clone is still a challenging task. Recently, several works have used tree or graph, such as Abstract Syntax Tree (AST), Control Flow Graph (CFG) or Program Dependency Graph (PDG) to extract semantic information from source codes. In order to reduce the complexity of tree and graph, some studies transform them into node sequences. However, this transformation will lose some semantic information. To address this issue, we propose a novel high-performance method that utilizes community detection to extract features of AST while preserving its semantic information. First, based on the AST of source code, we exploit community detection to split AST into different subtrees to extract the underlying semantics information of different code blocks, and use centrality analysis to quantify the semantic information as the weight of AST nodes. Then, the AST is converted into a sequence of tokens with weights, and a Siamese neural network model is used to detect the similarity of token sequences for semantic code clone detection. Finally, to evaluate our approach, we conduct experiments on two standard benchmark datasets, Google Code Jam (GCJ) and BigCloneBench (BCB). Experimental results show that our model outperforms the eight publicly available state-of-the-art methods in detecting code clones. It is five times faster than the tree-based method (ASTNN) in terms of time complexity. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. Link prediction based on depth structure in social networks.

Author: Yang, Jie and Wu, Yu
Abstract: Link prediction is an important task in social network analysis. Considering that the properties of nodes in social networks are generally inaccurate, it is more reliable and effective to use the network structure features to predict the links in the network. However, a central challenge of such methods is how to fully mine and utilize the network structure information. Here, we introduce a deep structure link prediction model (DSLP), whose idea is to integrate multiple types of community structures and multiple topology features into one probability model. We detect three types of community structures, disjoint, crisp overlap and fuzzy overlap, and then design an edge probability parameter to reflect their importance. Additionally, we propose an effective method to aggregate multiple topology features based on nodes and paths. We perform extensive experiments on artificial networks and real-world social networks to compare the proposed method with nine baseline algorithms, and the results show that our method offers higher precision than that of these well-known approaches. Finally, we discuss the method of integrating trusted node properties and feature selection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. UCAD: commUnity disCovery method in Attribute-based multicoloreD networks.

Author: Gamgne Domgue, Félicité, Tsopze, Norbert, and Ndoundam, René
Subjects: SOCIAL network analysis, SOCIAL networks
Abstract: Many hierarchical methods for community detection in multicolored networks are capable of finding clusters when there are interslice correlation between layers. However, in general, they aggregate all the links in different layer treating them as being equivalent. Therefore, such aggregation might ignore the information about the relevance of a dimension in which the node is involved. In this paper, we fill this gap by proposing a hierarchical classification-based Louvain method for interslice-multicolored networks. In particular, we define a new node centrality measure named Attractivity to describe the inter-slice correlation that incorporates within and across-dimension topological features in order to identify the relevant dimension. Then, after merging dimensions through a frequential aggregation, we group nodes by their relational and attribute similarity, where attributes correspond to their relevant dimensions. We conduct an extensive experimentation using seven real-world multicolored networks, which also includes comparison with state-of-the-art methods. Results show the significance of our proposed method in discovering relevant communities over multiple dimensions and highlight its ability in producing optimal covers with higher values of the multidimensional version of the modularity function. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. Multi-view clustering analysis of mega-city regions based on intercity flow networks.

Author: Wu, Zhiqiang, Zhao, Gang, Xu, Haowen, Qiao, Renlu, and Zhao, Qian
Subjects: *MEGALOPOLIS, *BIG data, *INTERNET of things, *MACHINE learning, *CLUSTER analysis (Statistics)
Abstract: With the booming of Big Data and the Internet of Things, various urban networks have been built based on intercity flow data, and how to combine them to learn a more comprehensive understanding of mega-city regions is becoming more and more indispensable. In this paper, we designed a graph-based multi-view clustering method based on graph learning to explore the mega-city region structures from multi-source data. An example of clustering analysis consists of the people flow network, cargo flow network, and information flow network, covering 88 cities from Beijing, Tianjin, Hebei Province, Shandong Province, Henan Province, Jiangsu Province, Anhui Province, Shanghai, and Zhejiang Province in China is used to illustrate the applicability of the idea in super mega-city region scale studies. Utilizing the proposed clustering method, a unified network representation is calculated, and 5 mega-city regions, Beijing-Tianjin-Hebei Cluster, Henan Cluster, Shandong Cluster, Shanghai-Jiangsu-Anhui Cluster, and Zhejiang Cluster, are detected based on intercity flow networks. City-to-city flows, including Luan-Taizhou, Lianyungang-Chuzhou, and Xuzhou-Bengbu of the people network, Shanghai-Hangzhou, Suzhou-Shanghai, and Shanghai-Ningbo of the cargo network, Shanghai-Hangzhou, Bozhou-Jinhua, and Huaibei-Bozhou of the information network, are suggested to be further enhanced to facilitate the ongoing nationwide constructions of urban agglomerations in China. The multi-view clustering method proved to be a helpful calculation framework for mega-city region analysis, which would also be considered as a substantial foundation for further urban explorations with more advanced graph learning techniques. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. Estimating Higher-Order Mixed Memberships via the ℓ2,∞ Tensor Perturbation Bound.

Author: Agterberg, Joshua and Zhang, Anru R.
Subjects: *SIMPLEX algorithm, *SIGNAL-to-noise ratio, *MACHINE learning, *NOISE, *GENERALIZATION
Abstract: AbstractHigher-order multiway data is ubiquitous in machine learning and statistics and often exhibits community-like structures, where each component (node) along each different mode has a community membership associated with it. In this paper we propose the (subgaussian) tensor mixed-membership blockmodel, a generalization of the tensor blockmodel positing that memberships need not be discrete, but instead are convex combinations of latent communities. We establish the identifiability of our model and propose a computationally efficient estimation procedure based on the higher-order orthogonal iteration algorithm (HOOI) for tensor SVD composed with a simplex corner-finding algorithm. We then demonstrate the consistency of our estimation procedure by providing a per-node error bound under subgaussian noise, which showcases the effect of higher-order structures on estimation accuracy. To prove our consistency result, we develop the ℓ2,∞ tensor perturbation bound for HOOI under independent, heteroskedastic, subgaussian noise that may be of independent interest. Our analysis uses a novel leave-one-out construction for the iterates, and our bounds depend only on spectral properties of the underlying low-rank tensor under nearly optimal signal-to-noise ratio conditions such that tensor SVD is computationally feasible. Finally, we apply our methodology to real and simulated data, demonstrating some effects not identifiable from the model with discrete community memberships. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. Personal Recovery With Bipolar Disorder: A Network Analysis.

Author: Glossop, Zoe, Campbell, Catriona, Ushakova, Anastasia, Dodd, Alyson, and Jones, Steven
Subjects: *BIPOLAR disorder, *SOCIAL network analysis, *QUESTIONNAIRES, *COMMUNITIES, *CONFIDENCE, *THEMATIC analysis, *CONVALESCENCE, *STATISTICAL reliability, *DATA analysis software, *CONFIDENCE intervals, *NONPARAMETRIC statistics, RESEARCH evaluation
Abstract: Background: Personal recovery is valued by people with bipolar disorder (BD), yet its conceptualisation is unclear. Prior work conceptualising personal recovery has focussed on qualitative evidence or clinical factors without considering broader psychosocial factors. This study used a network analysis of Bipolar Recovery Questionnaire (BRQ) responses, aiming to identify (1) independent relationships between items to identify those most "central" to personal recovery and (2) how the relationships between items reflect themes of personal recovery. Methods: The model was developed from BRQ responses (36 items) from 394 people diagnosed with bipolar disorder. The undirected network was based on a partial correlation matrix and was weighted. Strength scores were calculated for each node. Community detection analysis identified potential themes. The accuracy of the network was assessed using bootstrapping. Results: Two consistent communities were identified: "Access to meaningful activity" and "Learning from experiences." "I feel confident enough to get involved in things in life that interest me" was the strongest item, although the strength stability coefficient (0.36) suggested strength should be interpreted with caution. The average edge weight was 0.02; however, stronger edges were identified. Limitations: The network showed low stability, possibly due to sample heterogeneity. Future work could incorporate demographic variables, such as time since BD diagnosis or stage of personal recovery, into network estimation. Conclusions: Network analysis can be applied to personal recovery, not only clinical symptoms of BD. Clinical applications could include tailoring recovery‐focussed therapies towards encouraging important aspects of recovery, such as feeling confident to get involved with life. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Partitioning Dichotomous Items Using Mokken Scale Analysis, Exploratory Graph Analysis and Parallel Analysis: A Monte Carlo Simulation.

Author: Abdelhamid, Gomaa Said Mohamed, Hidalgo, María Dolores, French, Brian F., and Gómez-Benito, Juana
Subjects: *MOKKEN model, *LEAST squares, *GENETIC algorithms, *MULTIPLE correspondence analysis (Statistics), *MONTE Carlo method
Abstract: Estimating the number of latent factors underlying a set of dichotomous items is a major challenge in social and behavioral research. Mokken scale analysis (MSA) and exploratory graph analysis (EGA) are approaches for partitioning measures consisting of dichotomous items. In this study we perform simulation-based comparisons of two EGA methods (EGA with graphical least absolute shrinkage and selector operator; EGAtmfg with triangulated maximally filtered graph algorithm), two MSA methods (AISP: automated item selection procedure; GA: genetic algorithm), and two widely used factor analytic techniques (parallel analysis with principal component analysis (PApc) and parallel analysis with principal axis factoring (PApaf)) for partitioning dichotomous items. Performance of the six methods differed significantly according to the data structure. AISP and PApc had highest accuracy and lowest bias for unidimensional structures. Moreover, AISP demonstrated the lowest rate of misclassification of items. Regarding multidimensional structures, EGA with GLASSO estimation and PApaf yielded highest accuracy and lowest bias, followed by EGAtmfg. In addition, both EGA techniques exhibited the lowest rate of misclassification of items to factors. In summary, EGA and EGAtmfg showed comparable performance to the highly accurate traditional method, parallel analysis. These findings offer guidance on selecting methods for dimensionality analysis with dichotomous indicators to optimize accuracy in factor identification. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Community Detection Using Deep Learning: Combining Variational Graph Autoencoders with Leiden and K-Truss Techniques.

Author: Patil, Jyotika Hariom, Potikas, Petros, Andreopoulos, William B., and Potika, Katerina
Subjects: *K-means clustering, *ALGORITHMS, *DEEP learning
Abstract: Deep learning struggles with unsupervised tasks like community detection in networks. This work proposes the Enhanced Community Detection with Structural Information VGAE (VGAE-ECF) method, a method that enhances variational graph autoencoders (VGAEs) for community detection in large networks. It incorporates community structure information and edge weights alongside traditional network data. This combined input leads to improved latent representations for community identification via K-means clustering. We perform experiments and show that our method works better than previous approaches of community-aware VGAEs. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. PCABM: Pairwise Covariates-Adjusted Block Model for Community Detection.

Author: Huang, Sihan, Sun, Jiajin, and Feng, Yang
Subjects: *MAXIMUM likelihood statistics, *STOCHASTIC models, *FEATURE selection, *GENERALIZATION, *ALGORITHMS
Abstract: One of the most fundamental problems in network study is community detection. The stochastic block model (SBM) is a widely used model, and various estimation methods have been developed with their community detection consistency results unveiled. However, the SBM is restricted by the strong assumption that all nodes in the same community are stochastically equivalent, which may not be suitable for practical applications. We introduce a pairwise covariates-adjusted stochastic block model (PCABM), a generalization of SBM that incorporates pairwise covariate information. We study the maximum likelihood estimators of the coefficients for the covariates as well as the community assignments, and show they are consistent under suitable sparsity conditions. Spectral clustering with adjustment (SCWA) is introduced to efficiently solve PCABM. Under certain conditions, we derive the error bound of community detection for SCWA and show that it is community detection consistent. In addition, we investigate model selection in terms of the number of communities and feature selection for the pairwise covariates, and propose two corresponding algorithms. PCABM compares favorably with the SBM or degree-corrected stochastic block model (DCBM) under a wide range of simulated and real networks when covariate information is accessible. for this article are available online. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Network community detection using higher-order structures.

Author: Yu, X and Zhu, J
Subjects: *STATISTICAL errors, *SUBGRAPHS, *STATISTICAL models
Abstract: In many real-world networks, it is often observed that subgraphs or higher-order structures of certain configurations, e.g. triangles and by-fans, are overly abundant compared to standard randomly generated networks (Milo et al. 2002). However, statistical models accounting for this phenomenon are limited, especially when community structure is of interest. This limitation is coupled with a lack of community detection methods that leverage subgraphs or higher-order structures. In this paper, we propose a new community detection method that effectively uses higher-order structures in a network. Furthermore, for the community detection accuracy, under an edge-dependent network model that consists of both community and triangle structures, we develop a finite-sample error bound characterized by the expected triangle degree, which leads to the consistency of the proposed method. To the best of our knowledge, this is the first statistical error bound and consistency result for community detection of a single network considering a network model with dependent edges. We also show, in both simulation studies and a real-world data example, that our method unveils network communities that are otherwise invisible to methods that ignore higher-order structures. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Community detection in attributed networks using neighborhood information.

Author: Wang, Xiaozong, Tang, Fengqin, Wang, Yuanyuan, Li, Cuixia, and Zhao, Xuejing
Subjects: *SOCIAL influence, *STOCHASTIC models, *NEIGHBORHOODS, *TOPOLOGY, *NEIGHBORS
Abstract: Community detection is a crucial aspect in network analysis. Real-world networks are often enriched with attributes providing extensive information for nodes beyond mere topology. Integrating these nodal attributes into community detection for attributed networks poses notable challenges and remains an active research field. In this paper, we propose a novel method that incorporates structural information into fused attributes. This is achieved by defining a fusion similarity between nodes, which is a convex combination of topology similarity, pairwise attribute similarity, and attribute similarity with their immediate neighbors. One advantage of the proposed method is its flexibility in identifying communities in disassortative networks, where nodes exhibit more connections between different groups than within their own group. We employ an iterative spectral clustering technique to discover communities and assess the influence of various attributes within these communities. Our experimental results validate the effectiveness of this approach, demonstrating its utility in leveraging node attributes in diverse simulated and real-world network datasets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. Community detection based on improved user interaction degree, weighted quasi-local path-based similarity and frequent pattern mining.

Author: Sayari, Somaye, Harounabadi, Ali, and Banirostam, Touraj
Subjects: *SOCIAL networks
Abstract: Community detection is a significant research area in social networks. Most methods use network topology, but combining it with user interactions improves accuracy. This paper proposes a robust method to identify communities based on the improved user interaction degree, the weighted quasi-local structural similarity measure, and the frequent pattern mining on user interactions. In the community creation phase, influential users are identified based on eigenvector centrality and users who interact with them the most are extracted based on frequent pattern mining. In the community expansion phase, we introduce a measure to calculate the degree of user interactions based on the local clustering coefficient improved by interactions between common neighbors. We present two strategies to expand the community. The first strategy, a direct connection, exists between a user outside and a user inside the community. Their similarity is calculated based on the combined measure of improved user interaction degree and user degrees. The second strategy is if two users do not have a direct connection, we consider their communication paths. Therefore, we present a similarity measure combining a quasi-local path-based measure and an improved user interaction degree. Analysis of Higgs Twitter and Flickr datasets using internal density, Normalized Mutual Information, and Adjusted Rand Index shows that this paper's method outperforms the other five community detection methods. Furthermore, our method has more robustness than other relevant methods. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. CDCA: Community detection in RNA-seq data using centrality-based approach.

Author: Sarmah, Tonmoya and Bhattacharyya, Dhruba K
Abstract: One of the integral part of the network analysis is finding groups of nodes that exhibit similar properties. Community detection techniques are a popular choice to find such groups or communities within a network and it relies on graph-based methods to achieve this goal. Finding communities in biological networks such as gene co-expression networks are particularly important to find groups of genes where we can focus on further downstream analysis and find valuable insights regarding concerned diseases. Here, we present an effective community detection method called community detection using centrality-based approach (CDCA), designed using the graph centrality approach. The method has been tested using four benchmark bulk RNA-seq datasets for schizophrenia and bipolar disorder, and the performance has been proved superior in comparison to several other counterparts. The quality of communities are determined using intrinsic graph properties such as modularity and homogeneity. The biological significance of resultant communities is decided using the pathway enrichment analysis. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. Influence Maximization in Social Network using Community Detection and Node Modularity.

Author: Tyagi, Aditya Dayal and Asawa, Krishna
Subjects: ONLINE social networks, SOCIAL influence, SOCIAL networks, DEEP learning, INFORMATION dissemination
Abstract: Research in the area of identifying the most influential users in social networks is currently regarded to be one of the most important areas of study. Through the examination of the most influential users on social networks, it is possible to analyze and, in some cases, manage the dissemination of information. A technique that is both quick and scalable is proposed in this research as a means of identifying the users with maximum diffusion capabilities in online social networks. This approach is suited for directed networks as well as undirected networks. The approach that has been suggested is comprised of four stages: (1) community detection, which involves the partial partitioning of the whole social network into communities that are connected to one another by the use of the Louvain algorithm; (2) the removal of communities that are not suitable; (3) selection of prominent nodes within the particular community; and (4) selection of the top k seed nodes. Experimental research was carried out on a number of datasets, each of which was of a different complexity. Using imperfect social networks, it has been demonstrated that the findings generate better outcomes for the diffusion of influence than the current related work models, and they do so with a much less amount of processing time being required. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. Extraction and Structuring of Financial Terminology.

Author: Porta-Zamorano, Jordi, Carbajo-Coronado, Blanca, and Moreno-Sandoval, Antonio
Subjects: LANGUAGE models, SOCIAL network analysis, FINANCIAL statements, SPANISH language, TERMS & phrases
Abstract: Copyright of Procesamiento del Lenguaje Natural is the property of Sociedad Espanola para el Procesamiento del Lenguaje Natural and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

33. Anonymous group structure algorithm based on community structure.

Author: Kuang, Linghong, Si, Kunliang, and Zhang, Jing
Subjects: DATA privacy, GRAPH algorithms, SOCIAL networks, PRIVACY, ALGORITHMS
Abstract: A social network is a platform that users can share data through the internet. With the ever-increasing intertwining of social networks and daily existence, the accumulation of personal privacy information is steadily mounting. However, the exposure of such data could lead to disastrous consequences. To mitigate this problem, an anonymous group structure algorithm based on community structure is proposed in this article. At first, a privacy protection scheme model is designed, which can be adjusted dynamically according to the network size and user demand. Secondly, based on the community characteristics, the concept of fuzzy subordinate degree is introduced, then three kinds of community structure mining algorithms are designed: the fuzzy subordinate degree-based algorithm, the improved Kernighan-Lin algorithm, and the enhanced label propagation algorithm. At last, according to the level of privacy, different anonymous graph construction algorithms based on community structure are designed. Furthermore, the simulation experiments show that the three methods of community division can divide the network community effectively. They can be utilized at different privacy levels. In addition, the scheme can satisfy the privacy requirement with minor changes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. Revealing the Community Structure of Urban Bus Networks: a Multi-view Graph Learning Approach.

Author: Chen, Shuaiming, Ji, Ximing, and Shao, Haipeng
Subjects: MACHINE learning, PUBLIC transit, GRAPH algorithms, URBANIZATION, ALGORITHMS
Abstract: Despite great progress in enhancing the efficiency of public transport, one still cannot seamlessly incorporate structural characteristics into existing algorithms. Moreover, comprehensively exploring the structure of urban bus networks through a single-view modelling approach is limited. In this research, a multi-view graph learning algorithm (MvGL) is proposed to aggregate community information from multiple views of urban bus system. First, by developing a single-view graph encoder module, latent community relationships can be captured during learning node embeddings. Second, inspired by attention mechanism, a multi-view graph encoder module is designed to fuse node embeddings in different views, aims to perceive more community information of urban bus network comprehensively. Then, the community assignment can be updated by using a differentiable clustering layer. Finally, a well-defined objective function, which integrates node level, community level and graph level, can help improve the quality of community detection. Experimental results demonstrated that MvGL can effectively aggregate community information from different views and further improve the quality of community detection. This research contributes to the understanding the structural characteristics of public transport networks and facilitates their operational efficiency. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

35. Root and community inference on the latent growth process of a network.

Author: Crane, Harry and Xu, Min
Subjects: SOCIAL networks, FAKE news, ATTRIBUTION of news, STATISTICAL models, ROOT growth, GIBBS sampling
Abstract: Many statistical models for networks overlook the fact that most real-world networks are formed through a growth process. To address this, we introduce the Preferential Attachment Plus Erdős–Rényi model, where we let a random network G be the union of a preferential attachment (PA) tree T and additional Erdős–Rényi (ER) random edges. The PA tree captures the underlying growth process of a network where vertices/edges are added sequentially, while the ER component can be regarded as noise. Given only one snapshot of the final network G ⁠ , we study the problem of constructing confidence sets for the root node of the unobserved growth process; the root node can be patient zero in an infection network or the source of fake news in a social network. We propose inference algorithms based on Gibbs sampling that scales to networks with millions of nodes and provide theoretical analysis showing that the size of the confidence set is small if the noise level of the ER edges is not too large. We also propose variations of the model in which multiple growth processes occur simultaneously, reflecting the growth of multiple communities; we use these models to provide a new approach to community detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

36. Multi-view clustering analysis of mega-city regions based on intercity flow networks

Author: Zhiqiang Wu, Gang Zhao, Haowen Xu, Renlu Qiao, and Qian Zhao
Subjects: Mega-city regions, Multi-view clustering, Intercity flow networks, Graph learning, Community detection, Urbanization. City and country, HT361-384, Regional planning, HT390-395, Social Sciences
Abstract: Abstract With the booming of Big Data and the Internet of Things, various urban networks have been built based on intercity flow data, and how to combine them to learn a more comprehensive understanding of mega-city regions is becoming more and more indispensable. In this paper, we designed a graph-based multi-view clustering method based on graph learning to explore the mega-city region structures from multi-source data. An example of clustering analysis consists of the people flow network, cargo flow network, and information flow network, covering 88 cities from Beijing, Tianjin, Hebei Province, Shandong Province, Henan Province, Jiangsu Province, Anhui Province, Shanghai, and Zhejiang Province in China is used to illustrate the applicability of the idea in super mega-city region scale studies. Utilizing the proposed clustering method, a unified network representation is calculated, and 5 mega-city regions, Beijing-Tianjin-Hebei Cluster, Henan Cluster, Shandong Cluster, Shanghai-Jiangsu-Anhui Cluster, and Zhejiang Cluster, are detected based on intercity flow networks. City-to-city flows, including Luan-Taizhou, Lianyungang-Chuzhou, and Xuzhou-Bengbu of the people network, Shanghai-Hangzhou, Suzhou-Shanghai, and Shanghai-Ningbo of the cargo network, Shanghai-Hangzhou, Bozhou-Jinhua, and Huaibei-Bozhou of the information network, are suggested to be further enhanced to facilitate the ongoing nationwide constructions of urban agglomerations in China. The multi-view clustering method proved to be a helpful calculation framework for mega-city region analysis, which would also be considered as a substantial foundation for further urban explorations with more advanced graph learning techniques.
Published: 2024
Full Text: View/download PDF

37. Detection of Urban Space Community Structures in Island-Oriented Cities Based on Smart Bus Card Data.

Author: Gao, Yueer, Shi, Ruizheng, Chen, Ye, and Qi, Ruizhen
Subjects: *LARGE space structures (Astronautics), *CITIES & towns, *ARCHIPELAGOES, *SMART cards, *CHOICE of transportation, *PUBLIC spaces
Abstract: The urban space community is the basic unit used to demonstrate the urban spatial structure, and the space community structure of island-oriented cities is different from that of general inland cities because they are formed based on the island and the inland, respectively. Through the case study of Xiamen, based on smart card data, this paper divides the obtained origin–destination data into local and global levels with trip cutoff thresholds. The distribution of city centers is analyzed by detecting the overall urban space community structure and the two-level intraisland and cross-island urban space community structure with the PageRank and community detection algorithms. The results revealed that: (1) in the spatial structure of the island-oriented city, the cores are clustered on the island, while the centers are distributed in bands and dots outside the island; (2) the community units are distributed across the island; and (3) the space community boundary of the island-oriented city is quite different from the artificially defined administrative unit boundary. To summarize, by enhancing the delineation of urban spatial communities using the Gaussian kernel function and least-squares cross-validation, improving the model granularity, and integrating data from multiple public transport modes, this paper achieved a more precise division of urban spatial community structures. Consequently, it revealed the distinctive urban form and developmental pattern of the island-oriented city, holding significant implications for investigating urban spatial layout studies. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. Community detection in attributed social networks using deep learning.

Author: Rashnodi, Omid, Rastegarpour, Maryam, Moradi, Parham, and Zamanifar, Azadeh
Subjects: *STOCHASTIC matrices, *REPRESENTATIONS of graphs, *SOCIAL networks, *TEST methods, *TOPOLOGY, *DEEP learning
Abstract: Existing methods for detecting communities in attributed social networks often rely solely on network topology, which leads to suboptimal accuracy in community detection, inefficient use of available data, and increased time required for identifying groups. This paper introduces the Dual Embedding-based Graph Convolution Network (DEGCN) to address these challenges. This new method uses graph embedding techniques in a new deep learning framework to improve accuracy and speed up community detection by combining the nodes' content with the network's topology. Initially, we compute the modularity and Markov matrices of the input graph. Each matrix is then processed through a graph embedding network with at least two layers to produce a condensed graph representation. As a result, a multilayer perceptron neural network classifies each node's community based on these generated embeddings. We tested the suggested method on three standard datasets: Cora, CiteSeer, and PubMed. Then, we compared the outcomes to many basic and advanced approaches using five important metrics: F1-score, adjusted rand index (ARI), normalized mutual information (NMI), and accuracy. The findings demonstrate that the DEGCN accurately captures community structure, achieves superior precision, and has higher ARI, NMI, and F1 scores, significantly outperforming existing algorithms for identifying community structures in medium-scale networks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. Probabilistic temporal semantic graph: a holistic framework for event detection in twitter.

Author: Bashiri, Hadis and Naderi, Hassan
Subjects: PROBABILITY density function, TEXT mining, CRISIS management, MARKETING management, MARKETING research
Abstract: Event detection on social media platforms, especially Twitter, poses significant challenges due to the dynamic nature and high volume of data. The rapid flow of tweets and the varied ways users express thoughts complicate the identification of relevant events. Accurately identifying and interpreting events from this noisy and fast-paced environment is crucial for various applications, including crisis management and market analysis. This paper presents a novel unsupervised framework for event detection on social media, designed to enhance the accuracy and efficiency of identifying significant events from Twitter data. The framework incorporates several innovative techniques, including dynamic bandwidth adjustment based on local data density, Mahalanobis distance integration, adaptive kernel density estimation, and an improved Louvain-MOMR method for community detection. Additionally, a new scoring system is implemented to accurately extract trending words that evoke strong emotions, improving the identification of event-related keywords. The proposed framework demonstrates robust performance across three diverse datasets: FACup, Super Tuesday, and US Election, showcasing its effectiveness in capturing temporal and semantic patterns within tweets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

40. Community detection in social networks using machine learning: a systematic mapping study.

Author: Nooribakhsh, Mahsa, Fernández-Diego, Marta, González-Ladrón-De-Guevara, Fernando, and Mollamotalebi, Mahdi
Subjects: SOCIAL networks, SOCIAL types, KARATE, ALGORITHMS
Abstract: One of the important issues in social networks is the social communities which are formed by interactions between its members. Three types of community including overlapping, non-overlapping, and hidden are detected by different approaches. Regarding the importance of community detection in social networks, this paper provides a systematic mapping of machine learning-based community detection approaches. The study aimed to show the type of communities in social networks along with the algorithms of machine learning that have been used for community detection. After carrying out the steps of mapping and removing useless references, 246 papers were selected to answer the questions of this research. The results of the research indicated that unsupervised machine learning-based algorithms with 41.46% (such as k means) are the most used categories to detect communities in social networks due to their low processing overheads. On the other hand, there has been a significant increase in the use of deep learning since 2020 which has sufficient performance for community detection in large-volume data. With regard to the ability of NMI to measure the correlation or similarity between communities, with 53.25%, it is the most frequently used metric to evaluate the performance of community identifications. Furthermore, considering availability, low in size, and lack of multiple edge and loops, dataset Zachary's Karate Club with 26.42% is the most used dataset for community detection research in social networks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. Vgasom: community detection based on self-organizing map clustering of graph's embeddings.

Author: Alshahrani, Atheer Abdullah and Alslooli, Sundus Abdulrahman
Abstract: In this paper, we proposed VGASOM, a neural network approach for community detection. Community detection refers to discovering similar nodes in a graph that form a community having similar features or attributes as opposed to nodes from other communities. The proposed approach combines the capabilities of auto-encoder neural networks, specifically a Variational graph auto-encoder (VGAE) with self-organizing maps (SOM) clustering. VGAEs have achieved great success in learning the latent representation of graphs and therefore encoding them into lower-dimensional embeddings. The self-organizing map based on competitive learning is used to find communities in the graphs' embeddings obtained by the VGAE model which further reduces its dimensionality and divides the input space into clusters that correspond to the communities in the graph. We conducted experiments to evaluate our model compared to several baseline models, our model shows promising results for the community detection task. It outperforms the state-of-the-art methods by 3.29% in terms of the accuracy and 9% in terms of F1 metric. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. Network, correlation, and community structure of the financial sector of Bursa Malaysia before, during, and after COVID-19

Author: Nurun Najwa Bahari, Hafizah Bahaludin, Munira Ismail, and Fatimah Abdul Razak
Subjects: modularity, louvain, community detection, malaysia stock market, correlation network, covid-19, Finance, HG1-9999, Statistics, HA1-4737
Abstract: COVID-19 triggered a worldwide economic decline and raised concerns regarding its economic consequences on stock markets across the globe, notably on the Malaysian stock market. We examined how COVID-19 impacted Malaysia's financial market using correlation and network analysis. We found a rise in correlations between stocks during the pandemic, suggesting greater interdependence. To visualize this, we created networks for pre-pandemic, during-pandemic, and post-pandemic periods. Additionally, we built a network for the during-pandemic period with a specific threshold corresponding to pre- and post-pandemic network density. The networks during the pandemic showed increased connectivity and only contained positive correlations, reflecting synchronized stock movements. Last, we analyzed the networks' modularity, revealing highest modularity during the pandemic, which suggests stronger yet risk-prone communities.
Published: 2024
Full Text: View/download PDF

43. Location method for emergency rescue node on expressways based on spatio-temporal characteristics of vehicle operation

Author: Xinghua Hu, Zhouzuo Wang, Jiahao Zhao, Ran Wang, Hao Lei, Wei Liu, and Bing Long
Subjects: Expressway, Community detection, OD data, Emergency rescue, Node location, Medicine, Science
Abstract: Abstract Expressway networks are continuously developing and emergency rescue demand is increasing proportionately. The location of expressway emergency rescue nodes needs refinement to meet changing requirements. In this study, the expressway was modeled as an expressway network. The differences in the origin destination (OD) distribution matrices for working days and major holidays were used as the bases for determining the need for temporary emergency rescue nodes. Overlapping and non-overlapping community detection algorithms were used to extract the distribution characteristics of OD during both day categories. These distributions were used to determine permanent and temporary emergency rescue sites. In this study, we considered the differences in traffic volume, distance, and impact of four vehicle types on traffic accidents to select the location of emergency rescue nodes, and allocate emergency resources. An emergency rescue node selection model for an expressway network was established based on spatio-temporal characteristics. The results based on a regional example determined that 22 permanent and 25 temporary emergency rescue nodes were appropriate. The average rescue time for traffic accidents during working days and major holidays compared to the P-center location model, was reduced by approximately 27.08% and 6.70%, respectively. The coefficient of variation of emergency rescue time was reduced by approximately 28.22% and 21.41%, respectively. The results indicated that the model satisfied the expressway emergency rescue demand requirements, and improved the rationality of the rescue center node layout.
Published: 2024
Full Text: View/download PDF

44. COMSE: analysis of single-cell RNA-seq data using community detection-based feature selection

Author: Qinhuan Luo, Yaozhu Chen, and Xun Lan
Subjects: Feature selection, Single-cell RNA-sequencing, Community detection, Biology (General), QH301-705.5
Abstract: Abstract Background Single-cell RNA sequencing enables studying cells individually, yet high gene dimensions and low cell numbers challenge analysis. And only a subset of the genes detected are involved in the biological processes underlying cell-type specific functions. Result In this study, we present COMSE, an unsupervised feature selection framework using community detection to capture informative genes from scRNA-seq data. COMSE identified homogenous cell substates with high resolution, as demonstrated by distinguishing different cell cycle stages. Evaluations based on real and simulated scRNA-seq datasets showed COMSE outperformed methods even with high dropout rates in cell clustering assignment. We also demonstrate that by identifying communities of genes associated with batch effects, COMSE parses signals reflecting biological difference from noise arising due to differences in sequencing protocols, thereby enabling integrated analysis of scRNA-seq datasets of different sources. Conclusions COMSE provides an efficient unsupervised framework that selects highly informative genes in scRNA-seq data improving cell sub-states identification and cell clustering. It identifies gene subsets that reveal biological and technical heterogeneity, supporting applications like batch effect correction and pathway analysis. It also provides robust results for bulk RNA-seq data analysis.
Published: 2024
Full Text: View/download PDF

45. Revealing multi-scale spatial synergy of mega-city region from a human mobility perspective

Author: Bichen Fang, Mingxiao Li, Zhengdong Huang, Yang Yue, Wei Tu, and Renzhong Guo
Subjects: Spatial synergy, human mobility, community detection, backbone extraction, The Pearl River Delta (PRD), Mathematical geography. Cartography, GA1-1776, Geodesy, QB275-343
Abstract: Spatial synergy is strengthened integration and connection between cities in a mega-city region, transcending administrative boundaries. The central flow theory suggests that the mega-city regions are formed by the interconnected flows of people across cities, making the spatial synergy can be measured by assessing the aggregation and intensity of flows and interactions between cities and regions. Human mobility data, such as mobile phone data and social media check-ins, enable the tracking of human movements, thus facilitating the transition of central flow theory from theoretical constructs to empirical research. To this end, this study presents an alternative data-driven framework to reveal the multi-scale spatial synergy of mega-city regions from a human mobility perspective. It uncovers homogeneously spatial communities with high inter-city integration using community detection. Strong internal spatial connections of 2.13 billion mobility are filtered using network backbone extraction. An experiment in the Pearl River Delta (PRD), China, demonstrates a multi-scale and multi-core hierarchical spatial synergy in the PRD region. The detailed findings are as follows: (1) Three cities attract the majority of human mobility. Mobility distance is short in urban centers and long in suburban areas. (2) The spatial integration pattern shows the detected communities reveal the hierarchical integration pattern with three main integrated regions: Guangzhou-Foshan-Zhaoqing, Shenzhen-Dongguan-Huizhou, and Zhuhai-Zhongshan-Jiangmen. (3) The spatial connection pattern illustrates the close ties of 9 cities and three core cities, including Guangzhou, Shenzhen, and Foshan. These results provide a human-centric understanding of urban synergy and deeper insights into central flow theory, which supports cooperative development in mega-city regions.
Published: 2024
Full Text: View/download PDF

46. Multi-scale cross-city community detection of urban agglomeration using signaling big data

Author: Wenbo Yu, Zhenfeng Shao, Xiao Huang, Deren Li, Yewen Fan, and Xiaodi Xu
Subjects: Cross-city communities, community detection, mobile big data, human interaction network, scale effect, Mathematical geography. Cartography, GA1-1776, Geodesy, QB275-343
Abstract: Many existing efforts have taken advantage of large-scale spatial-temporal data to partition cities via constructed human interaction networks. However, few studies focus on communities emerging between adjacent cities in big urban agglomerations, which we call “cross-city” communities. In this study, we introduce a novel framework to detect cross-city communities in urban agglomerations under different scales leveraging a large number of fine-grained mobile signaling data aiming to break the original administrative boundaries. Taking the Pearl River Delta (PRD) urban agglomeration in China as study area, we investigate the existence of potential communities at three scales, i.e. city-group level, city level and sub-city level. The partition results are expected to benefit transportation planning, urban zoning and administrative boundary re-delineation. The results from our study highlight the necessity of considering cross-city communities and their scale effects when examining urban spatial interactions.
Published: 2024
Full Text: View/download PDF

47. Traffic demand prediction using a social multiplex networks representation on a multimodal and multisource dataset

Author: Panagiotis Fafoutellis and Eleni I. Vlahogianni
Subjects: Multiplex networks, Community detection, Multi-layer graphs, Traffic prediction, Multimodal data, Transportation engineering, TA1001-1280
Abstract: In this paper, a meaningful representation of the road network using multiplex networks and a novel feature selection framework that enhances the predictability of future traffic conditions of an entire network are proposed. Using data on traffic volumes and tickets’ validation from the transportation network of Athens, we were able to develop prediction models that not only achieve very good performance but are also trained efficiently, do not introduce high complexity and, thus, are suitable for real-time operation. More specifically, the network’s nodes (loop detectors and subway/metro stations) are organized as a multilayer graph, each layer representing an hour of the day. Nodes with similar structural properties are then classified in communities and are exploited as features to predict the future demand values of nodes belonging to the same community. The results reveal the potential of the proposed method to provide reliable and accurate predictions.
Published: 2024
Full Text: View/download PDF

48. Improving modularity score of community detection using memetic algorithms

Author: Dongwon Lee, Jingeun Kim, and Yourim Yoon
Subjects: genetic algorithm, local search, community detection, modularity, memetic algorithm, Mathematics, QA1-939
Abstract: With the growth of online networks, understanding the intricate structure of communities has become vital. Traditional community detection algorithms, while effective to an extent, often fall short in complex systems. This study introduced a meta-heuristic approach for community detection that leveraged a memetic algorithm, combining genetic algorithms (GA) with the stochastic hill climbing (SHC) algorithm as a local optimization method to enhance modularity scores, which was a measure of the strength of community structure within a network. We conducted comprehensive experiments on five social network datasets (Zachary's Karate Club, Dolphin Social Network, Books About U.S. Politics, American College Football, and the Jazz Club Dataset). Also, we executed an ablation study based on modularity and convergence speed to determine the efficiency of local search. Our method outperformed other GA-based community detection methods, delivering higher maximum and average modularity scores, indicative of a superior detection of community structures. The effectiveness of local search was notable in its ability to accelerate convergence toward the global optimum. Our results not only demonstrated the algorithm's robustness across different network complexities but also underscored the significance of local search in achieving consistent and reliable modularity scores in community detection.
Published: 2024
Full Text: View/download PDF

49. A method for cleaning wind power anomaly data by combining image processing with community detection algorithms

Author: Qiaoling Yang, Kai Chen, Jianzhang Man, Jiaheng Duan, and Zuoqi Jin
Subjects: Wind turbine power curve, Abnormal data cleaning, Community detection, Louvain algorithm, Mathematical morphology operation, Energy conservation, TJ163.26-163.5, Energy industries. Energy policy. Fuel trade, HD9502-9502.5
Abstract: Current methodologies for cleaning wind power anomaly data exhibit limited capabilities in identifying abnormal data within extensive datasets and struggle to accommodate the considerable variability and intricacy of wind farm data. Consequently, a method for cleaning wind power anomaly data by combining image processing with community detection algorithms (CWPAD-IPCDA) is proposed. To precisely identify and initially clean anomalous data, wind power curve (WPC) images are converted into graph structures, which employ the Louvain community recognition algorithm and graph- theoretic methods for community detection and segmentation. Furthermore, the mathematical morphology operation (MMO) determines the main part of the initially cleaned wind power curve images and maps them back to the normal wind power points to complete the final cleaning. The CWPAD-IPCDA method was applied to clean datasets from 25 wind turbines (WTs) in two wind farms in northwest China to validate its feasibility. A comparison was conducted using density-based spatial clustering of applications with noise (DBSCAN) algorithm, an improved isolation forest algorithm, and an image-based (IB) algorithm. The experimental results demonstrate that the CWPAD-IPCDA method surpasses the other three algorithms, achieving an approximately 7.23% higher average data cleaning rate. The mean value of the sum of the squared errors (SSE) of the dataset after cleaning is approximately 6.887 lower than that of the other algorithms. Moreover, the mean of overall accuracy, as measured by the F1-score, exceeds that of the other methods by approximately 10.49%; this indicates that the CWPAD-IPCDA method is more conducive to improving the accuracy and reliability of wind power curve modeling and wind farm power forecasting.
Published: 2024
Full Text: View/download PDF

50. Assessing Spillover Effects of Medications for Opioid Use Disorder on HIV Risk Behaviors among a Network of People Who Inject Drugs

Author: Joseph Puleo, Ashley Buchanan, Natallia Katenka, M. Elizabeth Halloran, Samuel R. Friedman, and Georgios Nikolopoulos
Subjects: causal inference, networks, spillover, community detection, Human Immunodeficiency Virus, People Who Inject Drugs, Statistics, HA1-4737
Abstract: People who inject drugs (PWID) have an increased risk of HIV infection partly due to injection behaviors often related to opioid use. Medications for opioid use disorder (MOUD) have been shown to reduce HIV infection risk, possibly by reducing injection risk behaviors. MOUD may benefit individuals who do not receive it themselves but are connected through social, sexual, or drug use networks with individuals who are treated. This is known as spillover. Valid estimation of spillover in network studies requires considering the network’s community structure. Communities are groups of densely connected individuals with sparse connections to other groups. We analyzed a network of 277 PWID and their contacts from the Transmission Reduction Intervention Project. We assessed the effect of MOUD on reductions in injection risk behaviors and the possible benefit for network contacts of participants treated with MOUD. We identified communities using modularity-based methods and employed inverse probability weighting with community-level propensity scores to adjust for measured confounding. We found that MOUD may have beneficial spillover effects on reducing injection risk behaviors. The magnitudes of estimated effects were sensitive to the community detection method. Careful consideration should be paid to the significance of community structure in network studies evaluating spillover.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

6,954 results on '"Community Detection"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources