Journal: algorithms / Language: english / Topic: algorithms and machine learning - Searchworks@Jio Institute Digital Library Search Results

1. Special Issue: "2022 and 2023 Selected Papers from Algorithms' Editorial Board Members".

Author: Werner, Frank
Subjects: EDITORIAL boards, ALGORITHMS, OPTIMIZATION algorithms, DIFFERENTIAL evolution, QUADRATIC assignment problem, MACHINE learning, TABU search algorithm
Abstract: This document is a special issue of the journal Algorithms, featuring selected papers from the journal's editorial board members from 2022 and 2023. The issue includes 16 research papers covering a range of topics such as game theory, fault detection in cellular networks, optimization algorithms, machine learning, cryptocurrency trading, and more. Each paper presents its own unique research findings and methodologies. The issue aims to showcase the diverse research interests and expertise of the journal's editorial board members. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

2. Artificial Intelligence Algorithms for Healthcare.

Author: Chumachenko, Dmytro and Yakovlev, Sergiy
Subjects: ARTIFICIAL intelligence, DEEP learning, ALGORITHMS, MACHINE learning, INFORMATION technology, MEDICAL care, MOTION capture (Human mechanics), MEDICAL technology
Abstract: Artificial intelligence (AI) algorithms are playing a crucial role in transforming healthcare by enhancing the quality, accessibility, and efficiency of medical care, research, and operations. These algorithms enable healthcare providers to offer more accurate diagnoses, predict outcomes, and customize treatments to individual patient needs. AI also improves operational efficiency by automating routine tasks and optimizing resource management. However, there are challenges to adopting AI in healthcare, such as data privacy concerns and potential biases in algorithms. Collaboration among stakeholders is necessary to ensure ethical use of AI and its positive impact on the field. AI also has applications in medical research, preventive medicine, and public health. It is important to recognize that AI should augment, not replace, the expertise and compassionate care provided by healthcare professionals. The ethical implications and societal impact of AI in healthcare must be carefully considered, guided by fairness, transparency, and accountability principles. Several research papers in this special issue explore the application of AI algorithms in various aspects of healthcare, such as gait analysis for Parkinson's disease diagnosis, human activity recognition, heart disease prediction, compliance assessment with clinical protocols, epidemic management, neurological complications identification, fall prevention, leukemia diagnosis, and genetic clinical pathways. These studies demonstrate the potential of AI in improving medical diagnostics, patient monitoring, and personalized care. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

3. Anomaly Detection in Blockchain Networks Using Unsupervised Learning: A Survey.

Author: Cholevas, Christos, Angeli, Eftychia, Sereti, Zacharoula, Mavrikos, Emmanouil, and Tsekouras, George E.
Subjects: DATA structures, MACHINE learning, PRIVATE networks, BLOCKCHAINS, ALGORITHMS
Abstract: In decentralized systems, the quest for heightened security and integrity within blockchain networks becomes an issue. This survey investigates anomaly detection techniques in blockchain ecosystems through the lens of unsupervised learning, delving into the intricacies and going through the complex tapestry of abnormal behaviors by examining avant-garde algorithms to discern deviations from normal patterns. By seamlessly blending technological acumen with a discerning gaze, this survey offers a perspective on the symbiotic relationship between unsupervised learning and anomaly detection by reviewing this problem with a categorization of algorithms that are applied to a variety of problems in this field. We propose that the use of unsupervised algorithms in blockchain anomaly detection should be viewed not only as an implementation procedure but also as an integration procedure, where the merits of these algorithms can effectively be combined in ways determined by the problem at hand. In that sense, the main contribution of this paper is a thorough study of the interplay between various unsupervised learning algorithms and how this can be used in facing malicious activities and behaviors within public and private blockchain networks. The result is the definition of three categories, the characteristics of which are recognized in terms of the way the respective integration takes place. When implementing unsupervised learning, the structure of the data plays a pivotal role. Therefore, this paper also provides an in-depth presentation of the data structures commonly used in unsupervised learning-based blockchain anomaly detection. The above analysis is encircled by a presentation of the typical anomalies that have occurred so far along with a description of the general machine learning frameworks developed to deal with them. Finally, the paper spotlights challenges and directions that can serve as a comprehensive compendium for future research efforts. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Special Issue "Algorithms for Feature Selection".

Author: Khan, Muhammad Adnan
Subjects: DEEP learning, MACHINE learning, FEATURE selection, ALGORITHMS
Published: 2023
Full Text: View/download PDF

5. Forgetful Forests: Data Structures for Machine Learning on Streaming Data under Concept Drift.

Author: Yuan, Zhehu, Sun, Yinqi, and Shasha, Dennis
Subjects: MACHINE learning, DATA structures, DATABASES, MACHINE performance, PROBABILISTIC databases, ALGORITHMS
Abstract: Database and data structure research can improve machine learning performance in many ways. One way is to design better algorithms on data structures. This paper combines the use of incremental computation as well as sequential and probabilistic filtering to enable "forgetful" tree-based learning algorithms to cope with streaming data that suffers from concept drift. (Concept drift occurs when the functional mapping from input to classification changes over time). The forgetful algorithms described in this paper achieve high performance while maintaining high quality predictions on streaming data. Specifically, the algorithms are up to 24 times faster than state-of-the-art incremental algorithms with, at most, a 2% loss of accuracy, or are at least twice faster without any loss of accuracy. This makes such structures suitable for high volume streaming applications. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

6. A Learnheuristic Algorithm for the Capacitated Dispersion Problem under Dynamic Conditions.

Author: Gomez, Juan F., Uguina, Antonio R., Panadero, Javier, and Juan, Angel A.
Subjects: MACHINE learning, REINFORCEMENT learning, ALGORITHMS, TELECOMMUNICATION systems, DISPERSION (Chemistry)
Abstract: The capacitated dispersion problem, which is a variant of the maximum diversity problem, aims to determine a set of elements within a network. These elements could symbolize, for instance, facilities in a supply chain or transmission nodes in a telecommunication network. While each element typically has a bounded service capacity, in this research, we introduce a twist. The capacity of each node might be influenced by a random Bernoulli component, thereby rendering the possibility of a node having zero capacity, which is contingent upon a black box mechanism that accounts for environmental variables. Recognizing the inherent complexity and the NP-hard nature of the capacitated dispersion problem, heuristic algorithms have become indispensable for handling larger instances. In this paper, we introduce a novel approach by hybridizing a heuristic algorithm with reinforcement learning to address this intricate problem variant. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

7. Literature Review on Hybrid Evolutionary Approaches for Feature Selection.

Author: Piri, Jayashree, Mohapatra, Puspanjali, Dey, Raghunath, Acharya, Biswaranjan, Gerogiannis, Vassilis C., and Kanavos, Andreas
Subjects: FEATURE selection, METAHEURISTIC algorithms, LITERATURE reviews, MACHINE learning, MATHEMATICAL optimization, ALGORITHMS
Abstract: The efficiency and the effectiveness of a machine learning (ML) model are greatly influenced by feature selection (FS), a crucial preprocessing step in machine learning that seeks out the ideal set of characteristics with the maximum accuracy possible. Due to their dominance over traditional optimization techniques, researchers are concentrating on a variety of metaheuristic (or evolutionary) algorithms and trying to suggest cutting-edge hybrid techniques to handle FS issues. The use of hybrid metaheuristic approaches for FS has thus been the subject of numerous research works. The purpose of this paper is to critically assess the existing hybrid FS approaches and to give a thorough literature review on the hybridization of different metaheuristic/evolutionary strategies that have been employed for supporting FS. This article reviews pertinent documents on hybrid frameworks that were published in the period from 2009 to 2022 and offers a thorough analysis of the used techniques, classifiers, datasets, applications, assessment metrics, and schemes of hybridization. Additionally, new open research issues and challenges are identified to pinpoint the areas that have to be further explored for additional study. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

8. Image Quality Assessment for Gibbs Ringing Reduction.

Author: Wang, Yue and Healy, John J.
Subjects: MACHINE learning, IMAGE quality analysis, BEST practices, ALGORITHMS
Abstract: Gibbs ringing is an artefact that is inevitable in any imaging modality where the measurement is Fourier band-limited. It impacts the quality of the image by creating a ringing appearance around discontinuities. Many novel ways of suppressing the artefact have been proposed, including machine learning methods, but the quantitative comparisons of the results have frequently been lacking in rigour. In this paper, we examine image quality assessment metrics on three test images with different complexity. We determine six metrics which show promise for simultaneously assessing severity of Gibbs ringing and of other error such as blurring. We examined applying metrics to a region of interest around discontinuities in the image and use the metrics on the resulting region of interest. We demonstrate that the region of interest approach does not improve the performance of the metrics. Finally, we examine the effect of the error threshold parameter in two metrics. Our results will aid development of best practice in comparison of algorithms for the suppression of Gibbs ringing. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

9. An Adaptive Deep Learning Neural Network Model to Enhance Machine-Learning-Based Classifiers for Intrusion Detection in Smart Grids.

Author: Li, Xue Jun, Ma, Maode, and Sun, Yihan
Subjects: DEEP learning, MACHINE learning, COMPUTER network security, ENTORHINAL cortex, INFRASTRUCTURE (Economics), COMMUNICATION infrastructure, ALGORITHMS
Abstract: Modern smart grids are built based on top of advanced computing and networking technologies, where condition monitoring relies on secure cyberphysical connectivity. Over the network infrastructure, transported data containing confidential information, must be protected as smart grids are vulnerable and subject to various cyberattacks. Various machine learning based classifiers were proposed for intrusion detection in smart grids. However, each of them has respective advantage and disadvantages. Aiming to improve the performance of existing machine learning based classifiers, this paper proposes an adaptive deep learning algorithm with a data pre-processing module, a neural network pre-training module and a classifier module, which work together classify intrusion data types using their high-dimensional data features. The proposed Adaptive Deep Learning (ADL) algorithm obtains the number of layers and the number of neurons per layer by determining the characteristic dimension of the network traffic. With transfer learning, the proposed ADL algorithm can extract the original data dimensions and obtain new abstract features. By combining deep learning models with traditional machine learning-based classification models, the performance of classification of network traffic data is significantly improved. By using the Network Security Laboratory-Knowledge Discovery in Databases (NSL-KDD) dataset, experimental results show that the proposed ADL algorithm improves the effectiveness of existing intrusion detection methods and reduces the training time, indicating a promising candidate to enhance network security in smart grids. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

10. Simultaneous Feature Selection and Support Vector Machine Optimization Using an Enhanced Chimp Optimization Algorithm.

Author: Wu, Di, Zhang, Wanying, Jia, Heming, and Leng, Xin
Subjects: FEATURE selection, MATHEMATICAL optimization, CHIMPANZEES, SUPPORT vector machines, ALGORITHMS, PARTICLE swarm optimization, SEARCH algorithms, MACHINE learning
Abstract: Chimp Optimization Algorithm (ChOA), a novel meta-heuristic algorithm, has been proposed in recent years. It divides the population into four different levels for the purpose of hunting. However, there are still some defects that lead to the algorithm falling into the local optimum. To overcome these defects, an Enhanced Chimp Optimization Algorithm (EChOA) is developed in this paper. Highly Disruptive Polynomial Mutation (HDPM) is introduced to further explore the population space and increase the population diversity. Then, the Spearman's rank correlation coefficient between the chimps with the highest fitness and the lowest fitness is calculated. In order to avoid the local optimization, the chimps with low fitness values are introduced with Beetle Antenna Search Algorithm (BAS) to obtain visual ability. Through the introduction of the above three strategies, the ability of population exploration and exploitation is enhanced. On this basis, this paper proposes an EChOA-SVM model, which can optimize parameters while selecting the features. Thus, the maximum classification accuracy can be achieved with as few features as possible. To verify the effectiveness of the proposed method, the proposed method is compared with seven common methods, including the original algorithm. Seventeen benchmark datasets from the UCI machine learning library are used to evaluate the accuracy, number of features, and fitness of these methods. Experimental results show that the classification accuracy of the proposed method is better than the other methods on most data sets, and the number of features required by the proposed method is also less than the other algorithms. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

11. Skeptical Learning—An Algorithm and a Platform for Dealing with Mislabeling in Personal Context Recognition.

Author: Zhang, Wanyi, Zeni, Mattia, Passerini, Andrea, and Giunchiglia, Fausto
Subjects: USER-generated content, ALGORITHMS, INTERACTIVE learning, TRANSPORTATION of school children, STUDENTS, MACHINE learning
Abstract: Mobile Crowd Sensing (MCS) is a novel IoT paradigm where sensor data, as collected by the user's mobile devices, are integrated with user-generated content, e.g., annotations, self-reports, or images. While providing many advantages, the human involvement also brings big challenges, where the most critical is possibly the poor quality of human-provided content, most often due to the inaccurate input from non-expert users. In this paper, we propose Skeptical Learning, an interactive machine learning algorithm where the machine checks the quality of the user feedback and tries to fix it when a problem arises. In this context, the user feedback consists of answers to machine generated questions, at times defined by the machine. The main idea is to integrate three core elements, which are (i) sensor data, (ii) user answers, and (iii) existing prior knowledge of the world, and to enable a second round of validation with the user any time these three types of information jointly generate an inconsistency. The proposed solution is evaluated in a project focusing on a university student life scenario. The main goal of the project is to recognize the locations and transportation modes of the students. The results highlight an unexpectedly high pervasiveness of user mistakes in the university students life project. The results also shows the advantages provided by Skeptical Learning in dealing with the mislabeling issues in an interactive way and improving the prediction performance. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

12. Univariate Outlier Detection: Precision-Driven Algorithm for Single-Cluster Scenarios.

Author: El hairach, Mohamed Limam, Tmiri, Amal, and Bellamine, Insaf
Subjects: OUTLIER detection, DATA distribution, ALGORITHMS, DATA mining
Abstract: This study introduces a novel algorithm tailored for the precise detection of lower outliers (i.e., data points at the lower tail) in univariate datasets, which is particularly suited for scenarios with a single cluster and similar data distribution. The approach leverages a combination of transformative techniques and advanced filtration methods to efficiently segregate anomalies from normal values. Notably, the algorithm emphasizes high-precision outlier detection, ensuring minimal false positives, and requires only a few parameters for configuration. Its unsupervised nature enables robust outlier filtering without the need for extensive manual intervention. To validate its efficacy, the algorithm is rigorously tested using real-world data obtained from photovoltaic (PV) module strings with similar DC capacities, containing various outliers. The results demonstrate the algorithm's capability to accurately identify lower outliers while maintaining computational efficiency and reliability in practical applications. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering.

Author: Hämäläinen, Joonas, Jauhiainen, Susanne, and Kärkkäinen, Tommi
Subjects: PROTOTYPES, PATTERN recognition systems, MACHINE learning, ALGORITHMS, EMPIRICAL research
Abstract: Clustering is an unsupervised machine learning and pattern recognition method. In general, in addition to revealing hidden groups of similar observations and clusters, their number needs to be determined. Internal clustering validation indices estimate this number without any external information. The purpose of this article is to evaluate, empirically, characteristics of a representative set of internal clustering validation indices with many datasets. The prototype-based clustering framework includes multiple, classical and robust, statistical estimates of cluster location so that the overall setting of the paper is novel. General observations on the quality of validation indices and on the behavior of different variants of clustering algorithms will be given. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

14. Pediatric Ischemic Stroke: Clinical and Paraclinical Manifestations—Algorithms for Diagnosis and Treatment.

Author: Wessel, Niels, Sprincean, Mariana, Sidorenko, Ludmila, Revenco, Ninel, and Hadjiu, Svetlana
Subjects: ISCHEMIC stroke, SYMPTOMS, MACHINE learning, STROKE, ALGORITHMS
Abstract: Childhood stroke can lead to lifelong disability. Developing algorithms for timely recognition of clinical and paraclinical signs is crucial to ensure prompt stroke diagnosis and minimize decision-making time. This study aimed to characterize clinical and paraclinical symptoms of childhood and neonatal stroke as relevant diagnostic criteria encountered in clinical practice, in order to develop algorithms for prompt stroke diagnosis. The analysis included data from 402 pediatric case histories from 2010 to 2016 and 108 prospective stroke cases from 2017 to 2020. Stroke cases were predominantly diagnosed in newborns, with 362 (71%, 95% CI 68.99–73.01) cases occurring within the first 28 days of birth, and 148 (29%, 95% CI 26.99–31.01) cases occurring after 28 days. The findings of the study enable the development of algorithms for timely stroke recognition, facilitating the selection of optimal treatment options for newborns and children of various age groups. Logistic regression serves as the basis for deriving these algorithms, aiming to initiate early treatment and reduce lifelong morbidity and mortality in children. The study outcomes include the formulation of algorithms for timely recognition of newborn stroke, with plans to adopt these algorithms and train a fuzzy classifier-based diagnostic model using machine learning techniques for efficient stroke recognition. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. A Model-Agnostic Algorithm for Bayes Error Determination in Binary Classification.

Author: Michelucci, Umberto, Sperti, Michela, Piga, Dario, Venturini, Francesca, and Deriu, Marco A.
Subjects: RECEIVER operating characteristic curves, ALGORITHMS, CLASSIFICATION algorithms, CLASSIFICATION
Abstract: This paper presents the intrinsic limit determination algorithm (ILD Algorithm), a novel technique to determine the best possible performance, measured in terms of the AUC (area under the ROC curve) and accuracy, that can be obtained from a specific dataset in a binary classification problem with categorical features regardless of the model used. This limit, namely, the Bayes error, is completely independent of any model used and describes an intrinsic property of the dataset. The ILD algorithm thus provides important information regarding the prediction limits of any binary classification algorithm when applied to the considered dataset. In this paper, the algorithm is described in detail, its entire mathematical framework is presented and the pseudocode is given to facilitate its implementation. Finally, an example with a real dataset is given. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

16. An Introduction to Development of Centralized and Distributed Stochastic Approximation Algorithm with Expanding Truncations.

Author: Zhao, Wenxiao
Subjects: STOCHASTIC approximation, APPROXIMATION algorithms, ORDINARY differential equations, ALGORITHMS, PRINCIPAL components analysis, MACHINE learning
Abstract: The stochastic approximation algorithm (SAA), starting from the pioneer work by Robbins and Monro in 1950s, has been successfully applied in systems and control, statistics, machine learning, and so forth. In this paper, we will review the development of SAA in China, to be specific, the stochastic approximation algorithm with expanding truncations (SAAWET) developed by Han-Fu Chen and his colleagues during the past 35 years. We first review the historical development for the centralized algorithm including the probabilistic method (PM) and the ordinary differential equation (ODE) method for SAA and the trajectory-subsequence method for SAAWET. Then, we will give an application example of SAAWET to the recursive principal component analysis. We will also introduce the recent progress on SAAWET in a networked and distributed setting, named the distributed SAAWET (DSAAWET). [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

17. Optimizing Multidimensional Pooling for Variational Quantum Algorithms.

Author: Jeng, Mingyoung, Nobel, Alvir, Jha, Vinayak, Levy, David, Kneidel, Dylan, Chaudhary, Manu, Islam, Ishraq, Baumgartner, Evan, Vanderhoof, Eade, Facer, Audrey, Singh, Manish, Arshad, Abina, and El-Araby, Esam
Subjects: CIRCUIT complexity, MACHINE learning, COMPUTER vision, CONVOLUTIONAL neural networks, ALGORITHMS, MULTIDIMENSIONAL databases, QUANTUM computers
Abstract: Convolutional neural networks (CNNs) have proven to be a very efficient class of machine learning (ML) architectures for handling multidimensional data by maintaining data locality, especially in the field of computer vision. Data pooling, a major component of CNNs, plays a crucial role in extracting important features of the input data and downsampling its dimensionality. Multidimensional pooling, however, is not efficiently implemented in existing ML algorithms. In particular, quantum machine learning (QML) algorithms have a tendency to ignore data locality for higher dimensions by representing/flattening multidimensional data as simple one-dimensional data. In this work, we propose using the quantum Haar transform (QHT) and quantum partial measurement for performing generalized pooling operations on multidimensional data. We present the corresponding decoherence-optimized quantum circuits for the proposed techniques along with their theoretical circuit depth analysis. Our experimental work was conducted using multidimensional data, ranging from 1-D audio data to 2-D image data to 3-D hyperspectral data, to demonstrate the scalability of the proposed methods. In our experiments, we utilized both noisy and noise-free quantum simulations on a state-of-the-art quantum simulator from IBM Quantum. We also show the efficiency of our proposed techniques for multidimensional data by reporting the fidelity of results. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. NDARTS: A Differentiable Architecture Search Based on the Neumann Series.

Author: Han, Xiaoyu, Li, Chenyu, Wang, Zifan, and Liu, Guohua
Subjects: MACHINE learning, SEARCH algorithms, IMPLICIT functions, REINFORCEMENT learning, EVOLUTIONARY algorithms, ALGORITHMS
Abstract: Neural architecture search (NAS) has shown great potential in discovering powerful and flexible network models, becoming an important branch of automatic machine learning (AutoML). Although search methods based on reinforcement learning and evolutionary algorithms can find high-performance architectures, these search methods typically require hundreds of GPU days. Unlike searching in a discrete search space based on reinforcement learning and evolutionary algorithms, the differentiable neural architecture search (DARTS) continuously relaxes the search space, allowing for optimization using gradient-based methods. Based on DARTS, we propose NDARTS in this article. The new algorithm uses the Implicit Function Theorem and the Neumann series to approximate the hyper-gradient, which obtains better results than DARTS. In the simulation experiment, an ablation experiment was carried out to study the influence of the different parameters on the NDARTS algorithm and to determine the optimal weight, then the best performance of the NDARTS algorithm was searched for in the DARTS search space and the NAS-BENCH-201 search space. Compared with other NAS algorithms, the results showed that NDARTS achieved excellent results on the CIFAR-10, CIFAR-100, and ImageNet datasets, and was an effective neural architecture search algorithm. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

19. A Real-Time Car Towing Management System Using ML-Powered Automatic Number Plate Recognition.

Author: Ahmed, Ahmed Abdelmoamen and Ahmed, Sheikh
Subjects: AUTOMOBILES, INTELLIGENT transportation systems, K-nearest neighbor classification, MOBILE apps, TOLLS, AUTOMATIC systems in automobiles, USER interfaces, ALGORITHMS
Abstract: Automatic Number Plate Recognition (ANPR) has been widely used in different domains, such as car park management, traffic management, tolling, and intelligent transport systems. Despite this technology's importance, the existing ANPR approaches suffer from the accurate identification of number plats due to its different size, orientation, and shapes across different regions worldwide. In this paper, we are studying these challenges by implementing a case study for smart car towing management using Machine Learning (ML) models. The developed mobile-based system uses different approaches and techniques to enhance the accuracy of recognizing number plates in real-time. First, we developed an algorithm to accurately detect the number plate's location on the car body. Then, the bounding box of the plat is extracted and converted into a grayscale image. Second, we applied a series of filters to detect the alphanumeric characters' contours within the grayscale image. Third, the detected the alphanumeric characters' contours are fed into a K-Nearest Neighbors (KNN) model to detect the actual number plat. Our model achieves an overall classification accuracy of 95% in recognizing number plates across different regions worldwide. The user interface is developed as an Android mobile app, allowing law-enforcement personnel to capture a photo of the towed car, which is then recorded in the car towing management system automatically in real-time. The app also allows owners to search for their cars, check the case status, and pay fines. Finally, we evaluated our system using various performance metrics such as classification accuracy, processing time, etc. We found that our model outperforms some state-of-the-art ANPR approaches in terms of the overall processing time. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

20. Data Mining Algorithms for Smart Cities: A Bibliometric Analysis.

Author: Kousis, Anestis and Tjortjis, Christos
Subjects: SMART cities, DATA mining, BIBLIOMETRICS, INTERNET of things, MACHINE learning, ALGORITHMS
Abstract: Smart cities connect people and places using innovative technologies such as Data Mining (DM), Machine Learning (ML), big data, and the Internet of Things (IoT). This paper presents a bibliometric analysis to provide a comprehensive overview of studies associated with DM technologies used in smart cities applications. The study aims to identify the main DM techniques used in the context of smart cities and how the research field of DM for smart cities evolves over time. We adopted both qualitative and quantitative methods to explore the topic. We used the Scopus database to find relative articles published in scientific journals. This study covers 197 articles published over the period from 2013 to 2021. For the bibliometric analysis, we used the Biliometrix library, developed in R. Our findings show that there is a wide range of DM technologies used in every layer of a smart city project. Several ML algorithms, supervised or unsupervised, are adopted for operating the instrumentation, middleware, and application layer. The bibliometric analysis shows that DM for smart cities is a fast-growing scientific field. Scientists from all over the world show a great interest in researching and collaborating on this interdisciplinary scientific field. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

21. A Fire Detection Algorithm Based on Tchebichef Moment Invariants and PSO-SVM.

Author: Bian, Yongming, Yang, Meng, Fan, Xuying, and Liu, Yuchao
Subjects: PARTICLE swarm optimization, DETECTORS, ALGORITHMS, ARTIFICIAL neural networks, MACHINE learning
Abstract: Automatic fire detection, which can detect and raise the alarm for fire early, is expected to help reduce the loss of life and property as much as possible. Due to its advantages over traditional methods, image processing technology has been applied gradually in fire detection. In this paper, a novel algorithm is proposed to achieve fire image detection, combined with Tchebichef (sometimes referred to as Chebyshev) moment invariants (TMIs) and particle swarm optimization-support vector machine (PSO-SVM). According to the correlation between geometric moments and Tchebichef moments, the translation, rotation, and scaling (TRS) invariants of Tchebichef moments are obtained first. Then, the TMIs of candidate images are calculated to construct feature vectors. To gain the best detection performance, a PSO-SVM model is proposed, where the kernel parameter and penalty factor of support vector machine (SVM) are optimized by particle swarm optimization (PSO). Then, the PSO-SVM model is utilized to identify the fire images. Compared with algorithms based on Hu moment invariants (HMIs) and Zernike moment invariants (ZMIs), the experimental results show that the proposed algorithm can improve the detection accuracy, achieving the highest detection rate of 98.18%. Moreover, it still exhibits the best performance even if the size of the training sample set is small and the images are transformed by TRS. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

22. A New Cascade-Correlation Growing Deep Learning Neural Network Algorithm.

Author: Mohamed, Soha Abd El-Moamen, Mohamed, Marghany Hassan, Farghally, Mohammed F., and Radac, Mircea-Bogdan
Subjects: FEEDFORWARD neural networks, DEEP learning, ARTIFICIAL neural networks, PROBLEM solving, MACHINE learning, ALGORITHMS
Abstract: In this paper, a proposed algorithm that dynamically changes the neural network structure is presented. The structure is changed based on some features in the cascade correlation algorithm. Cascade correlation is an important algorithm that is used to solve the actual problem by artificial neural networks as a new architecture and supervised learning algorithm. This process optimizes the architectures of the network which intends to accelerate the learning process and produce better performance in generalization. Many researchers have to date proposed several growing algorithms to optimize the feedforward neural network architectures. The proposed algorithm has been tested on various medical data sets. The results prove that the proposed algorithm is a better method to evaluate the accuracy and flexibility resulting from it. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

23. Subgroup Discovery in Machine Learning Problems with Formal Concepts Analysis and Test Theory Algorithms.

Author: Masich, Igor, Rezova, Natalya, Shkaberina, Guzel, Mironov, Sergei, Bartosh, Mariya, and Kazakovtsev, Lev
Subjects: MACHINE learning, LEARNING problems, INDUSTRIAL clusters, INDUSTRIAL goods, ALGORITHMS
Abstract: A number of real-world problems of automatic grouping of objects or clustering require a reasonable solution and the possibility of interpreting the result. More specific is the problem of identifying homogeneous subgroups of objects. The number of groups in such a dataset is not specified, and it is required to justify and describe the proposed grouping model. As a tool for interpretable machine learning, we consider formal concept analysis (FCA). To reduce the problem with real attributes to a problem that allows the use of FCA, we use the search for the optimal number and location of cut points and the optimization of the support set of attributes. The approach to identifying homogeneous subgroups was tested on tasks for which interpretability is important: the problem of clustering industrial products according to primary tests (for example, transistors, diodes, and microcircuits) as well as gene expression data (collected to solve the problem of predicting cancerous tumors). For the data under consideration, logical concepts are identified, formed in the form of a lattice of formal concepts. Revealed concepts are evaluated according to indicators of informativeness and can be considered as homogeneous subgroups of elements and their indicative descriptions. The proposed approach makes it possible to single out homogeneous subgroups of elements and provides a description of their characteristics, which can be considered as tougher norms that the elements of the subgroup satisfy. A comparison is made with the COBWEB algorithm designed for conceptual clustering of objects. This algorithm is aimed at discovering probabilistic concepts. The resulting lattices of logical concepts and probabilistic concepts for the considered datasets are simple and easy to interpret. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

24. A Bayesian Multi-Armed Bandit Algorithm for Dynamic End-to-End Routing in SDN-Based Networks with Piecewise-Stationary Rewards.

Author: Santana, Pedro and Moura, José
Subjects: ALGORITHMS, DELAY lines, MACHINE learning, SOFTWARE-defined networking, ROUTING algorithms
Abstract: To handle the exponential growth of data-intensive network edge services and automatically solve new challenges in routing management, machine learning is steadily being incorporated into software-defined networking solutions. In this line, the article presents the design of a piecewise-stationary Bayesian multi-armed bandit approach for the online optimum end-to-end dynamic routing of data flows in the context of programmable networking systems. This learning-based approach has been analyzed with simulated and emulated data, showing the proposal's ability to sequentially and proactively self-discover the end-to-end routing path with minimal delay among a considerable number of alternatives, even when facing abrupt changes in transmission delay distributions due to both variable congestion levels on path network devices and dynamic delays to transmission links. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

25. Transfer Learning and Analogical Inference: A Critical Comparison of Algorithms, Methods, and Applications.

Author: Combs, Kara, Lu, Hongjing, and Bihl, Trevor J.
Subjects: MACHINE learning, ARTIFICIAL intelligence, PROBLEM solving, DECISION making, ALGORITHMS
Abstract: Artificial intelligence and machine learning (AI/ML) research has aimed to achieve human-level performance in tasks that require understanding and decision making. Although major advances have been made, AI systems still struggle to achieve adaptive learning for generalization. One of the main approaches to generalization in ML is transfer learning, where previously learned knowledge is utilized to solve problems in a different, but related, domain. Another approach, pursued by cognitive scientists for several decades, has investigated the role of analogical reasoning in comparisons aimed at understanding human generalization ability. Analogical reasoning has yielded rich empirical findings and general theoretical principles underlying human analogical inference and generalization across distinctively different domains. Though seemingly similar, there are fundamental differences between the two approaches. To clarify differences and similarities, we review transfer learning algorithms, methods, and applications in comparison with work based on analogical inference. Transfer learning focuses on exploring feature spaces shared across domains through data vectorization while analogical inferences focus on identifying relational structure shared across domains via comparisons. Rather than treating these two learning approaches as synonymous or as independent and mutually irrelevant fields, a better understanding of how they are interconnected can guide a multidisciplinary synthesis of the two approaches. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

26. Plum Tree Algorithm and Weighted Aggregated Ensembles for Energy Efficiency Estimation.

Author: Moldovan, Dorin
Subjects: PLUM, MACHINE learning, PARTICLE swarm optimization, ALGORITHMS, COOLING loads (Mechanical engineering), SEARCH algorithms
Abstract: This article introduces a novel nature-inspired algorithm called the Plum Tree Algorithm (PTA), which has the biology of the plum trees as its main source of inspiration. The PTA was tested and validated using 24 benchmark objective functions, and it was further applied and compared to the following selection of representative state-of-the-art, nature-inspired algorithms: the Chicken Swarm Optimization (CSO) algorithm, the Particle Swarm Optimization (PSO) algorithm, the Grey Wolf Optimizer (GWO), the Cuckoo Search (CS) algorithm, the Crow Search Algorithm (CSA), and the Horse Optimization Algorithm (HOA). The results obtained with the PTA are comparable to the results obtained by using the other nature-inspired optimization algorithms. The PTA returned the best overall results for the 24 objective functions tested. This article presents the application of the PTA for weight optimization for an ensemble of four machine learning regressors, namely, the Random Forest Regressor (RFR), the Gradient Boosting Regressor (GBR), the AdaBoost Regressor (AdaBoost), and the Extra Trees Regressor (ETR), which are used for the prediction of the heating load and cooling load requirements of buildings, using the Energy Efficiency Dataset from UCI Machine Learning as experimental support. The PTA optimized ensemble-returned results such as those returned by the ensembles optimized with the GWO, the CS, and the CSA. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

27. An Algorithm for Efficient Generation of Customized Priority Rules for Production Control in Project Manufacturing with Stochastic Job Processing Times.

Author: Kühn, Mathias, Völker, Michael, and Schmidt, Thorsten
Subjects: STOCHASTIC processes, PRODUCTION control, ALGORITHMS, DISCRETE event simulation, MACHINE learning, DETERMINISTIC algorithms
Abstract: Project Planning and Control (PPC) problems with stochastic job processing times belong to the problem class of Stochastic Resource-Constrained Multi-Project Scheduling Problems (SRCMPSP). A practical example of this problem class is the industrial domain of customer-specific assembly of complex products. PPC approaches have to compensate stochastic influences and achieve high objective fulfillment. This paper presents an efficient simulation-based optimization approach to generate Combined Priority Rules (CPRs) for determining the next job in short-term production control. The objective is to minimize project-specific objectives such as average and standard deviation of project delay or makespan. For this, we generate project-specific CPRs and evaluate the results with the Pareto dominance concept. However, generating CPRs considering stochastic influences is computationally intensive. To tackle this problem, we developed a 2-phase algorithm by first learning the algorithm with deterministic data and by generating promising starting solutions for the more computationally intensive stochastic phase. Since a good deterministic solution does not always lead to a good stochastic solution, we introduced the parameter Initial Copy Rate (ICR) to generate an initial population of copied and randomized individuals. Evaluating this approach, we conducted various computer-based experiments. Compared to Standard Priority Rules (SPRs) used in practice, the approach shows a higher objective fulfilment. The 2-phase algorithm can reduce the computation effort and increases the efficiency of generating CPRs. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

28. Understanding Contrail Business Processes through Hierarchical Clustering: A Multi-Stage Framework.

Author: Tariq, Zeeshan, Khan, Naveed, Charles, Darryl, McClean, Sally, McChesney, Ian, and Taylor, Paul
Subjects: CONDENSATION trails, PROCESS mining, ALGORITHMS
Abstract: Real-world business processes are dynamic, with event logs that are generally unstructured and contain heterogeneous business classes. Process mining techniques derive useful knowledge from such logs but translating them into simplified and logical segments is crucial. Complexity is increased when dealing with business processes with a large number of events with no outcome labels. Techniques such as trace clustering and event clustering, tend to simplify the complex business logs but the resulting clusters are generally not understandable to the business users as the business aspects of the process are not considered while clustering the process log. In this paper, we provided a multi-stage hierarchical framework for business-logic driven clustering of highly variable process logs with extensively large number of events. Firstly, we introduced a term contrail processes for describing the characteristics of such complex real-world business processes and their logs presenting contrail-like models. Secondly, we proposed an algorithm Novel Hierarchical Clustering (NoHiC) to discover business-logic driven clusters from these contrail processes. For clustering, the raw event log is initially decomposed into high-level business classes, and later feature engineering is performed exclusively based on the business-context features, to support the discovery of meaningful business clusters. We used a hybrid approach which combines rule-based mining technique with a novel form of agglomerative hierarchical clustering for the experiments. A case-study of a CRM process of the UK's renowned telecommunication firm is presented and the quality of the proposed framework is verified through several measures, such as cluster segregation, classification accuracy, and fitness of the log. We compared NoHiC technique with two trace clustering techniques using two real world process logs. The discovered clusters through NoHiC are found to have improved fitness as compared to the other techniques, and they also hold valuable information about the business context of the process log. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

29. High Per Parameter: A Large-Scale Study of Hyperparameter Tuning for Machine Learning Algorithms.

Author: Sipper, Moshe
Subjects: CLASSIFICATION algorithms, ALGORITHMS, MACHINE learning
Abstract: Hyperparameters in machine learning (ML) have received a fair amount of attention, and hyperparameter tuning has come to be regarded as an important step in the ML pipeline. However, just how useful is said tuning? While smaller-scale experiments have been previously conducted, herein we carry out a large-scale investigation, specifically one involving 26 ML algorithms, 250 datasets (regression and both binary and multinomial classification), 6 score metrics, and 28,857,600 algorithm runs. Analyzing the results we conclude that for many ML algorithms, we should not expect considerable gains from hyperparameter tuning on average; however, there may be some datasets for which default hyperparameters perform poorly, especially for some algorithms. By defining a single hp_score value, which combines an algorithm's accumulated statistics, we are able to rank the 26 ML algorithms from those expected to gain the most from hyperparameter tuning to those expected to gain the least. We believe such a study shall serve ML practitioners at large. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

30. Artificial Intelligence Algorithms for Treatment of Diabetes.

Author: Rashid, Mudassir M., Askari, Mohammad Reza, Chen, Canyu, Liang, Yueqing, Shu, Kai, and Cinar, Ali
Subjects: MEDICAL informatics, ARTIFICIAL intelligence, ELECTRONIC health records, MACHINE learning, ALGORITHMS, DIABETES
Abstract: Artificial intelligence (AI) algorithms can provide actionable insights for clinical decision-making and managing chronic diseases. The treatment and management of complex chronic diseases, such as diabetes, stands to benefit from novel AI algorithms analyzing the frequent real-time streaming data and the occasional medical diagnostics and laboratory test results reported in electronic health records (EHR). Novel algorithms are needed to develop trustworthy, responsible, reliable, and robust AI techniques that can handle the imperfect and imbalanced data of EHRs and inconsistencies or discrepancies with free-living self-reported information. The challenges and applications of AI for two problems in the healthcare domain were explored in this work. First, we introduced novel AI algorithms for EHRs designed to be fair and unbiased while accommodating privacy concerns in predicting treatments and outcomes. Then, we studied the innovative approach of using machine learning to improve automated insulin delivery systems through analyzing real-time information from wearable devices and historical data to identify informative trends and patterns in free-living data. Application examples in the treatment of diabetes demonstrate the benefits of AI tools for medical and health informatics. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

31. Idea of Using Blockchain Technique for Choosing the Best Configuration of Weights in Neural Networks.

Author: Winnicka, Alicja and Kęsik, Karolina
Subjects: ARTIFICIAL neural networks, MACHINE learning
Abstract: The blockchain technique is becoming more and more popular due to its advantages such as stability and dispersed nature. This is an idea based on blockchain activity paradigms. Another important field is machine learning, which is increasingly used in practice. Unfortunately, the training or overtraining artificial neural networks is very time-consuming and requires high computing power. In this paper, we proposed using a blockchain technique to train neural networks. This type of activity is important due to the possible search for initial weights in the network, which affect faster training, due to gradient decrease. We performed the tests with much heavier calculations to indicate that such an action is possible. However, this type of solution can also be used for less demanding calculations, i.e., only a few iterations of training and finding a better configuration of initial weights. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

32. BELMKN: Bayesian Extreme Learning Machines Kohonen Network.

Author: Senthilnath, J., Simha C, Sumanth, G, Nagaraj, Thapa, Meenakumari, and M, Indiramma
Subjects: BAYESIAN analysis, ALGORITHMS, MACHINE learning, CLUSTER analysis (Statistics), SELF-organizing maps
Abstract: This paper proposes the Bayesian Extreme Learning Machine Kohonen Network (BELMKN) framework to solve the clustering problem. The BELMKN framework uses three levels in processing nonlinearly separable datasets to obtain efficient clustering in terms of accuracy. In the first level, the Extreme Learning Machine (ELM)-based feature learning approach captures the nonlinearity in the data distribution by mapping it onto a d-dimensional space. In the second level, ELM-based feature extracted data is used as an input for Bayesian Information Criterion (BIC) to predict the number of clusters termed as a cluster prediction. In the final level, feature-extracted data along with the cluster prediction is passed to the Kohonen Network to obtain improved clustering accuracy. The main advantage of the proposed method is to overcome the problem of having a priori identifiers or class labels for the data; it is difficult to obtain labels in most of the cases for the real world datasets. The BELMKN framework is applied to 3 synthetic datasets and 10 benchmark datasets from the UCI machine learning repository and compared with the state-of-the-art clustering methods. The experimental results show that the proposed BELMKN-based clustering outperforms other clustering algorithms for the majority of the datasets. Hence, the BELMKN framework can be used to improve the clustering accuracy of the nonlinearly separable datasets. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

33. Approximately Optimal Control of Nonlinear Dynamic Stochastic Problems with Learning: The OPTCON Algorithm.

Author: Blueschke, Dmitri, Blueschke-Nikolaeva, Viktoria, and Neck, Reinhard
Subjects: *MACHINE learning, *STOCHASTIC control theory, *ALGORITHMS, *ACTIVE learning, *STOCHASTIC systems
Abstract: OPTCON is an algorithm for the optimal control of nonlinear stochastic systems which is particularly applicable to economic models. It delivers approximate numerical solutions to optimum control (dynamic optimization) problems with a quadratic objective function for nonlinear economic models with additive and multiplicative (parameter) uncertainties. The algorithm was first programmed in C# and then in MATLAB. It allows for deterministic and stochastic control, the latter with open loop (OPTCON1), passive learning (open-loop feedback, OPTCON2), and active learning (closed-loop, dual, or adaptive control, OPTCON3) information patterns. The mathematical aspects of the algorithm with open-loop feedback and closed-loop information patterns are presented in more detail in this paper. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

34. A New Hyper-Parameter Optimization Method for Power Load Forecast Based on Recurrent Neural Networks.

Author: Li, Yaru, Zhang, Yulai, and Cai, Yongping
Subjects: RECURRENT neural networks, PARTICLE swarm optimization, ALGORITHMS, MACHINE learning, FORECASTING, COMPUTATIONAL complexity, LOAD forecasting (Electric power systems)
Abstract: The selection of the hyper-parameters plays a critical role in the task of prediction based on the recurrent neural networks (RNN). Traditionally, the hyper-parameters of the machine learning models are selected by simulations as well as human experiences. In recent years, multiple algorithms based on Bayesian optimization (BO) are developed to determine the optimal values of the hyper-parameters. In most of these methods, gradients are required to be calculated. In this work, the particle swarm optimization (PSO) is used under the BO framework to develop a new method for hyper-parameter optimization. The proposed algorithm (BO-PSO) is free of gradient calculation and the particles can be optimized in parallel naturally. So the computational complexity can be effectively reduced which means better hyper-parameters can be obtained under the same amount of calculation. Experiments are done on real world power load data, where the proposed method outperforms the existing state-of-the-art algorithms, BO with limit-BFGS-bound (BO-L-BFGS-B) and BO with truncated-newton (BO-TNC), in terms of the prediction accuracy. The errors of the prediction result in different models show that BO-PSO is an effective hyper-parameter optimization method. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

35. A Simhash-Based Integrative Features Extraction Algorithm for Malware Detection.

Author: Li, Yihong, Liu, Fangzheng, Du, Zhenyu, and Zhang, Dubing
Subjects: FEATURE extraction, ALGORITHMS, MALWARE, APPLICATION program interfaces, MACHINE learning
Abstract: In the malware detection process, obfuscated malicious codes cannot be efficiently and accurately detected solely in the dynamic or static feature space. Aiming at this problem, an integrative feature extraction algorithm based on simhash was proposed, which combines the static information e.g., API (Application Programming Interface) calls and dynamic information (such as file, registry and network behaviors) of malicious samples to form integrative features. The experiment extracts the integrative features of some static information and dynamic information, and then compares the classification, time and obfuscated-detection performance of the static, dynamic and integrated features, respectively, by using several common machine learning algorithms. The results show that the integrative features have better time performance than the static features, and better classification performance than the dynamic features, and almost the same obfuscated-detection performance as the dynamic features. This algorithm can provide some support for feature extraction of malware detection. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

36. Idea of Using Blockchain Technique for Choosing the Best Configuration of Weights in Neural Networks

Author: Alicja Winnicka and Karolina Kęsik
Subjects: algorithms, blockchain, machine learning, Industrial engineering. Management engineering, T55.4-60.8, Electronic computers. Computer science, QA75.5-76.95
Abstract: The blockchain technique is becoming more and more popular due to its advantages such as stability and dispersed nature. This is an idea based on blockchain activity paradigms. Another important field is machine learning, which is increasingly used in practice. Unfortunately, the training or overtraining artificial neural networks is very time-consuming and requires high computing power. In this paper, we proposed using a blockchain technique to train neural networks. This type of activity is important due to the possible search for initial weights in the network, which affect faster training, due to gradient decrease. We performed the tests with much heavier calculations to indicate that such an action is possible. However, this type of solution can also be used for less demanding calculations, i.e., only a few iterations of training and finding a better configuration of initial weights.
Published: 2019
Full Text: View/download PDF

37. Locally Scaled and Stochastic Volatility Metropolis–Hastings Algorithms.

Author: Mongwe, Wilson Tsakane, Mbuvha, Rendani, and Marwala, Tshilidzi
Subjects: MARKOV chain Monte Carlo, RANDOM matrices, DIFFUSION processes, RANDOM walks, JUMP processes, DISTRIBUTION (Probability theory), ALGORITHMS
Abstract: Markov chain Monte Carlo (MCMC) techniques are usually used to infer model parameters when closed-form inference is not feasible, with one of the simplest MCMC methods being the random walk Metropolis–Hastings (MH) algorithm. The MH algorithm suffers from random walk behaviour, which results in inefficient exploration of the target posterior distribution. This method has been improved upon, with algorithms such as Metropolis Adjusted Langevin Monte Carlo (MALA) and Hamiltonian Monte Carlo being examples of popular modifications to MH. In this work, we revisit the MH algorithm to reduce the autocorrelations in the generated samples without adding significant computational time. We present the: (1) Stochastic Volatility Metropolis–Hastings (SVMH) algorithm, which is based on using a random scaling matrix in the MH algorithm, and (2) Locally Scaled Metropolis–Hastings (LSMH) algorithm, in which the scaled matrix depends on the local geometry of the target distribution. For both these algorithms, the proposal distribution is still Gaussian centred at the current state. The empirical results show that these minor additions to the MH algorithm significantly improve the effective sample rates and predictive performance over the vanilla MH method. The SVMH algorithm produces similar effective sample sizes to the LSMH method, with SVMH outperforming LSMH on an execution time normalised effective sample size basis. The performance of the proposed methods is also compared to the MALA and the current state-of-art method being the No-U-Turn sampler (NUTS). The analysis is performed using a simulation study based on Neal's funnel and multivariate Gaussian distributions and using real world data modeled using jump diffusion processes and Bayesian logistic regression. Although both MALA and NUTS outperform the proposed algorithms on an effective sample size basis, the SVMH algorithm has similar or better predictive performance when compared to MALA and NUTS across the various targets. In addition, the SVMH algorithm outperforms the other MCMC algorithms on a normalised effective sample size basis on the jump diffusion processes datasets. These results indicate the overall usefulness of the proposed algorithms. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

38. Adaptive Refinement in Advection–Diffusion Problems by Anomaly Detection: A Numerical Study.

Author: Falini, Antonella and Sampoli, Maria Lucia
Subjects: ANOMALY detection (Computer security), ADVECTION-diffusion equations, ARTIFICIAL intelligence, ALGORITHMS, MACHINE learning, DEGREES of freedom
Abstract: We consider advection–diffusion–reaction problems, where the advective or the reactive term is dominating with respect to the diffusive term. The solutions of these problems are characterized by the so-called layers, which represent localized regions where the gradients of the solutions are rather large or are subjected to abrupt changes. In order to improve the accuracy of the computed solution, it is fundamental to locally increase the number of degrees of freedom by limiting the computational costs. Thus, adaptive refinement, by a posteriori error estimators, is employed. The error estimators are then processed by an anomaly detection algorithm in order to identify those regions of the computational domain that should be marked and, hence, refined. The anomaly detection task is performed in an unsupervised fashion and the proposed strategy is tested on typical benchmarks. The present work shows a numerical study that highlights promising results obtained by bridging together standard techniques, i.e., the error estimators, and approaches typical of machine learning and artificial intelligence, such as the anomaly detection task. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

39. A Novel Semi-Supervised Fuzzy C-Means Clustering Algorithm Using Multiple Fuzzification Coefficients.

Author: Khang, Tran Dinh, Tran, Manh-Kien, and Fowler, Michael
Subjects: FUZZY algorithms, ALGORITHMS, MACHINE learning
Abstract: Clustering is an unsupervised machine learning method with many practical applications that has gathered extensive research interest. It is a technique of dividing data elements into clusters such that elements in the same cluster are similar. Clustering belongs to the group of unsupervised machine learning techniques, meaning that there is no information about the labels of the elements. However, when knowledge of data points is known in advance, it will be beneficial to use a semi-supervised algorithm. Within many clustering techniques available, fuzzy C-means clustering (FCM) is a common one. To make the FCM algorithm a semi-supervised method, it was proposed in the literature to use an auxiliary matrix to adjust the membership grade of the elements to force them into certain clusters during the computation. In this study, instead of using the auxiliary matrix, we proposed to use multiple fuzzification coefficients to implement the semi-supervision component. After deriving the proposed semi-supervised fuzzy C-means clustering algorithm with multiple fuzzification coefficients (sSMC-FCM), we demonstrated the convergence of the algorithm and validated the efficiency of the method through a numerical example. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

40. A General Cooperative Optimization Approach for Distributing Service Points in Mobility Applications.

Author: Jatschka, Thomas, Raidl, Günther R., and Rodemann, Tobias
Subjects: MATRIX decomposition, CAR sharing, ALGORITHMS, COOPERATIVE societies, MACHINE learning, STOCHASTIC dominance
Abstract: This article presents a cooperative optimization approach (COA) for distributing service points for mobility applications, which generalizes and refines a previously proposed method. COA is an iterative framework for optimizing service point locations, combining an optimization component with user interaction on a large scale and a machine learning component that learns user needs and provides the objective function for the optimization. The previously proposed COA was designed for mobility applications in which single service points are sufficient for satisfying individual user demand. This framework is generalized here for applications in which the satisfaction of demand relies on the existence of two or more suitably located service stations, such as in the case of bike/car sharing systems. A new matrix factorization model is used as surrogate objective function for the optimization, allowing us to learn and exploit similar preferences among users w.r.t. service point locations. Based on this surrogate objective function, a mixed integer linear program is solved to generate an optimized solution to the problem w.r.t. the currently known user information. User interaction, refinement of the matrix factorization, and optimization are iterated. An experimental evaluation analyzes the performance of COA with special consideration of the number of user interactions required to find near optimal solutions. The algorithm is tested on artificial instances, as well as instances derived from real-world taxi data from Manhattan. Results show that the approach can effectively solve instances with hundreds of potential service point locations and thousands of users, while keeping the user interactions reasonably low. A bound on the number of user interactions required to obtain full knowledge of user preferences is derived, and results show that with 50% of performed user interactions the solutions generated by COA feature optimality gaps of only 1.45% on average. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

41. COVID-19 Prediction Applying Supervised Machine Learning Algorithms with Comparative Analysis Using WEKA.

Author: Villavicencio, Charlyn Nayve, Macrohon, Julio Jerison Escudero, Inbaraj, Xavier Alphonse, Jeng, Jyh-Horng, and Hsieh, Jer-Guang
Subjects: COVID-19, MACHINE learning, COVID-19 pandemic, COMMUNICABLE diseases, SUPPORT vector machines
Abstract: Early diagnosis is crucial to prevent the development of a disease that may cause danger to human lives. COVID-19, which is a contagious disease that has mutated into several variants, has become a global pandemic that demands to be diagnosed as soon as possible. With the use of technology, available information concerning COVID-19 increases each day, and extracting useful information from massive data can be done through data mining. In this study, authors utilized several supervised machine learning algorithms in building a model to analyze and predict the presence of COVID-19 using the COVID-19 Symptoms and Presence dataset from Kaggle. J48 Decision Tree, Random Forest, Support Vector Machine, K-Nearest Neighbors and Naïve Bayes algorithms were applied through WEKA machine learning software. Each model's performance was evaluated using 10-fold cross validation and compared according to major accuracy measures, correctly or incorrectly classified instances, kappa, mean absolute error, and time taken to build the model. The results show that Support Vector Machine using Pearson VII universal kernel outweighs other algorithms by attaining 98.81% accuracy and a mean absolute error of 0.012. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

42. qRobot: A Quantum Computing Approach in Mobile Robot Order Picking and Batching Problem Solver Optimization.

Author: Atchade-Adelomou, Parfait, Alonso-Linaje, Guillermo, Albo-Canals, Jordi, and Casado-Fauli, Daniel
Subjects: QUANTUM computing, MOBILE robots, MOBILE computing, COMBINATORIAL optimization, ALGORITHMS, QUANTUM communication
Abstract: This article aims to bring quantum computing to robotics. A quantum algorithm is developed to minimize the distance traveled in warehouses and distribution centers where order picking is applied. For this, a proof of concept is proposed through a Raspberry Pi 4, generating a quantum combinatorial optimization algorithm that saves the distance travelled and the batch of orders to be made. In case of computational need, the robot will be able to parallelize part of the operations in hybrid computing (quantum + classical), accessing CPUs and QPUs distributed in a public or private cloud. We developed a stable environment (ARM64) inside the robot (Raspberry) to run gradient operations and other quantum algorithms on IBMQ, Amazon Braket (D-Wave), and Pennylane locally or remotely. The proof of concept, when run in the above stated quantum environments, showed the execution time of our algorithm with different public access simulators on the market, computational results of our picking and batching algorithm, and analyze the quantum real-time execution. Our findings are that the behavior of the Amazon Braket D-Wave is better than Gate-based Quantum Computing over 20 qubits, and that AWS-Braket has better time performance than Qiskit or Pennylane. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

43. Towards Understanding Clustering Problems and Algorithms: An Instance Space Analysis.

Author: Fernandes, Luiz Henrique dos Santos, Lorena, Ana Carolina, Smith-Miles, Kate, and Malan, Katherine
Subjects: ALGORITHMS
Abstract: Various criteria and algorithms can be used for clustering, leading to very distinct outcomes and potential biases towards datasets with certain structures. More generally, the selection of the most effective algorithm to be applied for a given dataset, based on its characteristics, is a problem that has been largely studied in the field of meta-learning. Recent advances in the form of a new methodology known as Instance Space Analysis provide an opportunity to extend such meta-analyses to gain greater visual insights of the relationship between datasets' characteristics and the performance of different algorithms. The aim of this study is to perform an Instance Space Analysis for the first time for clustering problems and algorithms. As a result, we are able to analyze the impact of the choice of the test instances employed, and the strengths and weaknesses of some popular clustering algorithms, for datasets with different structures. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

44. Molecular Subtyping and Outlier Detection in Human Disease Using the Paraclique Algorithm.

Author: Hagan, Ronald D., Langston, Michael A., Verzotto, Davide, Elloumi, Mourad, and Werner, Frank
Subjects: OUTLIER detection, ALGORITHMS, GENES, MACHINE learning, THERAPEUTICS, STATISTICAL learning
Abstract: Recent discoveries of distinct molecular subtypes have led to remarkable advances in treatment for a variety of diseases. While subtyping via unsupervised clustering has received a great deal of interest, most methods rely on basic statistical or machine learning methods. At the same time, techniques based on graph clustering, particularly clique-based strategies, have been successfully used to identify disease biomarkers and gene networks. A graph theoretical approach based on the paraclique algorithm is described that can easily be employed to identify putative disease subtypes and serve as an aid in outlier detection as well. The feasibility and potential effectiveness of this method is demonstrated on publicly available gene co-expression data derived from patient samples covering twelve different disease families. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

45. A Weighted Ensemble Learning Algorithm Based on Diversity Using a Novel Particle Swarm Optimization Approach.

Author: You, Gui-Rong, Shiue, Yeou-Ren, Yeh, Wei-Chang, Chen, Xi-Li, and Chen, Chih-Ming
Subjects: PARTICLE swarm optimization, MACHINE learning, ALGORITHMS, SEARCH algorithms, FACTOR structure
Abstract: In ensemble learning, accuracy and diversity are the main factors affecting its performance. In previous studies, diversity was regarded only as a regularization term, which does not sufficiently indicate that diversity should implicitly be treated as an accuracy factor. In this study, a two-stage weighted ensemble learning method using the particle swarm optimization (PSO) algorithm is proposed to balance the diversity and accuracy in ensemble learning. The first stage is to enhance the diversity of the individual learner, which can be achieved by manipulating the datasets and the input features via a mixed-binary PSO algorithm to search for a set of individual learners with appropriate diversity. The purpose of the second stage is to improve the accuracy of the ensemble classifier using a weighted ensemble method that considers both diversity and accuracy. The set of weighted classifier ensembles is obtained by optimization via the PSO algorithm. The experimental results on 30 UCI datasets demonstrate that the proposed algorithm outperforms other state-of-the-art baselines. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

46. Feasibility Analysis and Application of Reinforcement Learning Algorithm Based on Dynamic Parameter Adjustment.

Author: Li, Menglin, Gu, Xueqiang, Zeng, Chengyi, and Feng, Yuan
Subjects: MACHINE learning, REINFORCEMENT learning, PRACTICAL reason, DEEP learning, ALGORITHMS, INTELLIGENT control systems
Abstract: Reinforcement learning, as a branch of machine learning, has been gradually applied in the control field. However, in the practical application of the algorithm, the hyperparametric approach to network settings for deep reinforcement learning still follows the empirical attempts of traditional machine learning (supervised learning and unsupervised learning). This method ignores part of the information generated by agents exploring the environment contained in the updating of the reinforcement learning value function, which will affect the performance of the convergence and cumulative return of reinforcement learning. The reinforcement learning algorithm based on dynamic parameter adjustment is a new method for setting learning rate parameters of deep reinforcement learning. Based on the traditional method of setting parameters for reinforcement learning, this method analyzes the advantages of different learning rates at different stages of reinforcement learning and dynamically adjusts the learning rates in combination with the temporal-difference (TD) error values to achieve the advantages of different learning rates in different stages to improve the rationality of the algorithm in practical application. At the same time, by combining the Robbins–Monro approximation algorithm and deep reinforcement learning algorithm, it is proved that the algorithm of dynamic regulation learning rate can theoretically meet the convergence requirements of the intelligent control algorithm. In the experiment, the effect of this method is analyzed through the continuous control scenario in the standard experimental environment of "Car-on-The-Hill" of reinforcement learning, and it is verified that the new method can achieve better results than the traditional reinforcement learning in practical application. According to the model characteristics of the deep reinforcement learning, a more suitable setting method for the learning rate of the deep reinforcement learning network proposed. At the same time, the feasibility of the method has been proved both in theory and in the application. Therefore, the method of setting the learning rate parameter is worthy of further development and research. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

47. Fuzzy C-Means Clustering Algorithm with Multiple Fuzzification Coefficients.

Author: Khang, Tran Dinh, Vuong, Nguyen Duc, Tran, Manh-Kien, and Fowler, Michael
Subjects: ALGORITHMS, FUZZY algorithms, MACHINE learning
Abstract: Clustering is an unsupervised machine learning technique with many practical applications that has gathered extensive research interest. Aside from deterministic or probabilistic techniques, fuzzy C-means clustering (FCM) is also a common clustering technique. Since the advent of the FCM method, many improvements have been made to increase clustering efficiency. These improvements focus on adjusting the membership representation of elements in the clusters, or on fuzzifying and defuzzifying techniques, as well as the distance function between elements. This study proposes a novel fuzzy clustering algorithm using multiple different fuzzification coefficients depending on the characteristics of each data sample. The proposed fuzzy clustering method has similar calculation steps to FCM with some modifications. The formulas are derived to ensure convergence. The main contribution of this approach is the utilization of multiple fuzzification coefficients as opposed to only one coefficient in the original FCM algorithm. The new algorithm is then evaluated with experiments on several common datasets and the results show that the proposed algorithm is more efficient compared to the original FCM as well as other clustering methods. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

48. A Machine Learning Approach to Algorithm Selection for Exact Computation of Treewidth.

Author: Slavchev, Borislav, Masliankova, Evelina, and Kelk, Steven
Subjects: MACHINE learning, KEY performance indicators (Management), GRAPH algorithms, ALGORITHMS, RANDOM forest algorithms
Abstract: We present an algorithm selection framework based on machine learning for the exact computation of treewidth, an intensively studied graph parameter that is NP-hard to compute. Specifically, we analyse the comparative performance of three state-of-the-art exact treewidth algorithms on a wide array of graphs and use this information to predict which of the algorithms, on a graph by graph basis, will compute the treewidth the quickest. Experimental results show that the proposed meta-algorithm outperforms existing methods on benchmark instances on all three performance metrics we use: in a nutshell, it computes treewidth faster than any single algorithm in isolation. We analyse our results to derive insights about graph feature importance and the strengths and weaknesses of the algorithms we used. Our results are further evidence of the advantages to be gained by strategically blending machine learning and combinatorial optimisation approaches within a hybrid algorithmic framework. The machine learning model we use is intentionally simple to emphasise that speedup can already be obtained without having to engage in the full complexities of machine learning engineering. We reflect on how future work could extend this simple but effective, proof-of-concept by deploying more sophisticated machine learning models. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Region

Database

Publisher

48 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources