9,952 results
Search Results
2. One-Dimensional Convolutional Neural Networks with Infrared Spectroscopy for Classifying the Origin of Printing Paper.
- Author
-
Sung-Wook Hwang, Geungyong Park, Jinho Kim, Kwang-Ho Kang, and Won-Hee Lee
- Subjects
- *
CONVOLUTIONAL neural networks , *INFRARED spectroscopy , *SUPPORT vector machines , *MACHINE learning - Abstract
Herein, the challenge of accurately classifying the manufacturing origin of printing paper, including continent, country, and specific product, was addressed. One-dimensional convolutional neural network (1D CNN) models trained on infrared (IR) spectrum data acquired from printing paper samples were used for the task. The preprocessing of the IR spectra through a second-derivative transformation and the restriction of the spectral range to 1800 to 1200 cm-1 improved the classification performance of the model. The outcomes were highly promising. Models trained on second-derivative IR spectra in the 1800 to 1200-cm-1 range exhibited perfect classification for the manufacturing continent and country, with an impressive F1 score of 0.980 for product classification. Notably, the developed 1D CNN model outperformed traditional machine learning classifiers, such as support vector machines and feed-forward neural networks. In addition, the application of data point attribution enhanced the transparency of the decision-making process of the model, offering insights into the spectral patterns that affect classification. This study makes a considerable contribution to printing paper classification, with potential implications for accurate origin identification in various fields. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Classification Analysis of Copy Papers Using Infrared Spectroscopy and Machine Learning Modeling.
- Author
-
Yong-Ju Lee, Tai-Ju Lee, and Hyoung Jin Kim
- Subjects
- *
MACHINE learning , *INFRARED spectroscopy , *ATTENUATED total reflectance , *FORGERY , *K-nearest neighbor classification , *SUPPORT vector machines , *NEAR infrared spectroscopy - Abstract
The evaluation and classification of chemical properties in different copypaper products could significantly help address document forgery. This study analyzes the feasibility of utilizing infrared spectroscopy in conjunction with machine learning algorithms for classifying copy-paper products. A dataset comprising 140 infrared spectra of copy-paper samples was collected. The classification models employed in this study include partial least squares-discriminant analysis, support vector machine, and K-nearest neighbors. The key findings indicate that a classification model based on the use of attenuated-total-reflection infrared spectroscopy demonstrated good performance, highlighting its potential as a valuable tool in accurately classifying paper products and ensuring assisting in solving criminal cases involving document forgery. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Analysis of handmade paper by Raman spectroscopy combined with machine learning.
- Author
-
Yan, Chunsheng, Cheng, Zhongyi, Luo, Si, Huang, Chen, Han, Songtao, Han, Xiuli, Du, Yuandong, and Ying, Chaonan
- Subjects
- *
MACHINE learning , *RAMAN spectroscopy , *SUPPORT vector machines , *K-nearest neighbor classification , *PRINCIPAL components analysis , *RANDOM forest algorithms , *SPECTRAL imaging , *MULTISPECTRAL imaging - Abstract
Handmade paper is a major carrier and restoration material of traditional Chinese ancient books, calligraphies, and paintings. In this study, we carried out a Raman spectroscopy analysis of 18 types of handmade paper samples. The main components of the handmade paper were cellulose and lignin, according to the wavenumber and Raman vibration assignment. We divided its Raman spectrum into eight subbands. Five machine learning models were employed: principal component analysis (PCA), partial least squares (PLS), support vector machine (SVM), k‐nearest neighbors (KNN), and random forest (RF). The Raman spectral data were normalized, and the fluorescence envelope was subtracted using the airPLS algorithm to obtain four types of data, raw, normalized, defluorescence, and fluorescence data. An RF variable importance analysis of data processing showed that data normalization eliminated the intensity differences of fluorescence signals caused by lignin, which contained important information of raw materials and papermaking technology, let alone the data defluorescence. The data processing also reduced the importance of the average variables in almost all spectral bands. Nevertheless, the data processing is worthwhile because it significantly improves the accuracy of machine learning, and the information loss does not affect the prediction. Using the machine learning models of PCA, PLS, and SVM combined with linear regression (LR), KNN, and RF, the classification and prediction of handmade paper samples were realized. For almost all processed data, including the fluorescence data, PCA‐LR had the highest classification and prediction accuracy (R2 = 1) in almost all spectral bands. PLS‐LR and SVM‐LR had the second‐highest accuracies (R2 = 0.4–0.9), whereas KNN and RF had the lowest accuracies (R2 = 0.1–0.4) for full band spectral data. Our results suggest that the abundant information contained in Raman spectroscopy combined with powerful machine learning models could inspire further studies on handmade paper and related cultural relics. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
5. Application of Selected Machine Learning Techniques for Identification of Basic Classes of Partial Discharges Occurring in Paper-Oil Insulation Measured by Acoustic Emission Technique.
- Author
-
Boczar, Tomasz, Borucki, Sebastian, Jancarczyk, Daniel, Bernas, Marcin, and Kurtasz, Pawel
- Subjects
- *
ACOUSTIC emission , *PARTIAL discharges , *NAIVE Bayes classification , *SUPPORT vector machines , *MACHINE learning , *RANDOM forest algorithms , *CLASSIFICATION algorithms , *K-nearest neighbor classification - Abstract
The paper reports the results of a comparative assessment concerned with the effectiveness of identifying the basic forms of partial discharges (PD) measured by the acoustic emission technique (AE), carried out by application of selected machine learning methods. As part of the re-search, the identification involved AE signals registered in laboratory conditions for eight basic classes of PDs that occur in paper-oil insulation systems of high-voltage power equipment. On the basis of acoustic signals emitted by PDs and by application of the frequency descriptor that took the form of a signal power density spectrum (PSD), the assessment involved the possibility of identifying individual types of PD by the analyzed classification algorithms. As part of the research, the results obtained with the use of five independent classification mechanisms were analyzed, namely: k-Nearest Neighbors method (kNN), Naive Bayes Classification, Support Vector Machine (SVM), Random Forests and Probabilistic Neural Network (PNN). The best results were achieved using the SVM classification tuned with polynomial core, which obtained 100% accuracy. Similar results were achieved with the kNN classifier. Random Forests and Naïve Bayes obtained high accuracy over 97%. Throughout the study, identification algorithms with the highest effectiveness in identifying specific forms of PD were established. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Tensor product based 2-D correlation data preprocessing methods for Raman spectroscopy of Chinese handmade paper.
- Author
-
Yan, Chunsheng, Luo, Si, Cao, Linquan, Cheng, Zhongyi, and Zhang, Hui
- Subjects
- *
TENSOR products , *RAMAN spectroscopy , *SUPPORT vector machines , *K-nearest neighbor classification , *PRINCIPAL components analysis , *MACHINE learning - Abstract
[Display omitted] • The 2-D correlation methods do not require external perturbation variables. • They are pure mathematical methods that utilize the tensor product of spectral data. • The R2 values of KNN and RF for TDACM are close to 1, indicating nearly 100% improvement. The paper introduces two new methods, namely the cross correlation method (CCM) and two-dimensional correlation method (TDCM), for preprocessing Raman spectroscopy data for analyzing Chinese handmade paper samples. CCM expands the spectral dimension from 1 × N to 1 × 2 N - 1 by taking cross-correlation between two spectral data of the same category. TDCM includes two-dimensional synchronous correlation method (TDSCM) and two-dimensional asynchronous correlation method (TDACM), which expand the spectral dimension from 1 × N to N × N by taking tensor products between two spectral data and between one spectral data and the Hilbert transformation of the other spectral data of the same category, respectively. The experimental data were preprocessed using baseline removal, CCM, TDSCM, and TDACM methods. Four machine learning models were employed to evaluate the effects of these methods: principal component analysis (PCA) combined with linear regression (LR), support vector machine (SVM) combined with LR, k-Nearest Neighbors (KNN), and random forest (RF). The results show that the R-squared values for the PCA model were nearly 1 for all types of data, indicating high accuracy. However, for SVM-LR, KNN, and RF models, the R-squared values were sorted in the order of raw data, baseline removal data, CCM, TDSCM, and TDACM preprocessed data. The R-squared values of KNN and RF machine learning models for TDACM preprocessed data were approaching 1, indicating that the accuracy of machine learning was significantly improved by nearly 100%. This has led to a remarkable improvement in the accuracy of supervised models such as KNN and RF, bringing them closer to the level of unsupervised models such as PCA. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. USING TEXT MINING TO CLASSIFY RESEARCH PAPERS.
- Author
-
Sulova, Snezhana, Todoranova, Latinka, Penchev, Bonimir, and Nacheva, Radka
- Subjects
SCIENTIFIC literature ,NATURAL language processing ,MACHINE learning ,SUPPORT vector machines ,NAIVE Bayes classification ,TEXT mining - Abstract
Recently, the volume of scientific literature has grown rapidly raising an imminent question about its storage and organization. Many research papers are often available only through the websites of the relevant scientific journals. It is an essential problem when different classification codes are used in order to organize these papers or when specific categorization in a certain scientific field is missing. This leads to unnecessary complications in the researchers' aims who want to quickly and easily find literature on a specific topic among the large amount of scientific publications. Simultaneously, the research interest related to the mechanisms of natural language processing is growing because much of the information they work with is unstructured and in the form of plain text. In order to improve and automate the process of organizing and classifying scientific papers we propose an approach based on the technology for natural language processing. This applies the methods of supervised machine learning and two specific algorithms for text categorization - Support Vector Machines (SVM) and Naive Bayes (NB). The proposed approach classifies the scientific literature according to its contents. To successfully execute our scientific research, we used over 200 papers, published in the last four years in the journal "Izvestiya", which is issued by the University of Economics - Varna. The articles explore different topic areas and are written in English. The experiments were conducted with the software product RapidMiner. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
8. Review on Machine Learning-based Defect Detection of Shield Tunnel Lining.
- Author
-
Guixing Kuang, Bixiong Li, Site Mo, Xiangxin Hu, and Lianghui Li
- Subjects
TUNNEL lining ,CONVOLUTIONAL neural networks ,SUPPORT vector machines ,MACHINE learning ,PAPER arts - Abstract
At present, machine learning methods are widely used in various industries for their high adaptability, optimization function, and self-learning reserve function. Besides, the world-famous cities have almost built and formed subway networks that promote economic development. This paper presents the art states of Defect detection of Shield Tunnel lining based on Machine learning (DSTM). In addition, the processing method of image data from the shield tunnel is being explored to adapt to its complex environment. Comparison and analysis are used to show the performance of the algorithms in terms of the effects of data set establishment, algorithm selection, and detection devices. Based on the analysis results, Convolutional Neural Network methods show high recognition accuracy and better adaptability to the complexity of the environment in the shield tunnel compared to traditional machine learning methods. The Support Vector Machine algorithms show high recognition performance only for small data sets. To improve detection models and increase detection accuracy, measures such as optimizing features, fusing algorithms, creating a high-quality data set, increasing the sample size, and using devices with high detection accuracy can be recommended. Finally, we analyze the challenges in the field of coupling DSTM, meanwhile, the possible development direction of DSTM is prospected. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
9. Sensorless metal object detection for wireless power transfer using machine learning
- Author
-
Gong, Yunyi, Otomo, Yoshitsugu, and Igarashi, Hajime
- Published
- 2022
- Full Text
- View/download PDF
10. ADVANCING AUTISM SPECTRUM DISORDER DIAGNOSIS THROUGH ENSEMBLE LEARNING.
- Author
-
TANVEEN, NISHAT, TRISHA, JUVVALADINNE, and SRAVANI, AMBATIPUDI DURGA
- Subjects
AUTISM spectrum disorders ,MACHINE learning ,COMPUTER algorithms ,K-nearest neighbor classification ,SUPPORT vector machines ,NAIVE Bayes classification ,LOGISTIC regression analysis ,DECISION trees - Abstract
Autism Spectrum Disorder (ASD) diagnostics requires specialized clinical expertise, posing accessibility and affordability barriers for many. To widen the availability of precise screening, this paper examines ensemble machine learning models that combine multiple algorithms for improved accuracy and generalizability. Specifically, this paper compares the performance of K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Naive Bayes, and a Voting Classifier that integrates Logistic Regression, Decision Trees, and KNN. The comparison was conducted on separate datasets of pediatric and adult ASD questionnaire responses. The ensemble model significantly outperforms individual techniques, achieving higher accuracy for both pediatrics and adults with balanced sensitivity and specificity maintained across groups, indicating the viability of accessible community-available screening to alleviate diagnostic bottlenecks. Before scale-up, further model optimization for interpretability and testing on more diverse multi-site data are warranted. Overall, findings demonstrate the feasibility of mobile distributed pre-screening systems leveraging optimized ensembles to predict ASD with high precision across ages, opening possibilities for explainable AI to lower costs and widen access compared to in-person evaluation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. MACHINE LEARNING TECHNIQUES APPLIED IN SURFACE EMG DETECTION- A SYSTEMATIC REVIEW.
- Author
-
Syed, Sidra Abid, Nasim, Shahzad, Zahid, Hira, Saifullah, Bullo, Shams, Sarmad, Tanvir, Sania, and Zaidi, Syed Jamal Haider
- Subjects
MACHINE learning ,DECISION support systems ,MEDICAL sciences ,SUPPORT vector machines ,K-nearest neighbor classification - Abstract
Surface electromyography (EMG) has emerged as a promising clisnical decision support system, enabling the extraction of muscles' electrical activity through non-invasive devices placed on the body. This study focuses on the application of machine learning (ML) techniques to preprocess and analyze EMG signals for the detection of muscle abnormalities. Notably, state-of-the-art ML algorithms, including Support Vector Machines (SVM), k-Nearest Neighbors (KNN), Artificial Neural Networks (ANN), Random Forests (RF), and Naive Bayes (NB), have been harnessed by researchers in the biomedical sciences to achieve accurate surface EMG signal detection. Within this paper, we present a meticulously conducted systematic review, employing the PRISMA method to select relevant research papers. Various databases were thoroughly searched, and multiple pertinent studies were identified for detailed examination, weighing their respective merits and drawbacks. Our survey comprehensively elucidates the latest ML techniques used in surface EMG detection, offering valuable insights for researchers in this domain. Additionally, we outline potential future directions that can guide further advancements in this critical area of research. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. Identification and Severity Determination of Wheat Stripe Rust and Wheat Leaf Rust Based on Hyperspectral Data Acquired Using a Black-Paper-Based Measuring Method.
- Author
-
Wang, Hui, Qin, Feng, Ruan, Liu, Wang, Rui, Liu, Qi, Ma, Zhanhong, Li, Xiaolong, Cheng, Pei, and Wang, Haiguang
- Subjects
- *
LEAF rust of wheat , *PLANT diseases , *REMOTE sensing , *LEAST squares , *SUPPORT vector machines - Abstract
It is important to implement detection and assessment of plant diseases based on remotely sensed data for disease monitoring and control. Hyperspectral data of healthy leaves, leaves in incubation period and leaves in diseased period of wheat stripe rust and wheat leaf rust were collected under in-field conditions using a black-paper-based measuring method developed in this study. After data preprocessing, the models to identify the diseases were built using distinguished partial least squares (DPLS) and support vector machine (SVM), and the disease severity inversion models of stripe rust and the disease severity inversion models of leaf rust were built using quantitative partial least squares (QPLS) and support vector regression (SVR). All the models were validated by using leave-one-out cross validation and external validation. The diseases could be discriminated using both distinguished partial least squares and support vector machine with the accuracies of more than 99%. For each wheat rust, disease severity levels were accurately retrieved using both the optimal QPLS models and the optimal SVR models with the coefficients of determination (R2) of more than 0.90 and the root mean square errors (RMSE) of less than 0.15. The results demonstrated that identification and severity evaluation of stripe rust and leaf rust at the leaf level could be implemented based on the hyperspectral data acquired using the developed method. A scientific basis was provided for implementing disease monitoring by using aerial and space remote sensing technologies. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
13. Sentiment Analysis of Coastal Karnataka Daijiworld users with Classic ML Models.
- Author
-
D., Sushma M., Geethalaxmi, and K., Ranganath
- Subjects
MACHINE learning ,SENTIMENT analysis ,K-nearest neighbor classification ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
The "Daijiworld News" forum, a well-known news website in coastal Karnataka, was the source of the comments for this paper's sentiment analysis study, which was done on about 15,000 reader comments. The comments were scraped using Beautiful Soup, a popular web scraping library and labelled as positive, negative, and neutral. Pre-processing of comments was made using techniques such as stop word removal, tokenization, stemming, lemmatization, and lowercase conversion. Logistic regression, support vector machine (SVM), naive Bayes, random forest, K-nearest neighbors (KNN), AdaBoost, gradient boosting and neural networks was used for classification. Performance metrics including accuracy, precision, recall, and F1 score were evaluated. Logistic regression achieved the highest precision (0.75), recall (0.74), accuracy (0.74), and F1 score (0.74), followed closely by the neural network classifier with a precision of 0.670, recall of 0.670, accuracy of 0.670, and F1 score of 0.669. The study demonstrates the effectiveness of logistic regression and neural networks in sentiment analysis of news forum comments, giving insightful information to grasp public opinion and improving user engagement. The findings contribute to the field of sentiment analysis, emphasising the significance of web scraping and pre-processing techniques in enhancing sentiment classification accuracy. The results serve as a reference for researchers and practitioners, assisting in the selection of appropriate classifiers for sentiment analysis in similar contexts. The study encourages further exploration of advanced techniques to enhance sentiment classification accuracy in regional news forums, paving the way for future research in sentiment analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
14. 60‐3: Invited Paper: Machine Learning Approaches to Active Stylus for Capacitive Touch Screen Panel Applications.
- Author
-
Nam, Hyoungsik, Seol, Ki-Hyuk, and Park, Seungjun
- Subjects
MACHINE learning ,SUPPORT vector machines ,ANOMALY detection (Computer security) ,TOUCH screens ,CLASSIFICATION algorithms - Abstract
This paper introduces machine learning approaches on adding the stylus‐touch to the capacitive touch screen technology. The proposed schemes can discriminate the stylus‐touch from finger‐touch as well as no‐touch by means of classification algorithms using support vector machine and anomaly detection. The high frequency pulses are sent from a stylus to a touch screen and the receiver classifies the received sample sequences into three classes of no‐touch, finger‐touch, and stylus‐touch. In addition, some possible applications of data transmission and user authentication are demonstrated. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
15. Mapping of Coral Reefs with Multispectral Satellites: A Review of Recent Papers.
- Author
-
Nguyen, Teo, Liquet, Benoît, Mengersen, Kerrie, and Sous, Damien
- Subjects
- *
CORAL reefs & islands , *MARINE biodiversity , *CORALS , *SPATIAL resolution , *REMOTE-sensing images , *SUPPORT vector machines , *THEMATIC mapper satellite - Abstract
Coral reefs are an essential source of marine biodiversity, but they are declining at an alarming rate under the combined effects of global change and human pressure. A precise mapping of coral reef habitat with high spatial and time resolutions has become a necessary step for monitoring their health and evolution. This mapping can be achieved remotely thanks to satellite imagery coupled with machine-learning algorithms. In this paper, we review the different satellites used in recent literature, as well as the most common and efficient machine-learning methods. To account for the recent explosion of published research on coral reel mapping, we especially focus on the papers published between 2018 and 2020. Our review study indicates that object-based methods provide more accurate results than pixel-based ones, and that the most accurate methods are Support Vector Machine and Random Forest. We emphasize that the satellites with the highest spatial resolution provide the best images for benthic habitat mapping. We also highlight that preprocessing steps (water column correction, sunglint removal, etc.) and additional inputs (bathymetry data, aerial photographs, etc.) can significantly improve the mapping accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
16. Comprehensive Review On The Analysis Of Various Machine Learning Algorithms For Early Detection Of Critical Diseases.
- Author
-
Chitre, Divya, Bhushan, Shivendu, and Patil, Manisha S.
- Subjects
MACHINE learning ,EARLY diagnosis ,SUPPORT vector machines ,K-nearest neighbor classification ,RANDOM forest algorithms - Abstract
Early detection of critical diseases is a pivotal aspect of modern healthcare, significantly impacting patient outcomes and healthcare costs. This research paper provides a comprehensive review and analysis of various machine learning algorithms employed in the realm of early disease detection. The study explores the strengths, limitations, and overall efficacy of prominent algorithms, including Logistic Regression, Support Vector Machines, Random Forests, Neural Networks, K-Nearest Neighbors, and Ensemble Learning. Each algorithm's suitability for early detection is assessed based on factors such as interpretability, scalability, and performance in handling diverse data types. Furthermore, the review discusses the specific applications of these algorithms in different medical contexts, highlighting their contributions to the early identification of critical diseases. By synthesizing the current state of research, this paper aims to provide valuable insights for researchers, and policymakers working towards advancing the field of early disease detection through machine learning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
17. Twin support vector machines based on chaotic mapping dung beetle optimization algorithm.
- Author
-
Huang, Huajuan, Yao, Zhenhua, Wei, Xiuxi, and Zhou, Yongquan
- Subjects
OPTIMIZATION algorithms ,DUNG beetles ,SUPPORT vector machines ,CLASSIFICATION algorithms ,MACHINE learning - Abstract
Twin Support Vector Machine (TSVM) is a powerful machine learning method that is usually used to solve binary classification problems. But although the classification speed and performance of TSVM is better than that of primitive support vector machine, TSVM still faces the problem of difficult parameter selection; therefore, to overcome the problem of parameter selection of TSVM, this paper proposes a Chaotic Mapping Dung Beetle Optimization Algorithm-based Twin Support Vector Machine (CMDBO-TSVM) for automatic parameter selection. Due to the uncertainty of the random initialization population of the original Dung Beetle Optimization Algorithm, this paper additionally adds chaotic mapping initialization to improve the Dung Beetle Optimization Algorithm. Experiments on the dataset through this paper show that the classification accuracy of the CMDBO-TSVM has a better performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Heart disease prediction using ML through enhanced feature engineering with association and correlation analysis.
- Author
-
Lakshmanarao, Annemneedi, Krishna, Thotakura Venkata Sai, Kiran, Tummala Srinivasa Ravi, krishna, Chinta Venkata Murali, Ushanag, Samsani, and Supriya, Nandikolla
- Subjects
HEART diseases ,STATISTICAL correlation ,MACHINE learning ,SUPPORT vector machines ,K-nearest neighbor classification ,CLASSIFICATION algorithms - Abstract
Heart disease remains a prevalent and critical health concern globally. This paper addresses the critical task of heart disease prediction through the utilization of advanced machine learning techniques. Our approach focuses on the enhancement of feature engineering by incorporating a novel integration of association and correlation analyses. A heart disease dataset from Kaggle was used for the experiments. Association analysis was applied to the categorical and binary features in the dataset. Correlation analysis was applied to the numerical features in the dataset. Based on the insights from association analysis and correlation analysis, a new dataset was created with combinations of features. Later, newly created features are integrated with the original dataset, and classification algorithms are applied. Five machine learning (ML) classifiers, namely decision tree, k-nearest neighbors (KNN), random forest, XG-Boost, and support vector machine (SVM), were applied to the final dataset and achieved a good accuracy rate for heart disease detection. By systematically exploring associations and relationships with categorical, binary, and numerical features, this paper unveils innovative insights that contribute to a more comprehensive understanding of the heart disease dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Reviewing Machine Learning and Image Processing Based Decision-Making Systems for Breast Cancer Imaging.
- Author
-
Zerouaoui, Hasnae and Idri, Ali
- Subjects
BREAST tumor diagnosis ,ALGORITHMS ,MAMMOGRAMS ,BREAST tumors ,DECISION support systems ,DECISION trees ,DIAGNOSTIC imaging ,DIGITAL image processing ,MACHINE learning ,MAGNETIC resonance imaging ,MEDLINE ,ARTIFICIAL neural networks ,ONLINE information services ,RESEARCH funding ,SYSTEMATIC reviews ,RESEARCH bias ,SUPPORT vector machines ,DESCRIPTIVE statistics ,COMPUTER-aided diagnosis ,DEEP learning - Abstract
Breast cancer (BC) is the leading cause of death among women worldwide. It affects in general women older than 40 years old. Medical images analysis is one of the most promising research areas since it provides facilities for diagnosis and decision-making of several diseases such as BC. This paper conducts a Structured Literature Review (SLR) of the use of Machine Learning (ML) and Image Processing (IP) techniques to deal with BC imaging. A set of 530 papers published between 2000 and August 2019 were selected and analyzed according to ten criteria: year and publication channel, empirical type, research type, medical task, machine learning techniques, datasets used, validation methods, performance measures and image processing techniques which include image pre-processing, segmentation, feature extraction and feature selection. Results showed that diagnosis was the most used medical task and that Deep Learning techniques (DL) were largely used to perform classification. Furthermore, we found out that classification was the most ML objective investigated followed by prediction and clustering. Most of the selected studies used Mammograms as imaging modalities rather than Ultrasound or Magnetic Resonance Imaging with the use of public or private datasets with MIAS as the most frequently investigated public dataset. As for image processing techniques, the majority of the selected studies pre-process their input images by reducing the noise and normalizing the colors, and some of them use segmentation to extract the region of interest with the thresholding method. For feature extraction, we note that researchers extracted the relevant features using classical feature extraction techniques (e.g. Texture features, Shape features, etc.) or DL techniques (e. g. VGG16, VGG19, ResNet, etc.), and finally few papers used feature selection techniques in particular the filter methods. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
20. An Automated System for Surface Damage Detection Using Support Vector Machine.
- Author
-
Alqahtani, Hassan
- Subjects
SUPPORT vector machines ,MACHINE learning ,FATIGUE limit ,FREE surfaces ,OPTICAL measurements - Abstract
The global objective of this paper was to build an automated prediction system for surface damage. Practically, the damage initiates from the free surface because of the high-stress concentration that presents in valleys of the surface profile. Hence, the surface condition is a major factor in the fatigue strength of the metal. In this paper, the surface condition has been measured using an optical confocal measurement system (Alicona). Arithmetical mean height and Surface Flatness have been selected as input data source for the machine learning model. The machine learning model was built using the Support Vector Machine method. The role of this model is to select the best surface parameters to detect surface damage. The results show that the Surface Flatness parameter provides better prediction for surface damage than the Arithmetical mean height parameter. [ABSTRACT FROM AUTHOR]
- Published
- 2023
21. An investigation of the factors influencing cost system functionality using decision trees, support vector machines and logistic regression
- Author
-
Kuzey, Cemil, Uyar, Ali, and Delen, Dursun
- Published
- 2019
- Full Text
- View/download PDF
22. Determining the Remaining Functional Life of Power Transformers Using Multiple Methods of Diagnosing the Operating Condition Based on SVM Classification Algorithms.
- Author
-
Aciu, Ancuța-Mihaela, Nițu, Maria-Cristina, Nicola, Claudiu-Ionel, and Nicola, Marcel
- Subjects
POWER transformers ,SUPPORT vector machines ,SERVICE life ,MACHINE learning ,AUTOMATIC classification - Abstract
Starting from the current need for the safety of energy systems, in which power transformers play a key role, the study of the health of power transformers in service is a difficult and complex task, since the assessment consists of identifying indicators that can provide accurate data on the extent of degradation of transformer components and subcomponents, in order to establish a model for predicting the remaining life of transformers. Therefore, this paper proposes a model for assessing the remaining service life by diagnosing the condition of the transformer based on the health index (HI) obtained from a multi-parameter analysis. To determine the condition of power transformers, a number of methods are presented based on the combination of the combined Duval pentagon (PDC) method and ethylene concentration (C
2 H4 ) to determine the fault condition, the combination of the degree of polymerisation (DP) and moisture to determine the condition of the cellulose insulation and the use of the oil quality index (OQIN) to determine the condition of the oil. For each of the classification methods presented, applications based on machine learning (ML), in particular support vector machine (SVM), have been implemented for automatic classification using the Matlab development environment. The global algorithmic approach presented in this paper subscribes to the idea of event-based maintenance. Two case studies are also presented to validate SVM-based classification methods and algorithms. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
23. COMPARATIVE STUDY ON THE PERFORMANCE OF FACE RECOGNITION ALGORITHMS.
- Author
-
Nguyen, Truong Van and Chu, Tuan Duc
- Subjects
HUMAN facial recognition software ,COMPARATIVE studies ,SMART homes ,SUPPORT vector machines ,MACHINE learning - Abstract
Facial and object recognition are more and more applied in our life. Therefore, this field has become important to both academicians and practitioners. Face recognition systems are complex systems using features of the face to recognize. Current face recognition systems may be used to increase work efficiency in various methods, including smart homes, online banking, traffic, sports, robots, and others. With various applications like this, the number of facial recognition methods has been increasing in recent years. However, the performance of face recognition systems can be significantly affected by various factors such as lighting conditions, and different types of masks (sunglasses, scarves, hats, etc.). In this paper, a detailed comparison between face recognition techniques is exposed by listing the structure of each model, the advantages and disadvantages as well as performing experiments to demonstrate the robustness, accuracy, and complexity of each algorithm. To be detailed, let's give a performance comparison of three methods for measuring the efficacy of face recognition systems including a support vector machine (SVM), a visual geometry group with 16 layers (VGG-16), and a residual network with 50 layers (ResNet-50) in real-life settings. The efficiency of algorithms is evaluated in various environments such as normal light indoors, backlit indoors, low light indoors, natural light outdoors, and backlit outdoors. In addition, this paper also evaluates faces with hats and glasses to examine the accuracy of the methods. The experimental results indicate that the ResNet-50 has the highest accuracy to identify faces. The time to recognize is ranging from 1.1 s to 1.2 s in the normal environment. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
24. An Overview of the Special Issue "Remote Sensing Applications in Vegetation Classification".
- Author
-
Jarocińska, Anna, Marcinkowska-Ochtyra, Adriana, and Ochtyra, Adrian
- Subjects
REMOTE sensing ,VEGETATION classification ,MACHINE learning ,COMPUTER software testing ,MULTISPECTRAL imaging ,SUPPORT vector machines ,VEGETATION monitoring - Abstract
One of the ideas behind vegetation monitoring is the ability to identify different vegetation units, such as species, communities, habitats, or vegetation types. Remote sensing data allow for obtaining such information remotely, which is especially valuable in areas that are difficult to explore (such as mountains or wetlands). At the same time, such techniques allow for limiting field research, which is particularly important in this context. Remote sensing has been utilized for vegetation inventories for many decades, using airborne and spaceborne platforms. Developing newer tools, algorithms and sensors is conducive to more new applications in the vegetation identification field. The Special Issue "Remote Sensing Applications in Vegetation Classification" is an overview of the applications of remote sensing data with different resolutions for the identification of vegetation at different levels of detail. In 14 research papers, the most frequent different types of crops were analysed. In three cases, the authors recognised different types of grasslands, whereas trees were the object of the studies in two papers. The most commonly used sensors were Copernicus Sentinel-1 and Sentinel-2; however, to a lesser extent, MODIS, airborne hyperspectral and multispectral data, as well as LiDAR products, were also utilised. There were articles that tested and compared different combinations of datasets, different terms of data acquisition, or different classifiers in order to achieve the highest classification accuracy. These accuracies were assessed quite satisfactorily in each publication; the overall accuracy (OA) for the best result varied from 72% to 98%. In all of the research papers, at least one of the two commonly used machine learning algorithms, random forest (RF) and support vector machines (SVM), was applied. Additionally, one paper presented software ARTMO's machine-learning classification algorithms toolbox, which allows for the testing of 13 different classifiers. The studies published in this Special Issue can be used by the vegetation research teams and practitioners to conduct deeper analysis via the utilization of the proposed solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
25. Comparison of machine learning algorithms based on machine learning for the prediction of thermal plasma physical parameters of C4F7N and CO2 environmentally friendly gas mixtures.
- Author
-
Ding, Can, Tian, Haobo, and Yu, Donghai
- Subjects
MACHINE learning ,THERMAL plasmas ,RANDOM forest algorithms ,SUPPORT vector machines ,NAIVE Bayes classification ,GAS mixtures ,CARBON offsetting ,GREENHOUSE effect - Abstract
With the goal of "carbon peak and carbon neutrality," the need for environmentally friendly gases to replace SF
6 , a high greenhouse effect gas, is urgent. C4 F7 N, as an environmentally friendly gas with the greatest potential to replace SF6 as an arc extinguishing medium in circuit breakers, can be mixed with CO2 to greatly improve the shortcomings of its high liquefaction temperature, and the calculation of the physical parameters of the mixed gas plasma is a prerequisite for the computational simulation of the arc process in the opening of circuit breakers. Because solving the physical parameters is expensive, based on the system of differential equations, this paper adopts several machine learning algorithms by mining the relationship between the data using the known physical parameter data to predict the results of the physical parameters to be solved under certain conditions, which greatly reduces the cost of computation. The machine learning algorithms used in this paper are K-nearest-neighbor regression, decision tree, random forest, support vector machine, and gradient boosting regression, of which for the support vector machine, hyperparameters find it difficult to determine the problem of optimization using the gray wolf algorithm. The prediction results of several algorithms show that they are more accurate and that the problem can be solved better by using the method of machine learning. Finally, the comparison results show that the support vector machine exhibits better performance in most cases and that the gray wolf algorithm can make the results of the support vector machine more accurate. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
26. An aggressive driving state recognition model using EEG based on stacking ensemble learning.
- Author
-
Yang, Liu and Zhao, Qianxi
- Subjects
AGGRESSIVE driving ,ELECTROENCEPHALOGRAPHY ,FEATURE extraction ,SUPPORT vector machines ,CLASSIFICATION algorithms ,FOURIER transforms ,RANDOM forest algorithms - Abstract
An aggressive driving state impacts drivers' decisions, which could potentially lead to accidents. Real-time recognition of driving state is particularly important for improving road safety. However, the majority of modeling in existing studies relies on a single algorithm, which may lead to unreliable predictions. This paper proposes a stacking ensemble aggressive driving state recognition model using electroencephalography (EEG), which is able to combine different heterogeneous classification algorithms. Five types of classification algorithms and their variants are tested and compared to identify suitable base classifiers. All of these classifiers are optimized by Bayesian optimizer before the comparison. Three stacking ensemble recognition models using different meta-classifiers (i.e., logistic regression, random forest, and AdaBoost) and an equal-weight voting ensemble recognition model are established. The aforementioned recognition models are evaluated by using a dataset collected from a car-following simulated driving experiment. Fast Fourier transformation (FFT) and wavelet packet transformation (WPT) are adopted to extract features from raw EEG data. The results suggest that the stacking ensemble recognition models outperform the best single (i.e., support vector machine) model; the random Forest stacking recognition model achieves the best performance and the accuracy is increased from 81.21% to 84.23% using FFT features and from 86.45% to 87.38% using WPT features. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. GeoPDNN 1.0: a semi-supervised deep learning neural network using pseudo-labels for three-dimensional shallow strata modelling and uncertainty analysis in urban areas from borehole data.
- Author
-
Guo, Jiateng, Xu, Xuechuang, Wang, Luyuan, Wang, Xulei, Wu, Lixin, Jessell, Mark, Ogarko, Vitaliy, Liu, Zhibin, and Zheng, Yufei
- Subjects
SUPERVISED learning ,DEEP learning ,GEOLOGICAL modeling ,CITIES & towns ,RADIAL basis functions ,SUPPORT vector machines ,GEOLOGICAL surveys ,MACHINE learning - Abstract
Borehole data are essential for conducting precise urban geological surveys and large-scale geological investigations. Traditionally, explicit modelling and implicit modelling have been the primary methods for visualizing borehole data and constructing 3D geological models. However, explicit modelling requires substantial manual labour, while implicit modelling faces problems related to uncertainty analysis. Recently, machine learning approaches have emerged as effective solutions for addressing these issues in 3D geological modelling. Nevertheless, the use of machine learning methods for constructing 3D geological models is often limited by insufficient training data. In this paper, we propose the semi-supervised deep learning using pseudo-labels (SDLP) algorithm to overcome the issue of insufficient training data. Specifically, we construct the pseudo-labels in the training dataset using the triangular irregular network (TIN) method. A 3D geological model is constructed using borehole data obtained from a real building engineering project in Shenyang, Liaoning Province, NE China. Then, we compare the results of the 3D geological model constructed based on SDLP with those constructed by a support vector machine (SVM) method and an implicit Hermite radial basis function (HRBF) modelling method. Compared to the 3D geological models constructed using the HRBF algorithm and the SVM algorithm, the 3D geological model constructed based on the SDLP algorithm better conforms to the sedimentation patterns of the region. The findings demonstrate that our proposed method effectively resolves the issues of insufficient training data when using machine learning methods and the inability to perform uncertainty analysis when using the implicit method. In conclusion, the semi-supervised deep learning method with pseudo-labelling proposed in this paper provides a solution for 3D geological modelling in engineering project areas with borehole data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. A Robust Triboelectric Impact Sensor with Carbon Dioxide Precursor-Based Calcium Carbonate Layer for Slap Match Application.
- Author
-
Kim, Inkyum, Cho, Hyunwoo, Kitchamsetti, Narasimharao, Yun, Jonghyeon, Lee, Jeongmin, Park, Wook, and Kim, Daewon
- Subjects
CARBON dioxide detectors ,MACHINE learning ,SUPERVISED learning ,CARBON sequestration ,SUPPORT vector machines ,POLYIMIDES ,CALCIUM carbonate - Abstract
As an urgent international challenge, the sudden change in climate due to global warming needs to be addressed in the near future. This can be achieved through a reduction in fossil fuel utilization and through carbon sequestration, which reduces the concentration of CO
2 in the atmosphere. In this study, a self-sustainable impact sensor is proposed through implementing a triboelectric nanogenerator with a CaCO3 contact layer fabricated via a CO2 absorption method. The triboelectric polarity of CaCO3 with the location between the polyimide and the paper and the effects of varying the crystal structure are investigated first. The impact sensing characteristics are then confirmed at various input frequencies and under applied forces. Further, the high mechanical strength and strong adherence of CaCO3 on the surface of the device are demonstrated through enhanced durability compared to the unmodified device. For the intended application, the as-fabricated sensor is used to detect the turning state of the paper Ddakji in a slap match game using a supervised learning algorithm based on a support vector machine presenting a high classification accuracy of 95.8%. The robust CaCO3 -based triboelectric device can provide an eco-friendly advantage due to its self-powered characteristics for impact sensing and carbon sequestration. [ABSTRACT FROM AUTHOR]- Published
- 2023
- Full Text
- View/download PDF
29. Introduction: special issue of selected papers from ACML 2014.
- Author
-
Li, Hang, Phung, Dinh, Cao, Tru, Ho, Tu-Bao, and Zhou, Zhi-Hua
- Subjects
MACHINE learning ,BIBLIOGRAPHICAL citations ,SUPPORT vector machines - Abstract
An introduction is presented in which the editor discusses various reports within the issue on topics including the passive-aggressive active (PAA) learning algorithms, the combination of authors, citation network and contents in a single bibliographic model, and the support vector machine (SVM).
- Published
- 2016
- Full Text
- View/download PDF
30. Assessment of Global Forest Coverage through Machine Learning Algorithms.
- Author
-
Metkewar, P. S., Chauhan, Ravi, Prasanth, A., and Sathyamoorthy, Malathy
- Subjects
MACHINE learning ,DECISION trees ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
This exploration of paper presents an investigation of the Forest Region Inclusion Dataset that gives data on the backwoods inclusion of different nations overall from 1990 to 2020. The dataset contains country-wise information on population, population density, population development rate, total population rate, and forest region inclusion. We examined this dataset to decide the patterns in woodland region inclusion across various nations and mainlands, as well as the connection among populace and backwoods region inclusion. Our discoveries show that while certain nations have essentially expanded their forest region inclusion, others have encountered a decline. Besides, we found that population density and development rate are adversely related with forest area coverage. Authors have implemented four machine learning algorithms that are Linear Regression, Decision Tree, Random Forest and Support Vector Machine on the dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Localization of Coordinated Cyber-Physical Attacks in Power Grids Using Moving Target Defense and Machine Learning.
- Author
-
Yu, Jian, Li, Qiang, and Li, Lei
- Subjects
ELECTRIC power distribution grids ,CONVOLUTIONAL neural networks ,MACHINE learning ,CLASSIFICATION algorithms ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
Coordinated cyber-physical attacks (CCPAs) are dangerously stealthy and have considerable destructive effects against power grids. The problem of stealthy CCPA (SCCPA) localization, specifically identifying disconnected lines in attack, is a nonlinear multi-classification problem. To the best of our knowledge, only one paper has studied the problem; nevertheless, the total number of classifications is not appropriate. In the paper, we propose several methods to solve the problem of SCCPA localization. Firstly, considering the practical constraints and abiding by one of our previous studies, we elaborately determine the total number of classifications and design an approach for generating training and testing datasets. Secondly, we develop two algorithms to solve multiple classifications via the support vector machine (SVM) and random forest (RF), respectively. Similarly, we also present a one-dimensional convolutional neural network (1D-CNN) architecture. Finally, extensive simulations are carried out for IEEE 14-bus, 30-bus, and 118-bus power system, respectively, and we verify the effectiveness of our approaches in solving the problem of SCCPA localization. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. AUTOMATIC DETECTION OF VERBAL DECEPTION IN ROMANIAN WITH ARTIFICIAL INTELLIGENCE METHODS.
- Author
-
CRUDU, MĂLINA
- Subjects
ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,DECEPTION ,NATURAL language processing ,SUPPORT vector machines - Abstract
Automatic deception detection is an important task with several applications in both direct physical human communication, as well as in computer-mediated one. The objective of this paper is to study the nature of deceptive language. The primary goal of this study is to investigate deception in Romanian written communication. We created a number of artificial intelligence models (based on Support Vector Machine, Random Forest, and Artificial Neural Network) to detect dishonesty in a topic-specific corpus. To assess the efficiency of the Linguistic Inquiry and Word Count (LIWC) categories in Romanian, we conducted a comparison between multiple text representations based on LIWC, TF-IDF, and LSA. The results show that in the case of datasets with a common subject such as the one we used regarding friendship, text categorization is more successful using general text representations such as TF-IDF or LSA. The proposed approach achieves an accuracy of the classification of 91.3%, outperforming the similar approaches presented in the literature. These findings have implications in fields like linguistics and opinion mining, where research on this subject in languages other than English is necessary. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Enhancing Command Recognition in Air Traffic Control Through Advanced Classification Techniques.
- Author
-
Srinivasan, Narayanan and Balasundaram, S. R.
- Subjects
AIR traffic control ,SUPPORT vector machines ,MACHINE learning ,ERROR detection (Information theory) ,COMPUTER algorithms - Abstract
This paper addresses the persistent challenges in speech processing within the Air Traffic Control (ATC) domain, a field where despite extensive research, issues such as handling noisy environments, accented speech, and the need for strict adherence to standard phraseology continue to undermine conventional language models. Our study employs a hybrid approach that integrates syntactic analysis with advanced machine learning classification algorithms – Logistic Regression, Lagrangian Support Vector Machine, and Naïve Bayes. By mixing and matching algorithms tailored for specific aspects of speech processing, our approach moves away from traditional reliance on a singular integrated system, illustrating through rigorous testing with the ATCOSIM dataset that such a multifaceted strategy markedly improves command recognition accuracy and adapts more effectively to the unique linguistic features of ATC speech. Results highlight the superior performance of Logistic Regression across various command recognition categories, pointing towards a promising direction for future advancements in ATC speech recognition technologies aimed at reducing human workload and increasing automation precision. This paper explores the complexities of the required analysis techniques and underscores the necessity of employing diverse algorithms in the processing pipeline to enhance overall system accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Emotions unveiled: detecting COVID-19 fake news on social media.
- Author
-
Farhoudinia, Bahareh, Ozturkcan, Selcen, and Kasap, Nihat
- Subjects
FACIAL expression ,EMOTION recognition ,FAKE news ,MACHINE learning ,EMOTIONS ,SOCIAL media ,SUPPORT vector machines - Abstract
The COVID-19 pandemic has highlighted the pernicious effects of fake news, underscoring the critical need for researchers and practitioners to detect and mitigate its spread. In this paper, we examined the importance of detecting fake news and incorporated sentiment and emotional features to detect this type of news. Specifically, we compared the sentiments and emotions associated with fake and real news using a COVID-19 Twitter dataset with labeled categories. By utilizing different sentiment and emotion lexicons, we extracted sentiments categorized as positive, negative, and neutral and eight basic emotions, anticipation, anger, joy, sadness, surprise, fear, trust, and disgust. Our analysis revealed that fake news tends to elicit more negative emotions than real news. Therefore, we propose that negative emotions could serve as vital features in developing fake news detection models. To test this hypothesis, we compared the performance metrics of three machine learning models: random forest, support vector machine (SVM), and Naïve Bayes. We evaluated the models' effectiveness with and without emotional features. Our results demonstrated that integrating emotional features into these models substantially improved the detection performance, resulting in a more robust and reliable ability to detect fake news on social media. In this paper, we propose the use of novel features and methods that enhance the field of fake news detection. Our findings underscore the crucial role of emotions in detecting fake news and provide valuable insights into how machine-learning models can be trained to recognize these features. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Effect of the One-to-Many Relationship between the Depth and Spectral Profile on Shallow Water Depth Inversion Based on Sentinel-2 Data.
- Author
-
Huang, Erhui, Chen, Benqing, Luo, Kai, and Chen, Shuhan
- Subjects
DEPTH profiling ,MACHINE learning ,STANDARD deviations ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
In shallow water, Sentinel-2 multispectral imagery has only four visible bands and limited quantization levels, which easily leads to the occurrence of the same spectral profile but different depth (SSPBDD) phenomenon, resulting in a one-to-many relationship between water depth and spectral profile. Investigating the impact of this relationship on water depth inversion models is the main objective of this paper. The Stumpf model and three machine learning models (Random Forest, Support Vector Machine, and Mixture Density Network) are employed, and the performance of these models is analysed based on the spatial distribution of the training dataset and the input information composition of these models. The results show that the root mean square errors (RMSEs) of the depth inversion of Random Forest and Support Vector Machine are significantly affected by the spatial distribution of the training dataset, while minimal effects are observed for the Stumpf model and the Mixture Density Network model. The SSPBDD phenomenon is widespread in Sentinel-2 images at all depths, particularly between 5 m and 15 m, with most of the depth maximum difference of the SSPBDD pixels ranging from 0 to 5 m. The SSPBDDs phenomenon can significantly reduce the inversion accuracy of any model. The number and the depth maximum difference of the SSPBDDs pixels are the main influencing factors. However, by increasing the visible spectral information and the spatial neighbourhood information in the input layer of machine learning models, the inversion accuracy and stability of the models can be improved to a certain extent. Among the models, the Mixture Density Network achieves the best inversion accuracy and stability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Application of Computational Intelligence Methods for the Automated Identification of Paper-Ink Samples Based on LIBS.
- Author
-
Rzecki, Krzysztof, Sośnicki, Tomasz, Baran, Mateusz, Niedźwiecki, Michał, Król, Małgorzata, Łojewski, Tomasz, Acharya, U Rajendra, Yildirim, Özal, and Pławiak, Paweł
- Subjects
- *
LASER-induced breakdown spectroscopy , *SPECTRUM analysis , *COMPUTATIONAL intelligence , *K-nearest neighbor classification , *SUPPORT vector machines , *PATTERN recognition systems - Abstract
Laser-induced breakdown spectroscopy (LIBS) is an important analysis technique with applications in many industrial branches and fields of scientific research. Nowadays, the advantages of LIBS are impaired by the main drawback in the interpretation of obtained spectra and identification of observed spectral lines. This procedure is highly time-consuming since it is essentially based on the comparison of lines present in the spectrum with the literature database. This paper proposes the use of various computational intelligence methods to develop a reliable and fast classification of quasi-destructively acquired LIBS spectra into a set of predefined classes. We focus on a specific problem of classification of paper-ink samples into 30 separate, predefined classes. For each of 30 classes (10 pens of each of 5 ink types combined with 10 sheets of 5 paper types plus empty pages), 100 LIBS spectra are collected. Four variants of preprocessing, seven classifiers (decision trees, random forest, k-nearest neighbor, support vector machine, probabilistic neural network, multi-layer perceptron, and generalized regression neural network), 5-fold stratified cross-validation, and a test on an independent set (for methods evaluation) scenarios are employed. Our developed system yielded an accuracy of 99.08%, obtained using the random forest classifier. Our results clearly demonstrates that machine learning methods can be used to identify the paper-ink samples based on LIBS reliably at a faster rate. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
37. Study of a high-precision complex 3D geological modelling method based on a fine KNN and kriging coupling algorithm: a case study for Jiangsu, China.
- Author
-
Liu, Xiaozheng, Zhang, Peng, Guo, Yakun, Ma, Guotao, Liu, Ming, Jiang, Shui-Hua, Xue, Zhiwen, and Zheng, Jun
- Subjects
GEOLOGICAL modeling ,KRIGING ,MACHINE learning ,VORONOI polygons ,TRIANGLES ,K-nearest neighbor classification ,SUPPORT vector machines ,DATA encryption - Abstract
A high-precision, complex, three-dimensional (3D) geological model can directly express the attributes of stratum thickness, geological structure, lithology and spatial form, which can provide a reliable basis for the development and utilization of underground space and planning decisions. However, it is difficult to perform accurate modelling due to the lack of basic data. As such, this paper proposes coupling a machine learning algorithm (K-nearest neighbour (KNN)) with the kriging algorithm to construct the topological relationship between the Delaunay triangle and the Thiessen polygon in order to perform the simulation and prediction of virtual drilling. Based on KNN, support vector machine (SVM) and neural network algorithms as well as the virtual borehole encryption data, data standardization processing and analysis are carried out. Through model verification, algorithm optimization is realized, and the optimal modelling method is explored. The results show that the fine KNN algorithm improved by Bayesian optimization can effectively improve the modelling accuracy through 0.1-m encryption, standardization processing and 5-fold cross-validation. Stratum modelling combined with the fine KNN and kriging algorithms can obtain a more accurate modelling without adding virtual boreholes. The improved levels of upper and lower hybrid modelling with an appropriate number of profile boreholes can also effectively optimize model accuracy. Both modelling accuracy and efficiency can be significantly improved by using Delaunay triangles and Thiessen polygons with virtual boreholes. Stratum modelling can effectively express the geological pinch-out in areas with adequate degrees of stratification, and hybrid modelling performs well in irregular geological bodies such as karsts and lenses. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Enhancing Credit Card Fraud Detection: An Ensemble Machine Learning Approach.
- Author
-
Khalid, Abdul Rehman, Owoh, Nsikak, Uthmani, Omair, Ashawa, Moses, Osamor, Jude, and Adejoh, John
- Subjects
CREDIT card fraud ,FRAUD investigation ,MACHINE learning ,SUPPORT vector machines ,K-nearest neighbor classification - Abstract
In the era of digital advancements, the escalation of credit card fraud necessitates the development of robust and efficient fraud detection systems. This paper delves into the application of machine learning models, specifically focusing on ensemble methods, to enhance credit card fraud detection. Through an extensive review of existing literature, we identified limitations in current fraud detection technologies, including issues like data imbalance, concept drift, false positives/negatives, limited generalisability, and challenges in real-time processing. To address some of these shortcomings, we propose a novel ensemble model that integrates a Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Random Forest (RF), Bagging, and Boosting classifiers. This ensemble model tackles the dataset imbalance problem associated with most credit card datasets by implementing under-sampling and the Synthetic Over-sampling Technique (SMOTE) on some machine learning algorithms. The evaluation of the model utilises a dataset comprising transaction records from European credit card holders, providing a realistic scenario for assessment. The methodology of the proposed model encompasses data pre-processing, feature engineering, model selection, and evaluation, with Google Colab computational capabilities facilitating efficient model training and testing. Comparative analysis between the proposed ensemble model, traditional machine learning methods, and individual classifiers reveals the superior performance of the ensemble in mitigating challenges associated with credit card fraud detection. Across accuracy, precision, recall, and F1-score metrics, the ensemble outperforms existing models. This paper underscores the efficacy of ensemble methods as a valuable tool in the battle against fraudulent transactions. The findings presented lay the groundwork for future advancements in the development of more resilient and adaptive fraud detection systems, which will become crucial as credit card fraud techniques continue to evolve. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Intrusion Detection System (IDS) in Cloud Computing using Machine Learning Algorithms: A Comparative Study.
- Author
-
Rathod, Ganesh, Sabnis, Vikrant, and Jain, Jay Kumar
- Subjects
INTRUSION detection systems (Computer security) ,MACHINE learning ,CLOUD computing ,LITERATURE reviews ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
Organizations have witnessed a significant transformation in the realm of data storage and processing owing to the advent of cloud computing. Nonetheless, with its advantages, cloud computing has also brought forth new security concerns that need to be addressed. In this regard, the use of Intrusion Detection Systems (IDSs) has become indispensable for identifying and preventing a wide range of attacks that may occur in cloud computing environments, thereby ensuring data security. In modern years, machine learning (ML) algorithms have emerged as a promising approach for IDSs, as they can analyze huge amounts of data and identify patterns that may not be detectable by traditional rule-based IDSs. This review paper presents a comprehensive analysis of ML-based IDSs & Tradition-based IDSs for intrusion detection in cloud computing environments. The literature review covers various Traditional & ML algorithms used for intrusion detection, including decision trees, Neural Networks (NN), Support Vector machines (SVM), random forests, and k-nearest neighbours. The performance evaluation metrics used in this review paper include accuracy and false positive rate. These metrics are generally used to evaluate the performance of IDS and ML algorithms. The results of the analysis indicate that ML-based IDSs outperform traditional IDSs in terms of accuracy and false positive rate. However, ML-based IDSs may also have limitations, such as a high rate of false negatives and the usefulness of huge amounts of training data. Overall, the analysis suggests that ML-based IDSs have the potential to improve the usefulness of intrusion detection in cloud computing environments, but further research is needed to address the limitations of these systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
40. Visual detection and tracking algorithms for human motion.
- Author
-
Yang, Ge and Chen, Siping
- Subjects
TRACKING algorithms ,SUPPORT vector machines ,MACHINE learning ,ARTIFICIAL satellite tracking - Abstract
In dense scenes, a large number of individuals can introduce serious complications for motion detection, such as blurred vision, chaotic scenes, and complex behaviours. For low-density pedestrian detection and tracking algorithms, the accuracy is greatly reduced for both detection and tracking. High-density detection or tracking fails too when these problems are encountered in high-density scenes. In light of the above problems, a detection algorithm and a tracking algorithm based on the human head and shoulder model are proposed. A support vector machine is used to train the classifier by machine learning. The detection algorithm proposed in this paper achieves a detection accuracy of 94% by using the MIT and INRIA datasets. The average accuracy of pedestrian tracking in high-density scenes is approximately 95%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
41. Recent Advances and Applications of Textile Technology in Patient Monitoring.
- Author
-
Stern, Lindsay and Roshan Fekr, Atena
- Subjects
SLEEP quality ,SUPPORT vector machines ,TEXTILES ,VITAL signs ,PRESSURE ulcers ,WEARABLE technology ,MACHINE learning ,PATIENT monitoring ,SLEEP ,BODY movement ,HEART beat ,TECHNOLOGY ,ARTIFICIAL neural networks ,ALGORITHMS - Abstract
Sleep monitoring has become a prevalent area of research where body position and physiological data, such as heart rate and respiratory rate, are monitored. Numerous critical health problems are associated with poor sleep, such as pressure sore development, sleep disorders, and low sleep quality, which can lead to an increased risk of falls, cardiovascular diseases, and obesity. Current monitoring systems can be costly, laborious, and taxing on hospital resources. This paper reviews the most recent solutions for contactless textile technology in the form of bed sheets or mats to monitor body positions, vital signs, and sleep, both commercially and in the literature. This paper is organized into four categories: body position and movement monitoring, physiological monitoring, sleep monitoring, and commercial products. A detailed performance evaluation was carried out, considering the detection accuracy as well as the sensor types and algorithms used. The areas that need further research and the challenges for each category are discussed in detail. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
42. A FEATURE EXTRACTION BASED IMPROVED SENTIMENT ANALYSIS ON APACHE SPARK FOR REAL-TIME TWITTER DATA.
- Author
-
KANUNGO, PIYUSH and SINGH, HARI
- Subjects
SENTIMENT analysis ,FEATURE extraction ,CLASSIFICATION algorithms ,SUPPORT vector machines ,RANDOM forest algorithms - Abstract
This paper aims to improve the accuracy of sentiment analysis on Apache Spark for a real-time general twitter data. A lot of works exist on sentiment analysis on offline or stored twitter data that uses several classification algorithms on relevant features extracted using well-known feature extraction methodologies on pre-processed text data. However, not much works exist for sentiment analysis of real-time twitter data and especially for the generic data on big data processing platforms such as Apache Spark. This paper proposes a real-time sentiment analysis for generic twitter data through Apache Spark using six classification algorithms on N-gram and Term Frequency - Inverse Document Frequency (TF-IDF) feature extraction methodologies on the pre-processed data. An exhaustive comparison is done using Logistic Regression (LR), Multinomial Naive Bayes (MNB), Random Forest Classfier(RFC), Support Vector Machine (SVM), K-Nearest Neighbour (K-NN), and Decision Tree (DT) classification algorithms. It is observed that the trigram feature extraction method performs the best on LR and SVM and the RFC results are also comparable on the considered general tweets data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
43. APPLICATION OF MACHINE LEARNING FOR RECOGNIZING SURFACE WELDING DEFECTS IN VIDEO SEQUENCES.
- Author
-
Yemelyanova, Mariya and Smailova, Saule
- Subjects
MACHINE learning ,WELDING defects ,COMPUTER vision ,FEATURE extraction ,SUPPORT vector machines - Abstract
The paper offers a solution to the problem of detecting and recognizing surface defects in welded joints that appear during tungsten inert gas welding of metal edges. This problem belongs to the machine vision. Welding of stainless-steel edges is carried out automatically on the pipe production line. Therefore, frames of video sequences are investigated. Images of some welding defects are shown in the paper. An algorithm proposed by the authors is used to detect welding defects in the video sequence frames, the efficiency of which has been confirmed experimentally. The problem solution of welding defects recognition is based on the use of traditional machine learning methods: support vector machine and artificial neural network. To build classification models, a labeled dataset containing automatically extracted texture features from the areas of welding defects detected in the video sequences was created. An analysis was performed to identify the strength of the correlation of texture features between each other and the dependent variable in the dataset for dimensionality reduction of the feature vector. The models were trained and tested on datasets with different numbers of features. The quality of the classification models was evaluated based on the accuracy metric values. The best results were achieved by the classifier built using the support vector machine with a chi-square kernel on a training sample with two features. The build models allow automatic recognition of such welding defects as lack of fusion and metal oxidation. The computational experiments with real video sequences obtained with a digital camera confirmed the possibility of using the proposed solution for recognizing surface welding defects in the process of manufacturing stainless steel pipes. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. A review of optimization algorithms in SVM parameters.
- Author
-
Hussein, Hussein Ibrahim and Anwar, Said Amirul
- Subjects
OPTIMIZATION algorithms ,SUPPORT vector machines ,MACHINE learning - Abstract
The SVM is a widely known machine learning, which is very useful for regression applications and pattern classification. These machines have been used successfully in several domains to address numerous real-world challenges. In this context, parameter optimisation for an SVM is a widely researched topic, which has attracted attention from several research domains. Algorithms facilitating optimisation have been of greater interest compared to other algorithms. Algorithmic approaches allow the optimal parameters for an SVM to be determined, after which the model can be adapted for several other applications. During the last two decades, several enhancements have been brought about to facilitate better optimisation of SVM models to offer enhanced performance. This paper focuses on the several algorithms currently employed to optimise support vector machines in their basic and modified forms. This paper comprises a comprehensive analysis of algorithms and aims to ascertain the present challenges relating to algorithms used for SVM parameter optimisation. This study cannot evaluate all the details; however, the significant theoretical aspects are covered using references to existing literature. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. CNN-VAE: An intelligent text representation algorithm.
- Author
-
Xu, Saijuan, Guo, Canyang, Zhu, Yuhan, Liu, Genggeng, and Xiong, Neal
- Subjects
CONVOLUTIONAL neural networks ,BIG data ,MACHINE learning ,POLYSEMY ,SUPPORT vector machines ,K-nearest neighbor classification ,ALGORITHMS - Abstract
Collecting and analyzing data from all devices to improve the efficiency of business processes is an important task of Industrial Internet of Things (IIoT). In the age of data explosion, extensive text data generated by the IIoT have given birth to a variety of text representation methods. The task of text representation is to convert the natural language to a form that computer can understand with retaining the original semantics. However, these methods are difficult to effectively extract the semantic features among words and distinguish polysemy in natural language. Combining the advantages of convolutional neural network (CNN) and variational autoencoder (VAE), this paper proposes an intelligent CNN-VAE text representation algorithm as an advanced learning method for social big data within next-generation IIoT, which help users identify the information collected by sensors and perform further processing. This method employs the convolution layer to capture the local features of the context and uses the variational technique to reconstruct feature space to make it conform to the normal distribution. In addition, the improved word2vec model based on topical word embedding (TWE) is utilized to add topical information to word vectors to distinguish polysemy. This paper takes the social big data as an example to illustrate the way of the proposed algorithm applied in the next-generation IIoT and utilizes Cnews dataset to verify the performance of proposed method with four evaluating metrics (i.e., recall, accuracy, precision, and F1-score). Experimental results indicate that the proposed method outperforms word2vec-avg and CNN-AE in K-nearest neighbor (KNN), random forest (RF), and support vector machine (SVM) classifiers and distinguishes polysemy effectively. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. A Tri-Training method for lithofacies identification under scarce labeled logging data.
- Author
-
Zhu, Xinyi, Zhang, Hongbing, Ren, Quan, Zhang, Dailu, Zeng, Fanxing, Zhu, Xinjie, and Zhang, Lingyuan
- Subjects
DATA logging ,LITHOFACIES ,SUPPORT vector machines ,RANDOM forest algorithms ,SECURE Sockets Layer (Computer network protocol) ,DECISION trees ,SUPERVISED learning - Abstract
Lithofacies identification is critical to energy exploration and reservoir evaluation. Machine learning provides a way to use logging data for lithofacies intelligence identification. However, labeled logging data are usually scarce, which makes the currently used supervised algorithms less effective, so semi-supervised methods have received attention from researchers. In this paper, we propose to apply Tri-Training to the field of lithofacies recognition. The framework used Random Forest (RF), Gradient-Boosted Decision Trees (GBDT), and Support Vector Machine (SVM), as the baseline supervised classifiers, and based on the idea of inductive semi-supervised methods and ensemble learning. Baseline classifiers are trained and iterated using unlabeled data to obtain effect improvement. The final results are output in an ensemble paradigm. We used seven logging parameters from two wells as input and divide the data randomly 10 times for training and testing. With only five samples of each lithology, the prediction accuracy improved by the average of 2.1% and 14.5% in both wells compared to the baseline methods. In addition, we also compared two commonly used semi-supervised methods, label propagation algorithm (LPA) and Co-Training. The experimental results also confirm that Tri-training has the better and more stable performance. The Tri-training method in this paper can be effectively applied to lithofacies identification under scarce labeled logging data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
47. Machine Learning Applications in Surface Transportation Systems: A Literature Review.
- Author
-
Behrooz, Hojat and Hayeri, Yeganeh M.
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,INTELLIGENT transportation systems ,LITERATURE reviews ,CONVOLUTIONAL neural networks ,DEEP learning ,SUPPORT vector machines - Abstract
Surface transportation has evolved through technology advancements using parallel knowledge areas such as machine learning (ML). However, the transportation industry has not yet taken full advantage of ML. To evaluate this gap, we utilized a literature review approach to locate, categorize, and synthesize the principal concepts of research papers regarding surface transportation systems using ML algorithms, and we then decomposed them into their fundamental elements. We explored more than 100 articles, literature review papers, and books. The results show that 74% of the papers concentrate on forecasting, while multilayer perceptions, long short-term memory, random forest, supporting vector machine, XGBoost, and deep convolutional neural networks are the most preferred ML algorithms. However, sophisticated ML algorithms have been minimally used. The root-cause analysis revealed a lack of effective collaboration between the ML and transportation experts, resulting in the most accessible transportation applications being used as a case study to test or enhance a given ML algorithm and not necessarily to enhance a mobility or safety issue. Additionally, the transportation community does not define transportation issues clearly and does not provide publicly available transportation datasets. The transportation sector must offer an open-source platform to showcase the sector's concerns and build spatiotemporal datasets for ML experts to accelerate technology advancements. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. SVM-BASED STOCK MARKET PRICE PREDICTION METHODS: AN ADVANCED REVIEW.
- Author
-
VISHWAKARMA, VIJAY KUMAR and BHOSALE, NARAYAN P.
- Subjects
MARKET prices ,STOCK exchanges ,MARKET pricing ,MARKET sentiment ,SUPPORT vector machines - Abstract
This paper offers a concise analysis of the strategies currently in use for stock price prediction by retail investors. The price may increase or decrease according to the quarterly results, financial news flow, technical behavior, or market sentiment resulting from recent developments worldwide. This paper discussed the accuracy of many proposed approaches and methodologies for predicting stock price movement. The Support Vector Machine (SVM) is the foundation of the approaches, with additional parameters and variables. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
49. A CEEMD-ARIMA-SVM model with structural breaks to forecast the crude oil prices linked with extreme events.
- Author
-
Cheng, Yuxiang, Yi, Jiayu, Yang, Xiaoguang, Lai, Kin Keung, and Seco, Luis
- Subjects
PETROLEUM sales & prices ,STRUCTURAL models ,HILBERT-Huang transform ,SUPPORT vector machines ,MOVING average process - Abstract
This paper develops an integrated framework to forecast the volatility of crude oil prices by considering the impacts of extreme events (structural breaks). The impacts of extreme events are vital to improving prediction accuracy. Aiming to demonstrate the crude oil price fluctuation and the impacts of external events, this paper employs the complementary ensemble empirical mode decomposition (CEEMD). It decomposes the crude oil price into some constituents at various frequencies to extract a market fluctuation, a shock from extreme events and a long-term trend. The shock from extreme events is found to be the most crucial element in deciding the crude oil prices. Then we combine the iterative cumulative sum of squares (ICSS) test with the Chow test to get the structural breaks and analyze the extreme event impacts. Finally, this paper combines the structural breaks, the autoregressive integrated moving average (ARIMA) model, and the support vector machine (SVM) to make a forecast of the crude oil prices. The empirical process proves that the CEEMD-ARIMA-SVM model with structural breaks performs the best when compared with the other ARIMA-type models and SVM-type models. The framework offers an insightful view to help decision-makers and can be used in many areas. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
50. A Novel Architecture for Diabetes Patients' Prediction Using K-Means Clustering and SVM.
- Author
-
Arora, Nitin, Singh, Anupam, Al-Dabagh, Mustafa Zuhaer Nayef, and Maitra, Sumit Kumar
- Subjects
K-means clustering ,PEOPLE with diabetes ,BLOOD sugar ,SUPPORT vector machines ,MACHINE learning - Abstract
Diabetes is one of the alarming issues in today's era. It is a chronic disease that may cause many health-related problems. It is a group of syndrome that results in too much sugar in the blood. Diabetes's chronic hyperglycemia has been linked to long-term damage, organ breakdown, and organ failure, notably in the eyes, kidneys, nerves, heart, and veins. Machine learning has quickly advanced, and it is now used in many facets of medical health. The goal of this research is to create a model with the highest level of accuracy that can predict a patient's chance of developing diabetes. This paper proposes a novel architecture for predicting diabetes patients using the K-means clustering technique and support vector machine (SVM). The features extracted from K-means are then classified using an SVM classifier. A publicly available dataset, namely, the Pima Indians Diabetes Database, is tested using this approach. Accuracy of 98.7% is noted on the used dataset. On this dataset, the combined method performs better than the conventional SVM-based classification. This paper also compared the accuracy, precision, recall, and F1-score of the different machine learning techniques for classifying diabetes patients. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.