3,841 results
Search Results
2. An Effective Image Classification Method for Plant Diseases with Improved Channel Attention Mechanism aECAnet Based on Deep Learning.
- Author
-
Yang, Wenqiang, Yuan, Ying, Zhang, Donghua, Zheng, Liyuan, and Nie, Fuquan
- Subjects
DEEP learning ,IMAGE recognition (Computer vision) ,CONVOLUTIONAL neural networks ,PLANT classification ,PLANT diseases ,PLANT productivity ,NOSOLOGY - Abstract
Since plant diseases occurring during the growth process are a significant factor leading to the decline in both yield and quality, the classification and detection of plant leaf diseases, followed by timely prevention and control measures, are crucial for safeguarding plant productivity and quality. As the traditional convolutional neural network structure cannot effectively recognize similar plant leaf diseases, in order to more accurately identify the diseases on plant leaves, this paper proposes an effective plant disease image recognition method aECA-ResNet34. This method is based on ResNet34, and in the first and the last layers of this network, respectively, we add this paper's improved aECAnet with the symmetric structure. aECA-ResNet34 is compared with different plant disease classification models on the peanut dataset constructed in this paper and the open-source PlantVillage dataset. The experimental results show that the aECA-ResNet34 model proposed in this paper has higher accuracy, better performance, and better robustness. The results show that the aECA-ResNet34 model proposed in this paper is able to recognize diseases of multiple plant leaves very accurately. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Exploring the Processing Paradigm of Input Data for End-to-End Deep Learning in Tool Condition Monitoring.
- Author
-
Wang, Chengguan, Wang, Guangping, Wang, Tao, Xiong, Xiyao, Ouyang, Zhongchuan, and Gong, Tao
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,COMPUTER input design ,STANDARD deviations ,SIGNAL processing ,DEEP learning - Abstract
Tool condition monitoring technology is an indispensable part of intelligent manufacturing. Most current research focuses on complex signal processing techniques or advanced deep learning algorithms to improve prediction performance without fully leveraging the end-to-end advantages of deep learning. The challenge lies in transforming multi-sensor raw data into input data suitable for direct model feeding, all while minimizing data scale and preserving sufficient temporal interpretation of tool wear. However, there is no clear reference standard for this so far. In light of this, this paper innovatively explores the processing methods that transform raw data into input data for deep learning models, a process known as an input paradigm. This paper introduces three new input paradigms: the downsampling paradigm, the periodic paradigm, and the subsequence paradigm. Then an improved hybrid model that combines a convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM) was employed to validate the model's performance. The subsequence paradigm demonstrated considerable superiority in prediction results based on the PHM2010 dataset, as the newly generated time series maintained the integrity of the raw data. Further investigation revealed that, with 120 subsequences and the temporal indicator being the maximum value, the model's mean absolute error (MAE) and root mean square error (RMSE) were the lowest after threefold cross-validation, outperforming several classical and contemporary methods. The methods explored in this paper provide references for designing input data for deep learning models, helping to enhance the end-to-end potential of deep learning models, and promoting the industrial deployment and practical application of tool condition monitoring systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Amur Tiger Individual Identification Based on the Improved InceptionResNetV2.
- Author
-
Wu, Ling, Jinma, Yongyi, Wang, Xinyang, Yang, Feng, Xu, Fu, Cui, Xiaohui, and Sun, Qiao
- Abstract
Simple Summary: Accurate identification of individual Amur tigers is vital for their conservation, as it helps us understand their population and distribution. Existing identification methods often fall short in accuracy, and our study focuses on creating a more accurate method for identifying individual Amur tigers using advanced deep learning techniques. We improved an existing neural network model called InceptionResNetV2 by adding features like dropout layers and dual-attention mechanisms to better capture the unique stripe patterns of each tiger and reduce errors during training. We tested our model on a large dataset of tiger images and found it to be highly effective, achieving an average recognition accuracy of over 95% for different body parts, with left stripes reaching the highest 99.37%. This method significantly outperforms previous models and provides a reliable tool for wildlife researchers and conservationists to monitor and protect Amur tigers. By improving the ability to track individual tigers, our research offers practical benefits for preserving this endangered species and enhancing wildlife management practices globally. Accurate and intelligent identification of rare and endangered individuals of flagship wildlife species, such as Amur tiger (Panthera tigris altaica), is crucial for understanding population structure and distribution, thereby facilitating targeted conservation measures. However, many mathematical modeling methods, including deep learning models, often yield unsatisfactory results. This paper proposes an individual recognition method for Amur tigers based on an improved InceptionResNetV2 model. Initially, the YOLOv5 model is employed to automatically detect and segment facial, left stripe, and right stripe areas from images of 107 individual Amur tigers, achieving a high average classification accuracy of 97.3%. By introducing a dropout layer and a dual-attention mechanism, we enhance the InceptionResNetV2 model to better capture the stripe features of individual tigers at various granularities and reduce overfitting during training. Experimental results demonstrate that our model outperforms other classic models, offering optimal recognition accuracy and ideal loss changes. The average recognition accuracy for different body part features is 95.36%, with left stripes achieving a peak accuracy of 99.37%. These results highlight the model's excellent recognition capabilities. Our research provides a valuable and practical approach to the individual identification of rare and endangered animals, offering significant potential for improving conservation efforts. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Short-Term Power Load Forecasting Based on Secondary Cleaning and CNN-BILSTM-Attention.
- Author
-
Wang, Di, Li, Sha, and Fu, Xiaojin
- Abstract
Accurate power load forecasting can provide crucial insights for power system scheduling and energy planning. In this paper, to address the problem of low accuracy of power load prediction, we propose a method that combines secondary data cleaning and adaptive variational mode decomposition (VMD), convolutional neural networks (CNN), bi-directional long short-term memory (BILSTM), and adding attention mechanism (AM). The Inner Mongolia electricity load data were first cleaned use the K-means algorithm, and then further refined with the density-based spatial clustering of applications with the noise (DBSCAN) algorithm. Subsequently, the parameters of the VMD algorithm were optimized using a multi-strategy Cubic-T dung beetle optimization algorithm (CTDBO), after which the VMD algorithm was employed to decompose the twice-cleaned load sequences into a number of intrinsic mode functions (IMFs) with different frequencies. These IMFs were then used as inputs to the CNN-BILSTM-Attention model. In this model, a CNN is used for feature extraction, BILSTM for extracting information from the load sequence, and AM for assigning different weights to different features to optimize the prediction results. It is proved experimentally that the model proposed in this paper achieves the highest prediction accuracy and robustness compared to other models and exhibits high stability across different time periods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Deep Learning for Pneumonia Detection in Chest X-ray Images: A Comprehensive Survey.
- Author
-
Siddiqi, Raheel and Javaid, Sameena
- Abstract
This paper addresses the significant problem of identifying the relevant background and contextual literature related to deep learning (DL) as an evolving technology in order to provide a comprehensive analysis of the application of DL to the specific problem of pneumonia detection via chest X-ray (CXR) imaging, which is the most common and cost-effective imaging technique available worldwide for pneumonia diagnosis. This paper in particular addresses the key period associated with COVID-19, 2020–2023, to explain, analyze, and systematically evaluate the limitations of approaches and determine their relative levels of effectiveness. The context in which DL is applied as both an aid to and an automated substitute for existing expert radiography professionals, who often have limited availability, is elaborated in detail. The rationale for the undertaken research is provided, along with a justification of the resources adopted and their relevance. This explanatory text and the subsequent analyses are intended to provide sufficient detail of the problem being addressed, existing solutions, and the limitations of these, ranging in detail from the specific to the more general. Indeed, our analysis and evaluation agree with the generally held view that the use of transformers, specifically, vision transformers (ViTs), is the most promising technique for obtaining further effective results in the area of pneumonia detection using CXR images. However, ViTs require extensive further research to address several limitations, specifically the following: biased CXR datasets, data and code availability, the ease with which a model can be explained, systematic methods of accurate model comparison, the notion of class imbalance in CXR datasets, and the possibility of adversarial attacks, the latter of which remains an area of fundamental research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Digital Self-Interference Cancellation for Full-Duplex Systems Based on CNN and GRU.
- Author
-
Liu, Jun and Ding, Tian
- Subjects
CONVOLUTIONAL neural networks ,TELECOMMUNICATION systems ,POLYNOMIALS ,SIGNALS & signaling - Abstract
Self-interference (SI) represents a bottleneck in the performance of full-duplex (FD) communication systems, necessitating robust offsetting techniques to unlock the potential of FD systems. Currently, deep learning has been leveraged within the communication domain to address specific challenges and enhance efficiency. Inspired by this, this paper reviews the self-interference cancellation (SIC) process in the digital domain focusing on SIC capability. The paper introduces a model architecture that integrates CNN and gated recurrent unit (GRU), while also incorporating residual networks and self-attention mechanisms to enhance the identification and elimination of SI. This model is named CGRSA-Net. Firstly, CNN is employed to capture local signal features in the time–frequency domain. Subsequently, a ResNet module is introduced to mitigate the gradient vanishing problem. Concurrently, GRU is utilized to dynamically capture and retain both long- and short-term dependencies during the communication process. Lastly, by integrating the self-attention mechanism, attention weights are flexibly assigned when processing sequence data, thereby focusing on the most important parts of the input sequence. Experimental results demonstrate that the proposed CGRSA-Net model achieves a minimum of 28% improvement in nonlinear SIC capability compared to polynomial and existing neural network-based eliminator. Additionally, through ablation experiments, we demonstrate that the various modules utilized in this paper effectively learn signal features and further enhance SIC performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Optimizing Parameter Extraction in Grid Information Models Based on Improved Convolutional Neural Networks.
- Author
-
Li, Xintong and Liu, Xiangjun
- Subjects
CONVOLUTIONAL neural networks ,ENGINEERING equipment ,ARTIFICIAL intelligence ,ENGINEERING design ,IMAGE recognition (Computer vision) ,DIGITAL technology - Abstract
With the rapid advancement of digital technology, three-dimensional designs of Grid Information Models (GIMs) are increasingly applied in the power industry. Addressing the challenges of extracting key parameters during the GIM's process for power grid equipment, this paper explores an innovative approach that integrates artificial intelligence with image recognition technologies into power design engineering. The traditional methods of "template matching, feature extraction and classification, and symbol recognition" have enabled the automated processing of electrical grid equipment engineering drawings, allowing for the extraction of key information related to grid equipment. However, these methods still rely on manually designed and selected feature regions, which limits their potential for achieving full automation. This study introduces an optimized algorithm that combines enhanced Convolutional Neural Networks (CNNs) with Depth-First Search (DFS) strategies, and is specifically designed for the automated extraction of crucial GIM parameters from power grid equipment. Implemented on the design schematics of power engineering projects, this algorithm utilizes an improved CNN to precisely identify component symbols on schematics, and continues to extract text data associated with these symbols. Utilizing a scene text detector, the text data are matched with corresponding component symbols. Finally, the DFS strategy is applied to identify connections between these component symbols in the diagram, thus facilitating the automatic extraction of key GIM parameters. Experimental results demonstrate that this optimized algorithm can accurately identify basic GIM parameters, providing technical support for the automated extraction of parameters using the GIM. This study's recognition accuracy is 91.31%, while a traditional CNN achieves 71.23% and a Faster R-CNN achieves 89.59%. Compared to existing research, the main innovation of this paper lies in the application of the combined enhanced CNN and DFS strategies for the extraction of GIM parameters in the power industry. This method not only improves the accuracy of parameter extraction but also significantly enhances processing speed, enabling the rapid and effective identification and extraction of critical information in complex power design environments. Moreover, the automated process reduces manual intervention, offering a novel solution in the field of power design. These features make this research broadly applicable and of significant practical value in the construction and maintenance of smart grids. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. On-Board Multi-Class Geospatial Object Detection Based on Convolutional Neural Network for High Resolution Remote Sensing Images.
- Author
-
Shen, Yanyun, Liu, Di, Chen, Junyi, Wang, Zhipan, Wang, Zhe, and Zhang, Qingling
- Subjects
OBJECT recognition (Computer vision) ,CONVOLUTIONAL neural networks ,REMOTE-sensing images ,REMOTE sensing ,DATA transmission systems ,URBAN planning ,OPTICAL remote sensing - Abstract
Multi-class geospatial object detection in high-resolution remote sensing images has significant potential in various domains such as industrial production, military warning, disaster monitoring, and urban planning. However, the traditional process of remote sensing object detection involves several time-consuming steps, including image acquisition, image download, ground processing, and object detection. These steps may not be suitable for tasks with shorter timeliness requirements, such as military warning and disaster monitoring. Additionally, the transmission of massive data from satellites to the ground is limited by bandwidth, resulting in time delays and redundant information, such as cloud coverage images. To address these challenges and achieve efficient utilization of information, this paper proposes a comprehensive on-board multi-class geospatial object detection scheme. The proposed scheme consists of several steps. Firstly, the satellite imagery is sliced, and the PID-Net (Proportional-Integral-Derivative Network) method is employed to detect and filter out cloud-covered tiles. Subsequently, our Manhattan Intersection over Union (MIOU) loss-based YOLO (You Only Look Once) v7-Tiny method is used to detect remote-sensing objects in the remaining tiles. Finally, the detection results are mapped back to the original image, and the truncated NMS (Non-Maximum Suppression) method is utilized to filter out repeated and noisy boxes. To validate the reliability of the scheme, this paper creates a new dataset called DOTA-CD (Dataset for Object Detection in Aerial Images-Cloud Detection). Experiments were conducted on both ground and on-board equipment using the AIR-CD dataset, DOTA dataset, and DOTA-CD dataset. The results demonstrate the effectiveness of our method. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Sentiment Analysis of Comment Data Based on BERT-ETextCNN-ELSTM.
- Author
-
Deng, Lujuan, Yin, Tiantian, Li, Zuhe, and Ge, Qingxia
- Subjects
SENTIMENT analysis ,CONVOLUTIONAL neural networks ,DATA analysis ,MATHEMATICAL convolutions ,TASK analysis - Abstract
With the rapid popularity and continuous development of social networks, users' communication and interaction through platforms such as microblogs and forums have become more and more frequent. The comment data on these platforms reflect users' opinions and sentiment tendencies, and sentiment analysis of comment data has become one of the hot spots and difficulties in current research. In this paper, we propose a BERT-ETextCNN-ELSTM (Bidirectional Encoder Representations from Transformers–Enhanced Convolution Neural Networks–Enhanced Long Short-Term Memory) model for sentiment analysis. The model takes text after word embedding and BERT encoder processing and feeds it to an optimized CNN layer for convolutional operations in order to extract local features of the text. The features from the CNN layer are then fed into the LSTM layer for time-series modeling to capture long-term dependencies in the text. The experimental results proved that compared with TextCNN (Convolution Neural Networks), LSTM (Long Short-Term Memory), TextCNN-LSTM (Convolution Neural Networks–Long Short-Term Memory), and BiLSTM-ATT (Bidirectional Long Short-Term Memory Network–Attention), the model proposed in this paper was more effective in sentiment analysis. In the experimental data, the model reached a maximum of 0.89, 0.88, and 0.86 in terms of accuracy, F1 value, and macro-average F1 value, respectively, on both datasets, proving that the model proposed in this paper was more effective in sentiment analysis of comment data. The proposed model achieved better performance in the review sentiment analysis task and significantly outperformed the other comparable models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. Maize Leaf Disease Recognition Based on Improved Convolutional Neural Network ShuffleNetV2.
- Author
-
Zhou, Hanmi, Su, Yumin, Chen, Jiageng, Li, Jichen, Ma, Linshuang, Liu, Xingyi, Lu, Sibo, and Wu, Qi
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,CORN diseases ,CORN ,PRECISION farming ,AGRICULTURAL development - Abstract
The occurrence of maize diseases is frequent but challenging to manage. Traditional identification methods have low accuracy and complex model structures with numerous parameters, making them difficult to implement on mobile devices. To address these challenges, this paper proposes a corn leaf disease recognition model SNMPF based on convolutional neural network ShuffleNetV2. In the down-sampling module of the ShuffleNet model, the max pooling layer replaces the deep convolutional layer to perform down-sampling. This improvement helps to extract key features from images, reduce the overfitting of the model, and improve the model's generalization ability. In addition, to enhance the model's ability to express features in complex backgrounds, the Sim AM attention mechanism was introduced. This mechanism enables the model to adaptively adjust focus and pay more attention to local discriminative features. The results on a maize disease image dataset demonstrate that the SNMPF model achieves a recognition accuracy of 98.40%, representing a 4.1 percentage point improvement over the original model, while its size is only 1.56 MB. Compared with existing convolutional neural network models such as EfficientNet, MobileViT, EfficientNetV2, RegNet, and DenseNet, this model offers higher accuracy and a more compact size. As a result, it can automatically detect and classify maize leaf diseases under natural field conditions, boasting high-precision recognition capabilities. Its accurate identification results provide scientific guidance for preventing corn leaf disease and promote the development of precision agriculture. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. A Method for Identifying the Wear State of Grinding Wheels Based on VMD Denoising and AO-CNN-LSTM.
- Author
-
Xu, Kai and Feng, Dinglu
- Subjects
GRINDING wheels ,CONVOLUTIONAL neural networks ,ACOUSTIC emission ,DEEP learning - Abstract
Monitoring the condition of the grinding wheel in real-time during the grinding process is crucial as it directly impacts the precision and quality of the workpiece. Deep learning technology plays a vital role in analyzing the changes in sensor signals and identifying grinding wheel wear during the grinding process. Therefore, this paper innovatively proposes a grinding wheel wear recognition method based on Variational Mode Decomposition (VMD) denoising and Aquila Optimizer—Convolutional Neural Network—Long Short-Term Memory (AO-CNN-LSTM). The paper utilizes Acoustic Emission (AE) signals generated during grinding to identify the condition of the grinding wheel. To address noise interference, the study introduces the VMD algorithm for denoising the sample dataset, enhancing the effectiveness of neural network training. Subsequently, the dataset is fed into the designed Convolutional Neural Network—Long Short-Term Memory (CNN-LSTM) structure with AO-optimized parameters. Experimental results demonstrate that this method achieves high accuracy and performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. SE-VisionTransformer: Hybrid Network for Diagnosing Sugarcane Leaf Diseases Based on Attention Mechanism.
- Author
-
Sun, Cuimin, Zhou, Xingzhi, Zhang, Menghua, and Qin, An
- Subjects
LEAF anatomy ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,SUGAR plantations ,SUGARCANE ,SUPPORT vector machines ,HEBBIAN memory - Abstract
Sugarcane is an important raw material for sugar and chemical production. However, in recent years, various sugarcane diseases have emerged, severely impacting the national economy. To address the issue of identifying diseases in sugarcane leaf sections, this paper proposes the SE-VIT hybrid network. Unlike traditional methods that directly use models for classification, this paper compares threshold, K-means, and support vector machine (SVM) algorithms for extracting leaf lesions from images. Due to SVM's ability to accurately segment these lesions, it is ultimately selected for the task. The paper introduces the SE attention module into ResNet-18 (CNN), enhancing the learning of inter-channel weights. After the pooling layer, multi-head self-attention (MHSA) is incorporated. Finally, with the inclusion of 2D relative positional encoding, the accuracy is improved by 5.1%, precision by 3.23%, and recall by 5.17%. The SE-VIT hybrid network model achieves an accuracy of 97.26% on the PlantVillage dataset. Additionally, when compared to four existing classical neural network models, SE-VIT demonstrates significantly higher accuracy and precision, reaching 89.57% accuracy. Therefore, the method proposed in this paper can provide technical support for intelligent management of sugarcane plantations and offer insights for addressing plant diseases with limited datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. A Medium- and Long-Term Residential Load Forecasting Method Based on Discrete Cosine Transform-FEDformer.
- Author
-
Li, Dengao, Liu, Qi, Feng, Ding, and Chen, Zhichao
- Subjects
DISCRETE cosine transforms ,DISCRETE Fourier transforms ,ELECTRICITY pricing ,LOAD forecasting (Electric power systems) ,FORECASTING - Abstract
Accurate and reliable medium- and long-term load forecasting is crucial for the rational planning and operation of power systems. However, existing methods often struggle to accurately extract and capture long-term dependencies in load data, leading to poor predictive accuracy. Therefore, this paper proposes a medium- and long-term residential load forecasting method based on FEDformer, aiming to capture long-term temporal dependencies of load data in the frequency domain while considering factors such as electricity prices and temperature, ultimately improving the accuracy of medium- and long-term load forecasting. The proposed model employs Discrete Cosine Transform (DCT) for frequency domain transformation of time-series data to address the Gibbs phenomenon caused by the use of Discrete Fourier Transform (DFT) in FEDformer. Additionally, causal convolution and attention mechanisms are applied in the frequency domain to enhance the model's capability to capture long-term dependencies. The model is evaluated using real-world load data from power systems, and experimental results demonstrate that the proposed model effectively learns the temporal and nonlinear characteristics of load data. Compared to other baseline models, DCTformer improves prediction accuracy by 37.5% in terms of MSE, 26.9% in terms of MAE, and 26.24% in terms of RMSE. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Medical Image Classification with a Hybrid SSM Model Based on CNN and Transformer.
- Author
-
Hu, Can, Cao, Ning, Zhou, Han, and Guo, Bin
- Subjects
CONVOLUTIONAL neural networks ,IMAGE recognition (Computer vision) ,TRANSFORMER models ,FEATURE extraction ,CLASSIFICATION algorithms - Abstract
Medical image classification, a pivotal task for diagnostic accuracy, poses unique challenges due to the intricate and variable nature of medical images compared to their natural counterparts. While Convolutional Neural Networks (CNNs) and Transformers are prevalent in this domain, each architecture has its drawbacks. CNNs, despite their strength in local feature extraction, fall short in capturing global context, whereas Transformers excel at global information but can overlook fine-grained details. The integration of CNNs and Transformers in a hybrid model aims to bridge this gap by enabling simultaneous local and global feature extraction. However, this approach remains constrained in its capacity to model long-range dependencies, thereby hindering the efficient extraction of distant features. To address these issues, we introduce the MambaConvT model, which employs a state-space approach. It begins by locally processing input features through multi-core convolution, enhancing the extraction of deep, discriminative local details. Next, depth-separable convolution with a 2D selective scanning module (SS2D) is employed to maintain a global receptive field and establish long-distance connections, capturing the fine-grained features. The model then combines hybrid features for comprehensive feature extraction, followed by global feature modeling to emphasize on global detail information and optimize feature representation. This paper conducts thorough performance experiments on different algorithms across four publicly available datasets and two private datasets. The results demonstrate that MambaConvT outperforms the latest classification algorithms in terms of accuracy, precision, recall, F1 score, and AUC value ratings, achieving superior performance in the precise classification of medical images. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Detecting Selected Instruments in the Sound Signal.
- Author
-
Kostrzewa, Daniel, Szwajnoch, Paweł, Brzeski, Robert, and Mrozek, Dariusz
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,SOUND recording executives & producers ,INFORMATION retrieval ,DATABASES - Abstract
Detecting instruments in a music signal is often used in database indexing, song annotation, and creating applications for musicians and music producers. Therefore, effective methods that automatically solve this issue need to be created. In this paper, the mentioned task is solved using mel-frequency cepstral coefficients (MFCC) and various architectures of artificial neural networks. The authors' contribution to the development of automatic instrument detection covers the methods used, particularly the neural network architectures and the voting committees created. All these methods were evaluated, and the results are presented and discussed in the paper. The proposed automatic instrument detection methods show that the best classification quality was obtained for an extensive model, which is the so-called committee of voting classifiers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Multi-Directional Long-Term Recurrent Convolutional Network for Road Situation Recognition.
- Author
-
Dofitas Jr., Cyreneo, Gil, Joon-Min, and Byun, Yung-Cheol
- Subjects
ARTIFICIAL neural networks ,RECURRENT neural networks ,PEDESTRIANS ,ROAD safety measures ,CONVOLUTIONAL neural networks ,DEEP learning - Abstract
Understanding road conditions is essential for implementing effective road safety measures and driving solutions. Road situations encompass the day-to-day conditions of roads, including the presence of vehicles and pedestrians. Surveillance cameras strategically placed along streets have been instrumental in monitoring road situations and providing valuable information on pedestrians, moving vehicles, and objects within road environments. However, these video data and information are stored in large volumes, making analysis tedious and time-consuming. Deep learning models are increasingly utilized to monitor vehicles and identify and evaluate road and driving comfort situations. However, the current neural network model requires the recognition of situations using time-series video data. In this paper, we introduced a multi-directional detection model for road situations to uphold high accuracy. Deep learning methods often integrate long short-term memory (LSTM) into long-term recurrent network architectures. This approach effectively combines recurrent neural networks to capture temporal dependencies and convolutional neural networks (CNNs) to extract features from extensive video data. In our proposed method, we form a multi-directional long-term recurrent convolutional network approach with two groups equipped with CNN and two layers of LSTM. Additionally, we compare road situation recognition using convolutional neural networks, long short-term networks, and long-term recurrent convolutional networks. The paper presents a method for detecting and recognizing multi-directional road contexts using a modified LRCN. After balancing the dataset through data augmentation, the number of video files increased, resulting in our model achieving 91% accuracy, a significant improvement from the original dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Integrating Artificial Intelligence to Biomedical Science: New Applications for Innovative Stem Cell Research and Drug Development.
- Author
-
Kim, Minjae and Hong, Sunghoi
- Subjects
CONVOLUTIONAL neural networks ,LIFE sciences ,MEDICAL sciences ,STEM cell research ,DRUG discovery - Abstract
Artificial intelligence (AI) is rapidly advancing, aiming to mimic human cognitive abilities, and is addressing complex medical challenges in the field of biological science. Over the past decade, AI has experienced exponential growth and proven its effectiveness in processing massive datasets and optimizing decision-making. The main content of this review paper emphasizes the active utilization of AI in the field of stem cells. Stem cell therapies use diverse stem cells for drug development, disease modeling, and medical treatment research. However, cultivating and differentiating stem cells, along with demonstrating cell efficacy, require significant time and labor. In this review paper, convolutional neural networks (CNNs) are widely used to overcome these limitations by analyzing stem cell images, predicting cell types and differentiation efficiency, and enhancing therapeutic outcomes. In the biomedical sciences field, AI algorithms are used to automatically screen large compound databases, identify potential molecular structures and characteristics, and evaluate the efficacy and safety of candidate drugs for specific diseases. Also, AI aids in predicting disease occurrence by analyzing patients' genetic data, medical images, and physiological signals, facilitating early diagnosis. The stem cell field also actively utilizes AI. Artificial intelligence has the potential to make significant advances in disease risk prediction, diagnosis, prognosis, and treatment and to reshape the future of healthcare. This review summarizes the applications and advancements of AI technology in fields such as drug development, regenerative medicine, and stem cell research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Rolling Bearing Fault Diagnosis Based on CEEMDAN and CNN-SVM.
- Author
-
Shi, Lei, Liu, Wenchao, You, Dazhang, and Yang, Sheng
- Subjects
ROLLER bearings ,CONVOLUTIONAL neural networks ,FAULT diagnosis ,GREY Wolf Optimizer algorithm ,HILBERT-Huang transform ,SUPPORT vector machines - Abstract
The vibration signals collected by acceleration sensors are interspersed with noise interference, which increases the difficulty of fault diagnosis for rolling bearings. For this reason, a rolling bearing fault diagnosis method based on complete ensemble empirical model decomposition with adaptive noise (CEEMDAN) and improved convolutional neural network (CNN) is proposed. Firstly, the original vibration signal is decomposed into a series of intrinsic modal function (IMF) components using the CEEMDAN algorithm, the components are filtered according to the correlation coefficients and the signals are reconstructed. Secondly, the reconstructed signals are converted into a two-dimensional grey-scale map and input into a convolutional neural network to extract the features. Lastly, the features are inputted into a support vector machine (SVM) with the optimised parameters of the grey wolf optimiser (GWO) to perform the identification and classification. The experimental results show that the rolling bearing fault diagnosis method based on CEEMDAN and CNN-SVM proposed in this paper can significantly reduce the noise interference, and its average fault diagnosis accuracy is as high as 99.25%. Therefore, it is feasible to apply it in the field of rolling bearing fault diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Ball Screw Fault Diagnosis Based on Wavelet Convolution Transfer Learning.
- Author
-
Xie, Yifan, Liu, Chang, Huang, Liji, and Duan, Hongchun
- Subjects
FAULT diagnosis ,NUMERICAL control of machine tools ,NUTS ,SCREWS ,SENSOR placement - Abstract
The ball screw is the core component of the CNC machine tool feed system, and its health plays an important role in the feed system and even in the entire CNC machine tool. This paper studies the fault diagnosis and health assessment of ball screws. Aiming at the problem that the ball screw signal is weak and susceptible to interference, using a wavelet convolution structure to improve the network can improve the mining ability of signal time domain and frequency domain features; aiming at the challenge of ball screw sensor installation position limitation, a transfer learning method is proposed, which adopts the domain adaptation method as jointly distributed adaptation (JDA), and realizes the transfer diagnosis across measurement positions by extracting the diagnosis knowledge of different positions of the ball screw. In this paper, the adaptive batch normalization algorithm (AdaBN) is introduced to enhance the proposed model so as to improve the accuracy of migration diagnosis. Experiments were carried out using a self-made lead screw fatigue test bench. Through experimental verification, the method proposed in this paper can extract effective fault diagnosis knowledge. By collecting data under different working conditions at the bearing seat of the ball screw, the fault diagnosis knowledge is extracted and used to identify and diagnose the position fault of the nut seat. In this paper, some background noise is added to the collected data to test the robustness of the proposed network model. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. A Global Seawater Density Distribution Model Using a Convolutional Neural Network.
- Author
-
Qin Liu, Liyan Li, Yan Zhou, Shiwen Zhang, Yuliang Liu, and Xinwei Wang
- Abstract
Seawater density is an important physical property in oceanography that affects the accuracy of calculations such as gravity fields and tidal potentials and the calibration of acoustic and optical oceanographic sensors. In related studies, constant density values are frequently used, which can introduce significant errors. Therefore, this study employs a basic convolutional neural network model to construct a comprehensive model showing the seawater density distribution across the globe. The model takes into account depth, latitude, longitude, and month as inputs. Numerous real seawater datasets were used to train the model, and it has been shown that the model has an absolute mean error and root mean square error of less than 1 kg/m
3 in 99% of the test set samples. The model effectively demonstrates the influence of input parameters on the distribution of seawater density. In this paper, we present a newly developed global model for distributing seawater density which is both comprehensive and accurate, surpassing previous models. The utilization of the model presented in this paper for estimating seawater density can minimize errors in theoretical ocean models and serve as a foundation for designing and analyzing ocean exploration systems. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
22. A Refined Wind Power Forecasting Method with High Temporal Resolution Based on Light Convolutional Neural Network Architecture.
- Author
-
Zhang, Fei, Ren, Xiaoying, and Liu, Yongqian
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,WIND forecasting ,WIND power ,COMPUTER input design ,ARCHITECTURAL design - Abstract
With a large proportion of wind farms connected to the power grid, it brings more pressure on the stable operation of power systems in shorter time scales. Efficient and accurate scheduling, operation control and decision making require high time resolution power forecasting algorithms with higher accuracy and real-time performance. In this paper, we innovatively propose a high temporal resolution wind power forecasting method based on a light convolutional architecture—DC_LCNN. The method starts from the source data and novelly designs the dual-channel data input mode to provide different combinations of feature data for the model, thus improving the upper limit of the learning ability of the whole model. The dual-channel convolutional neural network (CNN) structure extracts different spatial and temporal constraints of the input features. The light global maximum pooling method replaces the flat operation combined with the fully connected (FC) forecasting method in the traditional CNN, extracts the most significant features of the global method, and directly performs data downscaling at the same time, which significantly improves the forecasting accuracy and efficiency of the model. In this paper, the experiments are carried out on the 1 s resolution data of the actual wind field, and the single-step forecasting task with 1 s ahead of time and the multi-step forecasting task with 1~10 s ahead of time are executed, respectively. Comparing the experimental results with the classical deep learning models in the current field, the proposed model shows absolute accuracy advantages on both forecasting tasks. This also shows that the light architecture design based on simple deep learning models is also a good solution in performing high time resolution wind power forecasting tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Vulnerable Road User Skeletal Pose Estimation Using mmWave Radars.
- Author
-
Zeng, Zhiyuan, Liang, Xingdong, Li, Yanlei, and Dang, Xiangwei
- Subjects
ROAD users ,TRACKING radar ,RADAR targets ,CONVOLUTIONAL neural networks ,RADAR signal processing ,DATA augmentation - Abstract
A skeletal pose estimation method, named RVRU-Pose, is proposed to estimate the skeletal pose of vulnerable road users based on distributed non-coherent mmWave radar. In view of the limitation that existing methods for skeletal pose estimation are only applicable to small scenes, this paper proposes a strategy that combines radar intensity heatmaps and coordinate heatmaps as input to a deep learning network. In addition, we design a multi-resolution data augmentation and training method suitable for radar to achieve target pose estimation for remote and multi-target application scenarios. Experimental results show that RVRU-Pose can achieve better than 2 cm average localization accuracy for different subjects in different scenarios, which is superior in terms of accuracy and time compared to existing state-of-the-art methods for human skeletal pose estimation with radar. As an essential performance parameter of radar, the impact of angular resolution on the estimation accuracy of a skeletal pose is quantitatively analyzed and evaluated in this paper. Finally, RVRU-Pose has also been extended to the task of estimating the skeletal pose of a cyclist, reflecting the strong scalability of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Advancing Facial Expression Recognition in Online Learning Education Using a Homogeneous Ensemble Convolutional Neural Network Approach.
- Author
-
Lawpanom, Rit, Songpan, Wararat, and Kaewyotha, Jakkrit
- Subjects
CONVOLUTIONAL neural networks ,FACIAL expression ,ONLINE education ,DEEP learning ,EMOTIONS ,USER interfaces - Abstract
Facial expression recognition (FER) plays a crucial role in understanding human emotions and is becoming increasingly relevant in educational contexts, where personalized and empathetic interactions are essential. The problems with existing approaches are typically solved using a single deep learning method, which is not robust with complex datasets, such as FER data, which have a characteristic imbalance and multi-class labels. In this research paper, an innovative approach to FER using a homogeneous ensemble convolutional neural network, called HoE-CNN, is presented for future online learning education. This paper aims to transfer the knowledge of models and FER classification using ensembled homogeneous conventional neural network architectures. FER is challenging to research because there are many real-world applications to consider, such as adaptive user interfaces, games, education, and robot integration. HoE-CNN is used to improve the classification performance on an FER dataset, encompassing seven main multi-classes (Angry, Disgust, Fear, Happy, Sad, Surprise, Neutral). The experiment shows that the proposed framework, which uses an ensemble of deep learning models, performs better than a single deep learning model. In summary, the proposed model will increase the efficiency of FER classification results and solve FER2013 at a accuracy of 75.51%, addressing both imbalanced datasets and multi-class classification to transfer the application of the model to online learning applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.
- Author
-
Xu, Jun, Chen, Zi-Xuan, Luo, Hao, and Lu, Zhe-Ming
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,IMAGE reconstruction ,COMPUTER vision ,ALGORITHMS ,TRANSFORMER models - Abstract
The purpose of image dehazing is to remove the interference from weather factors in degraded images and enhance the clarity and color saturation of images to maximize the restoration of useful features. Single image dehazing is one of the most important tasks in the field of image restoration. In recent years, due to the progress of deep learning, single image dehazing has made great progress. With the success of Transformer in advanced computer vision tasks, some research studies also began to apply Transformer to image dehazing tasks and obtained surprising results. However, both the deconvolution-neural-network-based dehazing algorithm and Transformer based dehazing algorithm magnify their advantages and disadvantages separately. Therefore, this paper proposes a novel Transformer–Convolution fusion dehazing network (TCFDN), which uses Transformer's global modeling ability and convolutional neural network's local modeling ability to improve the dehazing ability. In the Transformer–Convolution fusion dehazing network, the classic self-encoder structure is used. This paper proposes a Transformer–Convolution hybrid layer, which uses an adaptive fusion strategy to make full use of the Swin-Transformer and convolutional neural network to extract and reconstruct image features. On the basis of previous research, this layer further improves the ability of the network to remove haze. A series of contrast experiments and ablation experiments not only proved that the Transformer–Convolution fusion dehazing network proposed in this paper exceeded the more advanced dehazing algorithm, but also provided solid and powerful evidence for the basic theory on which it depends. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Classification of Scientific Documents in the Kazakh Language Using Deep Neural Networks and a Fusion of Images and Text.
- Author
-
Bogdanchikov, Andrey, Ayazbayev, Dauren, and Varlamis, Iraklis
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,IMAGE fusion ,NATURAL language processing ,MACHINE learning ,CORPORA ,KNOWLEDGE graphs - Abstract
The rapid development of natural language processing and deep learning techniques has boosted the performance of related algorithms in several linguistic and text mining tasks. Consequently, applications such as opinion mining, fake news detection or document classification that assign documents to predefined categories have significantly benefited from pre-trained language models, word or sentence embeddings, linguistic corpora, knowledge graphs and other resources that are in abundance for the more popular languages (e.g., English, Chinese, etc.). Less represented languages, such as the Kazakh language, balkan languages, etc., still lack the necessary linguistic resources and thus the performance of the respective methods is still low. In this work, we develop a model that classifies scientific papers written in the Kazakh language using both text and image information and demonstrate that this fusion of information can be beneficial for cases of languages that have limited resources for machine learning models' training. With this fusion, we improve the classification accuracy by 4.4499% compared to the models that use only text or only image information. The successful use of the proposed method in scientific documents' classification paves the way for more complex classification models and more application in other domains such as news classification, sentiment analysis, etc., in the Kazakh language. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
27. A Novel Method for Rolling Bearing Fault Diagnosis Based on Gramian Angular Field and CNN-ViT.
- Author
-
Zhou, Zijun, Ai, Qingsong, Lou, Ping, Hu, Jianmin, and Yan, Junwei
- Subjects
FAULT diagnosis ,CONVOLUTIONAL neural networks ,ROLLER bearings ,TRANSFORMER models ,EDGE computing ,DIAGNOSIS methods - Abstract
Fault diagnosis is one of the important applications of edge computing in the Industrial Internet of Things (IIoT). To address the issue that traditional fault diagnosis methods often struggle to effectively extract fault features, this paper proposes a novel rolling bearing fault diagnosis method that integrates Gramian Angular Field (GAF), Convolutional Neural Network (CNN), and Vision Transformer (ViT). First, GAF is used to convert one-dimensional vibration signals from sensors into two-dimensional images, effectively retaining the fault features of the vibration signal. Then, the CNN branch is used to extract the local features of the image, which are combined with the global features extracted by the ViT branch to diagnose the bearing fault. The effectiveness of this method is validated with two datasets. Experimental results show that the proposed method achieves average accuracies of 99.79% and 99.63% on the CWRU and XJTU-SY rolling bearing fault datasets, respectively. Compared with several widely used fault diagnosis methods, the proposed method achieves higher accuracy for different fault classifications, providing reliable technical support for performing complex fault diagnosis on edge devices. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Forecasting a Short-Term Photovoltaic Power Model Based on Improved Snake Optimization, Convolutional Neural Network, and Bidirectional Long Short-Term Memory Network.
- Author
-
Wang, Yonggang, Yao, Yilin, Zou, Qiuying, Zhao, Kaixing, and Hao, Yue
- Subjects
CONVOLUTIONAL neural networks ,OPTIMIZATION algorithms ,PHOTOVOLTAIC power systems ,PEARSON correlation (Statistics) ,K-means clustering ,SNAKES ,STATISTICAL power analysis - Abstract
The precision of short-term photovoltaic power forecasts is of utmost importance for the planning and operation of the electrical grid system. To enhance the precision of short-term output power prediction in photovoltaic systems, this paper proposes a method integrating K-means clustering: an improved snake optimization algorithm with a convolutional neural network–bidirectional long short-term memory network to predict short-term photovoltaic power. Firstly, K-means clustering is utilized to categorize weather scenarios into three categories: sunny, cloudy, and rainy. The Pearson correlation coefficient method is then utilized to determine the inputs of the model. Secondly, the snake optimization algorithm is improved by introducing Tent chaotic mapping, lens imaging backward learning, and an optimal individual adaptive perturbation strategy to enhance its optimization ability. Then, the multi-strategy improved snake optimization algorithm is employed to optimize the parameters of the convolutional neural network–bidirectional long short-term memory network model, thereby augmenting the predictive precision of the model. Finally, the model established in this paper is utilized to forecast photovoltaic power in diverse weather scenarios. The simulation findings indicate that the regression coefficients of this method can reach 0.99216, 0.95772, and 0.93163 on sunny, cloudy, and rainy days, which has better prediction precision and adaptability under various weather conditions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Short-Term Load Forecasting of Electric Vehicle Charging Stations Accounting for Multifactor IDBO Hybrid Models.
- Author
-
Tang, Minan, Wang, Changyou, Qiu, Jiandong, Li, Hanting, Guo, Xi, and Sheng, Wenxin
- Subjects
ELECTRIC vehicle charging stations ,CONVOLUTIONAL neural networks ,TRAFFIC estimation ,ELECTRIC vehicles ,OPTIMIZATION algorithms ,STANDARD deviations ,DUNG beetles - Abstract
The charging behavior of electric vehicle users is highly stochastic, which makes the short-term prediction of charging load at electric vehicle charging stations difficult. In this paper, a data-driven hybrid model optimized by the improved dung beetle optimization algorithm (IDBO) is proposed to address the problem of the low accuracy of short-term prediction. Firstly, the charging station data are preprocessed to obtain clear and organized load data, and the input feature matrix is constructed using factors such as temperature, date type, and holidays. Secondly, the optimal CNN-BiLSTM model is constructed using convolutional neural network (CNN) and Bi-directional Long Short-Term Memory (BiLSTM), which realizes the feature extraction of the input matrix and better captures the hidden patterns and regularities in it. Then, methods such as Bernoulli mapping are used to improve the DBO algorithm and its hyperparameters; for example, hidden neurons of the hybrid model are tuned to further improve the model prediction accuracy. Finally, a simulation experiment platform is established based on MATLAB R2023a to validate the example calculations on the historical data of EV charging stations in the public dataset of ANN-DATA, and comparative analyses are carried out. The results show that compared with the traditional models such as CNN, BiLSTM and PSO-CNN-BiLSTM, the coefficient of determination of the model exceeds 0.8921 and the root mean square error is maintained at about 4.413 on both the training and test sets, which proves its effectiveness and stability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Non-Invasive Prediction of Choledocholithiasis Using 1D Convolutional Neural Networks and Clinical Data.
- Author
-
Mena-Camilo, Enrique, Salazar-Colores, Sebastián, Aceves-Fernández, Marco Antonio, Lozada-Hernández, Edgard Efrén, and Ramos-Arreguín, Juan Manuel
- Subjects
ENDOSCOPIC retrograde cholangiopancreatography ,CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,FISHER discriminant analysis ,GALLSTONES ,MACHINE learning - Abstract
This paper introduces a novel one-dimensional convolutional neural network that utilizes clinical data to accurately detect choledocholithiasis, where gallstones obstruct the common bile duct. Swift and precise detection of this condition is critical to preventing severe complications, such as biliary colic, jaundice, and pancreatitis. This cutting-edge model was rigorously compared with other machine learning methods commonly used in similar problems, such as logistic regression, linear discriminant analysis, and a state-of-the-art random forest, using a dataset derived from endoscopic retrograde cholangiopancreatography scans performed at Olive View–University of California, Los Angeles Medical Center. The one-dimensional convolutional neural network model demonstrated exceptional performance, achieving 90.77% accuracy and 92.86% specificity, with an area under the curve of 0.9270. While the paper acknowledges potential areas for improvement, it emphasizes the effectiveness of the one-dimensional convolutional neural network architecture. The results suggest that this one-dimensional convolutional neural network approach could serve as a plausible alternative to endoscopic retrograde cholangiopancreatography, considering its disadvantages, such as the need for specialized equipment and skilled personnel and the risk of postoperative complications. The potential of the one-dimensional convolutional neural network model to significantly advance the clinical diagnosis of this gallstone-related condition is notable, offering a less invasive, potentially safer, and more accessible alternative. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Deep-Learning-Based Daytime COT Retrieval and Prediction Method Using FY4A AGRI Data.
- Author
-
Xu, Fanming, Song, Biao, Chen, Jianhua, Guan, Runda, Zhu, Rongjie, Liu, Jiayu, and Qiu, Zhongfeng
- Subjects
CONVOLUTIONAL neural networks ,PREDICTION models ,DEEP learning ,FORECASTING - Abstract
The traditional method for retrieving cloud optical thickness (COT) is carried out through a Look-Up Table (LUT). Researchers must make a series of idealized assumptions and conduct extensive observations and record features in this scenario, consuming considerable resources. The emergence of deep learning effectively addresses the shortcomings of the traditional approach. In this paper, we first propose a daytime (SOZA < 70°) COT retrieval algorithm based on FY-4A AGRI. We establish and train a Convolutional Neural Network (CNN) model for COT retrieval, CM4CR, with the CALIPSO's COT product spatially and temporally synchronized as the ground truth. Then, a deep learning method extended from video prediction models is adopted to predict COT values based on the retrieval results obtained from CM4CR. The COT prediction model (CPM) consists of an encoder, a predictor, and a decoder. On this basis, we further incorporated a time embedding module to enhance the model's ability to learn from irregular time intervals in the input COT sequence. During the training phase, we employed Charbonnier Loss and Edge Loss to enhance the model's capability to represent COT details. Experiments indicate that our CM4CR outperforms existing COT retrieval methods, with predictions showing better performance across several metrics than other benchmark prediction models. Additionally, this paper also investigates the impact of different lengths of COT input sequences and the time intervals between adjacent frames of COT on prediction performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. A Two-Stage Automatic Container Code Recognition Method Considering Environmental Interference.
- Author
-
Yu, Meng, Zhu, Shanglei, Lu, Bao, Chen, Qiang, and Wang, Tengfei
- Subjects
PATTERN recognition systems ,CONVOLUTIONAL neural networks ,RECOGNITION (Psychology) ,COMPUTATIONAL complexity - Abstract
Automatic Container Code Recognition (ACCR) is critical for enhancing the efficiency of container terminals. However, existing ACCR methods frequently fail to achieve satisfactory performance in complex environments at port gates. In this paper, we propose an approach for accurate, fast, and compact container code recognition by utilizing YOLOv4 for container region localization and Deeplabv3+ for character recognition. To enhance the recognition speed and accuracy of YOLOv4 and Deeplabv3+, and to facilitate their deployment at gate entrances, we introduce several improvements. First, we optimize the feature-extraction process of YOLOv4 and Deeplabv3+ to reduce their computational complexity. Second, we enhance the multi-scale recognition and loss functions of YOLOv4 to improve the accuracy and speed of container region localization. Furthermore, we adjust the dilated convolution rates of the ASPP module in Deeplabv3+. Finally, we replace two upsampling structures in the decoder of Deeplabv3+ with transposed convolution upsampling and sub-pixel convolution upsampling. Experimental results on our custom dataset demonstrate that our proposed method, C-YOLOv4, achieves a container region localization accuracy of 99.76% at a speed of 56.7 frames per second (FPS), while C-Deeplabv3+ achieves an average pixel classification accuracy (MPA) of 99.88% and an FPS of 11.4. The overall recognition success rate and recognition speed of our approach are 99.51% and 2.3 ms per frame, respectively. Moreover, C-YOLOv4 and C-Deeplabv3+ outperform existing methods in complex scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Flow Velocity Computation in Solid–Liquid Two-Phase Flow by a Hybrid Network CNN–RKSVM.
- Author
-
Li, Kun, Yue, Shihong, and Liu, Liping
- Subjects
CONVOLUTIONAL neural networks ,SUPPORT vector machines - Abstract
As an advanced detection technique, electrical resistive tomography (ERT) has been applied to detect the solid–liquid two-phase flow velocity based on available ERT measurements. The flow velocity computation by ERT must depend on the relative algorithms, including both the cross-correlation (CC) principle and convolutional neural networks (CNNs). However, these two types of algorithms have poor accuracy and generalization under complex measuring conditions and various flow patterns. To address this issue, in this paper, a hybrid network is proposed that combines a CNN with a reproducing kernel-based support vector machine (RKSVM) technique. The features hidden in ERT measurements are extracted using the CNN, and then the flow velocity is computed by the RKSVM in a high-dimensional feature space. According to the ERT measurements in an actual experimental platform, the results show that the hybrid network has higher accuracy and generalization ability for flow velocity computation compared with the existing CC, RKSVM, and CNN methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Dynamic Electrocardiogram Signal Quality Assessment Method Based on Convolutional Neural Network and Long Short-Term Memory Network.
- Author
-
He, Chen, Wei, Yuxuan, Wei, Yeru, Liu, Qiang, and An, Xiang
- Subjects
CONVOLUTIONAL neural networks ,ARRHYTHMIA ,HEART beat ,ELECTROCARDIOGRAPHY ,DATABASES ,DIAGNOSIS - Abstract
Cardiovascular diseases (CVDs) are highly prevalent, sudden onset, and relatively fatal, posing a significant public health burden. Long-term dynamic electrocardiography, which can continuously record the long-term dynamic ECG activities of individuals in their daily lives, has high research value. However, ECG signals are weak and highly susceptible to external interference, which may lead to false alarms and misdiagnosis, affecting the diagnostic efficiency and the utilization rate of healthcare resources, so research on the quality of dynamic ECG signals is extremely necessary. Aimed at the above problems, this paper proposes a dynamic ECG signal quality assessment method based on CNN and LSTM that divides the signal into three quality categories: the signal of the Q1 category has a lower noise level, which can be used for reliable diagnosis of arrhythmia, etc.; the signal of the Q2 category has a higher noise level, but it still contains information that can be used for heart rate calculation, HRV analysis, etc.; and the signal of the Q3 category has a higher noise level that can interfere with the diagnosis of cardiovascular disease and should be discarded or labeled. In this paper, we use the widely recognized MIT-BIH database, based on which the model is applied to realistically collect exercise experimental data to assess the performance of the model in dealing with real-world situations. The model achieves an accuracy of 98.65% on the test set, a macro-averaged F1 score of 98.5%, and a high F1 score of 99.71% for the prediction of Q3 category signals, which shows that the model has good accuracy and generalization performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. An Interference Mitigation Method for FMCW Radar Based on Time–Frequency Distribution and Dual-Domain Fusion Filtering.
- Author
-
Zhou, Yu, Cao, Ronggang, Zhang, Anqi, and Li, Ping
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,RADIO interference ,RADAR interference ,RADAR ,IMAGE reconstruction ,BISTATIC radar - Abstract
Radio frequency interference (RFI) significantly hampers the target detection performance of frequency-modulated continuous-wave radar. To address the problem and maintain the target echo signal, this paper proposes a priori assumption on the interference component nature in the radar received signal, as well as a method for interference estimation and mitigation via time–frequency analysis. The solution employs Fourier synchrosqueezed transform to implement the radar's beat signal transformation from time domain to time–frequency domain, thus converting the interference mitigation to the task of time–frequency distribution image restoration. The solution proposes the use of image processing based on the dual-tree complex wavelet transform and combines it with the spatial domain-based approach, thereby establishing a dual-domain fusion interference filter for time–frequency distribution images. This paper also presents a convolutional neural network model of structurally improved UNet++, which serves as the interference estimator. The proposed solution demonstrated its capability against various forms of RFI through the simulation experiment and showed a superior interference mitigation performance over other CNN model-based approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Minimally Distorted Adversarial Images with a Step-Adaptive Iterative Fast Gradient Sign Method.
- Author
-
Ding, Ning and Möller, Knut
- Subjects
IMAGE recognition (Computer vision) ,TRAFFIC signs & signals ,CONVOLUTIONAL neural networks ,SURGICAL equipment - Abstract
The safety and robustness of convolutional neural networks (CNNs) have raised increasing concerns, especially in safety-critical areas, such as medical applications. Although CNNs are efficient in image classification, their predictions are often sensitive to minor, for human observers, invisible modifications of the image. Thus, a modified, corrupted image can be visually equal to the legitimate image for humans but fool the CNN and make a wrong prediction. Such modified images are called adversarial images throughout this paper. A popular method to generate adversarial images is backpropagating the loss gradient to modify the input image. Usually, only the direction of the gradient and a given step size were used to determine the perturbations (FGSM, fast gradient sign method), or the FGSM is applied multiple times to craft stronger perturbations that change the model classification (i-FGSM). On the contrary, if the step size is too large, the minimum perturbation of the image may be missed during the gradient search. To seek exact and minimal input images for a classification change, in this paper, we suggest starting the FGSM with a small step size and adapting the step size with iterations. A few decay algorithms were taken from the literature for comparison with a novel approach based on an index tracking the loss status. In total, three tracking functions were applied for comparison. The experiments show our loss adaptive decay algorithms could find adversaries with more than a 90% success rate while generating fewer perturbations to fool the CNNs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. A Remaining Useful Life Prediction Method of Mechanical Equipment Based on Particle Swarm Optimization-Convolutional Neural Network-Bidirectional Long Short-Term Memory.
- Author
-
Liu, Yong, Liu, Jiaqi, Wang, Han, Yang, Mingshun, Gao, Xinqin, and Li, Shujuan
- Subjects
REMAINING useful life ,DEEP learning ,CONVOLUTIONAL neural networks ,PARTICLE swarm optimization ,RELIABILITY in engineering - Abstract
In industry, forecast prediction and health management (PHM) is used to improve system reliability and efficiency. In PHM, remaining useful life (RUL) prediction plays a key role in preventing machine failures and reducing operating costs, especially for reliability requirements such as critical components in aviation as well as for costly equipment. With the development of deep learning techniques, many RUL prediction methods employ convolutional neural network (CNN) and long short-term memory (LSTM) networks and demonstrate superior performance. In this paper, a novel two-stream network based on a bidirectional long short-term memory neural network (BiLSTM) is proposed to establish a two-stage residual life prediction model for mechanical devices using CNN as the feature extractor and BiLSTM as the timing processor, and finally, a particle swarm optimization (PSO) algorithm is used to adjust and optimize the network structural parameters for the initial data. Under the condition of lack of professional knowledge, the adaptive extraction of the features of the data accumulated by the enterprise and the effective processing of a large amount of timing data are achieved. Comparing the prediction results with other models through examples, it shows that the model established in this paper significantly improves the accuracy and efficiency of equipment remaining life prediction. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Locating and Grading of Lidar-Observed Aircraft Wake Vortex Based on Convolutional Neural Networks.
- Author
-
Zhang, Xinyu, Zhang, Hongwei, Wang, Qichao, Liu, Xiaoying, Liu, Shouxin, Zhang, Rongchuan, Li, Rongzhong, and Wu, Songhua
- Subjects
CONVOLUTIONAL neural networks ,DOPPLER lidar ,AERONAUTICAL safety measures - Abstract
Aircraft wake vortices are serious threats to aviation safety. The Pulsed Coherent Doppler Lidar (PCDL) has been widely used in the observation of aircraft wake vortices due to its advantages of high spatial-temporal resolution and high precision. However, the post-processing algorithms require significant computing resources, which cannot achieve the real-time detection of a wake vortex (WV). This paper presents an improved Convolutional Neural Network (CNN) method for WV locating and grading based on PCDL data to avoid the influence of unstable ambient wind fields on the localization and classification results of WV. Typical WV cases are selected for analysis, and the WV locating and grading models are validated on different test sets. The consistency of the analytical algorithm and the CNN algorithm is verified. The results indicate that the improved CNN method achieves satisfactory recognition accuracy with higher efficiency and better robustness, especially in the case of strong turbulence, where the CNN method recognizes the wake vortex while the analytical method cannot. The improved CNN method is expected to be applied to optimize the current aircraft spacing criteria, which is promising in terms of aviation safety and economic benefit improvement. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Leveraging Bit-Serial Architectures for Hardware-Oriented Deep Learning Accelerators with Column-Buffering Dataflow.
- Author
-
Cheng, Xiaoshu, Wang, Yiwen, Ding, Weiran, Lou, Hongfei, and Li, Ping
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,ARRAY processing - Abstract
Bit-serial neural network accelerators address the growing need for compact and energy-efficient deep learning tools. Traditional neural network accelerators, while effective, often grapple with issues of size, power consumption, and versatility in handling a variety of computational tasks. To counter these challenges, this paper introduces an approach that hinges on the integration of bit-serial processing with advanced dataflow techniques and architectural optimizations. Central to this approach is a column-buffering (CB) dataflow, which significantly reduces access and movement requirements for the input feature map (IFM), thereby enhancing efficiency. Moreover, a simplified quantization process effectively eliminates biases, streamlining the overall computation process. Furthermore, this paper presents a meticulously designed LeNet-5 accelerator leveraging a convolutional layer processing element array (CL PEA) architecture incorporating an improved bit-serial multiply–accumulate unit (MAC). Empirically, our work demonstrates superior performance in terms of frequency, chip area, and power consumption compared to current state-of-the-art ASIC designs. Specifically, our design utilizes fewer hardware resources to implement a complete accelerator, achieving a high performance of 7.87 GOPS on a Xilinx Kintex-7 FPGA with a brief processing time of 284.13 μs. The results affirm that our design is exceptionally suited for applications requiring compact, low-power, and real-time solutions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Bearing Fault Diagnosis Using a Grad-CAM-Based Convolutional Neuro-Fuzzy Network.
- Author
-
Lin, Cheng-Jian and Jhang, Jyun-Yu
- Subjects
- *
MACHINE tools , *ELECTRONIC paper , *DEEP learning , *FAULT diagnosis , *CONVOLUTIONAL neural networks - Abstract
When a machine tool is used for a long time, its bearing experiences wear and failure due to heat and vibration, resulting in damage to the machine tool. In order to make the machine tool stable for processing, this paper proposes a smart bearing diagnosis system (SBDS), which uses a gradient-weighted class activation mapping (Grad-CAM)-based convolutional neuro-fuzzy network (GC-CNFN) to detect the bearing status of the machine tool. The developed GC-CNFN is composed of a convolutional layer and neuro-fuzzy network. The convolutional layer can automatically extract vibration signal features, which are then classified using the neuro-fuzzy network. Moreover, Grad-CAM is used to analyze the attention of the diagnosis model. To verify the performance of bearing fault classification, the 1D CNN (ODCNN) and improved 1D LeNet-5 (I1DLeNet) were adopted to compare with the proposed GC-CNFN. Experimental results showed that the proposed GC-CNFN required fewer parameters (20K), had a shorter average calculation time (117.7 s), and had a higher prediction accuracy (99.88%) in bearing fault classification. The proposed SBDS can not only accurately classify bearing faults, but also help users understand the current status of the machine tool. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
41. Analyzing Trends in Medical Imaging Using Intelligent Photonics †.
- Author
-
Sharma, Sunil, Das, Sandip, and Tharani, Lokesh
- Subjects
PHOTONIC crystal fibers ,DIAGNOSTIC imaging ,ARTIFICIAL intelligence ,TUMOR diagnosis ,FINITE element method - Abstract
The integration of photonics and artificial intelligence (AI) has led to the emergence of intelligent photonics, which offers significant advancements in medical imaging. In this paper, a Photonic Crystal Fiber (PCF)-based sensor is presented for tumor detection. The finite element method is used to simulate the proposed sensor. By varying the geometrical parameters of the proposed sensor, an optimized sensor is proposed. Meanwhile, the latest AI techniques used in medical imaging, such as deep learning (DL) and convolutional neural networks (CNN), are also analyzed to improve upon the ability of the sensor. This paper highlights the potential of intelligent photonics in improving efficiency, sensitivity, specificity and accuracy of medical imaging, particularly in the areas of tumor detection and treatment. The results show that DL has an efficiency of 95%, and CNN has shown an accuracy of 98%. Additionally, this paper discusses the challenges and limitations that need to be addressed in order to fully realize the potential of these technologies. This paper demonstrates that the integration of photonics and AI has great potential to revolutionize medical imaging. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
42. A Diagnostic Method for the Saturable Reactor Core Looseness Degree of Thyristor Converter Valves.
- Author
-
Zheng, Lin, Wei, Xiaoguang, Sun, Tianshu, and Zhang, Xiaolong
- Subjects
CONVOLUTIONAL neural networks ,NUCLEAR reactor cores ,THYRISTORS ,ROLLER bearings ,WAVELET transforms ,VALVES ,IRON - Abstract
During the long-term operation of thyristor converter valves, the saturable reactor vibration (mainly caused by magnetostriction) will lead to core looseness faults. In order to accurately evaluate the fault degradation degree, this paper proposes a vibration signal recognition model for iron core looseness based on synchrosqueezed wavelet transforms and a convolutional neural network. Firstly, vibration experiments are conducted on saturable reactors to obtain signals under different core looseness degrees. Then, the spectrogram features of vibration signals are extracted using synchrosqueezed wavelet transform. Finally, based on the high-dimensional learning ability of convolutional neural networks, the fault characteristics of the spectrogram are mined to accurately identify the core looseness degree. The research results indicate that the model in the paper has higher recognition accuracy than some other methods, which provides convenience for the monitoring and maintenance of saturable reactors. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
43. A Deep Neural Network-Based Optimal Scheduling Decision-Making Method for Microgrids.
- Author
-
Chen, Fei, Wang, Zhiyang, and He, Yu
- Subjects
MICROGRIDS ,CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,MIXED integer linear programming ,MATHEMATICAL optimization ,DECISION making - Abstract
With the rapid growth in the proportion of renewable energy access and the structural complexity of distributed energy systems, traditional microgrid (MG) scheduling methods that rely on mathematical optimization models and expert experience are facing significant challenges. Therefore, it is essential to present a novel scheduling technique with high intelligence and fast decision-making capacity to realize MGs' automatic operation and regulation. This paper proposes an optimal scheduling decision-making method for MGs based on deep neural networks (DNN). Firstly, a typical mathematical scheduling model used for MG operation is introduced, and the limitations of current methods are analyzed. Then, a two-stage optimal scheduling framework comprising day-ahead and intra-day stages is presented. The day-ahead part is solved by mixed integer linear programming (MILP), and the intra-day part uses a convolutional neural network (CNN)—bidirectional long short-term memory (Bi LSTM) for high-speed rolling decision making, with the outputs adjusted by a power correction balance algorithm. Finally, the validity of the model and algorithm of this paper are verified by arithmetic case analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. Black Ice Classification with Hyperspectral Imaging and Deep Learning.
- Author
-
Bhattacharyya, Chaitali and Kim, Sungho
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,IMAGE recognition (Computer vision) ,ICE ,PRINCIPAL components analysis - Abstract
With the development of new technologies inside car mechanisms with various sensors connected to the IoT, a new generation of automation is attracting attention. However, there are still some factors that are difficult to detect. Among them, one of the highest risk factors is black ice. A road covered with black ice, which is hard to see from a distance, is not only the cause of damage to vehicles passing over the spot, but it also puts lives at risk. Hence, the detection of black ice is essential. A lot of research has been done on this topic with various sensors and methods. However, hyperspectral imaging has not been used for this particular purpose. Therefore, in this paper, black ice classification has been performed with the help of hyperspectral imaging in collaboration with a deep learning model for the first time. With abundant spectral and spatial information, hyperspectral imaging is a good way to analyze any material. In this paper, a 2D–3D Convolutional Neural Network (CNN) has been used to classify hyperspectral images of black ice. The spectral data were preprocessed, and the dimension of the image cube was reduced with the help of Principal Component Analysis (PCA). The proposed method was then compared with the existing method for better evaluation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
45. Artificial Intelligence in Photovoltaic Fault Identification and Diagnosis: A Systematic Review.
- Author
-
Islam, Mahmudul, Rashel, Masud Rana, Ahmed, Md Tofael, Islam, A. K. M. Kamrul, and Tlemçani, Mouhaydine
- Subjects
ARTIFICIAL intelligence ,MACHINE learning ,DEEP learning ,CONVOLUTIONAL neural networks ,PROCESS capability ,PHOTOVOLTAIC power systems - Abstract
Photovoltaic (PV) fault detection is crucial because undetected PV faults can lead to significant energy losses, with some cases experiencing losses of up to 10%. The efficiency of PV systems depends upon the reliable detection and diagnosis of faults. The integration of Artificial Intelligence (AI) techniques has been a growing trend in addressing these issues. The goal of this systematic review is to offer a comprehensive overview of the recent advancements in AI-based methodologies for PV fault detection, consolidating the key findings from 31 research papers. An initial pool of 142 papers were identified, from which 31 were selected for in-depth review following the PRISMA guidelines. The title, objective, methods, and findings of each paper were analyzed, with a focus on machine learning (ML) and deep learning (DL) approaches. ML and DL are particularly suitable for PV fault detection because of their capacity to process and analyze large amounts of data to identify complex patterns and anomalies. This study identified several AI techniques used for fault detection in PV systems, ranging from classical ML methods like k-nearest neighbor (KNN) and random forest to more advanced deep learning models such as Convolutional Neural Networks (CNNs). Quantum circuits and infrared imagery were also explored as potential solutions. The analysis found that DL models, in general, outperformed traditional ML models in accuracy and efficiency. This study shows that AI methodologies have evolved and been increasingly applied in PV fault detection. The integration of AI in PV fault detection offers high accuracy and effectiveness. After reviewing these studies, we proposed an Artificial Neural Network (ANN)-based method for PV fault detection and classification. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Environmental Sound Classification Framework Based on L-mHP Features and SE-ResNet50 Network Model.
- Author
-
Huang, Mengxiang, Wang, Mei, Liu, Xin, Kan, Ruixiang, and Qiu, Hongbing
- Subjects
CONVOLUTIONAL neural networks ,SPECTROGRAMS ,SIGNAL-to-noise ratio - Abstract
Environmental sound classification (ESC) tasks are attracting more and more attention. Due to the complexity of the scene and personnel mobility, there are some difficulties in understanding and generating environmental sound models for ESC tasks. To address these key issues, this paper proposes an audio classification framework based on L-mHP features and the SE-ResNet50 model and improves a dual-channel data enhancement scheme based on a symmetric structure for model training. Firstly, this paper proposes the L-mHP feature to characterize environmental sound. The L-mHP feature is a three-channel feature consisting of a Log-Mel spectrogram, a harmonic spectrogram, and a percussive spectrogram. The harmonic spectrogram and percussive spectrogram can be obtained by harmonic percussive source separation (HPSS) of a Log-Mel spectrogram. Then, an improved audio classification model SE-ResNet50 is proposed based on the ResNet-50 model. In this paper, a dual-channel data enhancement scheme based on a symmetric structure is promoted, which not only makes the audio variants more diversified, but also makes the model focus on learning the time–frequency mode in the acoustic features during the training process, so as to improve the generalization performance of the model. Finally, the audio classification experiment of the framework is carried out on public datasets. An experimental accuracy of 94.92%, 99.67%, and 90.75% was obtained on ESC-50, ESC-10 and UrbanSound8K datasest, respectively. In order to simulate the classification performance of the framework in the actual environment, the framework was also evaluated on a self-made sound dataset with different signal-to-noise ratios. The experimental results show that the proposed audio classification framework has good robustness and feasibility. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
47. Motor On-Line Fault Diagnosis Method Research Based on 1D-CNN and Multi-Sensor Information.
- Author
-
Gu, Yufeng, Zhang, Yongji, Yang, Mingrui, and Li, Chengshan
- Subjects
DEEP learning ,FAULT diagnosis ,DIAGNOSIS methods ,CONVOLUTIONAL neural networks ,MULTISENSOR data fusion ,FEATURE extraction ,MOTORS - Abstract
The motor is the primary impetus source of most mechanical equipment, and its failure will cause substantial economic losses and safety problems. Therefore, it is necessary to study online fault diagnosis techniques for motors, given the problems caused by shallow learning models or single-sensor fault analysis in previous motor fault diagnosis techniques, such as blurred fault features, inaccurate identification, and time and manpower consumption. In this paper, we proposed a model for motor fault diagnosis based on deep learning and multi-sensor information fusion. Firstly, a correlation adaptive weighting method is proposed in this paper, and it is used to integrate the collected multi-source homogeneous sensor information into multi-source heterogeneous sensor information through the data layer fusion. Secondly, the 1D-CNN is used to carry out feature extraction, feature layer fusion, and fault classification of multi-source heterogeneous information of the motor. Finally, the data of seven states (one healthy and six faulty) of the motor are collected by the motor drive test bench to realize the model's training, testing, and verification. The experimental results show that the fault diagnosis accuracy of the model is 99.3%. Thus, this method has important practical implications for improving the accuracy of motor fault diagnosis further. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
48. CSK-CNN: Network Intrusion Detection Model Based on Two-Layer Convolution Neural Network for Handling Imbalanced Dataset.
- Author
-
Song, Jiaming, Wang, Xiaojuan, He, Mingshu, and Jin, Lei
- Subjects
INTRUSION detection systems (Computer security) ,CONVOLUTIONAL neural networks ,K-means clustering ,MULTICASTING (Computer networks) ,COMPUTER networks ,RECEIVER operating characteristic curves - Abstract
In computer networks, Network Intrusion Detection System (NIDS) plays a very important role in identifying intrusion behaviors. NIDS can identify abnormal behaviors by analyzing network traffic. However, the performance of classifier is not very good in identifying abnormal traffic for minority classes. In order to improve the detection rate on class imbalanced dataset, we propose a network intrusion detection model based on two-layer CNN and Cluster-SMOTE + K-means algorithm (CSK-CNN) to process imbalanced dataset. CSK combines the cluster based Synthetic Minority Over Sampling Technique (Cluster-SMOTE) and K-means based under sampling algorithm. Through the two-layer network, abnormal traffic can not only be identified, but also be classified into specific attack types. This paper has been verified on UNSW-NB15 dataset and CICIDS2017 dataset, and the performance of the proposed model has been evaluated using such indicators as accuracy, recall, precision, F1-score, ROC curve, AUC value, training time and testing time. The experiment shows that the proposed CSK-CNN in this paper is obviously superior to other comparison algorithms in terms of network intrusion detection performance, and is suitable for deployment in the real network environment. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
49. Integrated Damage Location Diagnosis of Frame Structure Based on Convolutional Neural Network with Inception Module.
- Author
-
Ren, Jianhua, Cai, Chaozhi, Chi, Yaolei, and Xue, Yingfang
- Subjects
CONVOLUTIONAL neural networks ,STRUCTURAL frames ,FAULT diagnosis ,STEEL framing ,DIAGNOSIS methods - Abstract
Accurate damage location diagnosis of frame structures is of great significance to the judgment of damage degree and subsequent maintenance of frame structures. However, the similarity characteristics of vibration data at different damage locations and noise interference bring great challenges. In order to overcome the above problems and realize accurate damage location diagnosis of the frame structure, the existing convolutional neural network with training interference (TICNN) is improved in this paper, and a high-precision neural network model named convolutional neural network based on Inception (BICNN) for fault diagnosis with strong anti-noise ability is proposed by adding the Inception module to TICNN. In order to effectively avoid the overall misjudgment problem caused by using single sensor data for damage location diagnosis, an integrated damage location diagnosis method is proposed. Taking the four-story steel frame model of the University of British Columbia as the research object, the method proposed in this paper is tested and compared with other methods. The experimental results show that the diagnosis accuracy of the proposed method is 97.38%, which is higher than other methods; at the same time, it has greater advantages in noise resistance. Therefore, the method proposed in this paper not only has high accuracy, but also has strong anti-noise ability, which can solve the problem of accurate damage location diagnosis of complex frame structures under a strong noise environment. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
50. Using Traffic Sensors in Smart Cities to Enhance a Spatio-Temporal Deep Learning Model for COVID-19 Forecasting.
- Author
-
Muñoz-Organero, Mario
- Subjects
VEHICLE detectors ,INTELLIGENT sensors ,SMART cities ,DEEP learning ,COVID-19 pandemic ,COMPUTER network traffic ,MACHINE learning - Abstract
Respiratory viruses, such as COVID-19, are spread over time and space based on human-to-human interactions. Human mobility plays a key role in the propagation of the virus. Different types of sensors in smart cities are able to continuously monitor traffic-related human mobility, showing the impact of COVID-19 on traffic volumes and patterns. In a similar way, traffic volumes measured by smart traffic sensors provide a proxy variable to capture human mobility, which is expected to have an impact on new COVID-19 infections. Adding traffic data from smart city sensors to machine learning models designed to estimate upcoming COVID-19 incidence values should provide optimized results compared to models based on COVID-19 data alone. This paper proposes a novel model to extract spatio-temporal patterns in the spread of the COVID-19 virus for short-term predictions by organizing COVID-19 incidence and traffic data as interrelated temporal sequences of spatial images. The model is trained and validated with real data from the city of Madrid in Spain for 84 weeks, combining information from 4372 traffic measuring points and 143 COVID-19 PCR test centers. The results are compared with a baseline model designed for the extraction of spatio-temporal patterns from COVID-19-only sequences of images, showing that using traffic information enhances the results when forecasting a new wave of infections (MSE values are reduced by a 70% factor). The information that traffic data has on the spread of the COVID-19 virus is also analyzed, showing that traffic data alone is not sufficient for accurate COVID-19 forecasting. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.