756 results
Search Results
2. Maize Leaf Disease Recognition Based on Improved Convolutional Neural Network ShuffleNetV2.
- Author
-
Zhou, Hanmi, Su, Yumin, Chen, Jiageng, Li, Jichen, Ma, Linshuang, Liu, Xingyi, Lu, Sibo, and Wu, Qi
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,CORN diseases ,CORN ,PRECISION farming ,AGRICULTURAL development - Abstract
The occurrence of maize diseases is frequent but challenging to manage. Traditional identification methods have low accuracy and complex model structures with numerous parameters, making them difficult to implement on mobile devices. To address these challenges, this paper proposes a corn leaf disease recognition model SNMPF based on convolutional neural network ShuffleNetV2. In the down-sampling module of the ShuffleNet model, the max pooling layer replaces the deep convolutional layer to perform down-sampling. This improvement helps to extract key features from images, reduce the overfitting of the model, and improve the model's generalization ability. In addition, to enhance the model's ability to express features in complex backgrounds, the Sim AM attention mechanism was introduced. This mechanism enables the model to adaptively adjust focus and pay more attention to local discriminative features. The results on a maize disease image dataset demonstrate that the SNMPF model achieves a recognition accuracy of 98.40%, representing a 4.1 percentage point improvement over the original model, while its size is only 1.56 MB. Compared with existing convolutional neural network models such as EfficientNet, MobileViT, EfficientNetV2, RegNet, and DenseNet, this model offers higher accuracy and a more compact size. As a result, it can automatically detect and classify maize leaf diseases under natural field conditions, boasting high-precision recognition capabilities. Its accurate identification results provide scientific guidance for preventing corn leaf disease and promote the development of precision agriculture. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Closing Editorial for Computer Vision and Pattern Recognition Based on Deep Learning.
- Author
-
Yuan, Hui
- Subjects
PATTERN recognition systems ,VIDEO compression ,DEEP learning ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,OBJECT recognition (Computer vision) ,IMAGE recognition (Computer vision) - Abstract
This document is the closing editorial for a special issue of the journal Applied Sciences on computer vision and pattern recognition based on deep learning. The issue includes 31 papers covering various topics such as image and video processing, object detection, object and scene recognition, visual application technologies, classification, segmentation, compression, and more. Each paper explores different methods and techniques to improve the performance and accuracy of deep learning models in these areas. The document provides a brief summary of each paper, highlighting their contributions and findings. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
4. Editorial for the Special Issue "Machine Learning in Computer Vision and Image Sensing: Theory and Applications".
- Author
-
Chakraborty, Subrata and Pradhan, Biswajeet
- Subjects
COMPUTER vision ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,SIGNAL processing ,GAIT in humans - Abstract
This document is an editorial for a special issue titled "Machine Learning in Computer Vision and Image Sensing: Theory and Applications." The editorial highlights the diverse applications of machine learning (ML) models in various domains such as medical imaging, signal processing, remote sensing, and human activity detection. The special issue includes 11 papers that cover topics such as image segmentation, fluvial navigation, Alzheimer's disease classification, pneumothorax detection, lung cancer malignancy prediction, amniotic fluid volume detection, COVID-19 detection, and Parkinson's disease detection. The papers showcase the progress and potential of ML models in computer vision applications and provide valuable insights for future research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
5. Special Issue: Machine Learning and Data Analysis.
- Author
-
Michalak, Marcin
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,PATTERN recognition systems ,DATA analysis ,ARTIFICIAL neural networks ,CREDIT card fraud ,MACHINE learning - Abstract
This Special Issue contains 2 reviews and 17 research papers related to the following topics: Time series forecasting [[1], [3], [5]]; Image analysis [[6]]; Medical applications [[7]]; Knowledge graph analysis [[9]]; Cybersecurity [[11], [13]]; Traffic analysis [[14]]; Agriculture [[16]]; Environmental data analysis [[17]]. In [[2]], a time series analysis was applied in a different manner: their prediction of the high stock dividend (HSD) was based on a sequence of typical machine learning approaches instead of state-of-the-art methods such as ARIMA or SMA. The authors of [[1]] focused on short time series forecasting in the domain of crime data (thefts, shoplifting, vehicular crimes, and burglaries in Mexico). In this paper, the authors attempt to answer the questions what is air traffic complexity and which air traffic data variables have greater impacts on increases in complexity?. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
6. Deep Time Series Forecasting Models: A Comprehensive Survey.
- Author
-
Liu, Xinhe and Wang, Wenmin
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,TIME series analysis ,CONVOLUTIONAL neural networks ,ARTIFICIAL intelligence ,LANGUAGE models - Abstract
Deep learning, a crucial technique for achieving artificial intelligence (AI), has been successfully applied in many fields. The gradual application of the latest architectures of deep learning in the field of time series forecasting (TSF), such as Transformers, has shown excellent performance and results compared to traditional statistical methods. These applications are widely present in academia and in our daily lives, covering many areas including forecasting electricity consumption in power systems, meteorological rainfall, traffic flow, quantitative trading, risk control in finance, sales operations and price predictions for commercial companies, and pandemic prediction in the medical field. Deep learning-based TSF tasks stand out as one of the most valuable AI scenarios for research, playing an important role in explaining complex real-world phenomena. However, deep learning models still face challenges: they need to deal with the challenge of large-scale data in the information age, achieve longer forecasting ranges, reduce excessively high computational complexity, etc. Therefore, novel methods and more effective solutions are essential. In this paper, we review the latest developments in deep learning for TSF. We begin by introducing the recent development trends in the field of TSF and then propose a new taxonomy from the perspective of deep neural network models, comprehensively covering articles published over the past five years. We also organize commonly used experimental evaluation metrics and datasets. Finally, we point out current issues with the existing solutions and suggest promising future directions in the field of deep learning combined with TSF. This paper is the most comprehensive review related to TSF in recent years and will provide a detailed index for researchers in this field and those who are just starting out. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Mapping Method of Human Arm Motion Based on Surface Electromyography Signals.
- Author
-
Zheng, Yuanyuan, Zheng, Gang, Zhang, Hanqi, Zhao, Bochen, and Sun, Peng
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,CONVOLUTIONAL neural networks ,DEEP learning ,SENSOR placement ,ARM ,FINGER joint - Abstract
This paper investigates a method for precise mapping of human arm movements using sEMG signals. A multi-channel approach captures the sEMG signals, which, combined with the accurately calculated joint angles from an Inertial Measurement Unit, allows for action recognition and mapping through deep learning algorithms. Firstly, signal acquisition and processing were carried out, which involved acquiring data from various movements (hand gestures, single-degree-of-freedom joint movements, and continuous joint actions) and sensor placement. Then, interference signals were filtered out through filters, and the signals were preprocessed using normalization and moving averages to obtain sEMG signals with obvious features. Additionally, this paper constructs a hybrid network model, combining Convolutional Neural Networks and Artificial Neural Networks, and employs a multi-feature fusion algorithm to enhance the accuracy of gesture recognition. Furthermore, a nonlinear fitting between sEMG signals and joint angles was established based on a backpropagation neural network, incorporating momentum term and adaptive learning rate adjustments. Finally, based on the gesture recognition and joint angle prediction model, prosthetic arm control experiments were conducted, achieving highly accurate arm movement prediction and execution. This paper not only validates the potential application of sEMG signals in the precise control of robotic arms but also lays a solid foundation for the development of more intuitive and responsive prostheses and assistive devices. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Deep Learning-Based Design Method for Acoustic Metasurface Dual-Feature Fusion.
- Author
-
Lv, Qiang, Zhao, Huanlong, Huang, Zhen, Hao, Guoqiang, and Chen, Wei
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,SOUND design ,ACOUSTIC field ,GENETIC algorithms - Abstract
Existing research in metasurface design was based on trial-and-error high-intensity iterations and requires deep acoustic expertise from the researcher, which severely hampered the development of the metasurface field. Using deep learning enabled the fast and accurate design of hypersurfaces. Based on this, in this paper, an integrated learning approach was first utilized to construct a model of the forward mapping relationship between the hypersurface physical structure parameters and the acoustic field, which was intended to be used for data enhancement. Then a dual-feature fusion model (DFCNN) based on a convolutional neural network was proposed, in which the first feature was the high-dimensional nonlinear features extracted using a data-driven approach, and the second feature was the physical feature information of the acoustic field mined using the model. A convolutional neural network was used for feature fusion. A genetic algorithm was used for network parameter optimization. Finally, generalization ability verification was performed to prove the validity of the network model. The results showed that 90% of the integrated learning models had an error of less than 3 dB between the real and predicted sound field data, and 93% of the DFCNN models could achieve an error of less than 5 dB in the local sound field intensity. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Machine Learning Applications in Surface Transportation Systems: A Literature Review.
- Author
-
Behrooz, Hojat and Hayeri, Yeganeh M.
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,INTELLIGENT transportation systems ,LITERATURE reviews ,CONVOLUTIONAL neural networks ,DEEP learning ,SUPPORT vector machines - Abstract
Surface transportation has evolved through technology advancements using parallel knowledge areas such as machine learning (ML). However, the transportation industry has not yet taken full advantage of ML. To evaluate this gap, we utilized a literature review approach to locate, categorize, and synthesize the principal concepts of research papers regarding surface transportation systems using ML algorithms, and we then decomposed them into their fundamental elements. We explored more than 100 articles, literature review papers, and books. The results show that 74% of the papers concentrate on forecasting, while multilayer perceptions, long short-term memory, random forest, supporting vector machine, XGBoost, and deep convolutional neural networks are the most preferred ML algorithms. However, sophisticated ML algorithms have been minimally used. The root-cause analysis revealed a lack of effective collaboration between the ML and transportation experts, resulting in the most accessible transportation applications being used as a case study to test or enhance a given ML algorithm and not necessarily to enhance a mobility or safety issue. Additionally, the transportation community does not define transportation issues clearly and does not provide publicly available transportation datasets. The transportation sector must offer an open-source platform to showcase the sector's concerns and build spatiotemporal datasets for ML experts to accelerate technology advancements. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
10. An Unsupervised Character Recognition Method for Tibetan Historical Document Images Based on Deep Learning.
- Author
-
Wang, Xiaojuan and Wang, Weilan
- Subjects
DEEP learning ,PATTERN recognition systems ,ARTIFICIAL neural networks ,HISTORICAL source material ,CONVOLUTIONAL neural networks ,TIBETANS - Abstract
As there is a lack of public mark samples of Tibetan historical document image characters at present, this paper proposes an unsupervised Tibetan historical document character recognition method based on deep learning (UD-CNN). Firstly, using the Tibetan historical document character component, the Tibetan historical document character sample data set is constructed for model-aided training. Then, the character baseline information is introduced, and a fine-grained feature learning strategy is proposed. For the samples above and below the baseline, the Up-CNN recognition model and Down-CNN recognition model are established. The convolution neural network model is trained and optimized for the samples above and below the baseline, respectively, to improve the recognition accuracy. The experimental results show that the proposed method obviously affects the unmarked character classification and recognition of real Tibetan historical document images. The recognition rate of Top5 can reach 92.94%, and the recognition rate of Top1 can be increased from 82.25% to 87.27% using the CNN model only. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Advances in Facial Expression Recognition: A Survey of Methods, Benchmarks, Models, and Datasets.
- Author
-
Kopalidis, Thomas, Solachidis, Vassilios, Vretos, Nicholas, and Daras, Petros
- Subjects
DEEP learning ,FACIAL expression ,ARTIFICIAL neural networks ,COMPUTER vision ,CONVOLUTIONAL neural networks ,FEATURE extraction - Abstract
Recent technological developments have enabled computers to identify and categorize facial expressions to determine a person's emotional state in an image or a video. This process, called "Facial Expression Recognition (FER)", has become one of the most popular research areas in computer vision. In recent times, deep FER systems have primarily concentrated on addressing two significant challenges: the problem of overfitting due to limited training data availability, and the presence of expression-unrelated variations, including illumination, head pose, image resolution, and identity bias. In this paper, a comprehensive survey is provided on deep FER, encompassing algorithms and datasets that offer insights into these intrinsic problems. Initially, this paper presents a detailed timeline showcasing the evolution of methods and datasets in deep facial expression recognition (FER). This timeline illustrates the progression and development of the techniques and data resources used in FER. Then, a comprehensive review of FER methods is introduced, including the basic principles of FER (components such as preprocessing, feature extraction and classification, and methods, etc.) from the pro-deep learning era (traditional methods using handcrafted features, i.e., SVM and HOG, etc.) to the deep learning era. Moreover, a brief introduction is provided related to the benchmark datasets (there are two categories: controlled environments (lab) and uncontrolled environments (in the wild)) used to evaluate different FER methods and a comparison of different FER models. Existing deep neural networks and related training strategies designed for FER, based on static images and dynamic image sequences, are discussed. The remaining challenges and corresponding opportunities in FER and the future directions for designing robust deep FER systems are also pinpointed. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Object Detection for Industrial Applications: Training Strategies for AI-Based Depalletizer.
- Author
-
Buongiorno, Domenico, Caramia, Donato, Di Ruscio, Luca, Longo, Nicola, Panicucci, Simone, Di Stefano, Giovanni, Bevilacqua, Vitoantonio, and Brunetti, Antonio
- Subjects
OBJECT recognition (Computer vision) ,ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,STOCK-keeping unit ,CONVOLUTIONAL neural networks ,STEREO vision (Computer science) - Abstract
In the last 10 years, the demand for robot-based depalletization systems has constantly increased due to the growth of sectors such as logistics, storage, and supply chains. Since the scenarios are becoming more and more unstructured, characterized by unknown pallet layouts and stock-keeping unit shapes, the classical depalletization systems based on the knowledge of predefined positions within the pallet frame are going to be substituted by innovative and robust solutions based on 2D/3D vision and Deep Learning (DL) methods. In particular, the Convolutional Neural Networks (CNNs) are deep networks that have proven to be effective in processing 2D/3D images, for example in the automatic object detection task, and robust to the possible variability among the data. However, deep neural networks need a big amount of data to be trained. In this context, whenever deep networks are involved in object detection for supporting depalletization systems, the dataset collection represents one of the main bottlenecks during the commissioning phase. The present work aims at comparing different training strategies to customize an object detection model aiming at minimizing the number of images required for model fitting, while ensuring reliable and robust performances. Different approaches based on a CNN for object detection are proposed, evaluated, and compared in terms of the F1-score. The study was conducted considering different starting conditions in terms of the neural network's weights, the datasets, and the training set sizes. The proposed approaches were evaluated on the detection of different kinds of paper boxes placed on an industrial pallet. The outcome of the work validates that the best strategy is based on fine-tuning of a CNN-based model already trained on the detection of paper boxes, with a median F1-score greater than 85.0 % . [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
13. Deep Learning Techniques for Radar-Based Continuous Human Activity Recognition †.
- Author
-
Mehta, Ruchita, Sharifzadeh, Sara, Palade, Vasile, Tan, Bo, Daneshkhah, Alireza, and Karayaneva, Yordanka
- Subjects
DEEP learning ,HUMAN activity recognition ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,FEATURE extraction ,DOPPLER radar - Abstract
Human capability to perform routine tasks declines with age and age-related problems. Remote human activity recognition (HAR) is beneficial for regular monitoring of the elderly population. This paper addresses the problem of the continuous detection of daily human activities using a mm-wave Doppler radar. In this study, two strategies have been employed: the first method uses un-equalized series of activities, whereas the second method utilizes a gradient-based strategy for equalization of the series of activities. The dynamic time warping (DTW) algorithm and Long Short-term Memory (LSTM) techniques have been implemented for the classification of un-equalized and equalized series of activities, respectively. The input for DTW was provided using three strategies. The first approach uses the pixel-level data of frames (UnSup-PLevel). In the other two strategies, a convolutional variational autoencoder (CVAE) is used to extract Un-Supervised Encoded features (UnSup-EnLevel) and Supervised Encoded features (Sup-EnLevel) from the series of Doppler frames. The second approach for equalized data series involves the application of four distinct feature extraction methods: i.e., convolutional neural networks (CNN), supervised and unsupervised CVAE, and principal component Analysis (PCA). The extracted features were considered as an input to the LSTM. This paper presents a comparative analysis of a novel supervised feature extraction pipeline, employing Sup-ENLevel-DTW and Sup-EnLevel-LSTM, against several state-of-the-art unsupervised methods, including UnSUp-EnLevel-DTW, UnSup-EnLevel-LSTM, CNN-LSTM, and PCA-LSTM. The results demonstrate the superiority of the Sup-EnLevel-LSTM strategy. However, the UnSup-PLevel strategy worked surprisingly well without using annotations and frame equalization. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Power Equipment Fault Diagnosis Method Based on Energy Spectrogram and Deep Learning.
- Author
-
Liu, Yiyang, Li, Fei, Guan, Qingbo, Zhao, Yang, and Yan, Shuaihua
- Subjects
DEEP learning ,FAULT diagnosis ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DIAGNOSIS methods ,ROLLER bearings - Abstract
With the development of industrial manufacturing intelligence, the role of rotating machinery in industrial production and life is more and more important. Aiming at the problems of the complex and changeable working environment of rolling bearings and limited computing ability, fault feature information cannot be effectively extracted, and the current deep learning model is difficult to be compatible with lightweight and high efficiency. Therefore, this paper proposes a fault detection method for power equipment based on an energy spectrum diagram and deep learning. Firstly, a novel two-dimensional time-frequency feature representation method and energy spectrum feature map based on wavelet packet transform is proposed, and an energy spectrum feature map dataset is made for subsequent diagnosis. This method can realize multi-resolution analysis, fully extract the feature information contained in the fault signal, and accelerate the convergence of the subsequent diagnosis model. Secondly, a lightweight residual dense convolutional neural network model (LR-DenseNet) is proposed. This model combines the advantages of residual learning and a dense connection, and can not only extract deep features more easily, but can also effectively use shallow features. Then, based on the lightweight residual dense convolutional neural network model, an LR-DenseSENet model is proposed. By introducing the transfer learning strategy and adding the channel domain, an attention mechanism is added to the channel feature fusion layer, with the accuracy of detection up to 99.4%, and the amount of parameter calculation greatly reduced to one-fifth of that of VGG. Finally, through an experimental analysis, it is verified that the fault detection model designed in this paper based on the combination of an energy spectrum feature map and LR-DenseSENet achieves a satisfactory detection effect. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
15. A Study on Gear Defect Detection via Frequency Analysis Based on DNN.
- Author
-
Kim, Jeonghyeon, Kim, Jonghoek, and Kim, Hyuntai
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,GEARING machinery vibration ,CONVOLUTIONAL neural networks - Abstract
In this paper, we introduce a gear defect detection system using frequency analysis based on deep learning. The existing defect diagnosis systems using acoustic analysis use spectrogram, scalogram, and MFCC (Mel-Frequency Cepstral Coefficient) images as inputs to the convolutional neural network (CNN) model to diagnose defects. However, using visualized acoustic data as input to the CNN models requires a lot of computation time. Although computing power has improved, there is a situation in which a processor with low performance is used for reasons such as cost-effectiveness. In this paper, only the sums of frequency bands are used as input to the deep neural network (DNN) model to diagnose the gear fault. This system diagnoses the defects using only a few specific frequency bands, so it ignores unnecessary data and does not require high performance when diagnosing defects because it uses a relatively simple deep learning model for classification. We evaluate the performance of the proposed system through experiments and verify that real-time diagnosis of gears is possible compared to the CNN model. The result showed 95.5% accuracy for 1000 test data, and it took 18.48 ms, so that verified the capability of real-time diagnosis in a low-spec environment. The proposed system is expected to be effectively used to diagnose defects in various sound-based facilities at a low cost. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. Automatic Remote Sensing Identification of Co-Seismic Landslides Using Deep Learning Methods.
- Author
-
Pang, Dongdong, Liu, Gang, He, Jing, Li, Weile, and Fu, Rao
- Subjects
LANDSLIDES ,REMOTE sensing ,ARTIFICIAL neural networks ,EARTHQUAKE resistant design ,CONVOLUTIONAL neural networks ,DEEP learning ,VISUAL fields - Abstract
Rapid and accurate extraction of landslide areas triggered by earthquakes has far-reaching significance for geological disaster risk assessment and emergency rescue. At present, visual interpretation and field survey are still the most-commonly used methods for landslide identification, but these methods are often time-consuming and costly. For this reason, this paper tackles the problem of co-seismic landslide identification and the fact that there is little sample information in existing studies on landslide. A landslide sample dataset with 4000 tags was produced. With the YOLOv3 algorithm as the core, a convolutional neural network model with landslide characteristics was established to automatically recognize co-seismic landslides in satellite remote sensing images. By comparing it with the graphical interpretation results of remote sensing images, we found that the remote sensing for landslide recognition model constructed in this paper demonstrated high recognition accuracy and fast speed. The F1 value was 0.93, indicating that the constructed model was stable. The research results can provide reference for emergency rescue and disaster investigation of the same co-seismic landslide disaster. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. A Deep Learning-Based Emergency Alert Wake-Up Signal Detection Method for the UHD Broadcasting System.
- Author
-
Song, Jin-Hyuk, Baek, Myung-Sun, Bae, Byungjun, and Song, Hyoung-Kyu
- Subjects
EMERGENCY communication systems ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,SIGNAL detection ,DEEP learning ,FAST Fourier transforms ,SOFTWARE radio - Abstract
With the increasing frequency and severity of disasters and accidents, there is a growing need for efficient emergency alert systems. The ultra-high definition (UHD) broadcasting service based on Advanced Television Systems Committee (ATSC) 3.0, a leading terrestrial digital broadcasting system, offers such capabilities, including a wake-up function for minimizing damage through early alerts. In case of a disaster situation, the emergency alert wake-up signal is transmitted, allowing UHD TVs to be activated, enabling individuals to receive emergency alerts and access emergency broadcasting content. However, conventional methods for detecting the bootstrap signal, essential for this function, typically require an ATSC 3.0 demodulator. In this paper, we propose a novel deep learning-based method capable of detecting an emergency wake-up signal without the need for an ATSC 3.0. The proposed method leverages deep learning techniques, specifically a deep neural network (DNN) structure for bootstrap detection and a convolutional neural network (CNN) structure for wake-up signal demodulation and to detect the bootstrap and 2 bit emergency alert wake-up signal. Specifically, our method eliminates the need for Fast Fourier Transform (FFT), frequency synchronization, and interleaving processes typically required by a demodulator. By applying a deep learning in the time domain, we simplify the detection process, allowing for the detection of an emergency alert signal without the full suite of demodulator components required for ATSC 3.0. Furthermore, we have verified the performance of the deep learning-based method using ATSC 3.0-based RF signals and a commercial Software-Defined Radio (SDR) platform in a real environment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. AI-Driven Network Security and Privacy.
- Author
-
Tan, Yu-an, Zhang, Qikun, Li, Yuanzhang, and Yu, Xiao
- Subjects
DEEP learning ,INTRUSION detection systems (Computer security) ,COMPUTER network security ,ARTIFICIAL neural networks ,MACHINE learning ,CONVOLUTIONAL neural networks ,GENERATIVE adversarial networks - Abstract
This document is a summary of a special issue of the journal Electronics titled "AI-Driven Network Security and Privacy." The issue focuses on various topics related to network security and privacy, including malware detection, intrusion detection, malware classification, domain name detection, encryption schemes, model robustness evaluation, traffic obfuscation, APT attack event extraction, object detection, abnormal traffic detection, data management, sentiment analysis, deepfake dataset evaluation, evasion attacks, face recognition, remote sensing image object detection, and target tracking in UAV videos. Each contribution presents a specific method or approach to address the challenges in their respective areas. The summary provides a brief overview of each contribution's main findings and methodologies. The document also discusses the future directions of AI-driven network security and privacy, including secure data sharing, privacy protection, threat detection and prevention, and automated response and repair. The authors express their gratitude to the researchers, reviewers, and editorial staff involved in the publication of the articles. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
19. A New Method for Traffic Participant Recognition Using Doppler Radar Signature and Convolutional Neural Networks.
- Author
-
Ślesicki, Błażej and Ślesicka, Anna
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DOPPLER radar ,TRACKING radar ,PEDESTRIANS ,DATABASES - Abstract
The latest survey results show an increase in accidents on the roads involving pedestrians and cyclists. The reasons for such situations are many, the fault actually lies on both sides. Equipping vehicles, especially autonomous vehicles, with frequency-modulated continuous-wave (FMCW) radar and dedicated algorithms for analyzing signals in the time–frequency domain as well as algorithms for recognizing objects in radar imaging through deep neural networks can positively affect safety. This paper presents a method for recognizing and distinguishing a group of objects based on radar signatures of objects and a special convolutional neural network structure. The proposed approach is based on a database of radar signatures generated on pedestrian, cyclist, and car models in a Matlab environment. The obtained results of simulations and positive tests provide a basis for the application of the system in many sectors and areas of the economy. Innovative aspects of the work include the method of discriminating between multiple objects on a single radar signature, the dedicated architecture of the convolutional neural network, and the use of a method of generating a custom input database. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Development of Context-Based Sentiment Classification for Intelligent Stock Market Prediction.
- Author
-
Smatov, Nurmaganbet, Kalashnikov, Ruslan, and Kartbayev, Amandyk
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,SENTIMENT analysis ,STOCKS (Finance) ,CORPORATE finance - Abstract
This paper presents a novel approach to sentiment analysis specifically customized for predicting stock market movements, bypassing the need for external dictionaries that are often unavailable for many languages. Our methodology directly analyzes textual data, with a particular focus on context-specific sentiment words within neural network models. This specificity ensures that our sentiment analysis is both relevant and accurate in identifying trends in the stock market. We employ sophisticated mathematical modeling techniques to enhance both the precision and interpretability of our models. Through meticulous data handling and advanced machine learning methods, we leverage large datasets from Twitter and financial markets to examine the impact of social media sentiment on financial trends. We achieved an accuracy exceeding 75%, highlighting the effectiveness of our modeling approach, which we further refined into a convolutional neural network model. This achievement contributes valuable insights into sentiment analysis within the financial domain, thereby improving the overall clarity of forecasting in this field. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Aircraft Wake Evolution Prediction Based on Parallel Hybrid Neural Network Model.
- Author
-
Deng, Leilei, Pan, Weijun, Wang, Yuhao, Luan, Tian, and Leng, Yuanfei
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,MODEL airplanes ,COMPUTATIONAL fluid dynamics ,STANDARD deviations ,PARALLEL processing ,DEEP learning - Abstract
To overcome the time-consuming drawbacks of Computational Fluid Dynamics (CFD) numerical simulations, this paper proposes a hybrid model named PA-TLA (parallel architecture combining a TCN, LSTM, and an attention mechanism) based on the concept of intelligent aerodynamics and a parallel architecture. This model utilizes CFD data to drive efficient predictions of aircraft wake evolution at different initial altitudes during the approach phase. Initially, CFD simulations of continuous initial altitudes during the approach phase are used to generate aircraft wake evolution data, which are then validated against real-world LIDAR data to verify their reliability. The PA-TLA model is designed based on a parallel architecture, combining Long Short-Term Memory (LSTM) networks, Temporal Convolutional Networks (TCNs), and a tensor concatenation module based on the attention mechanism, which ensures computational efficiency while fully leveraging the advantages of each component in a parallel processing framework. The study results show that the PA-TLA model outperforms both the LSTM and TCN models in predicting the three characteristic parameters of aircraft wake: vorticity, circulation, and Q-criterion. Compared to the serially structured TCN-LSTM, PA-TLA achieves an average reduction in mean squared error (MSE) of 6.80%, in mean absolute error (MAE) of 7.70%, and in root mean square error (RMSE) of 4.47%, with an average increase in the coefficient of determination (R
2 ) of 0.36% and a 35% improvement in prediction efficiency. Lastly, this study combines numerical simulations and the PA-TLA deep learning architecture to analyze the near-ground wake vortex evolution. The results indicate that the ground effect increases air resistance and turbulence as vortices approach the ground, thereby slowing the decay rate of the wake vortex strength at lower altitudes. The ground effect also accelerates the dissipation and movement of vortex centers, causing more pronounced changes in vortex spacing at lower altitudes. Additionally, the vortex center height at lower altitudes initially decreases and then increases, unlike the continuous decrease observed at higher altitudes. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
22. Deep Learning Approach for Pitting Corrosion Detection in Gas Pipelines.
- Author
-
Malashin, Ivan, Tynchenko, Vadim, Nelyub, Vladimir, Borodulin, Aleksei, Gantimurov, Andrei, Krysko, Nikolay V., Shchipakov, Nikita A., Kozlov, Denis M., Kusyy, Andrey G., Martysyuk, Dmitry, and Galinovsky, Andrey
- Subjects
PITTING corrosion ,CONVOLUTIONAL neural networks ,DEEP learning ,ARTIFICIAL neural networks ,PIPELINE inspection ,PIPELINE corrosion - Abstract
The paper introduces a computer vision methodology for detecting pitting corrosion in gas pipelines. To achieve this, a dataset comprising 576,000 images of pipelines with and without pitting corrosion was curated. A custom-designed and optimized convolutional neural network (CNN) was employed for binary classification, distinguishing between corroded and non-corroded images. This CNN architecture, despite having relatively few parameters compared to existing CNN classifiers, achieved a notably high classification accuracy of 98.44%. The proposed CNN outperformed many contemporary classifiers in its efficacy. By leveraging deep learning, this approach effectively eliminates the need for manual inspection of pipelines for pitting corrosion, thus streamlining what was previously a time-consuming and cost-ineffective process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Residual-Based Multi-Stage Deep Learning Framework for Computer-Aided Alzheimer's Disease Detection.
- Author
-
Hassan, Najmul, Musa Miah, Abu Saleh, and Shin, Jungpil
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,ALZHEIMER'S disease ,CONVOLUTIONAL neural networks ,SUPPORT vector machines ,MACHINE learning - Abstract
Alzheimer's Disease (AD) poses a significant health risk globally, particularly among the elderly population. Recent studies underscore its prevalence, with over 50% of elderly Japanese facing a lifetime risk of dementia, primarily attributed to AD. As the most prevalent form of dementia, AD gradually erodes brain cells, leading to severe neurological decline. In this scenario, it is important to develop an automatic AD-detection system, and many researchers have been working to develop an AD-detection system by taking advantage of the advancement of deep learning (DL) techniques, which have shown promising results in various domains, including medical image analysis. However, existing approaches for AD detection often suffer from limited performance due to the complexities associated with training hierarchical convolutional neural networks (CNNs). In this paper, we introduce a novel multi-stage deep neural network architecture based on residual functions to address the limitations of existing AD-detection approaches. Inspired by the success of residual networks (ResNets) in image-classification tasks, our proposed system comprises five stages, each explicitly formulated to enhance feature effectiveness while maintaining model depth. Following feature extraction, a deep learning-based feature-selection module is applied to mitigate overfitting, incorporating batch normalization, dropout and fully connected layers. Subsequently, machine learning (ML)-based classification algorithms, including Support Vector Machines (SVM), Random Forest (RF) and SoftMax, are employed for classification tasks. Comprehensive evaluations conducted on three benchmark datasets, namely ADNI1: Complete 1Yr 1.5T, MIRAID and OASIS Kaggle, demonstrate the efficacy of our proposed model. Impressively, our model achieves accuracy rates of 99.47%, 99.10% and 99.70% for ADNI1: Complete 1Yr 1.5T, MIRAID and OASIS datasets, respectively, outperforming existing systems in binary class problems. Our proposed model represents a significant advancement in the AD-analysis domain. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. ECARRNet: An Efficient LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection.
- Author
-
Eunus, Salman Ibne, Hossain, Shahriar, Ridwan, A. E. M., Adnan, Ashik, Islam, Md. Saiful, Karim, Dewan Ziaul, Alam, Golam Rabiul, and Uddin, Jia
- Subjects
ARTIFICIAL neural networks ,RECURRENT neural networks ,JOINT use of railroad facilities ,DEEP learning ,CONVOLUTIONAL neural networks - Abstract
Accidents due to defective railway lines and derailments are common disasters that are observed frequently in Southeast Asian countries. It is imperative to run proper diagnosis over the detection of such faults to prevent such accidents. However, manual detection of such faults periodically can be both time-consuming and costly. In this paper, we have proposed a Deep Learning (DL)-based algorithm for automatic fault detection in railway tracks, which we termed an Ensembled Convolutional Autoencoder ResNet-based Recurrent Neural Network (ECARRNet). We compared its output with existing DL techniques in the form of several pre-trained DL models to investigate railway tracks and determine whether they are defective or not while considering commonly prevalent faults such as—defects in rails and fasteners. Moreover, we manually collected the images from different railway tracks situated in Bangladesh and made our dataset. After comparing our proposed model with the existing models, we found that our proposed architecture has produced the highest accuracy among all the previously existing state-of-the-art (SOTA) architecture, with an accuracy of 93.28% on the full dataset. Additionally, we split our dataset into two parts having two different types of faults, which are fasteners and rails. We ran the models on those two separate datasets, obtaining accuracies of 98.59% and 92.06% on rail and fastener, respectively. Model explainability techniques like Grad-CAM and LIME were used to validate the result of the models, where our proposed model ECARRNet was seen to correctly classify and detect the regions of faulty railways effectively compared to the previously existing transfer learning models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Attention-Oriented CNN Method for Type 2 Diabetes Prediction.
- Author
-
Zhao, Jian, Gao, Hanlin, Yang, Chen, An, Tianbo, Kuang, Zhejun, and Shi, Lijuan
- Subjects
TYPE 2 diabetes ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,MACHINE learning ,HEALTH & Nutrition Examination Survey ,OUTLIER detection ,FORECASTING - Abstract
Diabetes is caused by insulin deficiency or impaired biological action, and long-term hyperglycemia leads to a variety of tissue damage and dysfunction. Therefore, the early prediction of diabetes and timely intervention and treatment are crucial. This paper proposes a robust framework for the prediction and diagnosis of type 2 diabetes (T2DM) to aid in diabetes applications in clinical diagnosis. The data-preprocessing stage includes steps such as outlier removal, missing value filling, data standardization, and assigning class weights to ensure the quality and consistency of the data, thereby improving the performance and stability of the model. This experiment used the National Health and Nutrition Examination Survey (NHANES) dataset and the publicly available PIMA Indian dataset (PID). For T2DM classification, we designed a convolutional neural network (CNN) and proposed a novel attention-oriented convolutional neural network (SECNN) through the channel attention mechanism. To optimize the hyperparameters of the model, we used grid search and K-fold cross-validation methods. In addition, we also comparatively analyzed various machine learning (ML) models such as support vector machine (SVM), logistic regression (LR), decision tree (DT), random forest (RF), and artificial neural network (ANN). Finally, we evaluated the performance of the model using performance evaluation metrics such as precision, recall, F1-Score, accuracy, and AUC. Experimental results show that the SECNN model has an accuracy of 94.12% on the NHANES dataset and an accuracy of 89.47% on the PIMA Indian dataset. SECNN models and CNN models show significant improvements in diabetes prediction performance compared to traditional ML models. The comparative analysis of the SECNN model and the CNN model has significantly improved performance, further verifying the advantages of introducing the channel attention mechanism. The robust diabetes prediction framework proposed in this article establishes an effective foundation for diabetes diagnosis and prediction, and has a positive impact on the development of health management and medical industries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. A Lightweight Convolutional Neural Network Method for Two-Dimensional PhotoPlethysmoGraphy Signals.
- Author
-
Zhao, Feng, Zhang, Xudong, and He, Zhenyu
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,PHOTOPLETHYSMOGRAPHY ,DEEP learning ,INFORMATION technology security - Abstract
Data information security on wearable devices has emerged as a significant concern among users, so it becomes urgent to explore authentication methods based on wearable devices. Using PhotoPlethysmoGraphy (PPG) signals for identity authentication has been proven effective in biometric authentication. This paper proposes a convolutional neural network authentication method based on 2D PPG signals applied to wearable devices. This method uses Markov Transition Field technology to convert one-dimensional PPG signal data into two-dimensional image data, which not only retains the characteristics of the signal but also enriches the spatial information. Afterward, considering that wearable devices usually have limited resources, a lightweight convolutional neural network model is also designed in this method, which reduces resource consumption and computational complexity while ensuring high performance. It is proved experimentally that this method achieves 98.62% and 96.17% accuracy on the training set and test set, respectively, an undeniable advantage compared to the traditional one-dimensional deep learning method and the classical two-dimensional deep learning method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Age-Related Macular Degeneration Detection in Retinal Fundus Images by a Deep Convolutional Neural Network.
- Author
-
García-Floriano, Andrés and Ventura-Molina, Elías
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,MACULAR degeneration ,RETINAL degeneration ,RETINAL imaging ,FUNDUS oculi ,DEEP learning - Abstract
Featured Application: The Deep Neural Network model employed in this work will help build a system for the pre-diagnosis of retinopathies that can lead to blindness. The main intention of this type of system is to support the work performed by ophthalmology specialists. Computer-based pre-diagnosis of diseases through medical imaging is a task worked on for many years. The so-called fundus images stand out since they do not have uniform illumination and are highly sensitive to noise. One of the diseases that can be pre-diagnosed through fundus images is age-related macular degeneration, which initially manifests as the appearance of lesions called drusen. Several ways of pre-diagnosing macular degeneration have been proposed, methods based entirely on the segmentation of drusen with prior image processing have been designed and applied, and methods based on image pre-processing and subsequent conversion to feature vectors, or patterns, to be classified by a Machine-Learning model have also been developed. Finally, in recent years, the use of Deep-Learning models, particularly Convolutional Networks, has been proposed and used in classification problems where the data are only images. The latter has allowed the so-called transfer learning, which consists of using the learning achieved in the solution of one problem to solve another. In this paper, we propose the use of transfer learning through the Xception Deep Convolutional Neural Network to detect age-related macular degeneration in fundus images. The performance of the Xception model was compared against six other state-of-the-art models with a dataset created from images available in public and private datasets, which were divided into training/validation and test; with the training/validation set, the training was made using 10-fold cross-validation. The results show that the Xception neural network obtained a validation accuracy that surpasses other models, such as the VGG-16 or VGG-19 networks, and had an accuracy higher than 80% in the test set. We consider that the contributions of this work include the use of a Convolutional Neural Network model for the detection of age-related macular degeneration through the classification of fundus images in those affected by AMD (drusen) and the images of healthy patients. The performance of this model is compared against other methods featured in the state-of-the-art approaches, and the best model is tested on a test set outside the training and validation set. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Enhancing Human Activity Recognition with Siamese Networks: A Comparative Study of Contrastive and Triplet Learning Approaches.
- Author
-
Cha, Byung-Rae and Vaidya, Binod
- Subjects
ARTIFICIAL neural networks ,HUMAN activity recognition ,CONVOLUTIONAL neural networks ,DEEP learning ,FEATURE extraction - Abstract
This paper delves into the realm of human activity recognition (HAR) by leveraging the capabilities of Siamese neural networks (SNNs), focusing on the comparative effectiveness of contrastive and triplet learning approaches. Against the backdrop of HAR's growing importance in healthcare, sports, and smart environments, the need for advanced models capable of accurately recognizing and classifying complex human activities has become paramount. Addressing this, we have introduced a Siamese network architecture integrated with convolutional neural networks (CNNs) for spatial feature extraction, bidirectional LSTM (Bi-LSTM) for temporal dependency capture, and attention mechanisms to prioritize salient features. Employing both contrastive and triplet loss functions, we meticulously analyze the impact of these learning approaches on the network's ability to generate discriminative embeddings for HAR tasks. Through extensive experimentation, the study reveals that Siamese networks, particularly those utilizing triplet loss functions, demonstrate superior performance in activity recognition accuracy and F1 scores compared with baseline deep learning models. The inclusion of a stacking meta-classifier further amplifies classification efficacy, showcasing the robustness and adaptability of our proposed model. Conclusively, our findings underscore the potential of Siamese networks with advanced learning paradigms in enhancing HAR systems, paving the way for future research in model optimization and application expansion. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Learning in Deep Radial Basis Function Networks.
- Author
-
Wurzberger, Fabian and Schwenker, Friedhelm
- Subjects
RADIAL basis functions ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,EMOTION recognition ,IMAGE recognition (Computer vision) - Abstract
Learning in neural networks with locally-tuned neuron models such as radial Basis Function (RBF) networks is often seen as instable, in particular when multi-layered architectures are used. Furthermore, universal approximation theorems for single-layered RBF networks are very well established; therefore, deeper architectures are theoretically not required. Consequently, RBFs are mostly used in a single-layered manner. However, deep neural networks have proven their effectiveness on many different tasks. In this paper, we show that deeper RBF architectures with multiple radial basis function layers can be designed together with efficient learning schemes. We introduce an initialization scheme for deep RBF networks based on k-means clustering and covariance estimation. We further show how to make use of convolutions to speed up the calculation of the Mahalanobis distance in a partially connected way, which is similar to the convolutional neural networks (CNNs). Finally, we evaluate our approach on image classification as well as speech emotion recognition tasks. Our results show that deep RBF networks perform very well, with comparable results to other deep neural network types, such as CNNs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Artificial Intelligence in Ship Trajectory Prediction.
- Author
-
Bi, Jinqiang, Cheng, Hongen, Zhang, Wenjia, Bao, Kexin, and Wang, Peiren
- Subjects
DEEP learning ,ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,RECURRENT neural networks ,CONVOLUTIONAL neural networks ,MACHINE learning - Abstract
Maritime traffic is increasing more and more, creating more complex navigation environments for ships. Ship trajectory prediction based on historical AIS data is a vital method of reducing navigation risks and enhancing the efficiency of maritime traffic control. At present, employing machine learning or deep learning techniques to construct predictive models based on AIS data has become a focal point in ship trajectory prediction research. This paper systematically evaluates various trajectory prediction methods, spanning classical machine learning approaches and emerging deep learning techniques, to uncover their respective merits and drawbacks. In this work, a variety of studies were investigated that applied different algorithms in ship trajectory prediction, including regression models (RMs), artificial neural networks (ANNs), Kalman filtering (KF), and random forests (RFs) in machine learning, along with deep learning such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), long short-term memory (LSTM), gate recurrent unit (GRU) networks, and sequence-to-sequence (Seq2seq) networks. The performance of predictive models based on different algorithms in trajectory prediction tasks was graded and analyzed. Among the existing studies, deep learning methods exhibit significant performance and considerable potential application value for maritime traffic systems, which can be assessed by future work on ship trajectory prediction research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Application Research of Bridge Damage Detection Based on the Improved Lightweight Convolutional Neural Network Model.
- Author
-
Du, Fujun, Jiao, Shuangjian, and Chu, Kaili
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,TRAFFIC safety ,FEATURE extraction - Abstract
To ensure the safety and rational use of bridge traffic lines, the existing bridge structural damage detection models are not perfect for feature extraction and have difficulty meeting the practicability of detection equipment. Based on the YOLO (You Only Look Once) algorithm, this paper proposes a lightweight target detection algorithm with enhanced feature extraction of bridge structural damage. The BIFPN (Bidirectional Feature Pyramid Network) network structure is used for multi-scale feature fusion, which enhances the ability to extract damage features of bridge structures, and uses EFL (Equalized Focal Loss) to optimize the sample imbalance processing mechanism, which improves the accuracy of bridge structure damage target detection. The evaluation test of the model has been carried out in the constructed BDD (Bridge Damage Dataset) dataset. Compared with the YOLOv3-tiny, YOLOv5S, and B-YOLOv5S models, the mAP@.5 of the BE-YOLOv5S model increased by 45.1%, 2%, and 1.6% respectively. The analysis and comparison of the experimental results prove that the BE-YOLOv5S network model proposed in this paper has a better performance and a more reliable performance in the detection of bridge structural damage. It can meet the needs of bridge structure damage detection engineering with high requirements for real-time and flexibility. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
32. Computer Aided Classifier of Colorectal Cancer on Histopatological Whole Slide Images Analyzing Deep Learning Architecture Parameters.
- Author
-
Martínez-Fernandez, Elena, Rojas-Valenzuela, Ignacio, Valenzuela, Olga, and Rojas, Ignacio
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,COLORECTAL cancer ,CONVOLUTIONAL neural networks ,MEDICAL imaging systems ,COMPUTERS - Abstract
The diagnosis of different pathologies and stages of cancer using whole histopathology slide images (WSI) is the gold standard for determining the degree of tissue metastasis. The use of deep learning systems in the field of medical images, especially histopathology images, is becoming increasingly important. The training and optimization of deep neural network models involve fine-tuning parameters and hyperparameters such as learning rate, batch size (BS), and boost to improve the performance of the model in task-specific applications. Tuning hyperparameters is a major challenge in designing deep neural network models, having a large impact on the performance. This paper analyzes how the parameters and hyperparameters of a deep learning architecture affect the classification of colorectal cancer (CRC) histopathology images using the well-known VGG19 model. This paper also discusses the pre-processing of these images, such as the use of color normalization and stretching transformations on the data set. Among these hyperparameters, the most important neural network hyperparameter is the learning rate (LR). In this paper, different strategies for the optimization of LR are analyzed (both static and dynamic) and a new experiment based on the variation of LR is proposed (the relevance of dynamic strategies over fixed LR is highlighted), after each layer of the neural network together with decreasing variations according to the epochs. The results obtained are very remarkable, obtaining in the simulation an accurate system that achieves 96.4% accuracy on test images (for nine different tissue classes) using the triangular-cyclic learning rate. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
33. Survey of Recent Deep Neural Networks with Strong Annotated Supervision in Histopathology.
- Author
-
Petríková, Dominika and Cimrák, Ivan
- Subjects
ARTIFICIAL neural networks ,BREAST ,CONVOLUTIONAL neural networks ,DEEP learning ,HISTOPATHOLOGY ,IMAGE recognition (Computer vision) - Abstract
Deep learning (DL) and convolutional neural networks (CNNs) have achieved state-of-the-art performance in many medical image analysis tasks. Histopathological images contain valuable information that can be used to diagnose diseases and create treatment plans. Therefore, the application of DL for the classification of histological images is a rapidly expanding field of research. The popularity of CNNs has led to a rapid growth in the number of works related to CNNs in histopathology. This paper aims to provide a clear overview for better navigation. In this paper, recent DL-based classification studies in histopathology using strongly annotated data have been reviewed. All the works have been categorized from two points of view. First, the studies have been categorized into three groups according to the training approach and model construction: 1. fine-tuning of pre-trained networks for one-stage classification, 2. training networks from scratch for one-stage classification, and 3. multi-stage classification. Second, the papers summarized in this study cover a wide range of applications (e.g., breast, lung, colon, brain, kidney). To help navigate through the studies, the classification of reviewed works into tissue classification, tissue grading, and biomarker identification was used. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
34. A Survey of Automatic Speech Recognition for Dysarthric Speech.
- Author
-
Qian, Zhaopeng and Xiao, Kejing
- Subjects
AUTOMATIC speech recognition ,DEEP learning ,ARTIFICIAL neural networks ,SPEECH perception ,LANGUAGE models ,RECURRENT neural networks ,CONVOLUTIONAL neural networks - Abstract
Dysarthric speech has several pathological characteristics, such as discontinuous pronunciation, uncontrolled volume, slow speech, explosive pronunciation, improper pauses, excessive nasal sounds, and air-flow noise during pronunciation, which differ from healthy speech. Automatic speech recognition (ASR) can be very helpful for speakers with dysarthria. Our research aims to provide a scoping review of ASR for dysarthric speech, covering papers in this field from 1990 to 2022. Our survey found that the development of research studies about the acoustic features and acoustic models of dysarthric speech is nearly synchronous. During the 2010s, deep learning technologies were widely applied to improve the performance of ASR systems. In the era of deep learning, many advanced methods (such as convolutional neural networks, deep neural networks, and recurrent neural networks) are being applied to design acoustic models and lexical and language models for dysarthric-speech-recognition tasks. Deep learning methods are also used to extract acoustic features from dysarthric speech. Additionally, this scoping review found that speaker-dependent problems seriously limit the generalization applicability of the acoustic model. The scarce available speech data cannot satisfy the amount required to train models using big data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
35. A Low-Cost Detail-Aware Neural Network Framework and Its Application in Mask Wearing Monitoring.
- Author
-
Cao, Silei, Long, Shun, and Liao, Fangting
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,ARTIFICIAL neural networks ,IMAGE recognition (Computer vision) ,MEDICAL masks ,PUBLIC spaces - Abstract
The use of deep learning techniques in real-time monitoring can save a lot of manpower in various scenarios. For example, mask-wearing is an effective measure to prevent COVID-19 and other respiratory diseases, especially for vulnerable populations such as children, the elderly, and people with underlying health problems. Currently, many public places such as hospitals, nursing homes, social service facilities, and schools experiencing outbreaks require mandatory mask-wearing. However, most of the terminal devices currently available have very limited GPU capability to run large neural networks. This means that we have to keep the parameter size of a neural network modest while maintaining its performance. In this paper, we propose a framework that applies deep learning techniques to real-time monitoring and uses it for the real-time monitoring of mask-wearing status. The main contributions are as follows: First, a feature fusion technique called skip layer pooling fusion (SLPF) is proposed for image classification tasks. It fully utilizes both deep and shallow features of a convolutional neural network while minimizing the growth in model parameters caused by feature fusion. On average, this technique improves the accuracy of various neural network models by 4.78% and 5.21% on CIFAR100 and Tiny-ImageNet, respectively. Second, layer attention (LA), an attention mechanism tailor-made for feature fusion, is proposed. Since different layers of convolutional neural networks make different impacts on the final prediction results, LA learns a set of weights to better enhance the contribution of important convolutional layer features. On average, it improves the accuracy of various neural network models by 2.10% and 2.63% on CIFAR100 and Tiny-ImageNet, respectively. Third, a MobileNetv2-based lightweight mask-wearing status classification model is trained, which is suitable for deployment on mobile devices and achieves an accuracy of 95.49%. Additionally, a ResNet mask-wearing status classification model is trained, which has a larger model size but achieves high accuracy of 98.14%. By applying the proposed methods to the ResNet mask-wearing status classification model, the accuracy is improved by 1.58%. Fourth, a mask-wearing status detection model is enhanced based on YOLOv5 with a spatial-frequency fusion module resulting in a mAP improvement of 2.20%. Overall, this paper presents various techniques to improve the performance of neural networks and apply them to mask-wearing status monitoring, which can help stop pandemics. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
36. New Trends in Emotion Recognition Using Image Analysis by Neural Networks, A Systematic Review.
- Author
-
Cîrneanu, Andrada-Livia, Popescu, Dan, and Iordache, Dragoș
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,EMOTION recognition ,IMAGE analysis ,RECURRENT neural networks ,GENERATIVE adversarial networks - Abstract
Facial emotion recognition (FER) is a computer vision process aimed at detecting and classifying human emotional expressions. FER systems are currently used in a vast range of applications from areas such as education, healthcare, or public safety; therefore, detection and recognition accuracies are very important. Similar to any computer vision task based on image analyses, FER solutions are also suitable for integration with artificial intelligence solutions represented by different neural network varieties, especially deep neural networks that have shown great potential in the last years due to their feature extraction capabilities and computational efficiency over large datasets. In this context, this paper reviews the latest developments in the FER area, with a focus on recent neural network models that implement specific facial image analysis algorithms to detect and recognize facial emotions. This paper's scope is to present from historical and conceptual perspectives the evolution of the neural network architectures that proved significant results in the FER area. This paper endorses convolutional neural network (CNN)-based architectures against other neural network architectures, such as recurrent neural networks or generative adversarial networks, highlighting the key elements and performance of each architecture, and the advantages and limitations of the proposed models in the analyzed papers. Additionally, this paper presents the available datasets that are currently used for emotion recognition from facial expressions and micro-expressions. The usage of FER systems is also highlighted in various domains such as healthcare, education, security, or social IoT. Finally, open issues and future possible developments in the FER area are identified. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
37. Security Risk Level Prediction of Carbofuran Pesticide Residues in Chinese Vegetables Based on Deep Learning.
- Author
-
Jiang, Tongqiang, Liu, Tianqi, Dong, Wei, Liu, Yingjie, and Zhang, Qingchuan
- Subjects
PESTICIDE residues in food ,DEEP learning ,ARTIFICIAL neural networks ,PESTICIDE pollution ,CARBOFURAN ,RECURRENT neural networks ,CONVOLUTIONAL neural networks ,CLINICAL supervision - Abstract
The supervision of security risk level of carbofuran pesticide residues can guarantee the food quality and security of residents effectively. In order to predict the potential key risk vegetables and regions, this paper constructs a security risk assessment model, combined with the k-means++ algorithm, to establish the risk security level. Then the evaluation index value of the security risk model is predicted to determine the security risk level based on the deep learning model. The model consists of a convolutional neural network (CNN) and a long short-term memory network (LSTM) optimized by an arithmetic optimization algorithm (AOA), namely, CNN-AOA-LSTM. In this paper, a comparative experiment is conducted on a small sample data set of independently constructed security risk assessment indicators. Experimental results show that the accuracy of the CNN-AOA-LSTM prediction model based on attention mechanism is 6.12% to 18.99% higher than several commonly used deep neural network models (gated recurrent unit, LSTM, and recurrent neural networks). The prediction model proposed in this paper provides scientific reference to establish the priority order of supervision, and provides forward-looking supervision for the government. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
38. Fully Convolutional Neural Network Prediction Method for Aerostatic Performance of Bluff Bodies Based on Consistent Shape Description.
- Author
-
Li, Ke, Li, Hai, Li, Shaopeng, and Chen, Zengshun
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,WIND tunnel testing ,DRAG coefficient ,WIND tunnels - Abstract
The shape of a bluff body section is of high importance to its aerostatic performance. Obtaining the aerostatic performance of a specific shape based on wind tunnel tests and CFD simulations takes a lot of time, which affects evaluation efficiency. This paper proposes a novel fully convolutional neural network model that enables rapid prediction from shape to aerostatic performance. Its main innovations are: (1) The proposal of a new shape description method in which the shape is described by the combination of the wall distance field and the space coordinate field, which can efficiently express the influencing factors of the shape on the aerostatic performance. (2) A step-by-step strategy in which the pressure field is used as the model output and then the calculation of the aerostatic coefficient is proposed. Compared with the simple direct prediction of the aerostatic coefficient, the logical connection between input and output can be enhanced and the prediction accuracy can be improved. It is found that the model proposed in this paper has good prediction accuracy, and its average relative error is 9.42% compared with the CFD calculation results. Compared with the direct use of the shape as the model input, the accuracy is improved by 13.25%; compared with the direct use of the drag coefficient as the model output, the accuracy is improved by 10%. Compared with traditional CFD calculations and wind tunnel experiments, this method can be used as a fast auxiliary screening method for the optimization of the aerodynamic shapes of bluff body sections. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
39. Short-Term Regional Temperature Prediction Based on Deep Spatial and Temporal Networks.
- Author
-
Wu, Shun, Fu, Fengchen, Wang, Lei, Yang, Minhang, Dong, Shi, He, Yongqing, Zhang, Qingqing, and Guo, Rong
- Subjects
TIME-varying networks ,GRIDS (Cartography) ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,HUMAN information processing ,SPATIOTEMPORAL processes ,ATMOSPHERIC temperature - Abstract
Accurate prediction of air temperature is of great significance to outdoor activities and daily life. However, it is important and more challenging to predict air temperature in complex terrain areas because of prevailing mountain and valley winds and variable wind directions. The main innovation of this paper is to propose a regional temperature prediction method based on deep spatiotemporal networks, designing a spatiotemporal information processing module to align temperature data with regional grid points and further transforming temperature time series data into image sequences. Long Short-Term Memory network is constructed on the images to extract the depth features of the data to train the model. The experiments demonstrate that the deep learning prediction model containing the spatiotemporal information processing module and the deep learning prediction module is fully feasible in short-term regional temperature prediction. The comparison experiments show that the model proposed in this paper has better prediction results for classical models, such as convolutional neural networks and LSTM networks. The experimental conclusion shows that the method proposed in this paper can predict the distribution and change trend of temperature in the next 3 h and the next 6 h on a regional scale. The experimental result RMSE reached 0.63, showing high stability and accuracy. The model provides a new method for local regional temperature prediction, which can support the planning of production and life in advance and tend to save energy and reduce consumption. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
40. Prediction of Prospecting Target Based on ResNet Convolutional Neural Network.
- Author
-
Gao, Le, Huang, Yongjie, Zhang, Xin, Liu, Qiyuan, and Chen, Zequn
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,PROSPECTING ,ORE deposits ,MINES & mineral resources ,EARTH sciences ,ARTIFICIAL intelligence - Abstract
In recent years, with the development of geological prospecting from shallow ore to deep and hidden ore, the difficulty of prospecting is increasing day by day, so the application of computer technology and new methods of geological and mineral exploration is paid more and more attention. The mining and prediction of geological prospecting information based on deep learning have become the frontier field of earth science. However, as a deep artificial intelligence algorithm, deep learning still has many problems to be solved in the big data mining and prediction of geological prospecting, such as the small number of training samples of geological and mineral images, the difficulty of building deep learning network models, and the universal applicability of deep learning models. In this paper, the training samples and convolutional neural network models suitable for geochemical element data mining are constructed to solve the above problems, and the model is successfully applied to the prediction research of gold, silver, lead and zinc polymetallic metallogenic areas in South China. Taking the Pangxidong research area in the west of Guangdong Province as an example, this paper carries out prospecting target prediction research based on a 1:50000 stream sediment survey original data. Firstly, the support vector machine (SVM) model and statistical method were used to determine the ore-related geochemical element assemblage. Secondly, the experimental data of geochemical elements were augmented and a dataset was established. Finally, ResNet-50 neural network model is used for data training and prediction research. The experimental results show that the areas numbered 9, 29, 38, 40, 95, 111, 114, 124, 144 have great metallogenic potential, and this method would be a promising tool for metallogenic prediction. By applying the ResNet-50 neural network in metallogenic prediction, it can provide a new idea for the future exploration of mineral resources. In order to verify the generality of the research method in this paper, we conducted experimental tests on the geochemical dataset of B area, another deposit research area in South China. The results show that 100% of the prediction area obtained by using the proposed method covers the known ore deposit area. This model also provides method support for further delineating the prospecting target area in study area B. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
41. Head-Integrated Detecting Method for Workers under Complex Construction Scenarios.
- Author
-
Liu, Yongyue, Zhou, Zhenzong, Wang, Yaowu, and Sun, Chengshuang
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,DEEP learning ,GRAPHICS processing units ,CONSTRUCTION management ,BUILDING sites - Abstract
Real-time detection of workers is crucial in construction safety management. Deep learning-based detecting methods are valuable, but always challenged by the possibility of target missing or identity errors under complex scenarios. To address these limitations, previous research depended on re-training for new models or datasets, which are prohibitively time-consuming and incur high computing demands. However, we demonstrate that the better detecting model might not rely on more re-training of weights; instead, a training-free model can achieve even better performance by integrating head information. In this paper, a new head-detecting branch (55 MB) is added to the Keypoint Region-based Convolutional Network (Keypoint R-CNN, 226 MB) without altering its original weights, allowing for a less occluded head to aid in body detection. We also deployed motion information and anthropometric data through a post-processing module to calculate movement relationships. This study achieved an identity F1-score (IDF1) of 97.609%, recall (Rcll) of 98.173%, precision (Prcn) of 97.052%, and accuracy of 95.329% as a state-of-the-art (SOTA) method for worker detection. This exploration breaks the inertial attitudes of re-training dependency and accelerates the application of universal models, in addition to reducing the computational difficulty for most construction sites, especially in scenarios with an insufficient graphics processing unit (GPU). More importantly, this study can address occlusion challenges effectively in the worker detection field, making it of practical significance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Hybrid Deep Neural Networks with Multi-Tasking for Rice Yield Prediction Using Remote Sensing Data.
- Author
-
Chang, Che-Hao, Lin, Jason, Chang, Jia-Wei, Huang, Yu-Shun, Lai, Ming-Hsin, and Chang, Yen-Jen
- Subjects
ARTIFICIAL neural networks ,RICE quality ,DEEP learning ,REMOTE sensing ,CONVOLUTIONAL neural networks ,CROP yields ,AGRICULTURE ,HEBBIAN memory - Abstract
Recently, data-driven approaches have become the dominant solution for prediction problems in agricultural industries. Several deep learning models have been applied to crop yield prediction in smart farming. In this paper, we proposed an efficient hybrid deep learning model that coordinates the outcomes of a classification model and a regression model in deep learning via the shared layers to predict the rice crop yield. Three statistical analyses on the features, including Pearson correlation coefficients (PCC), Shapley additive explanations (SHAP), and recursive feature elimination with cross-validation (RFECV), are proposed to select the most relevant ones for the predictive goal to reduce the model training time. The data preprocessing normalizes the features of the collected data into specific ranges of values and then reformats them into a three-dimensional matrix. As a result, the root-mean-square error (RMSE) of the proposed model in rice yield prediction has achieved 344.56 and an R-squared of 0.64. The overall performance of the proposed model is better than the other deep learning models, such as the multi-parametric deep neural networks (MDNNs) (i.e., RMSE = 370.80, R-squared = 0.59) and the artificial neural networks (ANNs) (i.e., RMSE = 550.03, R-squared = 0.09). The proposed model has demonstrated significant improvement in the predictive results of distinguishing high yield from low yield with 90% accuracy and 94% F1 score. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Fuzzy Clustering-Based Deep Learning for Short-Term Load Forecasting in Power Grid Systems Using Time-Varying and Time-Invariant Features.
- Author
-
Chan, Kit Yan, Yiu, Ka Fai Cedric, Kim, Dowon, and Abu-Siada, Ahmed
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,GRIDS (Cartography) ,ELECTRIC power distribution grids ,DEEP learning ,FUZZY neural networks ,LINEAR time invariant systems - Abstract
Accurate short-term load forecasting (STLF) is essential for power grid systems to ensure reliability, security and cost efficiency. Thanks to advanced smart sensor technologies, time-series data related to power load can be captured for STLF. Recent research shows that deep neural networks (DNNs) are capable of achieving accurate STLP since they are effective in predicting nonlinear and complicated time-series data. To perform STLP, existing DNNs use time-varying dynamics of either past load consumption or past power correlated features such as weather, meteorology or date. However, the existing DNN approaches do not use the time-invariant features of users, such as building spaces, ages, isolation material, number of building floors or building purposes, to enhance STLF. In fact, those time-invariant features are correlated to user load consumption. Integrating time-invariant features enhances STLF. In this paper, a fuzzy clustering-based DNN is proposed by using both time-varying and time-invariant features to perform STLF. The fuzzy clustering first groups users with similar time-invariant behaviours. DNN models are then developed using past time-varying features. Since the time-invariant features have already been learned by the fuzzy clustering, the DNN model does not need to learn the time-invariant features; therefore, a simpler DNN model can be generated. In addition, the DNN model only learns the time-varying features of users in the same cluster; a more effective learning can be performed by the DNN and more accurate predictions can be achieved. The performance of the proposed fuzzy clustering-based DNN is evaluated by performing STLF, where both time-varying features and time-invariant features are included. Experimental results show that the proposed fuzzy clustering-based DNN outperforms the commonly used long short-term memory networks and convolution neural networks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. Hyperspectral Image Classification Based on Mutually Guided Image Filtering.
- Author
-
Zhan, Ying, Hu, Dan, Yu, Xianchuan, and Wang, Yufeng
- Subjects
IMAGE recognition (Computer vision) ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,FEATURE extraction ,GENERATIVE adversarial networks ,HYPERSPECTRAL imaging systems ,REMOTE sensing - Abstract
Hyperspectral remote sensing images (HSIs) have both spectral and spatial characteristics. The adept exploitation of these attributes is central to enhancing the classification accuracy of HSIs. In order to effectively utilize spatial and spectral features to classify HSIs, this paper proposes a method for the spatial feature extraction of HSIs based on a mutually guided image filter (muGIF) and combined with the band-distance-grouped principal component. Firstly, aiming at the problem that previously guided image filtering cannot effectively deal with the inconsistent information structure between the guided and target information, a method for extracting spatial features using muGIF is proposed. Then, aiming at the problem of the information loss caused by a single principal component as a guided image in the traditional GIF-based spatial–spectral classification, a spatial feature-extraction framework based on the band-distance-grouped principal component is proposed. The method groups the bands according to the band distance and extracts the principal components of each set of band subsets as the guide map of the current band subset to filter the HSIs. A deep convolutional neural network model and a generative adversarial network model for the filtered HSIs are constructed and then trained using samples for HSIs' spatial–spectral classification. Experiments show that compared with the traditional methods and several popular spatial–spectral HSI classification methods based on a filter, the proposed methods based on muGIF can effectively extract the spatial–spectral features and improve the classification accuracy of HSIs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Enhancing Day-Ahead Cooling Load Prediction in Tropical Commercial Buildings Using Advanced Deep Learning Models: A Case Study in Singapore.
- Author
-
Kondath, Namitha, Myat, Aung, Soh, Yong Loke, Tung, Whye Loon, Eugene, Khoo Aik Min, and An, Hui
- Subjects
COMMERCIAL buildings ,ARTIFICIAL neural networks ,DEEP learning ,RECURRENT neural networks ,CONVOLUTIONAL neural networks ,COOLING loads (Mechanical engineering) ,TROPICAL climate - Abstract
Commercial buildings in hot and humid tropical climates rely significantly on cooling systems to maintain optimal occupant comfort. A well-accurate day-ahead load profile prediction plays a pivotal role in planning the energy requirements of cooling systems. Despite the pressing need for effective day-ahead cooling load predictions, current methodologies have not fully harnessed the potential of advanced deep-learning techniques. This paper aims to address this gap by investigating the application of innovative deep-learning models in day-ahead hourly cooling load prediction for commercial buildings in tropical climates. A range of multi-output deep learning techniques, including Deep Neural Networks (DNNs), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Long Short-Term Memory networks (LSTMs), are employed to enhance prediction accuracy. Furthermore, these individual deep learning techniques are synergistically integrated to create hybrid models, such as CNN-LSTM and Sequence-to-Sequence models. Experiments are conducted to choose the time horizons from the past that can serve as input to the models. In addition, the influence of various categories of input parameters on prediction performance has been assessed. Historical cooling load, calendar features, and outdoor weather parameters are found in decreasing order of influence on prediction accuracy. This research focuses on buildings located in Singapore and presents a comprehensive case study to validate the proposed models and methodologies. The sequence-to-sequence model provided better performance than all the other models. It offered a CV-RMSE of 7.4%, 10%, and 6% for SIT@Dover, SIT@NYP, and the simulated datasets, which were 2.3%, 3%, and 1% less, respectively, than the base Deep Neural Network model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. Rolling Shutter OFDM Scheme for Optical Camera Communication Considering Mobility Environment Based on Deep Learning.
- Author
-
Nguyen, Huy, Nguyen, Van Linh, Tran, Duc Hoang, and Jang, Yeong Min
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,OPTICAL communications ,CONVOLUTIONAL neural networks ,WIRELESS communications ,MULTI-carrier modulation - Abstract
This paper presents a rolling shutter orthogonal frequency-division multiplexing (RS-OFDM) optical camera communication higher rate longer range proposed in IEEE 802.15.7a Task Group (TG7a) using an image sensor as a receiver. OFDM is a digital multi-carrier modulation scheme deployed for broadband wireless communication to resolve the inter-symbol interference (ISI) effect caused by the multipath channel. In optical wireless communication systems, OFDM was applied widely for indoor applications: internet of things, e-health, vehicular, and localization systems. The mobility scenario is a big problem for OWC systems, which reduces the system performance due to the optical channel variation in the processing time. In addition to that, signal detection should be considered in the mobility environment to improve the signal-to-noise ratio of OWC systems. In this paper, we proposed the convolution neural network (CNN) for LED detection in the RS-OFDM system, considering the mobility effect. In addition to that, the deep neural network was applied to detect the start of OFDM frame instead of conventional technology (Van De Beek algorithm). By applying our approach, the RS-OFDM system can achieve long communication (18 m distance) with a low error rate in the 2 m/s velocity environment. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Adaptive Modular Convolutional Neural Network for Image Recognition.
- Author
-
Wu, Wenbo and Pan, Yun
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,IMAGE recognition (Computer vision) ,ARTIFICIAL neural networks ,COMPUTER vision ,STRUCTURAL optimization ,IMAGE fusion - Abstract
Image recognition has long been one of the research hotspots in computer vision tasks. The development of deep learning is rapid in recent years, and convolutional neural networks usually need to be designed with fixed resources. If sufficient resources are available, the model can be scaled up to achieve higher accuracy, for example, VggNet, ResNet, GoogLeNet, etc. Although the accuracy of large-scale models has been improved, the following problems will occur with the expansion of model scale: (1) There may be over-fitting; (2) increasing model parameters; (3) slow model convergence. This paper proposes a design method for a modular convolutional neural network model which solves the problem of over-fitting and large model parameters by connecting multiple modules in parallel. Moreover, each module contains several submodules (three submodules in this paper) and fuses the features extracted from the submodules. The model convergence can be accelerated by using the fused features (the fused features contain more image information). In this study, we add a gate unit based on the attention mechanism to the model, which aims to optimize the structure of the model (select the optimal number of modules), allowing the model to select an optimum network structure by learning and dynamically reducing FLOPs (floating-point operations per second) of the model. Compared to VggNet, ResNet, and GoogLeNet, the structure of the model proposed in this paper is simple and the parameters are small. The proposed model achieves good results in the Kaggle datasets Cats-vs.-Dogs (99.3%), 10-Monkey Species (99.26%), and Birds-400 (99.13%). [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. A Seam Tracking Method Based on an Image Segmentation Deep Convolutional Neural Network.
- Author
-
Lu, Jun, Yang, Aodong, Chen, Xiaoyu, Xu, Xingwang, Lv, Ri, and Zhao, Zhuang
- Subjects
ARTIFICIAL neural networks ,PLASMA arc welding ,CONVOLUTIONAL neural networks ,IMAGE segmentation ,DEEP learning ,FEATURE extraction - Abstract
Vision-based welding seam tracking is an important and unique branch of welding automation. Active vision seam tracking systems achieve accurate feature extraction by using an auxiliary light source, but this will introduce extra costs and the real-time performance will be affected. In contrast, passive vision systems achieve better real-time performance and their structure is relatively simple. This paper proposes a passive vision welding seam tracking system in Plasma Arc Welding (PAW) based on semantic segmentation. The BiseNetV2 network is adopted in this paper and online hard example mining (OHEM) is used to improve the segmentation effect. This network structure is a lightweight structure allowing effective image feature extraction. According to the segmentation results, the offset between the welding seam and the welding torch can be calculated. The results of the experiments show that the proposed method can achieve 57 FPS and the average error of the offset calculation is within 0.07 mm, meaning it can be used for real-time seam tracking. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
49. Study on Morphological Identification of Tight Oil Reservoir Residual Oil after Water Flooding in Secondary Oil Layers Based on Convolution Neural Network.
- Author
-
Zhao, Ling, Sun, Xianda, Liu, Fang, Wang, Pengzhen, and Chang, Lijuan
- Subjects
CONVOLUTIONAL neural networks ,PETROLEUM reservoirs ,GAS fields ,ARTIFICIAL neural networks ,BASE oils ,IMAGE recognition (Computer vision) ,OIL fields ,PETROLEUM - Abstract
In this paper, a microscopic oil displacement visualization experiment based on the glass etching model to simulate the tight oil reservoir of underground rocks is carried out. At present, water flooding technology is widely used in the development of oil and gas fields, and the remaining oil content is still very high after water flooding. It is the key to improving oil recovery to identify and study the remaining oil form distribution after water flooding. The experiment result shows there are five types of residual oil after water flooding: columnar residual oil, membranous residual oil, oil droplet residual oil, blind terminal residual oil and cluster residual oil. A convolution neural network is suitable for complex image characteristics with good robustness. In recent years, it has made a breakthrough in a set of small and efficient neural networks with SqueezeNet, Google Inception and the flattened network method put forward. In order to solve the problems of low automation, low efficiency and high error rate in the traditional remaining oil form recognition algorithm after water flooding in tight oil reservoirs, an image recognition algorithm based on the MobileNets convolutional neural network model was proposed in this paper to achieve accurate recognition of the remaining oil form. Based on traditional image processing methods which, respectively, extracted the whole picture of the different types of remaining oil in the image block, it uses the MobileNets network structure to classify different types of image block and realizes the layered depth convolution neural network system. The experiment result shows that the model can accurately identify the remaining oil forms, and the overall recognition accuracy is up to 83.8% after the convergence of the network model, which infinitely identifies the remaining oil forms in the morphological library, proving the strong generalization and robustness of the model. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
50. A Fault Diagnosis Algorithm for the Dedicated Equipment Based on the CNN-LSTM Mechanism.
- Author
-
Guo, Zhannan, Hao, Yinlin, Shi, Hanwen, Wu, Zhenyu, Wu, Yuhu, and Sun, Ximing
- Subjects
FAULT diagnosis ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,ALGORITHMS - Abstract
Dedicated equipment, which is widely used in many different types of vehicles, is the core system that determines the combat capability of special vehicles. Therefore, assuring the normal operation of dedicated equipment is crucial. With the increase in battlefield complexity, the demand for equipment functions is increasing, and the complexity of dedicated equipment is also increasing. To solve the problem of fault diagnosis of dedicated equipment, a fault diagnosis algorithm based on CNN-LSTM was proposed in this paper. CNN and LSTM are used in the model adopted by the algorithm to extract spatial and temporal features from the data. CBAM is used to enhance the model's accuracy in identifying faults for dedicated equipment. Data on dedicated equipment faults were obtained from a hardware-in-loop simulation platform to verify the model. It is demonstrated that the proposed fault diagnosis algorithm has high recognition ability for dedicated equipment by comparing it to other neural network models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.