977 results
Search Results
2. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries.
- Author
-
Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
- Subjects
- *
MEDICAL records , *ARTIFICIAL neural networks , *COMPUTER vision , *DIASTOLIC blood pressure , *MEDICAL personnel , *DEEP learning , *SYSTOLIC blood pressure - Abstract
Background: In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods: We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results: The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions: We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Special Issue "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023".
- Author
-
Kim, Byung-Seo, Afzal, Muhammad Khalil, and Ullah, Rehmat
- Subjects
- *
MULTICASTING (Computer networks) , *INFORMATION technology , *SENSOR networks , *ARTIFICIAL neural networks , *DEEP learning , *BEAM steering , *INTEGRATED circuit design , *COMPUTER network security - Abstract
This document is a summary of a special issue of the journal Sensors, titled "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023." The special issue features selected papers from the 10th and 11th International Conferences on Green and Human Information Technology (ICGHITs), which were held in Korea and Thailand. The conferences focused on the theme of "Emerging Artificial Intelligent (AI)+X technology" and "Hyper Automation + Human AI" respectively. The selected papers cover various topics such as network security, routing protocols, signal detection, and clustering mechanisms, all incorporating AI-based methods. The issue also includes papers on topics like secure authentication, distance estimation in RFID systems, energy optimization in smart homes, blockchain technology, and radar signal detection. The authors emphasize the importance of both technology and humanity in advancing green and information technologies. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
4. Advanced Machine Learning and Deep Learning Approaches for Remote Sensing II.
- Author
-
Jeon, Gwanggil
- Subjects
REMOTE sensing ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL intelligence ,DISTANCE education - Abstract
This document is a summary of a special issue on advanced machine learning and deep learning techniques for remote sensing. The issue includes 16 research papers that cover a range of topics, including hyperspectral image classification, moving point target detection, radar echo extrapolation, and remote sensing object detection. Each paper introduces a novel approach or model and provides extensive testing and evaluation to demonstrate its effectiveness. The insights shared in this special issue are expected to contribute to future advancements in artificial intelligence-based remote sensing research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
5. Special Issue: Artificial Intelligence Technology in Medical Image Analysis.
- Author
-
Szilágyi, László and Kovács, Levente
- Subjects
DEEP learning ,COMPUTER-assisted image analysis (Medicine) ,IMAGE analysis ,ARTIFICIAL intelligence ,DIAGNOSTIC imaging ,ARTIFICIAL neural networks - Abstract
This document is a summary of a special issue in the journal Applied Sciences titled "Artificial Intelligence Technology in Medical Image Analysis." The special issue explores the applications of artificial intelligence (AI) in medical imaging and its impact on diagnostic and therapeutic processes. The use of AI-powered tools in image interpretation has shown exceptional capabilities in detecting and diagnosing medical conditions from imaging data, particularly in radiology. AI also contributes to improving image quality, automating routine tasks, and streamlining healthcare workflows. However, challenges such as data privacy, ethics, and regulatory frameworks need to be addressed for responsible implementation. The special issue includes several research papers that present advancements in automated medical decision support, age estimation, quality assurance, orthotic insole recommendation, tumor identification, thalamus segmentation, medical image classification, hyperparameter optimization, lung disease classification, and thoracic cavity segmentation. These papers demonstrate the potential of AI in improving accuracy, efficiency, and personalized treatment in medical image analysis. The integration of AI into healthcare requires collaboration between AI researchers, healthcare professionals, and regulatory bodies to ensure responsible and effective deployment. The future of AI in medical image analysis holds promise for improved diagnostic accuracy, early disease detection, and personalized treatment strategies. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
6. Guest Editorial: Special issue on computational methods and artificial intelligence applications in low‐carbon energy systems.
- Author
-
Wang, Yishen, Zhou, Fei, Guerrero, Josep M., Baker, Kyri, Chen, Yize, Wang, Hao, Xu, Bolun, Xu, Qianwen, Zhu, Hong, and Agwan, Utkarsha
- Subjects
ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,MACHINE learning ,REINFORCEMENT learning ,DEEP reinforcement learning ,DEEP learning - Abstract
This document is a guest editorial for a special issue on computational methods and artificial intelligence applications in low-carbon energy systems. The editorial highlights the urgent need for advanced computing and artificial intelligence in the clean energy transition to improve system reliability, economics, and sustainability. The special issue includes 19 original research articles covering topics such as energy forecasting, situational awareness, multi-energy system dispatch, and power system operation. The articles present state-of-the-art methods and techniques in these areas, including wind power forecasting, demand-side flexibility, fault diagnosis of photovoltaic strings, and energy management strategies. The authors express their gratitude to the participating authors and anonymous reviewers for their contributions to the special section. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
7. Guest Editorial: Special issue on advances in representation learning for computer vision.
- Author
-
Teoh, Andrew Beng Jin, Song Ong, Thian, Lim, Kian Ming, and Lee, Chin Poo
- Subjects
COMPUTER vision ,DEEP learning ,ARTIFICIAL neural networks ,IMAGE representation ,CONVOLUTIONAL neural networks ,IMAGE recognition (Computer vision) ,DATA privacy - Abstract
This document is a guest editorial for a special issue of the CAAI Transactions on Intelligence Technology journal. The special issue focuses on advances in representation learning for computer vision. The editorial highlights the success of deep learning methods in deriving powerful representations from visual data, but also acknowledges the challenges of conducting representation learning with deep models, especially with large and noisy datasets. The document provides summaries of several research papers included in the special issue, covering topics such as cancellable biometrics, medical image analysis, watermarking for medical images, facial pattern description, multi-biometric strategies, semantic segmentation, image enhancement, image classification, and hyperspectral image super-resolution. The authors express their hope that these papers will enhance readers' understanding of current trends and guide future research in the field. The document also includes brief biographies of the authors. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
8. Time Sequence Deep Learning Model for Ubiquitous Tabular Data with Unique 3D Tensors Manipulation.
- Author
-
Gicic, Adaleta, Đonko, Dženana, and Subasi, Abdulhamit
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,ALGORITHMS ,DATA modeling - Abstract
Although deep learning (DL) algorithms have been proved to be effective in diverse research domains, their application in developing models for tabular data remains limited. Models trained on tabular data demonstrate higher efficacy using traditional machine learning models than DL models, which are largely attributed to the size and structure of tabular datasets and the specific application contexts in which they are utilized. Thus, the primary objective of this paper is to propose a method to use the supremacy of Stacked Bidirectional LSTM (Long Short-Term Memory) deep learning algorithms in pattern discovery incorporating tabular data with customized 3D tensor modeling in feeding neural networks. Our findings are empirically validated using six diverse, publicly available datasets each varying in size and learning objectives. This paper proves that the proposed model based on time-sequence DL algorithms, which were generally described as inadequate when dealing with tabular data, yields satisfactory results and competes effectively with other algorithms specifically designed for tabular data. An additional benefit of this approach is its ability to preserve simplicity while ensuring fast model training also with large datasets. Even with extremely small datasets, models can be applied to achieve exceptional predictive results and fully utilize their capacity. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Editorial for the Special Issue "Data Science and Big Data in Biology, Physical Science and Engineering".
- Author
-
Mahmoud, Mohammed
- Subjects
PHYSICAL sciences ,BIG data ,DEEP learning ,ARTIFICIAL neural networks ,DATA science ,MACHINE learning ,REINFORCEMENT learning - Abstract
This document is an editorial for a special issue of the journal "Technologies" focused on data science and big data in various fields such as biology, physical science, and engineering. The editorial highlights the importance of analyzing large amounts of data generated by digital technologies and the need for data scientists to use artificial intelligence and machine learning to extract valuable knowledge. The special issue includes 12 papers covering topics such as machine learning techniques for customer churn prediction, agile program management in the U.S. Navy, deep learning for cybersecurity in Industry 5.0, self-directed learning during the COVID-19 era, decision tree-based neural networks for data classification, data-driven governance in technology companies, and more. The papers explore different approaches, models, and tools in the context of data science and big data. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
10. Special Issue on "Process Monitoring and Fault Diagnosis".
- Author
-
Ji, Cheng and Sun, Wei
- Subjects
ARTIFICIAL neural networks ,REMAINING useful life ,CONVOLUTIONAL neural networks ,PATTERN recognition systems ,TRANSFORMER models ,DEEP learning ,STATISTICAL learning ,WATER pipelines - Abstract
This document is a summary of a special issue of the journal Processes titled "Process Monitoring and Fault Diagnosis." The issue explores the application of data analytic techniques to enhance stable operation and safety in chemical processes and related industries. The collection of research papers covers various topics, including process fault detection, bearing fault diagnosis, remaining useful life prediction, and more. The papers introduce cutting-edge methodologies and demonstrate their reliability through validation. The issue aims to foster communication and the development of advanced process monitoring techniques. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
11. Maize Leaf Disease Recognition Based on Improved Convolutional Neural Network ShuffleNetV2.
- Author
-
Zhou, Hanmi, Su, Yumin, Chen, Jiageng, Li, Jichen, Ma, Linshuang, Liu, Xingyi, Lu, Sibo, and Wu, Qi
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,CORN diseases ,CORN ,PRECISION farming ,AGRICULTURAL development - Abstract
The occurrence of maize diseases is frequent but challenging to manage. Traditional identification methods have low accuracy and complex model structures with numerous parameters, making them difficult to implement on mobile devices. To address these challenges, this paper proposes a corn leaf disease recognition model SNMPF based on convolutional neural network ShuffleNetV2. In the down-sampling module of the ShuffleNet model, the max pooling layer replaces the deep convolutional layer to perform down-sampling. This improvement helps to extract key features from images, reduce the overfitting of the model, and improve the model's generalization ability. In addition, to enhance the model's ability to express features in complex backgrounds, the Sim AM attention mechanism was introduced. This mechanism enables the model to adaptively adjust focus and pay more attention to local discriminative features. The results on a maize disease image dataset demonstrate that the SNMPF model achieves a recognition accuracy of 98.40%, representing a 4.1 percentage point improvement over the original model, while its size is only 1.56 MB. Compared with existing convolutional neural network models such as EfficientNet, MobileViT, EfficientNetV2, RegNet, and DenseNet, this model offers higher accuracy and a more compact size. As a result, it can automatically detect and classify maize leaf diseases under natural field conditions, boasting high-precision recognition capabilities. Its accurate identification results provide scientific guidance for preventing corn leaf disease and promote the development of precision agriculture. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. A miRNA-disease association prediction model based on tree-path global feature extraction and fully connected artificial neural network with multi-head self-attention mechanism.
- Author
-
Biyu, Hou, Mengshan, Li, Yuxin, Hou, Ming, Zeng, Nan, Wang, and Lixin, Guan
- Subjects
ARTIFICIAL neural networks ,FEATURE extraction ,PREDICTION models ,DATA mining ,ASSOCIATION rule mining - Abstract
Background: MicroRNAs (miRNAs) emerge in various organisms, ranging from viruses to humans, and play crucial regulatory roles within cells, participating in a variety of biological processes. In numerous prediction methods for miRNA-disease associations, the issue of over-dependence on both similarity measurement data and the association matrix still hasn't been improved. In this paper, a miRNA-Disease association prediction model (called TP-MDA) based on tree path global feature extraction and fully connected artificial neural network (FANN) with multi-head self-attention mechanism is proposed. The TP-MDA model utilizes an association tree structure to represent the data relationships, multi-head self-attention mechanism for extracting feature vectors, and fully connected artificial neural network with 5-fold cross-validation for model training. Results: The experimental results indicate that the TP-MDA model outperforms the other comparative models, AUC is 0.9714. In the case studies of miRNAs associated with colorectal cancer and lung cancer, among the top 15 miRNAs predicted by the model, 12 in colorectal cancer and 15 in lung cancer were validated respectively, the accuracy is as high as 0.9227. Conclusions: The model proposed in this paper can accurately predict the miRNA-disease association, and can serve as a valuable reference for data mining and association prediction in the fields of life sciences, biology, and disease genetics, among others. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Intelligent Stroke Disease Prediction Model Using Deep Learning Approaches.
- Author
-
Gao, Chunhua, Wang, Hui, and Mezzapesa, Domenico Maria
- Subjects
STROKE diagnosis ,RISK assessment ,RANDOM forest algorithms ,PREDICTION models ,DATABASE management ,RESEARCH funding ,SYMPTOMS ,SUPPORT vector machines ,DEEP learning ,ARTIFICIAL neural networks ,STROKE ,COMPARATIVE studies ,MACHINE learning ,DECISION trees ,REGRESSION analysis ,ALGORITHMS ,DISEASE risk factors - Abstract
Stroke is a high morbidity and mortality disease that poses a serious threat to people's health. Early recognition of the various warning signs of stroke is necessary so that timely clinical intervention can help reduce the severity of stroke. Deep neural networks have powerful feature representation capabilities and can automatically learn discriminant features from large amounts of data. This paper uses a range of physiological characteristic parameters and collaborates with deep neural networks, such as the Wasserstein generative adversarial networks with gradient penalty and regression network, to construct a stroke prediction model. Firstly, to address the problem of imbalance between positive and negative samples in the stroke public data set, we performed positive sample data augmentation and utilized WGAN‐GP to generate stroke data with high fidelity and used it for the training of the prediction network model. Then, the relationship between observable physiological characteristic parameters and the predicted risk of suffering a stroke was modeled as a nonlinear mapping transformation, and a stroke prediction model based on a deep regression network was designed. Finally, the proposed method is compared with commonly used machine learning‐based classification algorithms such as decision tree, random forest, support vector machine, and artificial neural networks. The prediction results of the proposed method are optimal in the comprehensive measurement index F. Further ablation experiments also show that the designed prediction model has certain robustness and can effectively predict stroke diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Deep learning-based classification of anti-personnel mines and sub-gram metal content in mineralized soil (DL-MMD).
- Author
-
Minhas, Shahab Faiz, Shah, Maqsood Hussain, and Khaliq, Talal
- Subjects
METAL content of soils ,ARTIFICIAL neural networks ,SUPPORT vector machines ,K-nearest neighbor classification ,DEEP learning - Abstract
De-mining operations are of critical importance for humanitarian efforts and safety in conflict-affected regions. In this paper, we address the challenge of enhancing the accuracy and efficiency of mine detection systems. We present an innovative Deep Learning architecture tailored for pulse induction-based Metallic Mine Detectors (MMD), so called DL-MMD. Our methodology leverages deep neural networks to distinguish amongst nine distinct materials with an exceptional validation accuracy of 93.5%. This high level of precision enables us not only to differentiate between anti-personnel mines, without metal plates but also to detect minuscule 0.2-g vertical paper pins in both mineralized soil and non-mineralized environments. Moreover, through comparative analysis, we demonstrate a substantial 3% and 7% improvement (approx.) in accuracy performance compared to the traditional K-Nearest Neighbors and Support Vector Machine classifiers, respectively. The fusion of deep neural networks with the pulse induction-based MMD not only presents a cost-effective solution but also significantly expedites decision-making processes in de-mining operations, ultimately contributing to improved safety and effectiveness in these critical endeavors. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Closing Editorial for Computer Vision and Pattern Recognition Based on Deep Learning.
- Author
-
Yuan, Hui
- Subjects
PATTERN recognition systems ,VIDEO compression ,DEEP learning ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,OBJECT recognition (Computer vision) ,IMAGE recognition (Computer vision) - Abstract
This document is the closing editorial for a special issue of the journal Applied Sciences on computer vision and pattern recognition based on deep learning. The issue includes 31 papers covering various topics such as image and video processing, object detection, object and scene recognition, visual application technologies, classification, segmentation, compression, and more. Each paper explores different methods and techniques to improve the performance and accuracy of deep learning models in these areas. The document provides a brief summary of each paper, highlighting their contributions and findings. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
16. Editorial for the Special Issue "Machine Learning in Computer Vision and Image Sensing: Theory and Applications".
- Author
-
Chakraborty, Subrata and Pradhan, Biswajeet
- Subjects
COMPUTER vision ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,SIGNAL processing ,GAIT in humans - Abstract
This document is an editorial for a special issue titled "Machine Learning in Computer Vision and Image Sensing: Theory and Applications." The editorial highlights the diverse applications of machine learning (ML) models in various domains such as medical imaging, signal processing, remote sensing, and human activity detection. The special issue includes 11 papers that cover topics such as image segmentation, fluvial navigation, Alzheimer's disease classification, pneumothorax detection, lung cancer malignancy prediction, amniotic fluid volume detection, COVID-19 detection, and Parkinson's disease detection. The papers showcase the progress and potential of ML models in computer vision applications and provide valuable insights for future research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
17. Correlation coefficients of vibration signals and machine learning algorithm for structural damage assessment in beams under moving load.
- Author
-
Toan Pham-Bao and Vien Le-Ngoc
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,FEEDFORWARD neural networks ,STRUCTURAL health monitoring ,IMPULSE response ,OPTIMIZATION algorithms ,WOODEN beams ,DEEP learning - Abstract
This scientific paper explores the use of correlation coefficients of vibration signals and machine learning algorithms for structural damage assessment in beams under moving loads. The paper discusses the challenges of maintaining structural integrity and the importance of automated, nondestructive monitoring techniques. Preprocessing techniques, such as the random decrement technique (RDT), are highlighted for improving data analysis. Machine learning algorithms are identified as valuable tools for structural damage assessment. The paper concludes by emphasizing the potential of machine learning in safeguarding critical infrastructures. The text also discusses trigger points and the vibration response of a slender beam under a moving load. An artificial neural network (ANN) is proposed as a computational model for identifying non-linear features. Experimental testing on a simulated bridge girder using accelerometers collected data to identify and locate damage in the beam. The ANN achieved high accuracy in detecting damage appearance and location, but further research is needed to improve accuracy in real-world situations. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
18. An experimental study of acoustic bird repellents for reducing bird encroachment in pear orchards.
- Author
-
Qing Chen, Jingjing Xie, Qiang Yu, Can Liu, Wenqin Ding, Xiaogang Li, and Hongping Zhou
- Subjects
ARTIFICIAL neural networks ,AGRICULTURAL economics ,PEST control ,AGRICULTURAL development ,CROP yields ,DEEP learning ,COMPUTER vision - Abstract
Bird invasion will reduce the yield of high-value crops, which threatens the healthy development of agricultural economy. Sonic bird repellent has the advantages of large range, no time and geographical restrictions, and low cost, which has attracted people's attention in the field of agriculture. At present, there are few studies on the application of sonic bird repellents in pear orchards to minimize economic losses and prolong the adaptive capacity of birds. In this paper, a sound wave bird repellent system based on computer vision is designed, which combines deep learning target recognition technology to accurately identify birds and drive them away. The neural network model that can recognize birds is first trained and deployed to the server. Live video is captured by an installed webcam, and the sonic bird repellent is powered by an ESP-8266 relay switch. In a pear orchard, two experimental areas were divided into two experimental areas to test the designed sonic bird repellent device, and the number of bad fruits pecked by birds was used as an indicator to evaluate the bird repelling effect. The results showed that the pear pecked fruit rate was 6.03% in the pear orchard area that used the acoustic bird repeller based on computer recognition, 7.29% in the pear orchard area of the control group that used the acoustic bird repeller with continuous operation, and 13.07% in the pear orchard area that did not use any bird repellent device. While acoustic bird repellers based on computer vision can be more effective at repelling birds, they can be used in combination with methods such as fruit bags to reduce the economic damage caused by birds. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Influence of Temperature on Brushless Synchronous Machine Field Winding Interturn Fault Severity Estimation.
- Author
-
Pascual, Rubén, Rivero, Eduardo, Guerrero, José M., Mahtani, Kumar, and Platero, Carlos A.
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,SYNCHRONOUS generators ,ERROR rates ,WINDING machines ,STATORS - Abstract
There are numerous methods for detecting interturn faults (ITFs) in the field winding of synchronous machines (SMs). One effective approach is based on comparing theoretical and measured excitation currents. This method is unaffected by rotor temperature in static excitation SMs. However, this paper investigates the influence of rotor temperature in brushless synchronous machines (BSMs), where rotor temperature significantly impacts the exciter excitation current. Extensive experimental tests were conducted on a special BSM with measurable rotor temperature. Given the challenges of measuring rotor temperature in industrial machines, this paper explores the feasibility of using stator temperature in the exciter field current estimation model. The theoretical exciter field current is calculated using a deep neural network (DNN), which incorporates electrical brushless synchronous generator output values and stator temperature, and it is subsequently compared with the measured exciter field current. This method achieves an error rate below 0.5% under healthy conditions, demonstrating its potential for simple implementation in industrial BSMs for ITF detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Short-time photovoltaic output prediction method based on depthwise separable convolution Visual Geometry group-deep gate recurrent neural network.
- Author
-
Lei Zhang, Shuang Zhao, Guanchao Zhao, Lingyi Wang, Baolin Liu, Zhimin Na, Zhijian Liu, Zhongming Yu, Wei He, Mrzljak, Vedran, and Ling-Ling Li
- Subjects
RECURRENT neural networks ,SOLAR technology ,ARTIFICIAL neural networks ,ELECTRIC power engineering ,GENERATIVE adversarial networks ,PHOTOVOLTAIC power generation ,PHOTOVOLTAIC power systems - Abstract
In response to the issue of short-term fluctuations in photovoltaic (PV) output due to Cloud movement, this paper proposes a method for forecasting short-term PV output based on a Depthwise Separable Convolution Visual Geometry Group (DSCVGG) and a Deep Gate Recurrent Neural Network (DGN). Initially, a cloud motion prediction model is constructed using a DSCVGG, which achieves edge recognition and motion prediction of clouds by replacing the previous convolution layer of the pooling layer in VGG with a depthwise separable convolution. Subsequently, the output results of the DSCVGG network, along with historical PV output data, are introduced into a Deep Gate Recurrent Unit Network (DGN) to establish a PV output prediction model, thereby achieving precise prediction of PV output. Through experiments on actual data, the Mean Absolute Error (MAE) and Mean Squared Error (MSE) of our model are only 2.18% and 5.32 x 10
-5 , respectively, which validates the effectiveness, accuracy, and superiority of the proposed method. This provides new insights and methods for improving the stability of PV power generation. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
21. BFNet: a full-encoder skip connect way for medical image segmentation.
- Author
-
Siyu Zhan, Quan Yuan, Xin Lei, Rui Huang, Lu Guo, Ke Liu, and Rong Chen
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,COMPUTER-assisted image analysis (Medicine) ,DEEP learning ,IMAGE segmentation - Abstract
In recent years, semantic segmentation in deep learning has been widely applied in medical image segmentation, leading to the development of numerous models. Convolutional Neural Network (CNNs) have achieved milestone achievements in medical image analysis. Particularly, deep neural networks based on U-shaped architectures and skip connections have been extensively employed in various medical image tasks. U-Net is characterized by its encoder-decoder architecture and pioneering skip connections, along with multi-scale features, has served as a fundamental network architecture for many modifications. But U-Net cannot fully utilize all the information from the encoder layer in the decoder layer. U-Net++ connects mid parameters of different dimensions through nested and dense skip connections. However, it can only alleviate the disadvantage of not being able to fully utilize the encoder information and will greatly increase the model parameters. In this paper, a novel BFNet is proposed to utilize all feature maps from the encoder at every layer of the decoder and reconnects with the current layer of the encoder. This allows the decoder to better learn the positional information of segmentation targets and improves learning of boundary information and abstract semantics in the current layer of the encoder. Our proposed method has a significant improvement in accuracy with 1.4 percent. Besides enhancing accuracy, our proposed BFNet also reduces network parameters. All the advantages we proposed are demonstrated on our dataset. We also discuss how different loss functions influence this model and some possible improvements. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. FN-GNN: A Novel Graph Embedding Approach for Enhancing Graph Neural Networks in Network Intrusion Detection Systems.
- Author
-
Tran, Dinh-Hau and Park, Minho
- Subjects
ARTIFICIAL neural networks ,GRAPH neural networks ,RECURRENT neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,INTRUSION detection systems (Computer security) - Abstract
With the proliferation of the Internet, network complexities for both commercial and state organizations have significantly increased, leading to more sophisticated and harder-to-detect network attacks. This evolution poses substantial challenges for intrusion detection systems, threatening the cybersecurity of organizations and national infrastructure alike. Although numerous deep learning techniques such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and graph neural networks (GNNs) have been applied to detect various network attacks, they face limitations due to the lack of standardized input data, affecting model accuracy and performance. This paper proposes a novel preprocessing method for flow data from network intrusion detection systems (NIDSs), enhancing the efficacy of a graph neural network model in malicious flow detection. Our approach initializes graph nodes with data derived from flow features and constructs graph edges through the analysis of IP relationships within the system. Additionally, we propose a new graph model based on the combination of the graph neural network (GCN) model and SAGEConv, a variant of the GraphSAGE model. The proposed model leverages the strengths while addressing the limitations encountered by the previous models. Evaluations on two IDS datasets, CICIDS-2017 and UNSW-NB15, demonstrate that our model outperforms existing methods, offering a significant advancement in the detection of network threats. This work not only addresses a critical gap in the standardization of input data for deep learning models in cybersecurity but also proposes a scalable solution for improving the intrusion detection accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Research on electric vehicle charging load prediction method based on spectral clustering and deep learning network.
- Author
-
Fang Xin, Xie Yang, Wang Beibei, Xu Ruilin, Mei Fei, and Zheng Jianyong
- Subjects
DEEP learning ,ELECTRIC charge ,ELECTRIC vehicles ,ARTIFICIAL neural networks ,STATISTICAL sampling ,CONVOLUTIONAL neural networks - Abstract
With the increasing prominence of environmental and energy issues, electric vehicles (EVs) as representatives of clean energy vehicles have experienced rapid development in recent years, and the charging load has also exhibited statistical characteristics. Accurate prediction of EV charging load is crucial to improve grid load dispatch and intelligent level. However, current research on EV charging load prediction still faces challenges such as data reliability, complexity and variability of charging behavior, uncertainty, and lack of standardization methods. Therefore, this paper proposes an electric vehicle charging load prediction method based on spectral clustering and deep learning network (SC-CNNLSTM). Firstly, to address the insufficient amount of EV charging load data, this paper proposes to use Monte Carlo simulation to sample and simulate historical load data. Then, in order to identify the internal structure and patterns of charging load, the sampled and simulated dataset is clustered using spectral clustering, dividing the data into different clusters, where each cluster represents samples with similar charging load characteristics. Finally, based on the different sample features of each cluster, corresponding CNN-LSTM models are constructed and trained and predict using the respective data. By modifying the model parameters, the prediction accuracy of the model is improved. Through comparative experiments, the proposed method in this paper has significantly improved prediction accuracy compared to traditional prediction methods without clustering, thus validating the effectiveness and practicality of the method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Cross-Parallel Attention and Efficient Match Transformer for Aerial Tracking.
- Author
-
Deng, Anping, Han, Guangliang, Zhang, Zhongbo, Chen, Dianbing, Ma, Tianjiao, and Liu, Zhichao
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,TRACKING radar ,TRACKING algorithms ,DRONE aircraft ,ARTIFICIAL intelligence - Abstract
Visual object tracking is a key technology that is used in unmanned aerial vehicles (UAVs) to achieve autonomous navigation. In recent years, with the rapid development of deep learning, tracking algorithms based on Siamese neural networks have received widespread attention. However, because of complex and diverse tracking scenarios, as well as limited computational resources, most existing tracking algorithms struggle to ensure real-time stable operation while improving tracking performance. Therefore, studying efficient and fast-tracking frameworks, and enhancing the ability of algorithms to respond to complex scenarios has become crucial. Therefore, this paper proposes a cross-parallel attention and efficient match transformer for aerial tracking (SiamEMT). Firstly, we carefully designed the cross-parallel attention mechanism to encode global feature information and to achieve cross-dimensional interaction and feature correlation aggregation via parallel branches, highlighting feature saliency and reducing global redundancy information, as well as improving the tracking algorithm's ability to distinguish between targets and backgrounds. Meanwhile, we implemented an efficient match transformer to achieve feature matching. This network utilizes parallel, lightweight, multi-head attention mechanisms to pass template information to the search region features, better matching the global similarity between the template and search regions, and improving the algorithm's ability to perceive target location and feature information. Experiments on multiple drone public benchmark tests verified the accuracy and robustness of the proposed tracker in drone tracking scenarios. In addition, on the embedded artificial intelligence (AI) platform AGX Xavier, our algorithm achieved real-time tracking speed, indicating that our algorithm can be effectively applied to UAV tracking scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. A Comparison of Neural-Network-Based Intrusion Detection against Signature-Based Detection in IoT Networks.
- Author
-
Schrötter, Max, Niemann, Andreas, and Schnor, Bettina
- Subjects
ARTIFICIAL neural networks ,INTERNET of things ,MACHINE learning - Abstract
Over the last few years, a plethora of papers presenting machine-learning-based approaches for intrusion detection have been published. However, the majority of those papers do not compare their results with a proper baseline of a signature-based intrusion detection system, thus violating good machine learning practices. In order to evaluate the pros and cons of the machine-learning-based approach, we replicated a research study that uses a deep neural network model for intrusion detection. The results of our replicated research study expose several systematic problems with the used datasets and evaluation methods. In our experiments, a signature-based intrusion detection system with a minimal setup was able to outperform the tested model even under small traffic changes. Testing the replicated neural network on a new dataset recorded in the same environment with the same attacks using the same tools showed that the accuracy of the neural network dropped to 54%. Furthermore, the often-claimed advantage of being able to detect zero-day attacks could not be seen in our experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Multimodal Deep Neural Networks for Digitized Document Classification.
- Author
-
Baimakhanova, Aigerim, Zhumadillayeva, Ainur, Mukhametzhanova, Bigul, Glazyrina, Natalya, Niyazova, Rozamgul, Zhunissov, Nurseit, and Sambetbayeva, Aizhan
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,DIGITIZATION ,DOCUMENT classification (Electronic documents) ,DIGITAL technology ,MACHINE learning - Abstract
As digital technologies have advanced more rapidly, the number of paper documents recently converted into a digital format has exponentially increased. To respond to the urgent need to categorize the growing number of digitized documents, the classification of digitized documents in real time has been identified as the primary goal of our study. A paper classification is the first stage in automating document control and efficient knowledge discovery with no or little human involvement. Artificial intelligence methods such as Deep Learning are now combined with segmentation to study and interpret those traits, which were not conceivable ten years ago. Deep learning aids in comprehending input patterns so that object classes may be predicted. The segmentation process divides the input image into separate segments for a more thorough image study. This study proposes a deep learning-enabled framework for automated document classification, which can be implemented in higher education. To further this goal, a dataset was developed that includes seven categories: Diplomas, Personal documents, Journal of Accounting of higher education diplomas, Service letters, Orders, Production orders, and Student orders. Subsequently, a deep learning model based on Conv2D layers is proposed for the document classification process. In the final part of this research, the proposed model is evaluated and compared with other machine-learning techniques. The results demonstrate that the proposed deep learning model shows high results in document categorization overtaking the other machine learning models by reaching 94.84%, 94.79%, 94.62%, 94.43%, 94.07% in accuracy, precision, recall, F-score, and AUC-ROC, respectively. The achieved results prove that the proposed deep model is acceptable to use in practice as an assistant to an office worker. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Enhancing Deep Learning and Computer Image Analysis in Petrography through Artificial Self-Awareness Mechanisms.
- Author
-
Dell'Aversana, Paolo
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,IMAGE analysis ,SELF-consciousness (Awareness) ,IMAGE recognition (Computer vision) ,PETROLOGY - Abstract
In this paper, we discuss the implementation of artificial self-awareness mechanisms and self-reflection abilities in deep neural networks. While the current limitations of research prevent achieving cognitive capabilities on par with natural biological entities, the incorporation of basic self-awareness and self-reflection mechanisms in deep learning architectures offers substantial advantages in tackling specific problems across various scientific fields, including geosciences. In the first section, we outline the foundational architecture of our deep learning approach termed Self-Aware Learning (SAL). The subsequent part of the paper highlights the practical benefits of this machine learning methodology through synthetic tests and applications addressed to automatic classification and image analysis of real petrological data sets. We show how Self-Aware Learning allows enhanced accuracy, reduced overfitting problems, and improved performances compared to other existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Deep Learning-based DSM Generation from Dual-Aspect SAR Data.
- Author
-
Recla, Michael and Schmitt, Michael
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,SYNTHETIC aperture radar ,DATA mining ,REMOTE sensing ,GEOMETRIC modeling - Abstract
Rapid mapping demands efficient methods for a fast extraction of information from satellite data while minimizing data requirements. This paper explores the potential of deep learning for the generation of high-resolution urban elevation data from Synthetic Aperture Radar (SAR) imagery. In order to mitigate occlusion effects caused by the side-looking nature of SAR remote sensing, two SAR images from opposing aspects are leveraged and processed in an end-to-end deep neural network. The presented approach is the first of its kind to implicitly handle the transition from the SAR-specific slant range geometry to a ground-based mapping geometry within the model architecture. Comparative experiments demonstrate the superiority of the dual-aspect fusion over single-image methods in terms of reconstruction quality and geolocation accuracy. Notably, the model exhibits robust performance across diverse acquisition modes and geometries, showcasing its generalizability and suitability for height mapping applications. The study's findings underscore the potential of deep learning-driven SAR techniques in generating high-quality urban surface models efficiently and economically. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks.
- Author
-
Mo, Mouke, Wang, Chuntao, and Bian, Shan
- Subjects
ARTIFICIAL neural networks ,DIGITAL watermarking ,DISCRETE cosine transforms ,SINGULAR value decomposition ,DEEP learning ,IDENTIFICATION ,WATERMARKS - Abstract
Given the substantial value and considerable training costs associated with deep neural network models, the field of deep neural network model watermarking has come to the forefront. While black-box model watermarking has made commendable strides, the current methodology for constructing poisoned images in the existing literature is simplistic and susceptible to forgery. Notably, there is a scarcity of black-box model watermarking techniques capable of discerning a unique user in a multi-user model distribution setting. For this reason, this paper proposes a novel black-box model watermarking method for unique identity identification, which is denoted as the ID watermarking of neural networks (IDwNet). Specifically, to enhance the distinguishability of deep neural network models in multi-user scenarios and mitigate the likelihood of poisoned image counterfeiting, this study develops a discrete cosine transform (DCT) and singular value decomposition (SVD)-based symmetrical embedding method to form the poisoned image. As this ID embedding method leads to indistinguishable deep features, the study constructs a poisoned adversary training strategy by simultaneously inputting clean images, poisoned images with the correct ID, and poisoned adversary images with incorrect IDs to train a deep neural network. Extensive simulation experiments show that the proposed scheme achieves excellent invisibility for the concealed ID, surpassing remarkably the state-of-the-art. In addition, the proposed scheme obtains a validation success rate exceeding 99% for the poisoned images at the cost of a marginal classification accuracy reduction of less than 0.5%. Moreover, even though there is only a 1-bit discrepancy between IDs, the proposed scheme still results in an accurate validation of user copyright. These results indicate that the proposed scheme is promising. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. An Image Classification Method Based on Semi-Supervised Classification Learning and Convolutional Neural Networks.
- Author
-
Shi, Liyan and Chen, Hairui
- Subjects
SUPERVISED learning ,IMAGE recognition (Computer vision) ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning - Abstract
This paper aims to propose an improved image classification model to reduce the cost of model construction. Aiming at the problem that network training usually requires the support of a large number of labeled samples, an image classification model based on semi-supervised deep learning is proposed, which uses labeled samples to guide the network to learn unlabeled samples. A convolutional neural network model for simultaneous processing of labeled and unlabeled data is constructed. The tagged data is used to train the Softmax classifier and provide the initial K-means clustering center for the untagged data. The nonsubsampling contourlet layer is used to replace the first convolutional layer of the full convolutional neural network to extract multi-scale depth features, and the nonsubsampling contourlet full convolutional neural network is constructed. The network can extract multi-scale information of the images to be classified, and extract more discriminative deep image features. In addition, the parameters of the nonsubsampled contourlet layers are pre-set and do not require network training. The proposed method has higher classification accuracy than the contrast method on polarimetric SAR images using the nonsubsampled contourlet full convolutional neural network. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Ensemble Deep Learning Technique for Detecting MRI Brain Tumor.
- Author
-
Jader, Rasool Fakhir, Kareem, Shahab Wahhab, Awla, Hoshang Qasim, and Ashraf, Imran
- Subjects
MACHINE learning ,ARTIFICIAL neural networks ,MAGNETIC resonance imaging ,DEEP learning ,EMPLOYEE handbooks - Abstract
The classification process of MRI (magnetic resonance imaging) is frequently used for making medical diagnoses for conditions including pituitary, glioma, meningioma, and no tumor. For this reason, determining the type of MRI and its quantity are significant and valuable measurements that reveal the brain's state of health. To segment and classify brain analysis, laboratory personnel employ manual examination via screen; this requires a lot of labour and time. On the other hand, the devices used by specialists are not practical or inexpensive for every doctor or institution. In recent years, a variety of computational algorithms for segmentation and classification have been developed with improved results to get around the issue. Artificial neural networks (ANNs) have the capability and promise to classify in this regard. The purpose of this paper is to create and put into practice a system for classifying different types of MRI images of brain tumor samples. As a result, this paper concentrated on the tasks of segmentation, feature extraction, classifier building, and classification into four categories using various machine learning algorithms. The authors used VGG‐16, ResNet‐50, and AlexNet models based on the transfer learning algorithm for three models to classify images as an ensemble model. As a result, MRI brain tumor segmentation is more precise because each spatial feature point can now refer to all other contextual data. In the specifics, our models outperformed every other published modern ensemble model in the official deep learning challenge without any postprocessing. The ensemble model achieved an accuracy of 99.16%, a sensitivity of 98.47%, a specificity of 98.57%, a precision of 98.74%, a recall of 98.49%, and an F1‐score of 98.18%. These results significantly surpass the accuracy of other methods such as Naive Bayes, decision tree classifier, random forest, and DNN models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Generative Adversarial Networks for SAR Automatic Target Recognition and Classification Models Enhanced Explainability: Perspectives and Challenges.
- Author
-
Remusati, Héloïse, Le Caillec, Jean-Marc, Schneider, Jean-Yves, Petit-Frère, Jacques, and Merlet, Thomas
- Subjects
ARTIFICIAL neural networks ,GENERATIVE adversarial networks ,AUTOMATIC target recognition ,SYNTHETIC aperture radar ,DEEP learning - Abstract
Generative adversarial networks (or GANs) are a specific deep learning architecture often used for different usages, such as data generation or image-to-image translation. In recent years, this structure has gained increased popularity and has been used in different fields. One area of expertise currently in vogue is the use of GANs to produce synthetic aperture radar (SAR) data, and especially expand training datasets for SAR automatic target recognition (ATR). In effect, the complex SAR image formation makes these kind of data rich in information, leading to the use of deep networks in deep learning-based methods. Yet, deep networks also require sufficient data for training. However, contrary to optical images, we generally do not have a substantial number of available SAR images because of their acquisition and labelling cost; GANs are then an interesting tool. Concurrently, how to improve explainability for SAR ATR deep neural networks and how to make their reasoning more transparent have been increasingly explored as model opacity deteriorates trust of users. This paper aims at reviewing how GANs are used with SAR images, but also giving perspectives on how GANs could be used to improve interpretability and explainability of SAR classifiers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model.
- Author
-
Xu, Ruilin, Zheng, Jianyong, Mei, Fei, Yang, Xie, Wu, Yue, and Zhang, Heng
- Subjects
ARTIFICIAL neural networks ,PHOTOVOLTAIC power generation ,DEEP learning ,POWER series ,STATISTICAL correlation - Abstract
Photovoltaic (PV) power fluctuates with weather changes, and traditional forecasting methods typically decompose the power itself to study its characteristics, ignoring the impact of multidimensional weather conditions on the power decomposition. Therefore, this paper proposes a short-term PV power generation method based on MVMD (multivariate variational mode decomposition) feature extraction and the Informer model. First, MIC correlation analysis is used to extract weather features most related to PV power. Next, to more comprehensively describe the relationship between PV power and environmental conditions, MVMD is used for time–frequency synchronous analysis of the PV power time series combined with the highest MIC correlation weather data, obtaining frequency-aligned multivariate intrinsic modes. These modes incorporate multidimensional weather factors into the data-decomposition-based forecasting method. Finally, to enhance the model's learning capability, the Informer neural network model is employed in the prediction phase. Based on the input PV IMF time series and associated weather mode components, the Informer prediction model is constructed for training and forecasting. The predicted results of different PV IMF modes are then superimposed to obtain the total PV power generation. Experiments show that this method improves PV power generation accuracy, with an MAPE value of 4.31%, demonstrating good robustness. In terms of computational efficiency, the Informer model's ability to handle long sequences with sparse attention mechanisms reduces training and prediction times by approximately 15%, making it faster than conventional deep learning models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Triple-0: Zero-shot denoising and dereverberation on an end-to-end frozen anechoic speech separation network.
- Author
-
Gul, Sania, Khan, Muhammad Salman, and Ur-Rehman, Ata
- Subjects
REVERBERATION time ,ARTIFICIAL neural networks ,SPEECH ,MACHINE learning ,SPEECH enhancement ,DEEP learning - Abstract
Speech enhancement is crucial both for human and machine listening applications. Over the last decade, the use of deep learning for speech enhancement has resulted in tremendous improvement over the classical signal processing and machine learning methods. However, training a deep neural network is not only time-consuming; it also requires extensive computational resources and a large training dataset. Transfer learning, i.e. using a pretrained network for a new task, comes to the rescue by reducing the amount of training time, computational resources, and the required dataset, but the network still needs to be fine-tuned for the new task. This paper presents a novel method of speech denoising and dereverberation (SD&D) on an end-to-end frozen binaural anechoic speech separation network. The frozen network requires neither any architectural change nor any fine-tuning for the new task, as is usually required for transfer learning. The interaural cues of a source placed inside noisy and echoic surroundings are given as input to this pretrained network to extract the target speech from noise and reverberation. Although the pretrained model used in this paper has never seen noisy reverberant conditions during its training, it performs satisfactorily for zero-shot testing (ZST) under these conditions. It is because the pretrained model used here has been trained on the direct-path interaural cues of an active source and so it can recognize them even in the presence of echoes and noise. ZST on the same dataset on which the pretrained network was trained (homo-corpus) for the unseen class of interference, has shown considerable improvement over the weighted prediction error (WPE) algorithm in terms of four objective speech quality and intelligibility metrics. Also, the proposed model offers similar performance provided by a deep learning SD&D algorithm for this dataset under varying conditions of noise and reverberations. Similarly, ZST on a different dataset has provided an improvement in intelligibility and almost equivalent quality as provided by the WPE algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Modified receiver architecture in software-defined radio for real-time modulation classification.
- Author
-
Le, Quoc Nam, Huynh, Tan Quoc, Ta, Hien Quang, Tan, Phuoc Vo, and Nguyen, Lap Luat
- Subjects
SOFTWARE radio ,ELECTRONIC modulation ,ARTIFICIAL neural networks ,DEEP learning ,SPECTRUM allocation ,TELECOMMUNICATION systems - Abstract
Automatic modulation classification (AMC) is an important process for future communication systems with prominent applications from spectrum management, and secure communication, to cognitive radio. The requirement for an efficient AMC classifier is due to its capability in blind modulation recognition, which is a difficult task in real scenarios where the limitations of traditional hardware and the complexity of channel impairments are involved. Therefore, this paper proposes a complete real-time AMC system based on software-defined radio and deep learning architecture. The system demodulation performance is verified through simulations and real channel impairment conditions to ensure reliability. With at most 6 times reduced number of parameters, two proposed models convolutional long short-term memory deep neural network and residual long short-term memory neural network also show a general improvement in classification accuracy compared with reference studies. The performance of these models at real-time AMC is tested with suitable processing time for practical applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Multi-Directional Long-Term Recurrent Convolutional Network for Road Situation Recognition.
- Author
-
Dofitas Jr., Cyreneo, Gil, Joon-Min, and Byun, Yung-Cheol
- Subjects
ARTIFICIAL neural networks ,RECURRENT neural networks ,PEDESTRIANS ,ROAD safety measures ,CONVOLUTIONAL neural networks ,DEEP learning - Abstract
Understanding road conditions is essential for implementing effective road safety measures and driving solutions. Road situations encompass the day-to-day conditions of roads, including the presence of vehicles and pedestrians. Surveillance cameras strategically placed along streets have been instrumental in monitoring road situations and providing valuable information on pedestrians, moving vehicles, and objects within road environments. However, these video data and information are stored in large volumes, making analysis tedious and time-consuming. Deep learning models are increasingly utilized to monitor vehicles and identify and evaluate road and driving comfort situations. However, the current neural network model requires the recognition of situations using time-series video data. In this paper, we introduced a multi-directional detection model for road situations to uphold high accuracy. Deep learning methods often integrate long short-term memory (LSTM) into long-term recurrent network architectures. This approach effectively combines recurrent neural networks to capture temporal dependencies and convolutional neural networks (CNNs) to extract features from extensive video data. In our proposed method, we form a multi-directional long-term recurrent convolutional network approach with two groups equipped with CNN and two layers of LSTM. Additionally, we compare road situation recognition using convolutional neural networks, long short-term networks, and long-term recurrent convolutional networks. The paper presents a method for detecting and recognizing multi-directional road contexts using a modified LRCN. After balancing the dataset through data augmentation, the number of video files increased, resulting in our model achieving 91% accuracy, a significant improvement from the original dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. A Deep-Learning-Based Method for Spectrum Sensing with Multiple Feature Combination.
- Author
-
Zhang, Yixuan and Luo, Zhongqiang
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,COGNITIVE radio ,STATISTICAL learning ,RADIO networks - Abstract
Cognitive radio networks enable the detection and opportunistic access to an idle spectrum through spectrum-sensing technologies, thus providing services to secondary users. However, at a low signal-to-noise ratio (SNR), existing spectrum-sensing methods, such as energy statistics and cyclostationary detection, tend to fail or become overly complex, limiting their sensing accuracy in complex application scenarios. In recent years, the integration of deep learning with wireless communications has shown significant potential. Utilizing neural networks to learn the statistical characteristics of signals can effectively adapt to the changing communication environment. To enhance spectrum-sensing performance under low-SNR conditions, this paper proposes a deep-learning-based spectrum-sensing method that combines multiple signal features, including energy statistics, power spectrum, cyclostationarity, and I/Q components. The proposed method used these combined features to form a specific matrix, which was then efficiently learned and detected through the designed 'SenseNet' network. Experimental results showed that at an SNR of −20 dB, the SenseNet model achieved a 58.8% spectrum-sensing accuracy, which is a 3.3% improvement over the existing convolutional neural network model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Data modeling analysis of GFRP tubular filled concrete column based on small sample deep meta learning method.
- Author
-
Deng, Tianyi, Xue, Chengqi, and Zhang, Gengpei
- Subjects
ARTIFICIAL neural networks ,CONCRETE columns ,DATA modeling ,DATA analysis ,COMPOSITE columns ,DEEP learning ,DATA augmentation - Abstract
The meta-learning method proposed in this paper addresses the issue of small-sample regression in the application of engineering data analysis, which is a highly promising direction for research. By integrating traditional regression models with optimization-based data augmentation from meta-learning, the proposed deep neural network demonstrates excellent performance in optimizing glass fiber reinforced plastic (GFRP) for wrapping concrete short columns. When compared with traditional regression models, such as Support Vector Regression (SVR), Gaussian Process Regression (GPR), and Radial Basis Function Neural Networks (RBFNN), the meta-learning method proposed here performs better in modeling small data samples. The success of this approach illustrates the potential of deep learning in dealing with limited amounts of data, offering new opportunities in the field of material data analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. A Comprehensive Analysis Of Diverse Image Processing Techniques In Agriculture.
- Author
-
Goyal, Punam and Gill, Jasmeen
- Subjects
AGRICULTURAL technology ,IMAGE processing ,IMAGE analysis ,TECHNOLOGICAL progress ,DEEP learning ,ARTIFICIAL neural networks ,TECHNOLOGICAL innovations ,PLANT diseases - Abstract
Agriculture plays a crucial role in fostering sustainable growth through the integration of various technological advancements such as image processing, artificial intelligence, deep learning, and the Internet of Things (IoT). The global population is increasing on a daily basis. The increasing demand within the agriculture industry has necessitated the collective enhancement of plant cultivation and field productivity. This paper emphasizes the significance of effectively managing the crop during its initial growth phase as well as during the harvesting era. Image processing and artificial neural networks are employed as distinct methodologies for detecting illnesses on leaves. When capturing images using drones, the resulting images undergo a process of segmentation and transformation, resulting in the identification of three distinct vectors that represent diseases. These vectors include colour, texture, and morphology. This paper reviews on various disease classification strategies that can be utilized for the detection of plant diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
40. Slip Tendency Analysis From Sparse Stress and Satellite Data Using Physics‐Guided Deep Neural Networks.
- Author
-
Poulet, Thomas and Behnoudfar, Pouria
- Subjects
ARTIFICIAL neural networks ,GLOBAL Positioning System ,DEEP learning ,GEOLOGICAL statistics ,FEMORAL epiphysis ,CONTINENTAL drift ,DISPLACEMENT (Psychology) - Abstract
The significant risk associated with fault reactivation often necessitates slip tendency analyses for effective risk assessment. However, such analyses are challenging, particularly in large areas with limited or absent reliable stress measurements and where the cost of extensive geomechanical analyses or simulations is prohibitive. In this paper, we propose a novel approach using a physics‐informed neural network that integrates stress orientation and satellite displacement observations in a top‐down multi‐scale framework to estimate two‐dimensional slip tendency analyses even in regions lacking comprehensive stress data. Our study demonstrates that velocities derived from a continental scale analysis, combined with reliable stress orientation averages, can effectively guide models at smaller scales to generate qualitative slip tendency maps. By offering customizable data selection and stress resolution options, this method presents a robust solution to address data scarcity issues, as exemplified through a case study of the South Australian Eyre Peninsula. Plain Language Summary: Fault reactivation poses significant risks, often requiring slip tendency analyses for thorough risk assessment. Yet, such analyses face challenges, especially in large areas lacking reliable stress measurements or where extensive geomechanical analyses are too costly. Our paper suggests a new method using a physics‐based neural network. This approach combines compressive direction and satellite displacement observations to estimate slip tendencies in two dimensions, even where stress data is lacking. Our study shows that by using displacements from a continental scale analysis and reliable averages of compressive directions, we can guide models to create smaller‐scale maps indicating where faults are more likely to reactivate. This method allows for customizable data selection and stress resolution, offering a strong solution to data scarcity issues. We demonstrate its effectiveness through a case study of South Australia's Eyre Peninsula. Key Points: Physics‐based neural networks allow two‐dimensional slip tendency analyses without prior full‐stress informationA multi‐scale approach provides required displacement constraints when inferring full stresses from global navigation satellite system (GNSS) and stress orientation dataWe present a new application for GNSS data that would welcome more stations, even in seismically stable areas [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Effect Evaluation of Staged Fracturing and Productivity Prediction of Horizontal Wells in Tight Reservoirs.
- Author
-
Zhang, Yuan, Chen, Jianyang, Wu, Zhongbao, Xiao, Yuxiang, Xu, Ziyi, Cheng, Hanlie, and Zhang, Bin
- Subjects
HORIZONTAL wells ,ARTIFICIAL neural networks ,DEEP learning - Abstract
In this paper, the effect evaluation and production prediction of staged fracturing for horizontal wells in tight reservoirs are studied. Firstly, the basic characteristics and value of horizontal wells in tight reservoirs are introduced, their geological characteristics, flow mechanism and permeability model are analyzed and the application of grey theory in effect analysis is discussed. Considering the problems of staged fracturing effect evaluation and the production prediction of horizontal wells in tight reservoirs, a BP neural network model based on deep learning is proposed. Due to the interference of multiple physical parameters and the complex functional relationship in the development of tight reservoir fracturing, the traditional prediction method has low accuracy and it is difficult to establish an accurate mapping relationship. In this paper, a BP neural network is used to simulate multivariable nonlinear mapping by modifying the model, and its advantages in solving the coupling relationship of complex functions are brought into play. A neural network model with fracturing parameters as input and oil and gas production as output is designed. Through the training and testing of data sets, the accuracy and applicability of the proposed model for effect evaluation and yield prediction are verified. The research results show that the model can fit the complex mapping relationship between fracturing information and production and provide an effective evaluation and prediction tool for the development of the staged fracturing of horizontal wells in tight reservoirs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Advanced Wireless Sensor Networks: Applications, Challenges and Research Trends.
- Author
-
Kandris, Dionisis and Anastasiadis, Eleftherios
- Subjects
WIRELESS sensor networks ,K-nearest neighbor classification ,WIRELESS LANs ,DEEP learning ,AD hoc computer networks ,ARTIFICIAL neural networks ,WIRELESS Internet - Abstract
This document serves as an introduction to a special issue on wireless sensor networks (WSNs), explaining their structure and applications in various sectors. It also discusses the challenges faced by WSNs, such as limited energy and data management. The special issue features ten research papers on advanced WSNs. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
43. Online modeling method for composite load model including EVs and battery storage based on measurement data.
- Author
-
Yin, Yanhe, Zhong, Yi, He, Yi, Li, Guohao, Li, Zhuohuan, Pan, Shixian, Xiao, Dongliang, Han, Jintao, Zhang, Fei, and Kiranmayi, R.
- Subjects
ELECTRIC charge ,STORAGE batteries ,DEEP learning ,ARTIFICIAL neural networks ,REINFORCEMENT learning ,DEEP reinforcement learning ,MACHINE learning - Abstract
Load models have a significant influence on power system simulation. However, current load modeling approaches can hardly satisfy the diversity and time- varying characteristics of loads [including electric vehicles (EVs) and battery storage] in terms of model accuracy and computing efficiency. An online modeling method for composite load models based on measurement information is proposed in this paper. Firstly, the dominant factors in load model output are analyzed based on the active subspace of parameter space. Then the clustering algorithm is applied to cluster the large number of underlying loads based on the characteristics of load daily output curves. Finally, the underlying loads are equivalently aggregated from the low voltage levels to the high voltage levels to construct the composite load model. Simulation results obtained based on PSCAD/EMTDC demonstrate that the load model constructed by the proposed approach can accurately reflect the actual load characteristics of a power system. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. ECG autoencoder based on low-rank attention.
- Author
-
Zhang, Shilin, Fang, Yixian, and Ren, Yuwei
- Subjects
ARTIFICIAL neural networks ,SINGULAR value decomposition ,ELECTROCARDIOGRAPHY ,DISEASE prevalence ,DEEP learning - Abstract
The prevalence of cardiovascular disease (CVD) has surged in recent years, making it the foremost cause of mortality among humans. The Electrocardiogram (ECG), being one of the pivotal diagnostic tools for cardiovascular diseases, is increasingly gaining prominence in the field of machine learning. However, prevailing neural network models frequently disregard the spatial dimension features inherent in ECG signals. In this paper, we propose an ECG autoencoder network architecture incorporating low-rank attention (LRA-autoencoder). It is designed to capture potential spatial features of ECG signals by interpreting the signals from a spatial perspective and extracting correlations between different signal points. Additionally, the low-rank attention block (LRA-block) obtains spatial features of electrocardiogram signals through singular value decomposition, and then assigns these spatial features as weights to the electrocardiogram signals, thereby enhancing the differentiation of features among different categories. Finally, we utilize the ResNet-18 network classifier to assess the performance of the LRA-autoencoder on both the MIT-BIH Arrhythmia and PhysioNet Challenge 2017 datasets. The experimental results reveal that the proposed method demonstrates superior classification performance. The mean accuracy on the MIT-BIH Arrhythmia dataset is as high as 0.997, and the mean accuracy and F 1 -score on the PhysioNet Challenge 2017 dataset are 0.850 and 0.843. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Editorial: Deep learning for marine science.
- Author
-
Haiyong Zheng, Hongsheng Bi, Xuemin Cheng, and Benfield, Mark C.
- Subjects
DEEP learning ,MARINE sciences ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,UNDERWATER imaging systems ,CORAL trout ,GENERATIVE adversarial networks - Abstract
This document is an editorial published in the journal Frontiers in Marine Science, discussing the application of deep learning technology in marine science research. It highlights the potential of deep learning in various research fields, including biology, ecosystems, climate, energy, and physical and chemical interactions. The document also provides a summary of 39 research papers published in the journal, covering topics such as marine image enhancement, underwater visual recognition, dataset and labeling, and marine process prediction. The papers explore different methods and techniques to improve the quality and analysis of underwater images and data. The studies aim to enhance our understanding of marine life and ecosystems while also addressing challenges in data collection and analysis. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
46. Deep Time Series Forecasting Models: A Comprehensive Survey.
- Author
-
Liu, Xinhe and Wang, Wenmin
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,TIME series analysis ,CONVOLUTIONAL neural networks ,ARTIFICIAL intelligence ,LANGUAGE models - Abstract
Deep learning, a crucial technique for achieving artificial intelligence (AI), has been successfully applied in many fields. The gradual application of the latest architectures of deep learning in the field of time series forecasting (TSF), such as Transformers, has shown excellent performance and results compared to traditional statistical methods. These applications are widely present in academia and in our daily lives, covering many areas including forecasting electricity consumption in power systems, meteorological rainfall, traffic flow, quantitative trading, risk control in finance, sales operations and price predictions for commercial companies, and pandemic prediction in the medical field. Deep learning-based TSF tasks stand out as one of the most valuable AI scenarios for research, playing an important role in explaining complex real-world phenomena. However, deep learning models still face challenges: they need to deal with the challenge of large-scale data in the information age, achieve longer forecasting ranges, reduce excessively high computational complexity, etc. Therefore, novel methods and more effective solutions are essential. In this paper, we review the latest developments in deep learning for TSF. We begin by introducing the recent development trends in the field of TSF and then propose a new taxonomy from the perspective of deep neural network models, comprehensively covering articles published over the past five years. We also organize commonly used experimental evaluation metrics and datasets. Finally, we point out current issues with the existing solutions and suggest promising future directions in the field of deep learning combined with TSF. This paper is the most comprehensive review related to TSF in recent years and will provide a detailed index for researchers in this field and those who are just starting out. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Ancient mural dynasty recognition algorithm based on a neural network architecture search.
- Author
-
Cao, Jianfang, Jin, Mengyan, Tian, Yun, Cao, Zhen, and Peng, Cunhe
- Subjects
ARTIFICIAL neural networks ,MURAL art ,RECOGNITION (Psychology) ,IMAGE recognition (Computer vision) ,ALGORITHMS - Abstract
A neural network model needs to be manually designed for ancient mural dynasty recognition, and this paper proposes an ancient mural dynasty recognition algorithm that is based on a neural architecture search (NAS). First, the structural edge information of mural images is extracted for use by the neural network model in recognizing mural missions. Second, an NAS algorithm that is based on contrast selection (CS) simplifies the architecture search to an incremental CS and then searches for the optimal network architecture on the mural dataset. Finally, the identified optimal network architecture is used for training and testing to complete the mural dynasty recognition task. The results show that the top accuracy of the proposed method on the mural dataset is 88.10%, the recall rate is 87.52%, and the precision rate is 87.69%. Each evaluation index used by the neural network model is superior to that of classical network models such as AlexNet and ResNet-50. Compared with NAS methods such as ASNG and MIGO, the accuracy of mural dynasty recognition is higher by an average of 4.27% when using the proposed method. The proposed method is verified on CIFAR-10, CIFAR-100, ImageNet16-120 and other datasets and achieves a good recognition accuracy in the NAS-bench-201 search space, which averages 93.26%, 70.73% and 45.34%, respectively, on the abovementioned datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Analysis of slope stochastic fields using a novel deep learning model with attention mechanism.
- Author
-
Ning Ma and Zaizhen Yao
- Subjects
DEEP learning ,STOCHASTIC analysis ,MONTE Carlo method ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,POLYNOMIAL chaos ,POISSON'S ratio - Abstract
This paper proposes a novel deep learning model incorporating attention mechanisms for the analysis of slope stochastic fields. Initially, a deep learning model is designed to digitally image the stochastic field features of soil strength variability. This is achieved by discretizing the slope soil stochastic field using the Karhunen-Loeve expansion method and transforming the discrete results into digital images. These images are then used to establish a Convolutional Neural Network (CNN) surrogate model that maps the implicit relationship between stochastic field images and slope functional function values, thus calculating the probability of slope failure. The precision of the CNN surrogate model is enhanced through Bayesian optimization and five-fold cross-validation. Moreover, to overcome the limitations of existing data-driven landslide stability prediction models, this study also introduces a Spatial-Temporal Attention (STA) mechanism. By combining the CNN with Long Short-Term Memory (LSTM) networks, the model can accurately approximate the actual results of slope stability calculations in scenarios of high-dimensional representation imaging of stochastic fields and low-probability slope instability. Consequently, this significantly improves the computational efficiency of slope reliability analysis considering stochastic field simulations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. A Multi-Stance Detection Method by Fusing Sentiment Features.
- Author
-
Huang, Weidong and Yang, Jinyuan
- Subjects
ARTIFICIAL neural networks ,FEATURE extraction ,MARKETING strategy ,PUBLIC opinion ,GOVERNMENT policy - Abstract
Stance information has a significant influence on market strategy, government policy, and public opinion. Users differ not only in their polarity but also in the degree to which they take a stand. The traditional classification of stances is quite simple and cannot fully depict the diversity of stances. At the same time, traditional approaches ignore user sentiment features when expressing their stances. As a result, this paper develops a multi-stance detection model by fusing sentiment features. First, a five-category stance indicator system is built based on the LDA model, then sentiment features are extracted from the reviews using the sentiment lexicon, and finally, stance detection is implemented using a hybrid neural network model. The experiment shows that the proposed method can classify stances into five categories and perform stance detection more accurately. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Water Pipeline Leakage Detection Based on Coherent φ-OTDR and Deep Learning Technology.
- Author
-
Zhang, Shuo, Xiong, Zijian, Ji, Boyuan, Li, Nan, Yu, Zhangwei, Wu, Shengnan, and He, Sailing
- Subjects
WATER pipelines ,WATER leakage ,LEAK detection ,DEEP learning ,OPTICAL time-domain reflectometry ,ARTIFICIAL neural networks - Abstract
Leakage in water supply pipelines remains a significant challenge. It leads to resource and economic waste. Researchers have developed several leak detection methods, including the use of embedded sensors and pressure prediction. The former approach involves pre-installing detectors inside pipelines to detect leaks. This method allows for the precise localization of leak points. The stability is compromised because of the wireless signal strength. The latter approach, which relies on pressure measurements to predict leak events, does not achieve precise leak point localization. To address these challenges, in this paper, a coherent optical time-domain reflectometry (φ-OTDR) system is employed to capture vibration signal phase information. Subsequently, two pre-trained neural network models based on CNN and Resnet18 are responsible for processing this information to accurately identify vibration events. In an experimental setup simulating water pipelines, phase information from both leaking and non-leaking pipe segments is collected. Using this dataset, classical CNN and ResNet18 models are trained, achieving accuracy rates of 99.7% and 99.5%, respectively. The multi-leakage point experiment results indicate that the Resnet18 model has better generalization compared to the CNN model. The proposed solution enables long-distance water-pipeline precise leak point localization and accurate vibration event identification. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.