Descriptor: "bidirectional gated recurrent unit" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"bidirectional gated recurrent unit"' showing total 346 results

Start Over Descriptor "bidirectional gated recurrent unit"

346 results on '"bidirectional gated recurrent unit"'

1. DHG-BiGRU: Dual-attention based hierarchical gated BiGRU for software defect prediction

Author: Malhotra, Ruchika and Singh, Priya
Published: 2025
Full Text: View/download PDF

2. Industrial process fault diagnosis using dilated convolutional stacking bidirectional gated recurrent unit with high and low-order feature fusion

Author: Zhu, Yaoqian, Zhang, Ridong, and Gao, Furong
Published: 2025
Full Text: View/download PDF

3. A hybrid optimization method based on DBO-tuning BiGRU assisted AKF for seamless INS/GPS navigation.

Author: Wei, Xiaokai, Lang, Ping, Wang, Qikun, Li, Jie, Feng, Kaiqiang, and Zhan, Ying
Abstract: To enhance the navigation accuracy and continuity of the integrated navigation system (INS)/global positioning system (GPS) in satellite denied conditions, the study proposes a hybrid optimization seamless navigation strategy that utilizes dung beetle optimizer (DBO) to optimize bidirectional gated recurrent unit (BiGRU) assisted maximum versoria criterion (MVC)-based adaptive Kalman filter. Initially, for the information fusion challenge of INS/GPS integrated navigation system in complex environments, an adaptive Kalman filter based on MVC is designed, exhaustively considering the complexity of actual measurement noise and key parameters of each sensor in the INS/GPS system, thereby enhancing the accuracy and robustness of information fusion and providing satisfactory information samples for subsequent neural network training. Subsequently, the DBO is adopted to optimize the BiGRU, thus predicting the velocity and position observation information of the INS/GPS and addressing its performance deterioration during GPS outages. The BiGRU hyper-parameters are fine-tuned with the DBO to optimize the neural network’s structure and enhance its prediction performance and robustness for time series information, which enables it to learn from the uncertainty of the system smoothly and accurately. Finally, a vehicle navigation platform and targeted ground vehicle experiments are designed to evaluate the proposed method. The experimental results demonstrate that the proposed strategy can effectively improve navigation accuracy and maintain navigation continuity in the actual complex environment. [ABSTRACT FROM AUTHOR]
Published: 2025
Full Text: View/download PDF

4. A deep learning method based on CNN-BiGRU and attention mechanism for proton exchange membrane fuel cell performance degradation prediction.

Author: Zhou, Jiaming, Shu, Xing, Zhang, Jinming, Yi, Fengyan, Jia, Chunchun, Zhang, Caizhi, Kong, Xianghao, Zhang, Junling, and Wu, Guangping
Subjects: *CONVOLUTIONAL neural networks, *DEEP learning
Abstract: The performance of proton exchange membrane fuel cells (PEMFCs) will gradually deteriorate during long-term operation. Accurate performance degradation prediction is crucial for extending the lifespan and improve the durability of fuel cells. This paper proposes a deep learning method (CNN-BiGRU-AM) that incorporates convolutional neural network (CNN), bidirectional gated recurrent unit (BiGRU) and attention mechanism (AM) for fuel cell degradation prediction. In the proposed method, CNN extracts complex features from the input data through convolutional operations, BiGRU models temporal information by considering both forward and reverse directions of the input sequence, and attention mechanism highlights key information in the input data through weight allocation. The proposed method is validated using long-term experimental data from fuel cells under steady-state and quasi-dynamic conditions. The results indicate that the absolute error of the proposed method is less than 1.2 mV for 97.94% of the data samples under steady-state conditions and less than 1.2 mV for 94.82% of the data samples under quasi-dynamic conditions. The prediction accuracy and stability of the proposed method are significantly improved compared to other deep learning prediction methods. • The effects of multiple operating parameters on fuel cell degradation are considered. • The complex features in the input data are extracted by convolutional operations. • A PEMFC degradation method combining CNN, BiGRU and attention mechanism is proposed. • The performance and stability of the proposed method are verified in various aspects. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Harnessing Deep Learning and Snow Cover Data for Enhanced Runoff Prediction in Snow-Dominated Watersheds.

Author: Adnan, Rana Muhammad, Mo, Wang, Kisi, Ozgur, Heddam, Salim, Al-Janabi, Ahmed Mohammed Sami, and Zounemat-Kermani, Mohammad
Subjects: *CONVOLUTIONAL neural networks, *LONG short-term memory, *MODIS (Spectroradiometer), *RECURRENT neural networks, *STANDARD deviations, *WATERSHEDS
Abstract: Predicting streamflow is essential for managing water resources, especially in basins and watersheds where snowmelt plays a major role in river discharge. This study evaluates the advanced deep learning models for accurate monthly and peak streamflow forecasting in the Gilgit River Basin. The models utilized were LSTM, BiLSTM, GRU, CNN, and their hybrid combinations (CNN-LSTM, CNN-BiLSTM, CNN-GRU, and CNN-BiGRU). Our research measured the model's accuracy through root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe efficiency (NSE), and the coefficient of determination (R2). The findings indicated that the hybrid models, especially CNN-BiGRU and CNN-BiLSTM, achieved much better performance than traditional models like LSTM and GRU. For instance, CNN-BiGRU achieved the lowest RMSE (71.6 in training and 95.7 in testing) and the highest R2 (0.962 in training and 0.929 in testing). A novel aspect of this research was the integration of MODIS-derived snow-covered area (SCA) data, which enhanced model accuracy substantially. When SCA data were included, the CNN-BiLSTM model's RMSE improved from 83.6 to 71.6 during training and from 108.6 to 95.7 during testing. In peak streamflow prediction, CNN-BiGRU outperformed other models with the lowest absolute error (108.4), followed by CNN-BiLSTM (144.1). This study's results reinforce the notion that combining CNN's spatial feature extraction capabilities with the temporal dependencies captured by LSTM or GRU significantly enhances model accuracy. The demonstrated improvements in prediction accuracy, especially for extreme events, highlight the potential for these models to support more informed decision-making in flood risk management and water allocation. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. A new method based on generative adversarial networks for multivariate time series prediction.

Author: Qin, Xiwen, Shi, Hongyu, Dong, Xiaogang, and Zhang, Siqi
Subjects: *CONVOLUTIONAL neural networks, *GENERATIVE adversarial networks, *FOURIER transforms, *FOREIGN exchange rates, *AIR quality
Abstract: Multivariate time series have more complex and high‐dimensional characteristics, which makes it difficult to analyze and predict the data accurately. In this paper, a new multivariate time series prediction method is proposed. This method is a generative adversarial networks (GAN) method based on Fourier transform and bi‐directional gated recurrent unit (Bi‐GRU). First, the Fourier transform is utilized to extend the data features, which helps the GAN to better learn the distributional features of the original data. Second, in order to guide the model to fully learn the distribution of the original time series data, Bi‐GRU is introduced as the generator of GAN. To solve the problems of mode collapse and gradient vanishing that exist in GAN, Wasserstein distance is used as the loss function of GAN. Finally, the proposed method is used for the prediction of air quality, stock price and RMB exchange rate. The experimental results show that the model can effectively predict the trend of the time series compared with the other nine baseline models. It significantly improves the accuracy and flexibility of multivariate time series forecasting and provides new ideas and methods for accurate time series forecasting in industrial, financial and environmental fields. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. Integration of attention mechanism and CNN-BiGRU for TDOA/FDOA collaborative mobile underwater multi-scene localization algorithm.

Author: Peng, Duo, Liu, Ming Shuo, and Xie, Kun
Subjects: CONVOLUTIONAL neural networks, SPEED of sound, OCEAN waves, RADIATION sources, LEAST squares
Abstract: The aim of this study is to address the issue of TDOA/FDOA measurement accuracy in complex underwater environments, which is affected by multipath effects and variations in water sound velocity induced by the challenging nature of the underwater environment. To this end, a novel cooperative localisation algorithm has been developed, integrating the attention mechanism and convolutional neural network-bidirectional gated recurrent unit (CNN-BiGRU) with TDOA/FDOA and two-step weighted least squares (ImTSWLS). This algorithm is designed to enhance the accuracy of TDOA/FDOA measurements in complex underwater environments. The algorithm initially makes use of the considerable capacity of a convolutional neural network (CNN) to extract profound spatial and frequency domain characteristics from multimodal data. These features are of paramount importance for the characterisation of underwater signal propagation, particularly in complex environments. Subsequently, through the use of a bidirectional gated recurrent unit (BiGRU), the algorithm is able to effectively capture long-term dependencies in time series data. This enables a more comprehensive analysis and understanding of the changing pattern of signals over time. Furthermore, the incorporation of an attention mechanism within the algorithm enables the model to focus more on the signal features that have a significant impact on localisation, while simultaneously suppressing the interference of extraneous information. This further enhances the efficiency of identifying and utilising the key signal features. ImTSWLS is employed to resolve the position and velocity data following the acquisition of the predicted TDOA/FDOA, thereby enabling the accurate estimation of the position and velocity of the mobile radiation source. The algorithm was subjected to a series of tests in a variety of simulated underwater environments, including different sea states, target motion speeds and base station configurations. The experimental results demonstrate that the algorithm exhibits a deviation of only 2.88 m/s in velocity estimation and 2.58 m in position estimation when the noise level is 20 dB. The algorithm presented in this paper demonstrates superior performance in both position and velocity estimation compared to other algorithms. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. Multimodal Fusion of Optimized GRU–LSTM with Self-Attention Layer for Hydrological Time Series Forecasting.

Author: Kilinc, Huseyin Cagan, Apak, Sina, Ozkan, Furkan, Ergin, Mahmut Esad, and Yurtsever, Adem
Subjects: ENVIRONMENTAL impact analysis, PARTICLE swarm optimization, FLOOD control, AGRICULTURAL development, AGRICULTURAL policy
Abstract: Accurate flow forecasting is crucial for effective basin management, regional agricultural policy development, environmental impact analysis, soil and water conservation studies, and flood protection planning. This study proposes a novel approach that integrates particle swarm optimization (PSO) with bidirectional long short-term memory (Bi-LSTM) and bidirectional gated recurrent unit (Bi-GRU) architectures, augmented by feature fusion and attention layers. Our approach consistently outperforms traditional methods across multiple datasets, including Ahmethacı, Büyükincirli, and Ersil, thereby achieving lower RMSE, MAE, and higher KGE and BF scores. Specifically, in Ahmethacı, our method yields an RMSE of 3.448, MAE of 1.224, and an R2 of 0.886. In Büyükincirli, it records an RMSE of 0.085, MAE of 0.040, and an R2 of 0.964. In Ersil, it achieves an RMSE of 1.495, MAE of 0.565, and R2 of 0.883. These results underscore the effectiveness of the proposed approach in flow forecasting. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. SRCAE-STCBiGRU: a fused deep learning model for remaining useful life prediction of rolling bearings.

Author: Deng, Linfeng, Yan, Xinhui, and Li, Wei
Abstract: The intelligent prediction of bearing remaining useful life (RUL) plays a critical role in bearing maintenance. Therefore, it is particularly significant to accurately estimate the RUL of bearings in order to ensure the reliability and safety of mechanical systems. And deep learning techniques have been successfully applied in the RUL prediction. However, there are unresolved problems of information loss during feature extraction and hardly effectively extracting spatio-temporal sequence information during bearing degradation process for the convolutional neural networks. To solve the problem, this paper proposes a RUL prediction framework based on stacked residual convolutional autoencoder and spatio-temporal convolutional bidirectional gated recurrent unit. The method adopts continuous wavelet transform technology to convert the acquired raw vibration signals into two-dimensional time–frequency images, constructs a deep network using stacked residual convolutional networks to extract feature information at different levels, and learns the spatio-temporal information in the time series information in the past and future states through spatio-temporal convolutional bi-directional gated recurrent units to more accurately predict the remaining service life of rolling bearings. In experimental verification, by comparing with existing RUL prediction methods and utilizing the PHM2012 and XJYU-SY public datasets, the superiority and effectiveness of our proposed method were well validated. The experimental results indicate that the proposed RUL prediction approach exhibits excellent performance in terms of accuracy and generalization. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. Vessel Traffic Flow Prediction in Port Waterways Based on POA-CNN-BiGRU Model.

Author: Chang, Yumiao, Ma, Jianwen, Sun, Long, Ma, Zeqiu, and Zhou, Yue
Subjects: CONVOLUTIONAL neural networks, OPTIMIZATION algorithms, TRAFFIC flow, PREDICTION models, BLUEGRASSES (Plants)
Abstract: Vessel traffic flow forecasting in port waterways is critical to improving safety and efficiency of port navigation. Aiming at the stage characteristics of vessel traffic in port waterways in time sequence, which leads to complexity of data in the prediction process and difficulty in adjusting the model parameters, a convolutional neural network (CNN) based on the optimization of the pelican algorithm (POA) and the combination of bi-directional gated recurrent units (BiGRUs) is proposed as a prediction model, and the POA algorithm is used to search for optimized hyper-parameters, and then the iterative optimization of the optimal parameter combinations is input into the best combination of iteratively found parameters, which is input into the CNN-BiGRU model structure for training and prediction. The results indicate that the POA algorithm has better global search capability and faster convergence than other optimization algorithms in the experiment. Meanwhile, the BiGRU model is introduced and compared with the CNN-BiGRU model prediction; the POA-CNN-BiGRU combined model has higher prediction accuracy and stability; the prediction effect is significantly improved; and it can provide more accurate prediction information and cycle characteristics, which can serve as a reference for the planning of ships' routes in and out of ports and optimizing the management of ships' organizations. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. SMGformer: integrating STL and multi-head self-attention in deep learning model for multi-step runoff forecasting

Author: Wen-chuan Wang, Miao Gu, Yang-hao Hong, Xiao-xue Hu, Hong-fei Zang, Xiao-nan Chen, and Yan-guo Jin
Subjects: Monthly runoff forecast, Multi-step forecast, Seasonal and Trend decomposition using Loess, Informer, Bidirectional gated recurrent unit, Multi-head self-attention, Medicine, Science
Abstract: Abstract Accurate runoff forecasting is of great significance for water resource allocation flood control and disaster reduction. However, due to the inherent strong randomness of runoff sequences, this task faces significant challenges. To address this challenge, this study proposes a new SMGformer runoff forecast model. The model integrates Seasonal and Trend decomposition using Loess (STL), Informer’s Encoder layer, Bidirectional Gated Recurrent Unit (BiGRU), and Multi-head self-attention (MHSA). Firstly, in response to the nonlinear and non-stationary characteristics of the runoff sequence, the STL decomposition is used to extract the runoff sequence’s trend, period, and residual terms, and a multi-feature set based on ‘sequence-sequence’ is constructed as the input of the model, providing a foundation for subsequent models to capture the evolution of runoff. The key features of the input set are then captured using the Informer’s Encoder layer. Next, the BiGRU layer is used to learn the temporal information of these features. To further optimize the output of the BiGRU layer, the MHSA mechanism is introduced to emphasize the impact of important information. Finally, accurate runoff forecasting is achieved by transforming the output of the MHSA layer through the Fully connected layer. To verify the effectiveness of the proposed model, monthly runoff data from two hydrological stations in China are selected, and eight models are constructed to compare the performance of the proposed model. The results show that compared with the Informer model, the 1th step MAE of the SMGformer model decreases by 42.2% and 36.6%, respectively; RMSE decreases by 37.9% and 43.6% respectively; NSE increases from 0.936 to 0.975 and from 0.487 to 0.837, respectively. In addition, the KGE of the SMGformer model at the 3th step are 0.960 and 0.805, both of which can maintain above 0.8. Therefore, the model can accurately capture key information in the monthly runoff sequence and extend the effective forecast period of the model.
Published: 2024
Full Text: View/download PDF

12. Prediction of small-scale leak flow rate in LOCA situations using bidirectional GRU

Author: Hye Seon Jo, Sang Hyun Lee, and Man Gyun Na
Subjects: Leak flow rate, Bidirectional gated recurrent unit, Loss-of-coolant accidents, Nuclear engineering. Atomic power, TK9001-9401
Abstract: It is difficult to detect a small-scale leakage in a nuclear power plant (NPP) quickly and take appropriate action. Delaying these procedures can have adverse effects on NPPs. In this paper, we propose leak flow rate prediction using the bidirectional gated recurrent unit (Bi-GRU) method to detect leakage quickly and accurately in small-scale leakage situations because large-scale leak rates are known to be predicted accurately. The data were acquired by simulating small loss-of-coolant accidents (LOCA) or small-scale leakage situations using the modular accident analysis program (MAAP) code. In addition, to improve prediction performance, data were collected by distinguishing the break sizes in more detail. In addition, the prediction accuracy was improved by performing both LOCA diagnosis and leak flow rate prediction in small LOCA situations. The prediction model developed using the Bi-GRU showed a superior prediction performance compared with other artificial intelligence methods. Accordingly, the accurate and effective prediction model for small-scale leakage situations proposed herein is expected to support operators in decision-making and taking actions.
Published: 2024
Full Text: View/download PDF

13. Research on a CNN-BiGRU disk fault prediction method integrating attention mechanism

Author: WANG Yan, LIU Yadong, PI Chanjuan, and SHI Junhao
Subjects: attention mechanism, disk failure prediction, convolutional neural network, bidirectional gated recurrent unit, focal loss function, Electronic computers. Computer science, QA75.5-76.95
Abstract: Disk, as a crucial storage medium, can result in significant data loss if it malfunctions, causing immeasurable losses for individuals and businesses. Existing models for predicting disk failures have problems such as imbalanced disk data samples and underutilization of the temporal characteristics of the data. In this study, we focused on real disk data provided by the Backblaze cloud storage company and proposed a disk failure prediction model that combines a convolutional neural network (CNN) with a bidirectional gated recurrent unit(BiGRU) network, incorporating an attention mechanism.In terms of data preprocessing, we employed negative sampling and a focal loss function to balance positive and negative samples. Subsequently, we utilized CNN for feature extraction and combined it with BiGRU to effectively handle temporal data. The integration of an attention mechanism enables the model to quickly capture more critical feature informations. The selected features were then trained with the input data into the model. Compared to other fault prediction models, the proposed model in this paper demonstrates a performance improvement of 1% to 7% on four evaluation indicators, such as precision. This provides a robust support for enhancing disk storage reliability.
Published: 2024
Full Text: View/download PDF

14. Integration of attention mechanism and CNN-BiGRU for TDOA/FDOA collaborative mobile underwater multi-scene localization algorithm

Author: Duo Peng, Ming Shuo Liu, and Kun Xie
Subjects: Underwater localization, Convolutional neural network, Bidirectional gated recurrent unit, Two-step weighted least squares, Multimodal, Electronic computers. Computer science, QA75.5-76.95, Information technology, T58.5-58.64
Abstract: Abstract The aim of this study is to address the issue of TDOA/FDOA measurement accuracy in complex underwater environments, which is affected by multipath effects and variations in water sound velocity induced by the challenging nature of the underwater environment. To this end, a novel cooperative localisation algorithm has been developed, integrating the attention mechanism and convolutional neural network-bidirectional gated recurrent unit (CNN-BiGRU) with TDOA/FDOA and two-step weighted least squares (ImTSWLS). This algorithm is designed to enhance the accuracy of TDOA/FDOA measurements in complex underwater environments. The algorithm initially makes use of the considerable capacity of a convolutional neural network (CNN) to extract profound spatial and frequency domain characteristics from multimodal data. These features are of paramount importance for the characterisation of underwater signal propagation, particularly in complex environments. Subsequently, through the use of a bidirectional gated recurrent unit (BiGRU), the algorithm is able to effectively capture long-term dependencies in time series data. This enables a more comprehensive analysis and understanding of the changing pattern of signals over time. Furthermore, the incorporation of an attention mechanism within the algorithm enables the model to focus more on the signal features that have a significant impact on localisation, while simultaneously suppressing the interference of extraneous information. This further enhances the efficiency of identifying and utilising the key signal features. ImTSWLS is employed to resolve the position and velocity data following the acquisition of the predicted TDOA/FDOA, thereby enabling the accurate estimation of the position and velocity of the mobile radiation source. The algorithm was subjected to a series of tests in a variety of simulated underwater environments, including different sea states, target motion speeds and base station configurations. The experimental results demonstrate that the algorithm exhibits a deviation of only 2.88 m/s in velocity estimation and 2.58 m in position estimation when the noise level is 20 dB. The algorithm presented in this paper demonstrates superior performance in both position and velocity estimation compared to other algorithms.
Published: 2024
Full Text: View/download PDF

15. Neuro connect: Integrating data-driven and BiGRU classification for enhanced autism prediction from fMRI data.

Author: Rajaram, Pavithra and Marimuthu, Mohanapriya
Subjects: *FUNCTIONAL magnetic resonance imaging, *ANIMAL herds, *AUTISM spectrum disorders, *SYMPTOMS, *EARLY diagnosis
Abstract: Autism Spectrum Disorder (ASD) poses a significant challenge in early diagnosis and intervention due to its multifaceted clinical presentation and lack of objective biomarkers. This research presents a novel approach, termed Neuro Connect, which integrates data-driven techniques with Bidirectional Gated Recurrent Unit (BiGRU) classification to enhance the prediction of ASD using functional Magnetic Resonance Imaging (fMRI) data. This study uses both structural and functional neuroimaging data to investigate the complex brain underpinnings of autism spectrum disorder (ASD). They use an Auto-Encoder (AE) to efficiently reduce dimensionality while retaining critical information by learning and compressing important characteristics from high-dimensional data. We treat the feature-extracted data using a BiGRU model for the classification task of predicting ASD. They provide a new optimization strategy, the Horse Herd Algorithm (HHA), and show that it outperforms other established optimizers, such SGD and Adam, in order to improve classification accuracy. The model’s performance is greatly enhanced by the HHA’s novel optimization technique, which more precisely refines weight modifications made during training. The proposed ASD and EEG dataset accuracy value is 99.5%, and 99.3 compared to the existing method the proposed has a high accuracy value. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Hyperspectral crop image classification via ensemble of classification model with optimal training.

Author: Lavanya P, Venkata, Tripathi, Mukesh Kumar, E P, Hemand, K, Sangeetha, and Ramesh, Janjhyam Venkata Naga
Subjects: *IMAGE recognition (Computer vision), *OPTIMIZATION algorithms, *RECURRENT neural networks, *CONVOLUTIONAL neural networks, *FEATURE extraction, *IMAGE segmentation
Abstract: Agriculture is a significant source of income, and categorizing the crop has turned into vital factor that aids more in the crop production sector. Traditionally, crop development stage determination is done manually by eye inspection. However, producing high-quality crop type maps using modern approaches remains difficult. In this paper, the hyperspectral crop image classification model is proposed that includes four stages, they are (a) preprocessing, (b) segmentation, (c) feature extraction and (d) classification. In the preprocessing step, the hyperspectral image is provided as input, where the filtering process will carried out using median filtering. The filtered image is then used as the segmentation's input. The image is segmented in the segmentation step using the enhanced entropy-based fuzzy c-means technique. Subsequently, spectral spatial features and vegetation index-based features are derived from segmented images. The final step is the classification, where the ensemble of classification model will be used that includes models like Convolutional Neural Networks (CNN), Deep Maxout (DMO), Recurrent Neural Networks (RNN), and Bidirectional Gated Recurrent Unit (Bi-GRU), respectively. The proposed Self Improved Tasmanian devil Optimization (SI-TDO) approach has optimally adjusted the Bi-GRU model's training weights to enhance ensemble classification performance. Finally, the effectiveness of the proposed SI-TDO method compared to the traditional algorithm is examined for several metrics. The SI-TDO obtained the greatest accuracy of 94.68% in training rate 80, while other existing models have the lowest ratings. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Ultra-short-term Load Forecasting Based on XGBoost-BiGRU.

Author: Shuyi Chen, Guo Li, Kaixuan Chang, Xiang Hu, Peiqi Li, Yujue Wang, and Yantao Zhang
Abstract: High-precision load forecasting serves as the foundation for power grid scheduling planning and safe economic operation. In scenarios where only historical power load data is available without other external information, fully exploiting meaningful features from the temporal load sequence is crucial for improving the accuracy of load forecasting. Therefore, an ultra-short-term load forecasting method that combines eXtreme gradient boosting (XGBoost) and bidirectional gated recurrent unit (BiGRU) is proposed in this paper. Considering various factors that affect loads, a candidate feature set is established, which includes temporal information and historical loads. XGBoost is used to select the features that contribute significantly to load forecasting, forming an optimal feature set. These optimal features are then used as inputs to the BiGRU, and the bayesian optimization algorithm is applied to optimize the network hyperparameters. Then the load forecasting model for the next 15 minutes based on BiGRU is generated by training iteratively. The proposed XGBoost-BiGRU method is validated on real load data from a province in China. Experimental results demonstrate that the method can effectively avoid the impact of redundant features, improving both prediction accuracy and efficiency. The research has significant importance for guiding real-time supply-demand balance calculations and scheduling in power grids. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. 改进 TCN 结合 Bi-GRU 的人体动作识别方法.

Author: 路永乐, 罗毅, 肖轩, 粟萍, 李娜, and 修蔚然
Subjects: HUMAN activity recognition, RECOGNITION (Psychology), GENERALIZATION, FEATURE extraction
Abstract: Copyright of Journal of Chongqing University of Posts & Telecommunications (Natural Science Edition) is the property of Chongqing University of Posts & Telecommunications and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

19. Prediction of Sea Level Using Double Data Decomposition and Hybrid Deep Learning Model for Northern Territory, Australia.

Author: Raj, Nawin, Murali, Jaishukh, Singh-Peterson, Lila, and Downs, Nathan
Subjects: *CONVOLUTIONAL neural networks, *SEA level, *BEACH erosion, *PRINCIPAL components analysis, *ARTIFICIAL intelligence
Abstract: Sea level rise (SLR) attributed to the melting of ice caps and thermal expansion of seawater is of great global significance to vast populations of people residing along the world's coastlines. The extent of SLR's impact on physical coastal areas is determined by multiple factors such as geographical location, coastal structure, wetland vegetation and related oceanic changes. For coastal communities at risk of inundation and coastal erosion due to SLR, the modelling and projection of future sea levels can provide the information necessary to prepare and adapt to gradual sea level rise over several years. In the following study, a new model for predicting future sea levels is presented, which focusses on two tide gauge locations (Darwin and Milner Bay) in the Northern Territory (NT), Australia. Historical data from the Australian Bureau of Meteorology (BOM) from 1990 to 2022 are used for data training and prediction using artificial intelligence models and computation of mean sea level (MSL) linear projection. The study employs a new double data decomposition approach using Multivariate Variational Mode Decomposition (MVMD) and Successive Variational Mode Decomposition (SVMD) with dimensionality reduction techniques of Principal Component Analysis (PCA) for data modelling using four artificial intelligence models (Support Vector Regression (SVR), Adaptive Boosting Regressor (AdaBoost), Multilayer Perceptron (MLP), and Convolutional Neural Network–Bidirectional Gated Recurrent Unit (CNN-BiGRU). It proposes a deep learning hybrid CNN-BiGRU model for sea level prediction, which is benchmarked by SVR, AdaBoost, and MLP. MVMD-SVMD-CNN-BiGRU hybrid models achieved the highest performance values of 0.9979 (d), 0.996 (NS), 0.9409 (L); and 0.998 (d), 0.9959 (NS), 0.9413 (L) for Milner Bay and Darwin, respectively. It also attained the lowest error values of 0.1016 (RMSE), 0.0782 (MABE), 2.3699 (RRMSE), and 2.4123 (MAPE) for Darwin and 0.0248 (RMSE), 0.0189 (MABE), 1.9901 (RRMSE), and 1.7486 (MAPE) for Milner Bay. The mean sea level (MSL) trend analysis showed a rise of 6.1 ± 1.1 mm and 5.6 ± 1.5 mm for Darwin and Milner Bay, respectively, from 1990 to 2022. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Research on Short-Term Power Load Forecasting Model Based on NGO-CNN-BIGRU-AT.

Author: Qingyun Yuan, Pan Yu, Liu Tan, Yonggang Wang, and Heming Zhang
Subjects: GREY Wolf Optimizer algorithm, CONVOLUTIONAL neural networks, OPTIMIZATION algorithms, FEATURE extraction, FORECASTING, LOAD forecasting (Electric power systems)
Abstract: A CNN-BIGRU-AT model for power load forecasting is constructed by introducing Attention Mechanism (AT) into a model combining Convolutional Neural Network (CNN) and Bidirectional Gated Recurrent Unit (BIGRU), which has high prediction accuracy. In this model, CNN is first used to extract relevant features from power load data, and then BIGRU is applied to capture the inherent complex nonlinear dynamic patterns of power load. Additionally, Attention Mechanism (AT) is incorporated to further refine the feature extraction of power load data. The North Grey Wolf Optimizer Algorithm (NGO) is used to optimize parameters such as the number of hidden layer nodes, initial learning rate, and regularization coefficient in BIGRU, thus obtaining the modeling method of NGO-CNN-BIGRU-AT. Finally, the validity and feasibility of the model are substantiated using actual power load data from a certain region in China. The results demonstrate that the model has better performance than conventional models, including BP, BIGRU, CNN-BIGRU, and CNN-BIGRU-AT. [ABSTRACT FROM AUTHOR]
Published: 2024

21. Rotating machinery fault classification based on one-dimensional residual network with attention mechanism and bidirectional gated recurrent unit.

Author: Dong, Zhilin, Zhao, Dezun, and Cui, Lingli
Subjects: ROTATING machinery, CONVOLUTIONAL neural networks, FAULT diagnosis, CLASSIFICATION, PLANETARY gearing
Abstract: Conventional convolutional neural networks (CNNs) predominantly emphasize spatial features of signals and often fall short in prioritizing sequential features. As the number of layers increases, they are prone to issues such as vanishing or exploding gradients, leading to training instability and subsequent erratic fluctuations in loss values and recognition rates. To address this issue, a novel hybrid model, termed one-dimensional (1D) residual network with attention mechanism and bidirectional gated recurrent unit (BGRU) is developed for rotating machinery fault classification. First, a novel 1D residual network with optimized structure is constructed to obtain spatial features and mitigate the gradient vanishing or exploding. Second, the attention mechanism (AM) is designed to catch important impact characteristics for fault samples. Next, temporal features are mined through the BGRU. Finally, feature information is summarized through global average pooling, and the fully connected layer is utilized to output the final classification result for rotating machinery fault diagnosis. The developed technique which is tested on one set of planetary gear data and three different sets of bearing data, has achieved classification accuracy of 98.5%, 100%, 100%, and 100%, respectively. Compared with other methods, including CNN, CNN-BGRU, CNN-AM, and CNN with an AM-BGRU, the proposed technique has the highest recognition rate and stable diagnostic performance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. An integrated neural network method for landslide susceptibility assessment based on time-series InSAR deformation dynamic features

Author: Yi He, Zhan’ao Zhao, Qing Zhu, Tao Liu, Qing Zhang, Wang Yang, Lifeng Zhang, and Qiang Wang
Subjects: Landslide susceptibility assessment, InSAR, time distributed convolutional neural network, bidirectional gated recurrent unit, remote sensing, Mathematical geography. Cartography, GA1-1776
Abstract: ABSTRACTWe develop an integrated neural network landslide susceptibility assessment (LSA) method that integrates temporal dynamic features of interferometry synthetic aperture radar (InSAR) deformation data and the spatial features of landslide influencing factors. We construct a time-distributed convolutional neural network (TD-CNN) and bidirectional gated recurrent unit (Bi-GRU) to better understand the temporal dynamic features of InSAR cumulative deformation, and construct a multi-scale convolutional neural network (MSCNN) to determine the spatial features of landslide influencing factors, and construct a parallel unified deep learning network model to fuse these temporal and spatial features for LSA. Compared with the traditional MSCNN method, the accuracy of the proposed model is improved by 1.20%. The performance of the proposed model is preferable to MSCNN. The area under the receiver operating characteristic curve (AUC) of the testing set reaches 0.91. Our LSA results show that the proposed model clearly depicts areas with very high susceptibility landslides. Further, only 10.18% of the study area accurately covers 84.79% of historical landslide areas. Subjective consequences and objective indicators show that the proposed model that is integrated time-series InSAR deformation dynamic features can make full use of landslide characteristics and effectively improve the reliability of LSA.
Published: 2024
Full Text: View/download PDF

23. Construction and Application of Intelligent Forecasting Model of Metallurgical Performance Based on CNN-BIGRU-Attention Algorithm

Author: Li, Fumin, Zhang, Feng, Liu, Song, Liu, Xiaojie, Jin, Yatao, and Lv, Qing
Published: 2024
Full Text: View/download PDF

24. Prediction and scheduling of multi-energy microgrid based on BiGRU self-attention mechanism and LQPSO

Author: Yuchen Duan, Peng Li, and Jing Xia
Subjects: Microgrid, Bidirectional gated recurrent unit, Self-attention, Lévy-quantum particle swarm optimization, Multi- objective optimization, Energy conservation, TJ163.26-163.5, Energy industries. Energy policy. Fuel trade, HD9502-9502.5
Abstract: To predict renewable energy sources such as solar power in microgrids more accurately, a hybrid power prediction method is presented in this paper. First, the self-attention mechanism is introduced based on a bidirectional gated recurrent neural network (BiGRU) to explore the time-series characteristics of solar power output and consider the influence of different time nodes on the prediction results. Subsequently, an improved quantum particle swarm optimization (QPSO) algorithm is proposed to optimize the hyperparameters of the combined prediction model. The final proposed LQPSO-BiGRU-self-attention hybrid model can predict solar power more effectively. In addition, considering the coordinated utilization of various energy sources such as electricity, hydrogen, and renewable energy, a multi-objective optimization model that considers both economic and environmental costs was constructed. A two-stage adaptive multi- objective quantum particle swarm optimization algorithm aided by a Lévy flight, named MO-LQPSO, was proposed for the comprehensive optimal scheduling of a multi-energy microgrid system. This algorithm effectively balances the global and local search capabilities and enhances the solution of complex nonlinear problems. The effectiveness and superiority of the proposed scheme are verified through comparative simulations.
Published: 2024
Full Text: View/download PDF

25. CBGA: A deep learning method for power grid communication networks service activity prediction.

Author: Liu, Shangdong, Zhou, Longfei, Shao, Sisi, Zuo, Jun, and Ji, Yimu
Subjects: *ELECTRIC power distribution grids, *TELECOMMUNICATION systems, *CONVOLUTIONAL neural networks, *POWER resources, *RECURRENT neural networks, *DATA extraction, *DEEP learning, *FEATURE extraction
Abstract: The prediction of power equipment activity plays a vital role in optimizing power resource dispatch, ensuring supply and demand balance, and guiding network planning and management. However, due to the complex nonlinear, multi-scale, and multivariate characteristics of power grid communication networks service activity (PCNSA) data, it is often challenging to capture its intrinsic patterns and dynamic changes through a simple model. To address this issue, this paper proposes a power grid communication network service activity prediction method based on convolutional neural network, bidirectional gated recurrent unit, and attention mechanism, referred to as CBGA. CNN is used to extract features from time-series data, BiGRU is used to capture long-term feature changes in the data, and the attention mechanism is used to enhance the extraction of important information. To validate the performance of the proposed method, experiments and comparisons were conducted on three real power grid communication network datasets, demonstrating that our method shows better performance and robustness. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. Shear-Wave Velocity Prediction Based on the CNN-BiGRU Integrated Network with Spatiotemporal Attention Mechanism.

Author: Liu, Yaqi, Gao, Chuqiao, and Zhao, Bin
Subjects: CONVOLUTIONAL neural networks, SHEAR waves, PROPERTIES of fluids, ELECTRONIC data processing, VELOCITY, DATA logging, RECURRENT neural networks
Abstract: Shear wave velocity is one of the important parameters reflecting the lithological and physical properties of reservoirs, and it is widely used in the fields of lithology and fluid property identification, reservoir evaluation, seismic data processing, and interpretation. However, due to the high cost and challenge of obtaining shear wave velocity, only a few key wells are measured. Considering the intricate nonlinear mapping relationship between shear wave velocity and conventional logging data, an integrated network incorporating an attention mechanism, a convolutional neural network, and a bidirectional gated recurrent unit (STACBiN) is proposed for predicting shear wave velocity. The impact of conventional logging data on shear wave velocity is analyzed, thus employing the attention mechanism to focus on data correlated with shear wave velocity, which can enable the prediction results of the method proposed superior to those of conventional methods. Additionally, the prediction results of this method are compared with the prediction results of the two-dimensional convolutional neural network (2DCNN) and bidirectional gated recurrent unit (BiGRU). It is verified that the network proposed can effectively predict the shear wave velocity, with minimal error between predicted and true values. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Handwritten digit and Roman string recognition using gated CNN with federated learning.

Author: K, Sruthi, M, Meianbu, S R, Naveen, and Krishna R, Nidhish
Abstract: Handwritten number recognition has been extensively studied in the fields of machine literacy and computer vision, with datasets like MNIST serving as benchmarks. However, handwritten Roman numeral recognition presents unique challenges due to the diverse forms and structures of Roman numerals. In this paper, we propose a novel approach that combines Federated Learning with advanced neural network architectures to tackle this challenge effectively. Our methodology involves data acquisition and preprocessing, including the normalization of handwritten number and Roman numeral images. We design a hybrid neural network architecture that integrates Gated Convolutional Neural Networks (CNNs) for pixel-level feature extraction and Bidirectional Gated Recurrent Units (BGRUs) for sequence modeling. This architecture is essential for handling the complexity of recognizing both image and sequence data. Federated Learning is incorporated into our approach to train the model across multiple decentralized devices or servers while preserving data privacy. This ensures that sensitive handwritten data remains secure throughout the training process. By allowing model updates to be computed locally and aggregated without sharing raw data, Federated Learning maintains privacy and security in distributed learning environments. During training, each device computes gradients based on its local data and shares only the model updates with the central server. The central server aggregates these updates to update the global model, which is then sent back to the participating devices for further refinement. This iterative process continues until the model converges, while metrics such as accuracy, precision, recall, and F1-score are used to evaluate the model’s performance on a separate test dataset. Our approach demonstrates promising results in accurately recognizing both handwritten integers and Roman numerals, even in the presence of noise and variability in writing styles. By combining Federated Learning with advanced neural network architectures, our approach not only achieves state-of-the-art performance but also ensures data privacy and security in distributed learning environments. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. A dual path hybrid neural network framework for remaining useful life prediction of aero‐engine.

Author: Lu, Xinhua, Pan, Haobo, Zhang, Lingxiao, Ma, Li, and Wan, Hui
Subjects: *REMAINING useful life, *TURBOFAN engines, *COMPUTATIONAL complexity, *FEATURE extraction, *FORECASTING, *TIME series analysis
Abstract: Predicting the remaining useful life (RUL) of an engine is one of the key tasks of Prognostics and health management (PHM). Modern mechanical equipment typically operates in complex operating conditions and fault modes, leading to dispersed distribution of sensor data and challenges for feature extraction. To improve the accuracy of predicting the RUL under the complex scenarios, this paper proposes a multi‐scale convolutional network (CNN) and bidirectional gated recurrent unit (MSC‐BiGRU) mode under a dual path framework with temporal attention. Specifically, the multi‐scale CNN in the first path is to learn complex features, and Swish Activation function is used to improve the prediction ability of the network; the bidirectional gated recurrent unit (BiGRU) in the second path can handle both forward and backward time series, and adaptively capture the importance of outputs at different times using temporal attention, enhancing the model's feature extraction ability in the temporal dimension. A feature fusion mechanism is developed to connect two paths in parallel, overcoming the overfitting and high computational complexity in deep complex models. We verify the effectiveness of the proposed method using a simulated turbofan engine dataset, especially on datasets FD002 and FD004 under complex operating conditions and fault modes, the RMSE values were reduced by 17.37% and 9.97%, respectively, compared to the BiGRU‐TSAM. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. 典型调峰/调频工况下储能电池组荷电状态估计.

Author: 朱沐雨, 马宏忠, 郭鹏宇, and 宣文婧
Abstract: Copyright of Electric Power is the property of Electric Power Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

30. Weighted bidirectional gated recurrent network for event detection.

Author: Mary Vidya, R. and Ramakrishna, M.
Subjects: INFORMATION technology, FEATURE selection, BIG data
Abstract: Modern information technology is able to store enormous amounts of information even at high speeds and volumes. Meanwhile, handling continuous data streams becomes a complicated task and thus, a hybrid weighted recurrent neural network as well as bidirectional gated recurrent unit (hybrid WRBG) method is proposed for ideal feature sub-selection from the hyperspace of big data. Here, three datasets are utilized, namely as MAVEN dataset, Climate Change Twitter dataset, and event detection dataset to examine the proposed hybrid WRBG method. By utilizing the fuzzy elephant herding optimization (FEHO) which is a form of swarm search which delivers higher analytical accuracy within a practical processing time, the feature selection is specifically created for detection of events. Also, by attaining a tradeoff in the range of bias and variance terms, the classifier error is reduced through the FEHO algorithm. A weighted recurrent neural network (weighted RNN) ensemble with a bidirectional gated recurrent unit classifier is employed in order to automatically update current concepts in big data streams. The proposed model achieves an accuracy of 98.97%, a precision of 98.87%, an f-score of 98.72%, and a kappa score value is 0.92. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. Fine-grained hate speech detection in Arabic using transformer-based models.

Author: Bensoltane, Rajae and Zaki, Taher
Subjects: HATE speech, TRANSFORMER models, SOCIAL media, NATURAL language processing, VIRTUAL communities, DEAF children
Abstract: With the proliferation of social media platforms, characterized by features such as anonymity, user-friendly access, and the facilitation of online community building and discourse, the matter of detecting and monitoring hate speech has emerged as an increasingly formidable challenge for society, individuals, and researchers. Despite the crucial importance of hate speech detection task, the majority of work in this field has been conducted in English, with insufficient focus on other languages, particularly Arabic. Furthermore, most existing studies on Arabic hate speech detection have addressed this task as a binary classification problem, which is unreliable. Therefore, the aim of this study is to provide an enhanced model for detecting fine-grained hate speech in Arabic. To this end, three transformerbased models were evaluated to generate contextualized word embeddings from input sequence. Additionally, these models were combined with a bidirectional gated recurrent unit (BiGRU) layer to further improve the extracted semantic and context features. The experiments were conducted on an Arabic reference dataset provided by the open-source Arabic corpora and processing tools (OSACT-5) shared task. A comparative analysis indicates the efficiency of the proposed model over the baseline and related work models by achieving a macro F1-score of 61.68%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. Detection of False Data Injection Attacks on Smart Grids Based on A-BiTG Approach.

Author: He, Wei, Liu, Weifeng, Wen, Chenglin, and Yang, Qingqing
Subjects: DEEP learning, ELECTRIC power distribution grids, GRIDS (Cartography), TEST systems, TIME series analysis
Abstract: A false data injection attack (FDIA) is the main attack method that threatens the security of smart grids. FDIAs mislead the control center to make wrong judgments by modifying the measurement data of the power grid system. Therefore, the effective and accurate detection of FDIAs is crucial for the safe operation of smart grids. However, the current deep learning-based methods do not fully exploit the short-term local characteristics and long-term dependencies of power grid data and have poor correlation with past and future time series information, resulting in a lack of credibility in the detection results. In view of this, an FDIA detection model combining a bidirectional temporal convolutional network and bidirectional gated recurrent unit with an attention mechanism (A-BiTG) was proposed. The proposed model utilizes a bidirectional time convolutional network (BiTCN) and bidirectional gated recurrent unit (BiGRU) to consider past and future temporal information in the grid. This enhances the ability of the model to capture long-term dependencies and extract features, while also solving the model's problem of exploding and vanishing gradients. In addition, an attention mechanism (AM) was added to dynamically assign weights to the extracted feature information and retain the most valuable features to improve the detection accuracy of the model. Finally, the proposed method was compared with existing methods on the IEEE 14-bus and IEEE 118-bus test systems. The results show that the proposed detection model is more robust and superior under different noise environments and FDIA signals with different intensities. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Predicting Tool Wear with ParaCRN-AMResNet: A Hybrid Deep Learning Approach.

Author: Guo, Lian and Wang, Yongguo
Subjects: CONVOLUTIONAL neural networks, RECURRENT neural networks, DEEP learning, SEQUENTIAL learning, PARALLEL processing, FEATURE extraction
Abstract: In the manufacturing sector, tool wear substantially affects product quality and production efficiency. While traditional sequential deep learning models can handle time-series tasks, their neglect of complex temporal relationships in time-series data often leads to errors accumulating in continuous predictions, which reduces their forecasting accuracy for tool wear. For addressing these limitations, the parallel convolutional and recurrent neural networks with attention-modulated residual learning (ParaCRN-AMResNet) model is introduced. Compared with conventional deep learning models, ParaCRN-AMResNet markedly enhances the efficiency and precision of feature extraction from time-series data through its innovative parallel architecture. The model adeptly combines dilated convolution neural network and bidirectional gated recurrent units, effectively addressing distance dependencies and enriching the quantity and dimensions of extracted features. The strength of ParaCRN-AMResNet lies in its refined ability to capture the complex dynamics of time-series data, significantly boosting the model's accuracy and generalization capability. The model's efficacy was validated through comprehensive milling experiments and vibration signal analyses, showcasing ParaCRN-AMResNet's superior performance. In evaluation metrics, the model achieved a MAE of 2.6015, MSE of 15.1921, R2 of 0.9897, and MAPE of 2.7997%, conclusively proving its efficiency and accuracy in the precise prediction of tool wear. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. State of Health (SOH) Estimation of Lithium-Ion Batteries Based on ABC-BiGRU.

Author: Li, Hao, Chen, Chao, Wei, Jie, Chen, Zhuo, Lei, Guangzhou, and Wu, Lingling
Subjects: LITHIUM-ion batteries, LITHIUM cells, ELECTRIC vehicles, ELECTRIC vehicle batteries, ARTIFICIAL neural networks, STANDARD deviations, PEARSON correlation (Statistics), TRAFFIC safety
Abstract: As a core component of new energy vehicles, accurate estimation of the State of Health (SOH) of lithium-ion power batteries is essential. Correctly predicting battery SOH plays a crucial role in extending the lifespan of new energy vehicles, ensuring their safety, and promoting their sustainable development. Traditional physical or electrochemical models have low accuracy in measuring the SOH of lithium batteries and are not suitable for the complex driving conditions of real-world vehicles. This study utilized the black-box characteristics of deep learning models to explore the intrinsic correlations in the historical cycling data of lithium batteries, thereby eliminating the need to consider the internal chemical reactions of lithium batteries. Through Pearson correlation analysis, this study selects health indicators (HIs) from lithium battery cycling data that significantly impact SOH as input features. In the field of lithium batteries, this paper applies ABC-BiGRU for the first time to SOH prediction. Compared with other recursive neural network models, ABC-BiGRU demonstrates superior predictive performance, with maximum root mean square error and mean absolute error of only 0.016799317 and 0.012626847, respectively. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

35. Speaker Identification Under Noisy Conditions Using Hybrid Deep Learning Model

Author: Lambamo, Wondimu, Srinivasagan, Ramasamy, Jifara, Worku, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Debelee, Taye Girma, editor, Ibenthal, Achim, editor, Schwenker, Friedhelm, editor, and Megersa Ayano, Yehualashet, editor
Published: 2024
Full Text: View/download PDF

36. Botnet Detection Method Based on NSA and DRN

Author: Yin, Zhanhong, Qin, Renchao, Ye, Chengzhuo, He, Fei, Zhang, Lan, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Luo, Biao, editor, Cheng, Long, editor, Wu, Zheng-Guang, editor, Li, Hongyi, editor, and Li, Chaojie, editor
Published: 2024
Full Text: View/download PDF

37. Harnessing Deep Learning and Snow Cover Data for Enhanced Runoff Prediction in Snow-Dominated Watersheds

Author: Rana Muhammad Adnan, Wang Mo, Ozgur Kisi, Salim Heddam, Ahmed Mohammed Sami Al-Janabi, and Mohammad Zounemat-Kermani
Subjects: streamflow forecasting, long short-term memory, bidirectional long short-term memory, gated recurrent unit, bidirectional gated recurrent unit, convolutional neural network, Meteorology. Climatology, QC851-999
Abstract: Predicting streamflow is essential for managing water resources, especially in basins and watersheds where snowmelt plays a major role in river discharge. This study evaluates the advanced deep learning models for accurate monthly and peak streamflow forecasting in the Gilgit River Basin. The models utilized were LSTM, BiLSTM, GRU, CNN, and their hybrid combinations (CNN-LSTM, CNN-BiLSTM, CNN-GRU, and CNN-BiGRU). Our research measured the model’s accuracy through root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe efficiency (NSE), and the coefficient of determination (R2). The findings indicated that the hybrid models, especially CNN-BiGRU and CNN-BiLSTM, achieved much better performance than traditional models like LSTM and GRU. For instance, CNN-BiGRU achieved the lowest RMSE (71.6 in training and 95.7 in testing) and the highest R2 (0.962 in training and 0.929 in testing). A novel aspect of this research was the integration of MODIS-derived snow-covered area (SCA) data, which enhanced model accuracy substantially. When SCA data were included, the CNN-BiLSTM model’s RMSE improved from 83.6 to 71.6 during training and from 108.6 to 95.7 during testing. In peak streamflow prediction, CNN-BiGRU outperformed other models with the lowest absolute error (108.4), followed by CNN-BiLSTM (144.1). This study’s results reinforce the notion that combining CNN’s spatial feature extraction capabilities with the temporal dependencies captured by LSTM or GRU significantly enhances model accuracy. The demonstrated improvements in prediction accuracy, especially for extreme events, highlight the potential for these models to support more informed decision-making in flood risk management and water allocation.
Published: 2024
Full Text: View/download PDF

38. Vessel Traffic Flow Prediction in Port Waterways Based on POA-CNN-BiGRU Model

Author: Yumiao Chang, Jianwen Ma, Long Sun, Zeqiu Ma, and Yue Zhou
Subjects: port waterways, vessel traffic flow, pelican optimization algorithm, convolutional neural network, bidirectional gated recurrent unit, Naval architecture. Shipbuilding. Marine engineering, VM1-989, Oceanography, GC1-1581
Abstract: Vessel traffic flow forecasting in port waterways is critical to improving safety and efficiency of port navigation. Aiming at the stage characteristics of vessel traffic in port waterways in time sequence, which leads to complexity of data in the prediction process and difficulty in adjusting the model parameters, a convolutional neural network (CNN) based on the optimization of the pelican algorithm (POA) and the combination of bi-directional gated recurrent units (BiGRUs) is proposed as a prediction model, and the POA algorithm is used to search for optimized hyper-parameters, and then the iterative optimization of the optimal parameter combinations is input into the best combination of iteratively found parameters, which is input into the CNN-BiGRU model structure for training and prediction. The results indicate that the POA algorithm has better global search capability and faster convergence than other optimization algorithms in the experiment. Meanwhile, the BiGRU model is introduced and compared with the CNN-BiGRU model prediction; the POA-CNN-BiGRU combined model has higher prediction accuracy and stability; the prediction effect is significantly improved; and it can provide more accurate prediction information and cycle characteristics, which can serve as a reference for the planning of ships’ routes in and out of ports and optimizing the management of ships’ organizations.
Published: 2024
Full Text: View/download PDF

39. SMGformer: integrating STL and multi-head self-attention in deep learning model for multi-step runoff forecasting

Author: Wang, Wen-chuan, Gu, Miao, Hong, Yang-hao, Hu, Xiao-xue, Zang, Hong-fei, Chen, Xiao-nan, and Jin, Yan-guo
Published: 2024
Full Text: View/download PDF

40. A fault diagnosis method for flexible converter valve equipment based on DSC-BIGRU-MA.

Author: Guo, Jianbao, Liu, Hang, Feng, Lei, Zu, Lifeng, Yang, Xing, Wang, Shunli, and Deng, Fangming
Subjects: DIAGNOSIS methods, FAULT diagnosis, VALVES, PREDICTION models
Abstract: Introduction: Precise fault diagnosis is crucial for enhancing the reliability and lifespan of the flexible converter valve equipment. To address this issue, depthwise separable convolution, bidirectional gate recurrent unit, and multi- head attention module (DSC-BiGRU-MAM) based fault diagnosis approach is proposed. Methods: By DSC and BiGRU operation, the model can capture the correlation between local features and temporal information when processing sequence data, thereby enhancing the representation ability and predictive performance of the model for complex sequential data. In addition, by incorporating a multi-head attention module, the proposed method dynamically learns important information from different time intervals and channels. The proposed MAM continuously stimulates fault features in both time and channel dimensions during training, while suppressing fault independent expressions. As a result, it has made an important contribution to improving the performance of the fault diagnosis model. Results and Discussion: Experimental results demonstrate that the proposed method achieves higher accuracy compared to existing methods, with an average accuracy of 95.45%, average precision of 88.67%, and average recall of 89.03%. Additionally, the proposed method has a moderate number of model parameters (17,626) and training time (935 s). Results indicate that the proposed method accurately diagnoses faults in flexible converter valve equipment, especially in real-world situations with noise overlapping signals. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. Enhancing Arabic offensive language detection with BERTBiGRU model.

Author: Bensoltane, Rajae and Zaki, Taher
Subjects: LANGUAGE models, CYBERBULLYING, NATURAL language processing, ARABIC language, LANGUAGE research, WEB 2.0
Abstract: With the advent of Web 2.0, various platforms and tools have been developed to allow internet users to express their opinions and thoughts on diverse topics and occurrences. Nevertheless, certain users misuse these platforms by sharing hateful and offensive speeches, which has a negative impact on the mental health of internet society. Thus, the detection of offensive language has become an active area of research in the field of natural language processing. Rapidly detecting offensive language on the internet and preventing it from spreading is of great practical significance in reducing cyberbullying and self-harm behaviors. Despite the crucial importance of this task, limited work has been done in this field for non-English languages such as Arabic. Therefore, in this paper, we aim to improve the results of Arabic offensive language detection without the need for laborious preprocessing or feature engineering work. To achieve this, we combine the bidirectional encoder representations from transformers (BERT) model model with a bidirectional gated recurrent unit (BiGRU) layer to further enhance the extracted context and semantic features. The experiments were conducted on the Arabic dataset provided by the SemEval 2020 Task 12. The evaluation results show the effectiveness of our model compared to the baseline and related work models by achieving a macro F1- score of 93.16%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. Text summarization using modified generative adversarial network1.

Author: Srivastava, Jyoti, Srivastava, Ashish Kumar, Muthu Kumar, B., and Anandaraj, S.P.
Subjects: *TEXT summarization, *AUTOMATIC summarization, *CONVOLUTIONAL neural networks, *GENERATIVE adversarial networks, *RESEARCH personnel, *TEXT messages
Abstract: Text summarizing (TS) takes key information from a source text and condenses it for the user while retaining the primary material. When it comes to text summaries, the most difficult problem is to provide broad topic coverage and diversity in a single summary. Overall, text summarization addresses the fundamental need to distill large volumes of information into more manageable and digestible forms, making it a crucial technology in the era of information abundance. It benefits individuals, businesses, researchers, and various other stakeholders by enhancing efficiency and comprehension in dealing with textual data. In this paper, proposed a novel Modified Generative adversarial network (MGAN) for summarize the text. The proposed model involves three stages namely pre-processing, Extractive summarization, and summary generation. In the first Phase, the Text similarity dataset is pre-processed using Lowering Casing, Tokenization, Lemmatization, and, Stop Word Removal. In the second Phase, the Extractive summarization is done in three steps Generating similarity metrics, Sentence Ranking, and Sentence Extractive. In the third stage, a generative adversarial network (GAN) employs summary generation to jointly train the discriminative model D and the generative model G. To classify texts and annotate their syntax, Generative Model G employs a convolutional neural network called Bidirectional Gated Recursive Unit (CNN-BiGRU). The performance analysis of the proposed MGAN is calculated based on the parameters like accuracy, specificity, Recall, and Precision metrics. The proposed MGAN achieves an accuracy range of 99%. The result shows that the proposed MGAN improves the overall accuracy better than 9%, 6.5% and 5.4% is DRM, LSTM, and CNN respectively. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. A Hybrid Short‐Term Wind Power Forecasting Model Considering Significant Data Loss.

Author: Goh, Hui Hwang, Ding, Chunyang, Dai, Wei, Xie, Daiyu, Wen, Fangjun, Li, Keqiang, and Xia, Wenjiao
Abstract: Accurate wind power forecasting (WPF) is pivotal for the power system dominated by high penetration of renewable energy. Most forecasting techniques require sufficient data samples as a premise for achieving accurate prediction. Due to equipment faults during data collection, complete data is not always available, resulting that the forecasting accuracy is greatly diminished. To address this issue, this paper proposes a novel two‐stage hybrid forecasting approach including data restoration stage and forecasting stage. For the data restoration stage, the bidirectional long short‐term memory (Bi‐LSTM) is integrated into generative adversarial network (GAN) to recover the missing data with consideration of the complex time dynamics and correlations among heterogeneous data. To improve the prediction accuracy, the complete generated wind power sequence is decomposed into multiple time sequences with low volatility based on the enhanced variational mode decomposition (VMD). For the forecasting stage, a hybrid forecasting algorithm that combines convolutional neural network (CNN) and bidirectional gated recurrent unit (BiGRU) with an improved attention mechanism is proposed, strengthening the forecasting performance by assigning optimal weights to key features. The proposed hybrid forecasting method outperforms traditional methods based on real wind farm data with different shares of data loss from Guangxi province in China. © 2023 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. BSTFNet: An Encrypted Malicious Traffic Classification Method Integrating Global Semantic and Spatiotemporal Features.

Author: Hong Huang, Xingxing Zhang, Ye Lu, Ze Li, and Shaohua Zhou
Subjects: LANGUAGE models, DEEP learning, CONVOLUTIONAL neural networks, ENCRYPTION protocols, MACHINE learning
Abstract: While encryption technology safeguards the security of network communications, malicious traffic also uses encryption protocols to obscure its malicious behavior. To address the issues of traditional machine learning methods relying on expert experience and the insufficient representation capabilities of existing deep learning methods for encrypted malicious traffic, we propose an encrypted malicious traffic classification method that integrates global semantic features with local spatiotemporal features, called BERT-based Spatio-Temporal Features Network (BSTFNet). At the packet-level granularity, the model captures the global semantic features of packets through the attention mechanism of the Bidirectional Encoder Representations from Transformers (BERT)model. At the byte-level granularity, we initially employ the Bidirectional Gated Recurrent Unit (BiGRU) model to extract temporal features from bytes, followed by the utilization of the Text Convolutional Neural Network (TextCNN) model with multi-sized convolution kernels to extract local multi-receptive field spatial features. The fusion of features from both granularities serves as the ultimate multidimensional representation of malicious traffic. Our approach achieves accuracy and F1-score of 99.39% and 99.40%, respectively, on the publicly available USTC-TFC2016 dataset, and effectively reduces sample confusion within the Neris and Virut categories. The experimental results demonstrate that our method has outstanding representation and classification capabilities for encrypted malicious traffic. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Text summarization using modified generative adversarial network1.

Author: Srivastava, Jyoti, Srivastava, Ashish Kumar, Muthu Kumar, B., and Anandaraj, S.P.
Subjects: TEXT summarization, AUTOMATIC summarization, CONVOLUTIONAL neural networks, GENERATIVE adversarial networks, RESEARCH personnel, TEXT messages
Abstract: Text summarizing (TS) takes key information from a source text and condenses it for the user while retaining the primary material. When it comes to text summaries, the most difficult problem is to provide broad topic coverage and diversity in a single summary. Overall, text summarization addresses the fundamental need to distill large volumes of information into more manageable and digestible forms, making it a crucial technology in the era of information abundance. It benefits individuals, businesses, researchers, and various other stakeholders by enhancing efficiency and comprehension in dealing with textual data. In this paper, proposed a novel Modified Generative adversarial network (MGAN) for summarize the text. The proposed model involves three stages namely pre-processing, Extractive summarization, and summary generation. In the first Phase, the Text similarity dataset is pre-processed using Lowering Casing, Tokenization, Lemmatization, and, Stop Word Removal. In the second Phase, the Extractive summarization is done in three steps Generating similarity metrics, Sentence Ranking, and Sentence Extractive. In the third stage, a generative adversarial network (GAN) employs summary generation to jointly train the discriminative model D and the generative model G. To classify texts and annotate their syntax, Generative Model G employs a convolutional neural network called Bidirectional Gated Recursive Unit (CNN-BiGRU). The performance analysis of the proposed MGAN is calculated based on the parameters like accuracy, specificity, Recall, and Precision metrics. The proposed MGAN achieves an accuracy range of 99%. The result shows that the proposed MGAN improves the overall accuracy better than 9%, 6.5% and 5.4% is DRM, LSTM, and CNN respectively. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Convolutional Neural Network-Based Bidirectional Gated Recurrent Unit–Additive Attention Mechanism Hybrid Deep Neural Networks for Short-Term Traffic Flow Prediction.

Author: Liu, Song, Lin, Wenting, Wang, Yue, Yu, Dennis Z., Peng, Yong, and Ma, Xianting
Abstract: To more accurately predict short-term traffic flow, this study posits a sophisticated integrated prediction model, CNN-BiGRU-AAM, based on the additive attention mechanism of a convolutional bidirectional gated recurrent unit neural network. This model seeks to enhance the precision of traffic flow prediction by integrating both historical and prospective data. Specifically, the model achieves prediction through two steps: encoding and decoding. In the encoding phase, convolutional neural networks are used to extract spatial correlations between weather and traffic flow in the input sequence, while the BiGRU model captures temporal correlations in the time series. In the decoding phase, an additive attention mechanism is introduced to weigh and fuse the encoded features. The experimental results demonstrate that the CNN-BiGRU model, coupled with the additive attention mechanism, is capable of dynamically capturing the temporal patterns of traffic flow, and the introduction of isolation forests can effectively handle data anomalies and missing values, improving prediction accuracy. Compared to benchmark models such as GRU, the CNN-BiGRU-AAM model shows significant improvement on the test set, with a 47.49 reduction in the Root Mean Square Error (RMSE), a 30.72 decrease in the Mean Absolute Error (MAE), and a 5.27% reduction in the Mean Absolute Percentage Error (MAPE). The coefficient of determination ( R 2 ) reaches 0.97, indicating the high accuracy of the CNN-BiGRU-AAM model in traffic flow prediction. It provides a good solution for short-term traffic flow with spatio-temporal features, thereby enhancing the efficiency of traffic management and planning and promoting the sustainable development of transportation. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. Forecasting model for short-term wind speed using robust local mean decomposition, deep neural networks, intelligent algorithm, and error correction.

Author: Li, Jiawen, Liu, Minghao, Wen, Lei, Tian, Zhongda, and Ramirez, Carlos Andrés Perez
Subjects: ARTIFICIAL neural networks, WIND speed, DEEP learning, MACHINE learning, ALGORITHMS, WIND power, BIOCHEMICAL oxygen demand
Abstract: Wind power generation has aroused widespread concern worldwide. Accurate prediction of wind speed is very important for the safe and economic operation of the power grid. This paper presents a short-term wind speed prediction model which includes data decomposition, deep learning, intelligent algorithm optimization, and error correction modules. First, the robust local mean decomposition (RLMD) is applied to the original wind speed data to reduce the non-stationarity of the data. Then, the salp swarm algorithm (SSA) is used to determine the optimal parameter combination of the bidirectional gated recurrent unit (BiGRU) to ensure prediction quality. In order to eliminate the predictable components of the error further, a correction module based on the improved salp swarm algorithm (ISSA) and deep extreme learning machine (DELM) is constructed. The exploration and exploitation capability of the original SSA is enhanced by introducing a crazy operator and dynamic learning strategy, and the input weights and thresholds in the DELM are optimized by the ISSA to improve the generalization ability of the model. The actual data of wind farms are used to verify the advancement of the proposed model. Compared with other models, the results show that the proposed model has the best prediction performance. As a powerful tool, the developed forecasting system is expected to be further used in the energy system. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. 融合多粒度信息的用户画像生成方法.

Author: 邵一博, 秦玉华, 崔永军, 高宝勇, and 赵彪
Subjects: *FEATURE extraction
Abstract: Most of the existing user profile methods lack different granularity text information representation, and there is a noise problem in the feature extraction stage, resulting in the inaccurate construction of the profile. To address these issues, this paper proposed a user profile method based on multi-granularity information fusion, called UP-MGIF. Firstly, it integrated the character-level granularity and the word-level granularity representation vectors in the embedding layer to expand feature content. Secondly, based on the improved bi-directional gated recurrent unit network(Bi-GRU), it designed a hybrid feature extraction model called Bi-GRU-DAE-Attention by combining denoising autoencoder (DAE) and attention mechanism to achieve feature denoising and semantic enhancement. Finally, it input the robust feature vectors into the classifier to achieve user profile generation. Experiments show that the user profile generation method achieves higher classification accuracy than other baseline methods on two profile datasets in the medical and Internet domains, and validate the effectiveness of each module through ablation experiments. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. Bidirectional Gated Recurrent Unit with Glove Embedding and Attention Mechanism for Movie Review Classification.

Author: Greeshma, M and Simon, Philomina
Subjects: USER-generated content, FILM reviewing, MACHINE learning, SOCIAL media, NATURAL language processing, CONSUMER attitudes
Abstract: During the current Internet era, enormous volume of structured and unstructured textual data is generated and exchanged online that made the text classification more crucial. With the massive amount of user-generated content on social media platforms, businesses and organizations can leverage sentiment analysis to understand customer opinions and attitudes towards the products, services, or brand. Text classification is a natural language processing task in which a machine learning model is trained to categorize the text into predefined classes or categories. Due to the advent of promising Deep Learning techniques, intelligent and accurate text classification system can be developed. In movie review classification, algorithms are used to automatically categorize movie reviews into positive, negative, or sometimes neutral sentiments based on the opinions expressed in the text. The goal is to automate the process of determining whether a review reflects a positive or negative sentiment, helping users quickly understand the overall reception of a movie. This work proposes an automated movie review classification system based on Bidirectional Gated Recurrent Unit encoder with attention module. This method encodes the words using GloVe word embedding technique. Context information is captured in an efficient manner and the encoding is more reliable. Bidirectional GRU layer captures long-term dependencies in sentences, followed by a custom Attention layer that assigns higher weights to key components. The attention mechanism allows the model to selectively focus on important parts of the input sequence, by dynamically determining attention weights based on task relevance. The model was individually trained and evaluated using popular benchmark IMDb dataset. Experimental studies have proved the better performance of proposed movie review classification system that obtained accuracy of 98%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

50. BiHGCA: A Novel SRS-Based Bidirectional Hyperbolic Graph Capsule Co-Attention Network for User Preference Drift

Author: Nikorn Kannikaklang, Wachirawut Thamviset, and Sartra Wongthanavasu
Subjects: Bidirectional capsule attention network, bidirectional gated recurrent unit, bidirectional hyperbolic graph attention network, user preference drift, movie recommendation, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Social and sequential recommendations employing bidirectional attention architecture represent a notable advancement in deep learning, enhancing recommender system performance. This breakthrough facilitates the representation learning of interactions, both forward and backward, concerning dynamic user preferences influenced at the social and sequential levels. Despite previous research efforts to accurately model user preference changes, they encounter two primary shortcomings: 1) an insufficient to account for user preference shifts using unidirectional approaches, and 2) a disregard for the impact of social factors on user preference changes, leading to suboptimal outcomes. In this study, we propose a novel framework, the Bidirectional Hyperbolic Graph Capsule Co-Attention Network (BiHGCA), addressing the challenges posed by social and sequential dynamics in user preference changes. This model integrates a bidirectional hyperbolic graph attention network and a bidirectional capsule attention network with a newly designed gate, T-BiGRUDense, which stands for Time-aware Bidirectional Gated Recurrent Unit and Dense neural network. The bidirectional hyperbolic graph attention network aims to capture shifts in user preferences at the social level. Simultaneously, the bidirectional capsule attention network is tailored to model user preference dynamics at the sequential level. Moreover, T-BiGRUDense is innovatively crafted to merge the collaborative attention signals from both the social and sequential levels, enhancing next movie recommendation predictions. Empirical evaluations conducted on movie benchmark datasets illustrate the superiority of BiHGCA over existing state-of-the-art methods in delivering Top-N recommendations and its effectiveness across various levels of data sparsity.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

346 results on '"bidirectional gated recurrent unit"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources