980 results on '"Long Short-Term Memory"'
Search Results
2. Enhancing drug-target interaction predictions in context of neurodegenerative diseases using bidirectional long short-term memory in male Swiss albino mice pharmaco-EEG analysis
- Author
-
Qureshi, Shahnawaz, Iqbal, Syed Muhammad Zeeshan, Ameer, Asif, Karrila, Seppo, Ghadi, Yazeed Yasin, and Shah, Syed Aziz
- Published
- 2024
- Full Text
- View/download PDF
3. Development of residual learning in deep neural networks for computer vision: A survey
- Author
-
Xu, Guoping, Wang, Xiaxia, Wu, Xinglong, Leng, Xuesong, and Xu, Yongchao
- Published
- 2025
- Full Text
- View/download PDF
4. An efficient data fusion model based on Bayesian model averaging for robust water quality prediction using deep learning strategies
- Author
-
Alizamir, Meysam, Moradveisi, Kayhan, Othman Ahmed, Kaywan, Bahrami, Jamil, Kim, Sungwon, and Heddam, Salim
- Published
- 2025
- Full Text
- View/download PDF
5. Research on precise lithium battery state of charge estimation method based on CALSE-LSTM model and pelican algorithm
- Author
-
Ding, Zujun, Hu, Daiming, Jing, Yang, Ma, Mengyu, Xie, Yingqi, Yin, Qingyuan, Zeng, Xiaoyu, Zhang, Chu, Peng, Tian, and Ji, Jie
- Published
- 2024
- Full Text
- View/download PDF
6. Spatial Temporal Signatures: A Hybrid CNN-LSTM Architecture for Improved Sign Language Recognition
- Author
-
Sukhavasi, Vidyullatha, Shanmuga Sundari, M., Nithya, K. S. Yamini, Bairu, Pragna, Li, Gang, Series Editor, Filipe, Joaquim, Series Editor, Xu, Zhiwei, Series Editor, Ortiz-Rodriguez, Fernando, editor, Tiwari, Sanju, editor, Krisnadhi, Adila Alfa, editor, Medina-Quintero, Jose Melchor, editor, and Valle-Cruz, David, editor
- Published
- 2025
- Full Text
- View/download PDF
7. An Efficient Approach to Lip-Reading with 3D CNN and Bi-LSTM Fusion Model
- Author
-
Joshi, Rohit Chandra, Juyal, Aayush, Jain, Vishal, Chaturvedi, Saumya, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Gonçalves, Paulo J. Sequeira, editor, Singh, Pradeep Kumar, editor, Tanwar, Sudeep, editor, and Epiphaniou, Gregory, editor
- Published
- 2025
- Full Text
- View/download PDF
8. State of the Art Recurrent Neural Network with Bidirectional Long Short-Term Memory for Cursive Handwriting Recognition
- Author
-
Jose, Manju, Udupi, Prakash Kumar, Li, Gang, Series Editor, Filipe, Joaquim, Series Editor, Ghosh, Ashish, Series Editor, Xu, Zhiwei, Series Editor, T., Shreekumar, editor, L., Dinesha, editor, and Rajesh, Sreeja, editor
- Published
- 2025
- Full Text
- View/download PDF
9. A novel neural network architecture and cross-model transfer learning for multi-task autonomous driving
- Author
-
Li, Youwei and Qu, Jian
- Published
- 2024
- Full Text
- View/download PDF
10. A hybrid univariate data preprocessing using overlapping flexible sliding window and DWT for rainfall prediction using deep learning ensemble techniques.
- Author
-
Mohammed, Mansur, Abiyev, Rahib H, Ameen, Zubaida Said, and Mubarak, Auwalu Saleh
- Abstract
Accurately predicting rainfall is still a difficult but crucial task in meteorological forecasting, with significant implications for agriculture, disaster planning, and water resource management. In the past, statistical methods were used to forecast rainfall, however these methods were less accurate due to data dynamics and other problems. To address the data issue in this study, Discrete Wavelet Transform (DWT) was employed, it improves rainfall data preprocessing by decomposing data, eliminating noise, risk of overfitting and concentrating on significant patterns. Overlapping Flexible Sliding Window (OFSW) technique is essential since it dynamically modifies window sizes according to data properties unlike the overlapping sliding windows. Leveraging the advancements in Deep Learning (DL), this paper presents rainfall prediction by employing DL algorithms Convolutional Neural Networks (CNNs), Long Short-Term Memory Networks (LSTMs), and Multilayer Perceptron (MLPs). Furthermore, the standalone models were fused to form a weighted average ensemble. The monthly data from Nigeria and the Kaduna region were gathered and used in forecasting. The Bi-LSTM model has shown better performance compared to other models in predicting rainfall univariate time series data. In this study, these models are integrated into an ensemble structure for prediction purposes. The obtained ensemble learning model harmonizes the advantages of CNNs, Bi-LSTMs, and MLPs. The results of the weighted average ensemble demonstrated the benefits of the ensemble model in rainfall prediction in Kaduna by achieving the least MSE of 0.0018,0.04242, 0.0303 of MSE, RMSE and MAE in the training phase respectively, while in the testing phase, it achieves, 0.0041, 0.0640 and 0.0447 of MSE, RMSE and MAE in the testing phase respectively. MSE, RMSE and MAE of 0.0017, 0.0412, and 0.0301 for Nigeria in the training phase respectively, while in the testing phase, 0.0042, 0.0648 and 0.0457 of MSE, RMSE and MAE were achieved. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
11. Enhanced PM2.5 prediction in Delhi using a novel optimized STL-CNN-BILSTM-AM hybrid model.
- Author
-
Sreenivasulu, T. and Rayalu, G. Mokesh
- Subjects
AIR pollution control ,PARTICULATE matter ,AIR pollution ,LONG short-term memory ,AIR quality - Abstract
Accurate air pollution predictions in urban areas facilitate the implementation of efficient actions to control air pollution and the formulation of strategies to mitigate contamination. This includes establishing an early warning system to notify the public. Creating precise estimates for PM2.5 air pollutants in large cities is a challenging task because of the numerous relevant factors and quick fluctuations. This study introduces a novel hybrid model named STL-CNN-BILSTM-AM. It combines the seasonal-trend decomposition method with LOESS (STL) to simplify learning tasks and increase prediction accuracy for complex, nonlinear time-series data. Convolutional neural networks (CNNs) extract features from decomposed components of PM2.5 and other feature variables, such as pollutants and meteorological variables. Bidirectional long-short-term memory (BILSTM) uses these features to extract temporal relationships, enabling the forecasting of daily PM2.5 levels at four locations in Delhi. This hybrid model uses attention mechanisms to extract the most significant information, as well as Bayesian optimization to tune the hyperparameters. The suggested model greatly improved performance in all four regions used in this study, as evidenced by the findings. We compared it with the CNN-BILSTM, BILSTM, LSTM, and CNN models, and the suggested model outperformed the state-of-the-art models by utilizing STL decomposition components and other features. The overall results show that the STL-CNN-BILSTM-AM is better at predicting air quality, especially the concentration of PM2.5 in cities when the data has a high seasonal trend and is complex. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Improving Solar Radiation Forecasting in Cloudy Conditions by Integrating Satellite Observations.
- Author
-
Bu, Qiangsheng, Zhuang, Shuyi, Luo, Fei, Ye, Zhigang, Yuan, Yubo, Ma, Tianrui, and Da, Tao
- Subjects
- *
CONVOLUTIONAL neural networks , *LONG short-term memory , *SOLAR radiation , *STANDARD deviations , *SOLAR energy - Abstract
Solar radiation forecasting is the basis of building a robust solar power system. Most ground-based forecasting methods are unable to consider the impact of cloud changes on future solar radiation. To alleviate this limitation, this study develops a hybrid network which relies on a convolutional neural network to extract cloud motion patterns from time series of satellite observations and a long short-term memory neural network to establish the relationship between future solar radiation and cloud information, as well as antecedent measurements. We carefully select the optimal scales to consider the spatial and temporal correlations of solar radiation and design test experiments at ten stations to check the model performance in various climate zones. The results demonstrate that the solar radiation forecasting accuracy is considerably improved, particularly in cloudy conditions, compared with purely ground-based models. The maximum magnitude of improvements reaches up to 50 W/m2 (15%) in terms of the (relative) root mean squared error (RMSE) for 1 h ahead forecasts. The network achieves superior forecasts with correlation coefficients varying from 0.96 at 1 h ahead to 0.85 at 6 h ahead. Forecast errors are related to cloud regimes, of which the cloud amount leads to a maximum relative RMSE difference of about 50% with an additional 5% from cloud variability. This study ascertains that multi-source data fusion contributes to a better simulation of cloud impacts and a combination of different deep learning techniques enables more reliable forecasts of solar radiation. In addition, multi-step forecasts with a low latency make the advance planning and management of solar energy possible in practical applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Emotion Recognition Model of EEG Signals Based on Double Attention Mechanism.
- Author
-
Ma, Yahong, Huang, Zhentao, Yang, Yuyao, Zhang, Shanwen, Dong, Qi, Wang, Rongrong, and Hu, Liangliang
- Subjects
- *
AFFECTIVE computing , *LONG short-term memory , *CONVOLUTIONAL neural networks , *EMOTION recognition , *DEEP learning , *RECOGNITION (Psychology) - Abstract
Background: Emotions play a crucial role in people's lives, profoundly affecting their cognition, decision-making, and interpersonal communication. Emotion recognition based on brain signals has become a significant challenge in the fields of affective computing and human-computer interaction. Methods: Addressing the issue of inaccurate feature extraction and low accuracy of existing deep learning models in emotion recognition, this paper proposes a multi-channel automatic classification model for emotion EEG signals named DACB, which is based on dual attention mechanisms, convolutional neural networks, and bidirectional long short-term memory networks. DACB extracts features in both temporal and spatial dimensions, incorporating not only convolutional neural networks but also SE attention mechanism modules for learning the importance of different channel features, thereby enhancing the network's performance. DACB also introduces dot product attention mechanisms to learn the importance of spatial and temporal features, effectively improving the model's accuracy. Results: The accuracy of this method in single-shot validation tests on the SEED-IV and DREAMER (Valence-Arousal-Dominance three-classification) datasets is 99.96% and 87.52%, 90.06%, and 89.05%, respectively. In 10-fold cross-validation tests, the accuracy is 99.73% and 84.26%, 85.40%, and 85.02%, outperforming other models. Conclusions: This demonstrates that the DACB model achieves high accuracy in emotion classification tasks, demonstrating outstanding performance and generalization ability and providing new directions for future research in EEG signal recognition. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Comparative Analysis of Multiple Deep Learning Models for Forecasting Monthly Ambient PM 2.5 Concentrations: A Case Study in Dezhou City, China.
- Author
-
He, Zhenfang and Guo, Qingchun
- Subjects
- *
ARTIFICIAL neural networks , *CONVOLUTIONAL neural networks , *LONG short-term memory , *AIR pollution prevention , *AIR pollution control , *DEEP learning - Abstract
Ambient air pollution affects human health, vegetative growth and sustainable socio-economic development. Therefore, air pollution data in Dezhou City in China are collected from January 2014 to December 2023, and multiple deep learning models are used to forecast air pollution PM2.5 concentrations. The ability of the multiple models is evaluated and compared with observed data using various statistical parameters. Although all eight deep learning models can accomplish PM2.5 forecasting assignments, the precision accuracy of the CNN-GRU-LSTM forecasting method is 34.28% higher than that of the ANN forecasting method. The result shows that CNN-GRU-LSTM has the best forecasting performance compared to the other seven models, achieving an R (correlation coefficient) of 0.9686 and an RMSE (root mean square error) of 4.6491 μg/m3. The RMSE values of CNN, GRU and LSTM models are 57.00%, 35.98% and 32.78% higher than that of the CNN-GRU-LSTM method, respectively. The forecasting results reveal that the CNN-GRU-LSTM predictor remarkably improves the performances of benchmark CNN, GRU and LSTM models in overall forecasting. This research method provides a new perspective for predictive forecasting of ambient air pollution PM2.5 concentrations. The research results of the predictive model provide a scientific basis for air pollution prevention and control. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Harnessing Deep Learning and Snow Cover Data for Enhanced Runoff Prediction in Snow-Dominated Watersheds.
- Author
-
Adnan, Rana Muhammad, Mo, Wang, Kisi, Ozgur, Heddam, Salim, Al-Janabi, Ahmed Mohammed Sami, and Zounemat-Kermani, Mohammad
- Subjects
- *
CONVOLUTIONAL neural networks , *LONG short-term memory , *MODIS (Spectroradiometer) , *RECURRENT neural networks , *STANDARD deviations , *WATERSHEDS - Abstract
Predicting streamflow is essential for managing water resources, especially in basins and watersheds where snowmelt plays a major role in river discharge. This study evaluates the advanced deep learning models for accurate monthly and peak streamflow forecasting in the Gilgit River Basin. The models utilized were LSTM, BiLSTM, GRU, CNN, and their hybrid combinations (CNN-LSTM, CNN-BiLSTM, CNN-GRU, and CNN-BiGRU). Our research measured the model's accuracy through root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe efficiency (NSE), and the coefficient of determination (R2). The findings indicated that the hybrid models, especially CNN-BiGRU and CNN-BiLSTM, achieved much better performance than traditional models like LSTM and GRU. For instance, CNN-BiGRU achieved the lowest RMSE (71.6 in training and 95.7 in testing) and the highest R2 (0.962 in training and 0.929 in testing). A novel aspect of this research was the integration of MODIS-derived snow-covered area (SCA) data, which enhanced model accuracy substantially. When SCA data were included, the CNN-BiLSTM model's RMSE improved from 83.6 to 71.6 during training and from 108.6 to 95.7 during testing. In peak streamflow prediction, CNN-BiGRU outperformed other models with the lowest absolute error (108.4), followed by CNN-BiLSTM (144.1). This study's results reinforce the notion that combining CNN's spatial feature extraction capabilities with the temporal dependencies captured by LSTM or GRU significantly enhances model accuracy. The demonstrated improvements in prediction accuracy, especially for extreme events, highlight the potential for these models to support more informed decision-making in flood risk management and water allocation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Monitoring and Analyzing Driver Physiological States Based on Automotive Electronic Identification and Multimodal Biometric Recognition Methods.
- Author
-
Zhou, Shengpei, Zhang, Nanfeng, Duan, Qin, Liu, Xiaosong, Xiao, Jinchao, Wang, Li, and Yang, Jingfeng
- Subjects
- *
CONVOLUTIONAL neural networks , *LONG short-term memory , *ELECTRIC vehicles , *DEEP learning , *PATIENT monitoring , *BIOMETRIC identification - Abstract
In an intelligent driving environment, monitoring the physiological state of drivers is crucial for ensuring driving safety. This paper proposes a method for monitoring and analyzing driver physiological characteristics by combining electronic vehicle identification (EVI) with multimodal biometric recognition. The method aims to efficiently monitor the driver's heart rate, breathing frequency, emotional state, and fatigue level, providing real-time feedback to intelligent driving systems to enhance driving safety. First, considering the precision, adaptability, and real-time capabilities of current physiological signal monitoring devices, an intelligent cushion integrating MEMSs (Micro-Electro-Mechanical Systems) and optical sensors is designed. This cushion collects heart rate and breathing frequency data in real time without disrupting the driver, while an electrodermal activity monitoring system captures electromyography data. The sensor layout is optimized to accommodate various driving postures, ensuring accurate data collection. The EVI system assigns a unique identifier to each vehicle, linking it to the physiological data of different drivers. By combining the driver physiological data with the vehicle's operational environment data, a comprehensive multi-source data fusion system is established for a driving state evaluation. Secondly, a deep learning model is employed to analyze physiological signals, specifically combining Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks. The CNN extracts spatial features from the input signals, while the LSTM processes time-series data to capture the temporal characteristics. This combined model effectively identifies and analyzes the driver's physiological state, enabling timely anomaly detection. The method was validated through real-vehicle tests involving multiple drivers, where extensive physiological and driving behavior data were collected. Experimental results show that the proposed method significantly enhances the accuracy and real-time performance of physiological state monitoring. These findings highlight the effectiveness of combining EVI with multimodal biometric recognition, offering a reliable means for assessing driver states in intelligent driving systems. Furthermore, the results emphasize the importance of personalizing adjustments based on individual driver differences for more effective monitoring. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Attention-CNN-LSTM based intrusion detection system (ACL-IDS) for in-vehicle networks: Attention-CNN-LSTM based Intrusion...: A. Taneja, G. Kumary.
- Author
-
Taneja, Amit and Kumar, Gulshan
- Subjects
- *
LONG short-term memory , *CONVOLUTIONAL neural networks , *COMPUTER network traffic , *TELECOMMUNICATION , *ELECTRONIC control , *INTRUSION detection systems (Computer security) - Abstract
Modern vehicles rely on electronic control units (ECUs) communicating through the controller area network (CAN) bus protocol. However, increased connectivity through Wi-Fi, Bluetooth, and onboard diagnostics (OBD) ports has heightened cybersecurity risks due to the CAN bus protocol's inherent security vulnerabilities. Addressing this challenge requires an in-vehicle intrusion detection system (IDS) with high accuracy and minimal false alarms. Existing IDSs for in-vehicle networks often fall short due to inadequate extraction of network traffic features' dependencies in a time-series context.To overcome this limitation, this study presents ACL-IDS, a hybrid intrusion detection system for in-vehicle network traffic. Leveraging deep learning techniques like CNN, LSTM, and attention mechanisms, ACL-IDS effectively captures short-term and long-term dependencies within network traffic, enhancing intrusion detection accuracy. Extensive experiments on a real benchmark dataset demonstrate ACL-IDS's superior performance compared to individual, ensemble, and state-of-the-art methods. With detection accuracy reaching up to 99%, ACL-IDS emerges as a robust solution for analyzing and detecting intrusions in in-vehicle network traffic. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Deep learning models to predict primary open‐angle glaucoma.
- Author
-
Zhou, Ruiwen, Philip Miller, J., Gordon, Mae, Kass, Michael, Lin, Mingquan, Peng, Yifan, Li, Fuhai, Feng, Jiarui, and Liu, Lei
- Subjects
- *
OPEN-angle glaucoma , *DEEP learning , *CONVOLUTIONAL neural networks , *VISUAL fields , *VISION disorders , *OCULAR hypertension - Abstract
Summary: Glaucoma is a major cause of blindness and vision impairment worldwide, and visual field (VF) tests are essential for monitoring the conversion of glaucoma. While previous studies have primarily focused on using VF data at a single time point for glaucoma prediction, there has been limited exploration of longitudinal trajectories. Additionally, many deep learning techniques treat the time‐to‐glaucoma prediction as a binary classification problem (glaucoma Yes/No), resulting in the misclassification of some censored subjects into the nonglaucoma category and decreased power. To tackle these challenges, we propose and implement several deep‐learning approaches that naturally incorporate temporal and spatial information from longitudinal VF data to predict time‐to‐glaucoma. When evaluated on the Ocular Hypertension Treatment Study (OHTS) dataset, our proposed convolutional neural network (CNN)‐long short‐term memory (LSTM) emerged as the top‐performing model among all those examined. The implementation code can be found online (https://github.com/rivenzhou/VF_prediction). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Research on a Bearing Fault Diagnosis Method Based on a CNN-LSTM-GRU Model.
- Author
-
Han, Kaixu, Wang, Wenhao, and Guo, Jun
- Subjects
CONVOLUTIONAL neural networks ,LONG short-term memory ,FAULT diagnosis ,ROLLER bearings ,TIME series analysis ,DEEP learning - Abstract
In view of the problem of the insufficient performance of deep learning models in time series prediction and poor comprehensive space–time feature extraction, this paper proposes a diagnostic method (CNN-LSTM-GRU) that integrates convolutional neural network (CNN), long short-term memory (LSTM) network, and gated recurrent unit (GRU) models. In this study, a convolutional neural network (CNN) model is used to process two-dimensional image data in both time and frequency domains, and a convolutional core attention mechanism is introduced to extract spatial features, such as peaks, cliffs, and waveforms, from the samples. A long short-term memory (LSTM) network is embedded in the output processing of the convolutional neural network (CNN) to analyze the long-sequence variation characteristics of rolling bearing vibration signals and enable long-term time series prediction by capturing long-term dependencies in the sequence. In addition, a gated recurrent unit (GRU) is used to refine long-term time series predictions, providing local fine-tuning and improving the accuracy of fault diagnosis. Using a dataset obtained from Case Western Reserve University (CWRU), the average accuracy of CNN-LSTM-GRU fault vibration is greater than 99%, and its superior performance in a noisy environment is demonstrated. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Short-term power load forecasting using SSA-CNN-LSTM method.
- Author
-
Wang, Yonggang, Hao, Yue, Zhang, Biying, and Zhang, Nannan
- Subjects
CONVOLUTIONAL neural networks ,LONG short-term memory ,PREDICTION algorithms ,TIME series analysis ,SEARCH algorithms ,LOAD forecasting (Electric power systems) - Abstract
The short-term power load forecasting provides an essential foundation for the dispatching management of the power system, which is crucial for enhancing economy and ensuring operational stability. To enhance the precision of the short-term power load forecasting, this paper proposes a hybrid prediction algorithm based on sparrow search algorithm (SSA), convolutional neural network (CNN) and long short-term memory (LSTM). First, feature datasets are constructed based on date information, meteorological data, similar days. The CNN performs effective feature extraction on the data and feeds the results into the LSTM for time series data analysis. Second, eight key parameters are optimized by SSA for improving the prediction precision of the CNN-LSTM prediction model. Simulation results show that the R2 of the proposed model exhibits a substantial enhancement in comparison to other models, reaching 0.9919 and presents a remarkable decrease in MAPE resulting in a value of 1.2%. Furthermore, RMSE and MAE have decreased to 1.17MW and 0.97MW respectively. Therefore, the proposed method has the ability to improve the prediction accuracy, due to the advantages in data mining of CNN, good time series data fitting ability of LSTM, and excellent optimization ability of SSA. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Optimizing Bearing Fault Detection: CNN-LSTM with Attentive TabNet for Electric Motor Systems.
- Author
-
Khawaja, Alaa U., Shaf, Ahmad, Thobiani, Faisal Al, Ali, Tariq, Irfan, Muhammad, Pirzada, Aqib Rehman, and Shahkeel, Unza
- Subjects
CONVOLUTIONAL neural networks ,ELECTRIC faults ,RANDOM forest algorithms ,ELECTRIC motors ,AUTOMATION - Abstract
Electric motor-driven systems are core components across industries, yet they're susceptible to bearing faults. Manual fault diagnosis poses safety risks and economic instability, necessitating an automated approach. This study proposes FTCNNLSTM (Fine-Tuned TabNet Convolutional Neural Network Long Short-Term Memory), an algorithm combining Convolutional Neural Networks, Long Short-Term Memory Networks, and Attentive Interpretable Tabular Learning. The model preprocesses the CWRU (Case Western Reserve University) bearing dataset using segmentation, normalization, feature scaling, and label encoding. Its architecture comprises multiple 1D Convolutional layers, batch normalization, max-pooling, and LSTM blocks with dropout, followed by batch normalization, dense layers, and appropriate activation and loss functions. Fine-tuning techniques prevent overfitting. Evaluations were conducted on 10 fault classes from the CWRU dataset. FTCNNLSTM was benchmarked against four approaches: CNN, LSTM, CNN-LSTM with random forest, and CNN-LSTM with gradient boosting, all using 460 instances. The FTCNNLSTM model, augmented with TabNet, achieved 96% accuracy, outperforming other methods. This establishes it as a reliable and effective approach for automating bearing fault detection in electric motor-driven systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Dynamic scenario deduction analysis for hazardous chemical accident based on CNN‐LSTM model with attention mechanism.
- Author
-
Chen, Guohua, Ding, Xu, Gao, Xiaoming, Li, Xiaofeng, Zhou, Lixing, Zhao, Yimeng, and Lv, Hongpeng
- Subjects
CONVOLUTIONAL neural networks ,EMERGENCY management ,HAZARDOUS substances ,DATABASES ,ANALYTICAL chemistry - Abstract
The evolution of hazardous chemical accidents (HCAs) is characterized by uncertainty and complexity. It is challenging for decision‐makers to expeditiously adapt emergency response plans in response to dynamically changing scenario states. This study proposes a data‐driven methodology for constructing accident scenarios and develops a novel hybrid deep learning model for scenario deduction analysis. This model aids in accurately predicting the evolution of HCAs, enabling emergency responders to prepare and implement targeted interventions proactively. First, a framework for constructing an accident scenario database is presented, based on the time‐sequential characteristics of accident progression. This framework employs a data‐driven approach to describe the evolution process of accident scenarios. Second, a deep learning model (CNN‐LSTM‐Attention) that integrates convolutional neural network (CNN), long short‐term memory (LSTM), and attention mechanism (AM) is developed for accident scenario deduction analysis. Finally, to illustrate practical application, a scenario database for HCAs is established. A major HCA case study is conducted to demonstrate the ability of this model to analyze various scenarios, thereby improving emergency decision‐making efficiency. Compared with algorithms such as CNN, LSTM, and CNN‐LSTM, the prediction accuracy of this method ranges from 86% to 93%, signifying an improvement of over 7%. This work provides a reliable framework for supporting decision‐making in emergency management. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Spatio-Temporal Feature Extraction for Pipeline Leak Detection in Smart Cities Using Acoustic Emission Signals: A One-Dimensional Hybrid Convolutional Neural Network–Long Short-Term Memory Approach.
- Author
-
Ullah, Saif, Ullah, Niamat, Siddique, Muhammad Farooq, Ahmad, Zahoor, and Kim, Jong-Myon
- Subjects
CONVOLUTIONAL neural networks ,ACOUSTIC emission ,LEAK detection ,SMART cities ,INDUSTRIAL safety - Abstract
Pipeline leakage represents a critical challenge in smart cities and various industries, leading to severe economic, environmental, and safety consequences. Early detection of leaks is essential for overcoming these risks and ensuring the safe operation of pipeline systems. In this study, a hybrid convolutional neural network–long short-term memory (CNN-LSTM) model for pipeline leak detection that uses acoustic emission signals was designed. In this model, acoustic emission signals are initially preprocessed using a Savitzky–Golay filter to reduce noise. The filtered signals are input into the hybrid model, where spatial features are extracted using a CNN. The features are then passed to an LSTM network, which extracts temporal features from the signals. Based on these features, the presence or absence of a leakage is determined. The performance of the proposed model was compared with two alternative approaches: a method that employs combined features from the time domain and LSTM and a bidirectional gated recurrent unit model. The proposed approach demonstrated superior performance, as evidenced by lower validation loss, higher validation accuracy, enhanced confusion matrices, and improved t-distributed stochastic neighbor embedding plots compared to the other models when tested on industrial data. The findings indicate that the proposed model is more effective in accurately detecting pipeline leaks, offering a promising solution for enhancing smart cities and industrial safety. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Lower Limb Motion Recognition Based on sEMG and CNN-TL Fusion Model.
- Author
-
Zhou, Zhiwei, Tao, Qing, Su, Na, Liu, Jingxuan, Chen, Qingzheng, and Li, Bowen
- Subjects
- *
CONVOLUTIONAL neural networks , *MACHINE learning , *TRANSFORMER models , *ELECTROMYOGRAPHY , *RECOGNITION (Psychology) - Abstract
To enhance the classification accuracy of lower limb movements, a fusion recognition model integrating a surface electromyography (sEMG)-based convolutional neural network, transformer encoder, and long short-term memory network (CNN-Transformer-LSTM, CNN-TL) was proposed in this study. By combining these advanced techniques, significant improvements in movement classification were achieved. Firstly, sEMG data were collected from 20 subjects as they performed four distinct gait movements: walking upstairs, walking downstairs, walking on a level surface, and squatting. Subsequently, the gathered sEMG data underwent preprocessing, with features extracted from both the time domain and frequency domain. These features were then used as inputs for the machine learning recognition model. Finally, based on the preprocessed sEMG data, the CNN-TL lower limb action recognition model was constructed. The performance of CNN-TL was then compared with that of the CNN, LSTM, and SVM models. The results demonstrated that the accuracy of the CNN-TL model in lower limb action recognition was 3.76%, 5.92%, and 14.92% higher than that of the CNN-LSTM, CNN, and SVM models, respectively, thereby proving its superior classification performance. An effective scheme for improving lower limb motor function in rehabilitation and assistance devices was thus provided. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. A deep learning LSTM-based approach for AMD classification using OCT images.
- Author
-
Hamid, Laila, Elnokrashy, Amgad, Abdelhay, Ehab H., and Abdelsalam, Mohamed M.
- Subjects
- *
CONVOLUTIONAL neural networks , *MACULAR degeneration , *OPTICAL coherence tomography , *VISION disorders , *DEEP learning - Abstract
Age-related macular degeneration (AMD) is an age-related, persistent, painless eye disease that impairs central vision. The central area (macula) of the retina, located at the back of the eye, sustains damage that is the cause of loss of vision. The early detection of AMD can increase the probability of treatment and prevent vision loss. The AMD can be classified into dry and wet AMD based on the absence of neovascularization. This study introduces a new methodology for the classification of AMD using optical coherence tomography (OCT) retinal images. The proposed methodology is based on three stages. The first stage is the data preparation stage for resizing and normalizing the used images. The second stage is the image processing stage for enhancing the image quality as contrast and resolution these enhancements have been checked by the weighted peak signal-to-noise ratio (WPSNR) methodology. The third stage is the deep feature extraction and classification stage, which consists of two sub-models. The first model is MobileNet V1 which has been used as a deep feature extractor. The second model is LSTM (long short-term memory), fed with deep features to classify the AMD stages. A multi-classification with six separate trials has been employed with the proposed methodology, and compared with other models like DenseNet201 and InceptionV3. The proposed model has been tested on a sample of benchmark data with 4005 grayscale images labeled into three classes. The proposed methodology has achieved an accuracy of 98.85%, a sensitivity of 99.09%, and a specificity of 99.1%. To ensure the effectiveness of the proposed methodology, a comparative analysis has been established with previous approaches in the related field, and the results demonstrated the superiority of the proposed system in AMD multi-classification. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. The Deep Learning Based Epileptic Seizure Detection Using 2-layer Convolutional Network with Long Short-Term Memory.
- Author
-
Vaithilingam, Sonia Devi and Regulagedda, Pallavi
- Subjects
CONVOLUTIONAL neural networks ,SEIZURES (Medicine) ,EPILEPSY ,CENTRAL nervous system ,NEUROLOGICAL disorders ,DEEP learning - Abstract
Epilepsy is a pervasive chronic neurological disorder characterized through irregular electrical discharges in the brain which causes seizures. Epilepsy seizure is a disorder that affects the brain cells with an influence on an effectiveness of central nervous system. Electroencephalography (EEG) is a majorly utilized method for epileptic seizure detection and diagnosis. In this research, Deep Learning (DL) methods of 2-layer Convolutional Neural Network (CNN) with Long Short-Term Memory (LSTM) are proposed for an automatic detection and diagnosis of an epileptic seizure. In the pre-processing phase, a Butterworth filter method of order 2 is used to remove noise in the EEG signal. The 2-layer CNN is used for the process of feature extraction. In 2-layer LSTM, one layer is utilized to perform short-term dependencies, while another layer is utilized to perform long term dependencies. In the end, the proposed method classifies seizures into epileptic and non-epileptic. The results demonsrates that the proposed method delivers performance metrics of better accuracy of 99.90% and sensitivity of 90.06% using CHB-MIT and Bonn datasets which contains EEG signals as compared to the existing methods like CNN and Epilepsy-Net. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Short-term train arrival delay prediction: a data-driven approach
- Author
-
Fu, Qingyun, Ding, Shuxin, Zhang, Tao, Wang, Rongsheng, Hu, Ping, and Pu, Cunlai
- Published
- 2024
- Full Text
- View/download PDF
28. Hybrid deep learning for estimation of state-of-health in lithium-ion batteries.
- Author
-
Eka Cahyani, Denis, Gumilar, Langlang, Nur Afandi, Arif, Wibawa, Aji Prasetya, and Junoh, Ahmad Kadri
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,RECURRENT neural networks ,ENERGY density ,ENERGY storage ,LITHIUM-ion batteries - Abstract
Lithium-ion (li-ion) batteries have a high energy density and a long cycle life. Lithium-ion batteries have a finite lifespan, and their energy storage capacity diminishes with use. In order to properly plan battery maintenance, the state of health (SoH) of lithium-ion batteries is crucial. This study aims to combine two deep learning techniques (hybrid deep learning), namely convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM), for SoH estimation in li-ion batteries. This study contrasts hybrid deep learning methods to single deep learning models so that the most suitable model for accurately measuring the SoH in lithium-ion batteries can be determined. In comparison to other methodologies, CNN-BiLSTM yields the best results. The CNN-BiLSTM algorithm yields RMSE, mean square error (MSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) in the following order: 0.00916, 0.000084, 0.0048, and 0.00603. This indicates that CNN-BiLSTM, as a hybrid deep learning model, is able to calculate the approximate capacity of the lithium-ion battery more accurately than other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
29. Hybrid attention-based deep neural networks for short-term wind power forecasting using meteorological data in desert regions
- Author
-
Moussa Belletreche, Nadjem Bailek, Mostafa Abotaleb, Kada Bouchouicha, Bilel Zerouali, Mawloud Guermoui, Alban Kuriqi, Amal H. Alharbi, Doaa Sami Khafaga, Mohamed EL-Shimy, and El-Sayed M. El-kenawy
- Subjects
Wind power forecasting ,Convolutional neural network ,Hybrid deep learning ,Energy transition ,Long short-term memory ,Attention mechanism ,Medicine ,Science - Abstract
Abstract This study introduces an optimized hybrid deep learning approach that leverages meteorological data to improve short-term wind energy forecasting in desert regions. Over a year, various machine learning and deep learning models have been tested across different wind speed categories, with multiple performance metrics used for evaluation. Hyperparameter optimization for the LSTM and Conv-Dual Attention Long Short-Term Memory (Conv-DA-LSTM) architectures was performed. A comparison of the techniques indicates that the deep learning methods consistently outperform the classical techniques, with Conv-DA-LSTM yielding the best overall performance with a clear margin. This method obtained the lowest error rates (RMSE: 71.866) and the highest level of accuracy (R2: 0.93). The optimization clearly works for higher wind speeds, achieving a remarkable improvement of 22.9%. When we look at the monthly performance, all the months presented at least some level of consistent enhancement (RRMSE reductions from 1.6 to 10.2%). These findings highlight the potential of advanced deep learning techniques in enhancing wind energy forecasting accuracy, particularly in challenging desert environments. The hybrid method developed in this study presents a promising direction for improving renewable energy management. This allows for more efficient resource allocation and improves wind resource predictability.
- Published
- 2024
- Full Text
- View/download PDF
30. Constructing the Public Opinion Crisis Prediction Model Using CNN and LSTM Techniques Based on Social Network Mining
- Author
-
Lou Yan, Zhipeng Ren, Yong Zhang, Zhonghui Tao, and Yiwu Zhao
- Subjects
convolutional neural network ,deep learning ,inappropriate remarks ,internet of things ,long short-term memory ,social network ,Technology - Abstract
This research endeavors to address the persistent dissemination of public opinion within social networks, mitigate the propagation of inappropriate content on these platforms, and enhance the overall service quality of social networks. To achieve these objectives, Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) techniques are employed in this research to develop a predictive model for anticipating public opinion crises in social network mining. This model furnishes users with a valuable reference for subsequent decisionmaking processes. The initial phase of this research involves the collection of user behavior data from social networks using IoT technologies, serving as the basis for extensive big data analysis and neural network research. Subsequently, a social network text categorization model is constructed by amalgamating the Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) architecture, elucidating the training procedures of deep learning methodologies within CNN and LSTM networks. The effectiveness of this approach is subsequently validated through comparisons with other deep learning techniques. Based on the obtained results and findings, the CNN-LSTM model demonstrates a noteworthy accuracy rate of 92.19% and an exceptionally low loss value of 0.4075. Of particular significance is the classification accuracy of the CNN-LSTM algorithm within social network datasets, which surpasses that of alternative algorithms, including CNN (by 6.31%), LSTM (by 4.43%), RNN (by 3.51%), Transformer (by 40.29%), and Generative Adversarial Network (GAN) (by 4.49%). This underscores the effectiveness of the CNN-LSTM algorithm in the realm of social network text classification.
- Published
- 2024
- Full Text
- View/download PDF
31. Short-term train arrival delay prediction: a data-driven approach
- Author
-
Qingyun Fu, Shuxin Ding, Tao Zhang, Rongsheng Wang, Ping Hu, and Cunlai Pu
- Subjects
Train delay prediction ,Intelligent dispatching command ,Deep learning ,Convolutional neural network ,Long short-term memory ,Attention mechanism ,Transportation engineering ,TA1001-1280 ,Railroad engineering and operation ,TF1-1620 - Abstract
Purpose – To optimize train operations, dispatchers currently rely on experience for quick adjustments when delays occur. However, delay predictions often involve imprecise shifts based on known delay times. Real-time and accurate train delay predictions, facilitated by data-driven neural network models, can significantly reduce dispatcher stress and improve adjustment plans. Leveraging current train operation data, these models enable swift and precise predictions, addressing challenges posed by train delays in high-speed rail networks during unforeseen events. Design/methodology/approach – This paper proposes CBLA-net, a neural network architecture for predicting late arrival times. It combines CNN, Bi-LSTM, and attention mechanisms to extract features, handle time series data, and enhance information utilization. Trained on operational data from the Beijing-Tianjin line, it predicts the late arrival time of a target train at the next station using multidimensional input data from the target and preceding trains. Findings – This study evaluates our model's predictive performance using two data approaches: one considering full data and another focusing only on late arrivals. Results show precise and rapid predictions. Training with full data achieves a MAE of approximately 0.54 minutes and a RMSE of 0.65 minutes, surpassing the model trained solely on delay data (MAE: is about 1.02 min, RMSE: is about 1.52 min). Despite superior overall performance with full data, the model excels at predicting delays exceeding 15 minutes when trained exclusively on late arrivals. For enhanced adaptability to real-world train operations, training with full data is recommended. Originality/value – This paper introduces a novel neural network model, CBLA-net, for predicting train delay times. It innovatively compares and analyzes the model's performance using both full data and delay data formats. Additionally, the evaluation of the network's predictive capabilities considers different scenarios, providing a comprehensive demonstration of the model's predictive performance.
- Published
- 2024
- Full Text
- View/download PDF
32. Student academic performance prediction enhancement using t-SIDSBO and Triple Voter Network.
- Author
-
Muthuselvan, S., Rajaprakash, S., Jaichandran, R., Antony, Johns, U, Amal P, and A, Ijas V
- Subjects
CONVOLUTIONAL neural networks ,RECURRENT neural networks ,DEEP learning ,ACADEMIC achievement ,PREDICTION models - Abstract
Today's educational environment considers student academic performance prediction to be extremely important in educational organizations. It is a key problem for an academic system at all educational stages. Every educational system that wants to enhance its students' studying experience and academic success must be able to forecast their academic performance. The majority of current research on predicting student performance uses traditional feature selection strategies that involve extracting characteristics and feeding them to a classifier. Scholars can now extract meaningful high-level features from unprocessed data thanks to deep learning (DL). Performance evaluation on difficult tasks is made possible by such sophisticated feature selection strategies. This work proposed a combined Triple Voter Network and t-Self Improved Distribution-based Satin Bowerbird Optimization (t-SIDSBO) predict student academic achievement. Here, the deep LSTM model, CNN model, RNN model which are based on advanced feature prediction models, is used for effective classification, and the best features are chosen using a t-SIDSBO-based feature selection strategy. In this paper, the detailed explanation of the student academic prediction and the feature selection of t-SIDSBO using DL is explained step by step procedure. Following that, the anticipated performance is assessed and improved using performance metrics like accuracy, F_ score, recall, specificity, and precision. The program is implemented using the Python platform. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. 3D word embedding vector feature extraction and hybrid CNNLSTM for natural disaster reports identification.
- Author
-
Faisal, Mohammad Reza, Nugrahadi, Dodon Turianto, Budiman, Irwan, Muliadi, Delimayanti, Mera Kartika, Prastya, Septyan Eka, and Tahyudin, Imam
- Subjects
- *
CONVOLUTIONAL neural networks , *MACHINE learning , *ARTIFICIAL intelligence , *CLASSIFICATION algorithms , *NATURAL disasters - Abstract
Social media contain various information, such as natural disaster reports. Artificial intelligence is used to identify reports from eyewitnesses early for disaster warning systems. The artificial intelligence system includes a text classification model with feature extraction and classification algorithms. Word embedding-based feature extraction is widely used for 1-dimensional (1D) and 2-dimensional (2D) data, suitable for traditional or deep learning algorithms. However, applying feature extraction to 3-dimensional (3D) data for text classification is limited. Previous studies focused on word embedding for 1D, 2D, and 3D outputs with convolutional neural network (CNN). Yet, using 3D data and CNN did not perform well. Despite using CNN and 3D variants, identifying natural disaster reports remains below 80% accuracy. This research aims to improve identifying earthquakes, floods, and forest fires with 3D data and hybrid CNN long short-term memory (LSTM). The study found models with accuracies of 83.38%, 83.72%, and 89.03% for each disaster type. Hybrid CNN LSTM significantly enhanced identification compared to CNN alone, supported by statistical tests with P value less than 0.0001. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Spatial-spectral feature extraction for in-field chlorophyll content estimation using hyperspectral imaging.
- Author
-
Zhao, Ruomei, Tang, Weijie, Liu, Mingjia, Wang, Nan, Sun, Hong, Li, Minzan, and Ma, Yuntao
- Subjects
- *
CONVOLUTIONAL neural networks , *IMAGE segmentation , *DEEP learning , *FEATURE extraction , *CROP management - Abstract
In-situ leaf chlorophyll content (LCC) estimation based on hyperspectral imaging (HSI) is crucial to track the growth status of crops for field management. However, spatial and spectral features of HSI data, suffering from interference of growth dynamic effect and soil, pose the challenge on accuracy and robustness of LCC estimation in several years and growth stages. Therefore, a joint spectral-spatial feature extraction method was proposed by cascade of three-dimensional convolutional neural network (3DCNN) and long short-term memory (LSTM) to reduce the interference for optimising the LCC estimation. Firstly, crop pixels were separated from soil with vegetation index segmentation method. Secondly, when raw images and segmented pixels were input, sensitive bands were selected by random frog (RF bands), and 3DCNN-LSTM was used to extract the joint spectral-spatial features. Finally, models established by RF bands, 3DCNN and 3DCNN-LSTM were compared, and robustness in individual years and stages was validated. Results showed that RF bands and 3DCNN obtained R P 2 of 0.76 and 0.84 when not segmented. After segmentation, performance of 3DCNN improved (R P 2 = 0.85) compared to RF bands (R P 2 = 0.80). Spectral-spatial features by 3DCNN reduced the interference of soil. 3DCNN-LSTM without and with segmentation obtained good performance with R P 2 of 0.95 and 0.96, and the proposed method could reduce the image segmentation process. The optimal model achieved R P 2 above 0.93 in individual years (R P 2 = 0.96 in 2021, R P 2 = 0.94 in 2021) and R P 2 in the range of 0.87–0.97 at individual stages. This paper provides a method to track growth variability between soil and crop for the LCC estimation optimisation. • 3DCNN-LSTM method was proposed to extract spectral-spatial feature for LCC estimation. • 3DCNN reduced the interference of soil and crop difference. • 3DCNN-LSTM method can reduce the image segmentation procedure. • LSTM captured growth dynamics and improved robustness in multiple years and stages. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Big Data Driven Map Reduce Framework for Automated Flood Disaster Detection Based on Heuristic-Based Ensemble Learning.
- Author
-
Ali Shatat, Abdallah Saleh, Mobin Akhtar, Md., Zamani, Abu Sarwar, Dilshad, Sara, and Samdani, Faizan
- Subjects
- *
ARTIFICIAL neural networks , *CONVOLUTIONAL neural networks , *OPTIMIZATION algorithms , *FEATURE selection , *FINANCIAL crises - Abstract
Flooding disaster causes huge impacts on the socio-economic world. In the inundated area, some geo-referenced images are shared through some media posts, which assist in providing alertness to the critical volunteers and managing the financial loss crisis. In this work, the Adaptive Billiards-Inspired Optimization (A-BIO) and Optimized Ensemble-learning-based detection (OED) with map reduce framework is proposed for flood disaster detection. Initially, the big data is gathered and processed for detection. During the map phase, data preprocessing is performed to enhance the performance of the data, which helps in removing the noise or unwanted attributes. Furthermore, the reduction phase can be done through weighted feature selection, where the features to be selected and the weight is optimized through A-BIO, which assists in getting the most significant features for improving the performance and reducing the complexity of the designed model. Finally, OED is performed by a set of classifiers like Convolutional Neural Networks, Adaboost, XGBoost, Long Short-Term Memory, and Deep Neural Networks, where the parameters of ensemble learning classifiers are optimized by A-BIO algorithm. Finally, through the performance analysis, this detection model can provide high accuracy and better detection performance to avoid the huge impacts of flood disasters. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Mid-term forecasting of crude oil prices using the hybrid CEEMDAN and CNN_LSTM deep learning model.
- Author
-
GANDHI, Herry Kartika
- Subjects
PETROLEUM ,PETROLEUM sales & prices ,INTERNATIONAL economic relations ,RENEWABLE energy sources ,TRANSPORTATION - Abstract
Copyright of Energy Policy Journal / Polityka Energetyczna is the property of Mineral & Energy Economy Research Institute of the Polish Academy of Sciences and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
37. Leveraging Hybrid Deep Learning Models for Enhanced Multivariate Time Series Forecasting.
- Author
-
Mahmoud, Amal and Mohammed, Ammar
- Subjects
MACHINE learning ,CONVOLUTIONAL neural networks ,SUPPLY chain management ,TIME series analysis ,TRAFFIC flow ,DEEP learning - Abstract
Time series forecasting is crucial in various domains, ranging from finance and economics to weather prediction and supply chain management. Traditional statistical methods and machine learning models have been widely used for this task. However, they often face limitations in capturing complex temporal dependencies and handling multivariate time series data. In recent years, deep learning models have emerged as a promising solution for overcoming these limitations. This paper investigates how deep learning, specifically hybrid models, can enhance time series forecasting and address the shortcomings of traditional approaches. This dual capability handles intricate variable interdependencies and non-stationarities in multivariate forecasting. Our results show that the hybrid models achieved lower error rates and higher R 2 values, signifying their superior predictive performance and generalization capabilities. These architectures effectively extract spatial features and temporal dynamics in multivariate time series by combining convolutional and recurrent modules. This study evaluates deep learning models, specifically hybrid architectures, for multivariate time series forecasting. On two real-world datasets - Traffic Volume and Air Quality - the TCN-BiLSTM model achieved the best overall performance. For Traffic Volume, the TCN-BiLSTM model achieved an R 2 score of 0.976, and for Air Quality, it reached an R 2 score of 0.94. These results highlight the model's effectiveness in leveraging the strengths of Temporal Convolutional Networks (TCNs) for capturing multi-scale temporal patterns and Bidirectional Long Short-Term Memory (BiLSTMs) for retaining contextual information, thereby enhancing the accuracy of time series forecasting. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Development of an algorithm for identifying the autism spectrum based on features using deep learning methods.
- Author
-
Amirbay, Aizat, Mukhanova, Ayagoz, Baigabylov, Nurlan, Kudabekov, Medet, Mukhambetova, Kuralay, Baigusheva, Kanagat, Baibulova, Makbal, and Ospanova, Tleugaisha
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,AUTISM spectrum disorders ,MEDICAL protocols ,EYE tracking ,AUTISM ,DEEP learning - Abstract
The presented scientific work describes the results of the development and evaluation of two deep learning algorithms: long short-term memory with a convolutional neural network (LSTM+CNN) and long short-term memory with an autoencoder (LSTM+AE), designed for the diagnosis of autism spectrum disorders. The study focuses on the use of eye tracking technology to collect data on participants' eye movements while interacting with animated objects. These data were saved in NumPy array format (.npy) for ease of later analysis. The algorithms were evaluated in terms of their accuracy, generalization ability, and training time, which was confirmed by experts. The main goal of the study is to improve the diagnosis of autism, making it more accurate and effective. The convolutional neural network long short-term memory and autoencoder-long short-term memory models have shown promise as tools for achieving this goal, with the autoencoder model standing out for its ability to identify internal relationships in data. The article also discusses potential clinical applications of these algorithms and directions for future research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. A Performance Comparison Study on Climate Prediction in Weifang City Using Different Deep Learning Models.
- Author
-
Guo, Qingchun, He, Zhenfang, Wang, Zhaosheng, Qiao, Shuaisen, Zhu, Jingshu, and Chen, Jiaxin
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,WATER management ,DEEP learning ,STANDARD deviations - Abstract
Climate change affects the water cycle, water resource management, and sustainable socio-economic development. In order to accurately predict climate change in Weifang City, China, this study utilizes multiple data-driven deep learning models. The climate data for 73 years include monthly average air temperature (MAAT), monthly average minimum air temperature (MAMINAT), monthly average maximum air temperature (MAMAXAT), and monthly total precipitation (MP). The different deep learning models include artificial neural network (ANN), recurrent NN (RNN), gate recurrent unit (GRU), long short-term memory neural network (LSTM), deep convolutional NN (CNN), hybrid CNN-GRU, hybrid CNN-LSTM, and hybrid CNN-LSTM-GRU. The CNN-LSTM-GRU for MAAT prediction is the best-performing model compared to other deep learning models with the highest correlation coefficient (R = 0.9879) and lowest root mean square error (RMSE = 1.5347) and mean absolute error (MAE = 1.1830). These results indicate that The hybrid CNN-LSTM-GRU method is a suitable climate prediction model. This deep learning method can also be used for surface water modeling. Climate prediction will help with flood control and water resource management. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Hybrid attention-based deep neural networks for short-term wind power forecasting using meteorological data in desert regions.
- Author
-
Belletreche, Moussa, Bailek, Nadjem, Abotaleb, Mostafa, Bouchouicha, Kada, Zerouali, Bilel, Guermoui, Mawloud, Kuriqi, Alban, Alharbi, Amal H., Khafaga, Doaa Sami, EL-Shimy, Mohamed, and El-kenawy, El-Sayed M.
- Subjects
- *
ARTIFICIAL neural networks , *CONVOLUTIONAL neural networks , *WIND forecasting , *WIND power , *DESERTS , *DEEP learning - Abstract
This study introduces an optimized hybrid deep learning approach that leverages meteorological data to improve short-term wind energy forecasting in desert regions. Over a year, various machine learning and deep learning models have been tested across different wind speed categories, with multiple performance metrics used for evaluation. Hyperparameter optimization for the LSTM and Conv-Dual Attention Long Short-Term Memory (Conv-DA-LSTM) architectures was performed. A comparison of the techniques indicates that the deep learning methods consistently outperform the classical techniques, with Conv-DA-LSTM yielding the best overall performance with a clear margin. This method obtained the lowest error rates (RMSE: 71.866) and the highest level of accuracy (R2: 0.93). The optimization clearly works for higher wind speeds, achieving a remarkable improvement of 22.9%. When we look at the monthly performance, all the months presented at least some level of consistent enhancement (RRMSE reductions from 1.6 to 10.2%). These findings highlight the potential of advanced deep learning techniques in enhancing wind energy forecasting accuracy, particularly in challenging desert environments. The hybrid method developed in this study presents a promising direction for improving renewable energy management. This allows for more efficient resource allocation and improves wind resource predictability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Speaker recognition using Improved Butterfly Optimization Algorithm with hybrid Long Short Term Memory network.
- Author
-
Gade, Venkata Subba Reddy and Manickam, Sumathi
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,OPTIMIZATION algorithms ,LONG short-term memory - Abstract
Speaker recognition is extensively applied in several applications, namely identity verification, electronic voice eavesdropping, surveillance, voice recognition, etc. In an effective speaker recognition system, the extraction and selection of salient and discriminative features is an essential process for accurately identifying the speakers. Therefore, a novel hybrid framework is introduced in this research manuscript. Initially, the input data are acquired from the three-benchmark databases: THUYG-20 SRE corpus, Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), and LibriSpeech. Further, the emotional features are extracted by utilizing hybrid feature extraction techniques which are, amplitude, zero cross rate, energy, Root Mean Square (RMS), statistical moments, autocorrelation, and Mel-Frequency Cepstral Coefficients (MFCC). Then, the feature optimization is carried out using Improved Butterfly Optimization Algorithm (IBOA) that decreases the computational time and complexity of the recognition model. At last, a hybrid classifier: Convolutional Neural Network (CNN) with Long Short Term Memory (LSTM) is implemented for speaker recognition, and its performance is analyzed in terms of F1-score, specificity, accuracy, Positive Predictive Value (PPV), and sensitivity. The empirical investigation demonstrated that the IBOA-based hybrid LSTM network achieved 92.65%, 96.97% and 96.98% of recognition accuracy on the LibriSpeech, RAVDESS and THUYG-20 SRE corpus databases. These results are more impressive than the comparative models, Deep Neural Network (DNN), random forest, K-Nearest Neighbor (KNN), LSTM, Multi class Support Vector Machine (MSVM), Deep Convolutional Recurrent Neural Network (DCRNN), Golden Ratio aided Neural Network (GRaNN), deep sequential LSTM, and Probabilistic Neural Network (PNN). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Underwater Acoustic Orthogonal Frequency-Division Multiplexing Communication Using Deep Neural Network-Based Receiver: River Trial Results.
- Author
-
Thenginthody Hassan, Sabna, Chen, Peng, Rong, Yue, and Chan, Kit Yan
- Subjects
- *
ARTIFICIAL neural networks , *CONVOLUTIONAL neural networks , *UNDERWATER acoustic communication , *DOPPLER effect , *MULTIPATH channels , *CHANNEL estimation - Abstract
In this article, a deep neural network (DNN)-based underwater acoustic (UA) communication receiver is proposed. Conventional orthogonal frequency-division multiplexing (OFDM) receivers perform channel estimation using linear interpolation. However, due to the significant delay spread in multipath UA channels, the frequency response often exhibits strong non-linearity between pilot subcarriers. Since the channel delay profile is generally unknown, this non-linearity cannot be modeled precisely. A neural network (NN)-based receiver effectively tackles this challenge by learning and compensating for the non-linearity through NN training. The performance of the DNN-based UA communication receiver was tested recently in river trials in Western Australia. The results obtained from the trials prove that the DNN-based receiver performs better than the conventional least-squares (LS) estimator-based receiver. This paper suggests that UA communication using DNN receivers holds great potential for revolutionizing underwater communication systems, enabling higher data rates, improved reliability, and enhanced adaptability to changing underwater conditions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Radar-Based Target Tracking Using Deep Learning Approaches with Unscented Kalman Filter.
- Author
-
Patrick, Uwigize, Rao, S. Koteswara, Jagan, B. Omkar Lakshmi, Rai, Hari Mohan, Agarwal, Saurabh, and Pak, Wooguil
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,MONTE Carlo method ,RADAR targets ,KALMAN filtering ,PYTHON programming language ,DEEP learning - Abstract
Machine learning, a rapidly growing field, has attracted numerous researchers for its ability to automatically learn from and make predictions based on data. This manuscript presents an innovative approach to estimating the covariance matrix of noise in radar measurements for target tracking, resulting from collaborative efforts. Traditionally, researchers have assumed that the covariance matrix of noise in sonar measurements is present in the vast majority of literature related to target tracking. On the other hand, this research aims to estimate it by employing deep learning algorithms with noisy measurements in range, bearing, and elevation from radar sensors. This collaborative approach, involving multiple disciplines, provides a more precise and accurate covariance matrix estimate. Additionally, the unscented Kalman filter was combined with the gated recurrent unit, multilayer perceptron, convolutional neural network, and long short-term memory to accomplish the task of 3D target tracking in an airborne environment. The quantification of the results was achieved through the use of Monte Carlo simulations, which demonstrated that the convolutional neural network performed better than any other approach. The system was simulated using a Python program, and the proposed method offers higher accuracy and faster convergence time than conventional target tracking methods. This is a demonstration of the potential that collaboration can have in research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. A new hybrid approach for pneumonia detection using chest X-rays based on ACNN-LSTM and attention mechanism.
- Author
-
Lafraxo, Samira, El Ansari, Mohamed, and Koutti, Lahcen
- Subjects
CONVOLUTIONAL neural networks ,LONG short-term memory ,COMPUTER-aided diagnosis ,ADAPTIVE filters ,FEATURE extraction ,DEEP learning - Abstract
Pneumonia is a serious inflammatory disease that causes lung ulcers, and it is one of the leading reasons for pediatric death in the world. Chest X-rays are perhaps the most commonly utilized modalities to recognize pneumonia. Generally, the illness could be analyzed by a specialist radiologist. But for some reason, the diagnosis may be subjective. Thus, the physicians must be guided by computer-aided diagnosis frameworks in this challenging task. In this study, we propose a combined deep learning architecture to identify pneumonia in chest radiography images. We first, use Adaptive Median Filter for images enhancement, then we employ a regularized Convolutional Neural Network for features extraction, and then we use Long Short Term Memory as a classifier. Finally, the attention mechanism is used to direct the network attention to relevant features. The suggested approach was tested on two publicly available pneumonia X-ray datasets provided by Kermany and the Radiological Society of North America. On the Kermany and RSNA datasets, the suggested technique attained accuracy rates of 99.91% and 88.86%, respectively. In the last stage of our experiments, we employed a Grad-CAM-based color visualization technique to precisely interpret the detection of pneumonia in radiological images. The results outperformed those of state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. A Short-Term Power Prediction Method for Photovoltaics Based on Similar Day Clustering and Spatio-Temporal Feature Extraction.
- Author
-
Huang, Xu, Wang, Leying, Ge, Leijiao, Hou, Luyang, Du, Tianshuo, Zheng, Yiwen, and Chen, Yanbo
- Subjects
CONVOLUTIONAL neural networks ,SOLAR power plants ,FEATURE extraction ,PREDICTION models ,SEARCH algorithms - Abstract
Accurate PV power prediction is crucial for enhancing grid planning, optimizing dispatch operations, and advancing management strategies. In pursuit of this objective, this study proposes a short-term distributed PV power prediction method that incorporates temporal and spatial feature extraction as well as similar day analysis. Firstly, to address the poor adaptability of traditional clustering methods to time-series data, the K-shape clustering algorithm is employed to categorize the time series into different weather types. Secondly, to overcome the challenges posed by varying time resolutions in similar day analysis, a novel method based on Dynamic Time Warping (DTW) is proposed. This method calculates the similarity between the target days and the days to be collected, considering both the time of day and the day of the week. Subsequently, a PV power generation prediction model based on a convolutional long short-term memory (CNN-LSTM) network is developed to enhance prediction accuracy. To tackle the difficulty of manual hyperparameter tuning, the chaos reverse sparrow search algorithm (CRSSA) is introduced. Finally, a case study is conducted on the measured data of a distributed photovoltaic power station in a certain region of China. By comparing RMSE and MAPE, compared with other prediction models, the proposed prediction model and solving algorithm effectively reduced the relative error by more than 1%, verifying the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. Constructing the Public Opinion Crisis Prediction Model Using CNN and LSTM Techniques Based on Social Network Mining.
- Author
-
Yan Lou, Zhipeng Ren, Yong Zhang, Zhonghui Tao, and Yiwu Zhao
- Subjects
PUBLIC opinion ,CONVOLUTIONAL neural networks ,SOCIAL networks ,GENERATIVE adversarial networks ,DEEP learning - Abstract
This research endeavors to address the persistent dissemination of public opinion within social networks, mitigate the propagation of inappropriate content on these platforms, and enhance the overall service quality of social networks. To achieve these objectives, Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) techniques are employed in this research to develop a predictive model for anticipating public opinion crises in social network mining. This model furnishes users with a valuable reference for subsequent decisionmaking processes. The initial phase of this research involves the collection of user behavior data from social networks using IoT technologies, serving as the basis for extensive big data analysis and neural network research. Subsequently, a social network text categorization model is constructed by amalgamating the Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) architecture, elucidating the training procedures of deep learning methodologies within CNN and LSTM networks. The effectiveness of this approach is subsequently validated through comparisons with other deep learning techniques. Based on the obtained results and findings, the CNN-LSTM model demonstrates a noteworthy accuracy rate of 92.19% and an exceptionally low loss value of 0.4075. Of particular significance is the classification accuracy of the CNN-LSTM algorithm within social network datasets, which surpasses that of alternative algorithms, including CNN (by 6.31%), LSTM (by 4.43%), RNN (by 3.51%), Transformer (by 40.29%), and Generative Adversarial Network (GAN) (by 4.49%). This underscores the effectiveness of the CNN-LSTM algorithm in the realm of social network text classification. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Efficient traffic monitoring and congestion control with GGA and deep CNN-LSTM using VANET.
- Author
-
Budholiya, Akanksha and Manwar, Avinash Balkrishna
- Subjects
CONVOLUTIONAL neural networks ,TRAFFIC monitoring ,LONG short-term memory ,TRAFFIC congestion ,CLASSIFICATION algorithms - Abstract
In the fast modernised world, usage of vehicles increased day by day, leads to vehicle traffic and congestions. Traditional way of monitoring traffic is less efficient and requires man power. Besides, safety of traffic controllers is the major concern in the manual monitoring. For that reason, effective prediction of vehicular traffic and monitoring the level of congestion is significant in VANET (Vehicular Adhoc Network) for mitigating the delays and danger of accidents. In order to make clear prediction of vehicles and the collision free path in the network, numerous existing algorithms have been provided for prediction and classification. However, the conventional techniques have faced complications in satisfying the accuracy while making predictions of path images. The accuracy in prediction along with the faster computational time have been addressed as a drawback which hinders in making appropriate travel path decisions. In order to overcome such challenges, proposed system employed Yolo v5 algorithm, GGA (Greedy based Genetic Algorithm) and Deep CNN (Convolutional Neural Network) with LSTM (Long Short Term Memory) for traffic monitoring and congestion control in VANET. The Yolo v5 algorithm is utilised for the vehicle prediction mechanism, for the capability of detection with high speed and accuracy. GGA is utilised for the feature selection for the capability of handling numerous features in data and to increase the computational speed. The Deep CNN with LSTM is utilised for Classification for the capability of handling larger datasets and to enhance the accuracy. Though, DCNN is the effective algorithm for classification, it has few limitations like slow processing time. To resolve this, DCNN is utilised with LSTM for enhancing the speed in the projected system. Vehicle detection dataset is used in the proposed system. Further, the experimental evaluations and comparative analysis exhibits the efficiency of the system in terms of accuracy, precision, recall and F1 score when compared to several existing algorithms. The proposed model holds the potential in efficient vehicular congestion management with expected accuracy and it is intended to contribute in to the path identification for vehicles in VANET applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Cognitive workload estimation using physiological measures: a review.
- Author
-
Das Chakladar, Debashis and Roy, Partha Pratim
- Abstract
Estimating cognitive workload levels is an emerging research topic in the cognitive neuroscience domain, as participants' performance is highly influenced by cognitive overload or underload results. Different physiological measures such as Electroencephalography (EEG), Functional Magnetic Resonance Imaging, Functional near-infrared spectroscopy, respiratory activity, and eye activity are efficiently used to estimate workload levels with the help of machine learning or deep learning techniques. Some reviews focus only on EEG-based workload estimation using machine learning classifiers or multimodal fusion of different physiological measures for workload estimation. However, a detailed analysis of all physiological measures for estimating cognitive workload levels still needs to be discovered. Thus, this survey highlights the in-depth analysis of all the physiological measures for assessing cognitive workload. This survey emphasizes the basics of cognitive workload, open-access datasets, the experimental paradigm of cognitive tasks, and different measures for estimating workload levels. Lastly, we emphasize the significant findings from this review and identify the open challenges. In addition, we also specify future scopes for researchers to overcome those challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Hybrid convolutional neural network-long short-term memory combined model for arrhythmia classification.
- Author
-
Badiger, Raghavendra and Manickam, Prabhakar
- Subjects
CONVOLUTIONAL neural networks ,FEATURE extraction ,CLASSIFICATION algorithms ,MACHINE learning ,CARDIOMYOPATHIES ,ARRHYTHMIA - Abstract
The automated examination of electrocardiogram (ECG) signals holds significant importance within the medical field for managing various critical cardiac conditions. Identifying cardiomyopathy and arrhythmias is presently recognized as a challenging endeavor. While machine learning techniques have garnered substantial attention for categorizing these patterns, a predominant focus has been on the classification of arrhythmias. However, existing studies have overlooked instances where arrhythmia leads to cardiomyopathy, a specific cardiac disease scenario. In our research, we introduce an innovative method aimed at distinguishing between cardiomyopathy and cardiomyopathy accompanied by arrhythmia by employing a convolutional neural network (CNN-based) model. This novel approach fills the gap in existing literature by addressing the critical need to classify cases where arrhythmia induces cardiomyopathy, thereby presenting a potential advancement in accurately identifying and managing complex cardiac conditions. The proposed model uses convolution-based CNN model for feature extraction and combines these features with temporal features. Further, a CNN combined long short-term memory (CNN-LSTM) model is presented for classification where CNN models help to obtain the spatial information and LSTM helps to retain the temporal information resulting in improved classification accuracy. the experimental analysis is carried out into two phases where we have classified the rhythms and arrhythmias. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Enhancing drug-target interaction predictions in context of neurodegenerative diseases using bidirectional long short-term memory in male Swiss albino mice pharmaco-EEG analysis
- Author
-
Shahnawaz Qureshi, Syed Muhammad Zeeshan Iqbal, Asif Ameer, Seppo Karrila, Yazeed Yasin Ghadi, and Syed Aziz Shah
- Subjects
Drug target interaction ,Local field potential ,Convolutional neural network ,Intracranial EEG ,Long short-term memory ,Science (General) ,Q1-390 ,Social sciences (General) ,H1-99 - Abstract
Background and Objective: Emerging diseases like Parkinson or Alzheimer's, which are not curable, endanger human mental health and are challenging to research. Drug target interactions (DTI) are pivotal in the screening of candidate drugs and focus on a small pool of drug targets. Electroencephalogram shows the responses to psychotropic medicines in the brain bioelectric activity. Synaptic activity can be analyzed by using Local Field Potential recordings obtained from micro-electrodes implanted in the brain. The aim is to evaluate the effects of drug on brain bioelectric activity and increase the drug classification accuracy. The ultimate goal is to advance our understanding of how drugs affect synaptic activity and open the door to more focused treatment for neurodegenerative diseases. Methods: In this study, Pharmaco-EEG recordings are processed using Advanced neural network models, particularly Convolutional Neural Networks, to assess the effects of medications. The five different medicines used in this study are Ephedrine, Fluoxetine, Kratom, Morphine, and Saline. The signals observed are local field potential signals. To overcome some limits of DTI prediction, we propose Bidirectional Long Short-Term Memory (LSTM) for the categorization of intracranial EEG (i-EEG) data, departing from standard approaches. Similar EEG patterns are presumably caused by drugs that work by homologous pharmacological pathways, producing similar psychotropic effects. To improve accuracy and reduce training loss, our study introduces a bidirectional LSTM model for classification along with Bayesian optimization Results: High recall, precision, and F1-Scores, particularly a 95% F1-Score for morphine, ephedrine, fluoxetine, and saline, suggest good performance in predicting these drug classes. Kratom produces a somewhat lower recall of 94%, but a high F1-Score of 97% and perfect precision of 1.00. The weighted average F1-Score, macro average, and overall accuracy are all consistently high (around 97%), indicating that the model works well throughout the spectrum of drugs. Conclusions: Improved model performance was demonstrated by using a diversified dataset with five drug categories and bidirectional LSTM boosted with Bayesian optimization for hyperparameter tuning. From earlier limited-category models, it represents a substantial advancement.
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.