Publication Type: Periodicals / Publication Year Range: This year / Topic: activation function - Searchworks@Jio Institute Digital Library Search Results

Showing total 84 results

Start Over Topic activation function Publication Year Range This year Publication Type Periodicals

84 results

1. End-to-End Image Compression Through Machine Semantics

Author: Liu, Jianran, Zhang, Chang, Ji, Wen, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Zhai, Guangtao, editor, Zhou, Jun, editor, Ye, Long, editor, Yang, Hua, editor, An, Ping, editor, and Yang, Xiaokang, editor
Published: 2024
Full Text: View/download PDF

2. An empirical assessment of customer satisfaction of internet banking service quality – Hybrid model approach

Author: Kashyap, Sachin, Gupta, Sanjeev, and Chugh, Tarun
Published: 2024
Full Text: View/download PDF

3. An analysis of weight initialization methods in connection with different activation functions forfeedforward neural networks.

Author: Wong, Kit, Dornberger, Rolf, and Hanne, Thomas
Abstract: The selection of weight initialization in an artificial neural network is one of the key aspects and affects the learning speed, convergence rate and correctness of classification by an artificial neural network. In this paper, we investigate the effects of weight initialization in an artificial neural network. Nguyen-Widrow weight initialization, random initialization, and Xavier initialization method are paired with five different activation functions. This paper deals with a feedforward neural network, consisting of an input layer, a hidden layer, and an output layer. The paired combination of weight initialization methods with activation functions are examined and tested and compared based on their best achieved loss rate in training. This work aims to better understand how weight initialization methods in neural networks, in combination with activation functions, affect the learning speed in comparison after a fixed number of training epochs. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Seatbelt Detection Algorithm Improved with Lightweight Approach and Attention Mechanism.

Author: Qiu, Liankui, Rao, Jiankun, and Zhao, Xiangzhe
Subjects: SEAT belts, ALGORITHMS, PETRI nets
Abstract: Precise and rapid detection of seatbelts is an essential research field for intelligent traffic management. In order to improve the detection precision of seatbelts and speed up algorithm inference velocity, a lightweight seatbelt detection algorithm is proposed. Firstly, by adding the G-ELAN module designed in this paper to the YOLOv7-tiny network, the optimization of construction and reduction of parameters are accomplished, and the ResNet is compressed with the channel pruning approach to decrease computational overheads. Then, the Mish activation function is utilized to replace the Leaky Relu in the neck to enhance the non-linear competence of the network. Finally, the triplet attention module is integrated into the model after pruning to make up for the underlying performance reduction caused by the previous stage and upgrade overall detection precision. The experimental results based on the self-built seatbelt dataset showed that, compared to the initial network, the Mean Average Precision (mAP) achieved by the proposed GM-YOLOv7 was improved by 3.8%, while the volume and the computation amount were lowered by 20% and 24.6%, respectively. Compared with YOLOv3, YOLOX, and YOLOv5, the mAP of GM-YOLOv7 increased by 22.4%, 4.6%, and 4.2%, respectively, and the number of computational operations decreased by 25%, 63%, and 38%, respectively. In addition, the accuracy of the improved RST-Net increased to 98.25%, while the parameter value was reduced by 48% compared to the basic model, effectively improving the detection performance and realizing a lightweight structure. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. ASPP + -LANet: A Multi-Scale Context Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images.

Author: Hu, Lei, Zhou, Xun, Ruan, Jiachen, and Li, Supeng
Subjects: FEATURE extraction, IMAGE processing, URBAN planning, SPATIAL resolution, IMAGE segmentation
Abstract: Semantic segmentation of remote sensing (RS) images is a pivotal branch in the realm of RS image processing, which plays a significant role in urban planning, building extraction, vegetation extraction, etc. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is progressively improving. This escalation in resolution gives rise to challenges like imbalanced class distributions among ground objects in RS images, the significant variations of ground object scales, as well as the presence of redundant information and noise interference. In this paper, we propose a multi-scale context extraction network, ASPP+-LANet, based on the LANet for semantic segmentation of high-resolution RS images. Firstly, we design an ASPP+ module, expanding upon the ASPP module by incorporating an additional feature extraction channel, redesigning the dilation rates, and introducing the Coordinate Attention (CA) mechanism so that it can effectively improve the segmentation performance of ground object targets at different scales. Secondly, we introduce the Funnel ReLU (FReLU) activation function for enhancing the segmentation effect of slender ground object targets and refining the segmentation edges. The experimental results show that our network model demonstrates superior segmentation performance on both Potsdam and Vaihingen datasets, outperforming other state-of-the-art (SOTA) methods. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. modSwish: a new activation function for neural network

Author: Kalim, Heena, Chug, Anuradha, and Singh, Amit Prakash
Published: 2024
Full Text: View/download PDF

7. Hybrid sigmoid activation function and transfer learning assisted breast cancer classification on histopathological images

Author: Singh, Manoj Kumar and Chand, Satish
Published: 2024
Full Text: View/download PDF

8. On the universal approximation property of radial basis function neural networks

Author: Ismayilova, Aysu and Ismayilov, Muhammad
Published: 2024
Full Text: View/download PDF

9. High-Performance Binocular Disparity Prediction Algorithm for Edge Computing.

Author: Cheng, Yuxi, Song, Yang, Liu, Yi, Zhang, Hui, and Liu, Feng
Subjects: EDGE computing, DATA compression, DISTRIBUTION costs, COMPUTATIONAL complexity, FORECASTING
Abstract: End-to-end disparity estimation algorithms based on cost volume deployed in edge-end neural network accelerators have the problem of structural adaptation and need to ensure accuracy under the condition of adaptation operator. Therefore, this paper proposes a novel disparity calculation algorithm that uses low-rank approximation to approximately replace 3D convolution and transposed 3D convolution, WReLU to reduce data compression caused by the activation function, and unimodal cost volume filtering and a confidence estimation network to regularize cost volume. It alleviates the problem of disparity-matching cost distribution being far away from the true distribution and greatly reduces the computational complexity and number of parameters of the algorithm while improving accuracy. Experimental results show that compared with a typical disparity estimation network, the absolute error of the proposed algorithm is reduced by 38.3%, the three-pixel error is reduced to 1.41%, and the number of parameters is reduced by 67.3%. The calculation accuracy is better than that of other algorithms, it is easier to deploy, and it has strong structural adaptability and better practicability. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. ErfReLU: adaptive activation function for deep neural network.

Author: Rajanand, Ashish and Singh, Pradeep
Abstract: Recent research has found that the activation function (AF) plays a significant role in introducing non-linearity to enhance the performance of deep learning networks. Researchers recently started developing activation functions that can be trained throughout the learning process, known as trainable, or adaptive activation functions (AAF). Research on AAF that enhances the outcomes is still in its early stages. In this paper, a novel activation function ‘ErfReLU’ has been developed based on the erf function and ReLU. This function leverages the advantages of both the Rectified Linear Unit (ReLU) and the error function (erf). A comprehensive overview of activation functions like Sigmoid, ReLU, Tanh, and their properties have been briefly explained. Adaptive activation functions like Tanhsoft1, Tanhsoft2, Tanhsoft3, TanhLU, SAAF, ErfAct, Pserf, Smish, and Serf is also presented. Lastly, comparative performance analysis of 9 trainable activation functions namely Tanhsoft1, Tanhsoft2, Tanhsoft3, TanhLU, SAAF, ErfAct, Pserf, Smish, and Serf with the proposed one has been performed. These activation functions are used in MobileNet, VGG16, and ResNet models and their performance is evaluated on benchmark datasets such as CIFAR-10, MNIST, and FMNIST. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. A Neural-Network-Based Watermarking Method Approximating JPEG Quantization.

Author: Yamauchi, Shingo and Kawamura, Masaki
Subjects: DIGITAL watermarking, JPEG (Image coding standard), WATERMARKS, RECURRENT neural networks, BIT error rate, IMAGE compression, TANGENT function
Abstract: We propose a neural-network-based watermarking method that introduces the quantized activation function that approximates the quantization of JPEG compression. Many neural-network-based watermarking methods have been proposed. Conventional methods have acquired robustness against various attacks by introducing an attack simulation layer between the embedding network and the extraction network. The quantization process of JPEG compression is replaced by the noise addition process in the attack layer of conventional methods. In this paper, we propose a quantized activation function that can simulate the JPEG quantization standard as it is in order to improve the robustness against the JPEG compression. Our quantized activation function consists of several hyperbolic tangent functions and is applied as an activation function for neural networks. Our network was introduced in the attack layer of ReDMark proposed by Ahmadi et al. to compare it with their method. That is, the embedding and extraction networks had the same structure. We compared the usual JPEG compressed images and the images applying the quantized activation function. The results showed that a network with quantized activation functions can approximate JPEG compression with high accuracy. We also compared the bit error rate (BER) of estimated watermarks generated by our network with those generated by ReDMark. We found that our network was able to produce estimated watermarks with lower BERs than those of ReDMark. Therefore, our network outperformed the conventional method with respect to image quality and BER. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. An analytical approach for unsupervised learning rate estimation using rectified linear units.

Author: Chaoxiang Chen, Golovko, Vladimir, Kroshchanka, Aliaksandr, Mikhno, Egor, Chodyka, Marta, and Lichograj, Piotr
Subjects: BOLTZMANN machine, TRANSFER functions
Abstract: Unsupervised learning based on restricted Boltzmann machine or autoencoders has become an important research domain in the area of neural networks. In this paper mathematical expressions to adaptive learning step calculation for RBM with ReLU transfer function are proposed. As a result, we can automatically estimate the step size that minimizes the loss function of the neural network and correspondingly update the learning step in every iteration. We give a theoretical justification for the proposed adaptive learning rate approach, which is based on the steepest descent method. The proposed technique for adaptive learning rate estimation is compared with the existing constant step and Adam methods in terms of generalization ability and loss function. We demonstrate that the proposed approach provides better performance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. 基于分段线性激活的多任务行人目标检测识别算法研究.

Author: 朱亚旋, 张达明, 尹荣彬, and 吴继超
Abstract: Copyright of Automotive Digest is the property of Automotive Digest Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

14. Optimizing GANs for Cryptography: The Role and Impact of Activation Functions in Neural Layers Assessing the Cryptographic Strength.

Author: Singh, Purushottam, Dutta, Sandip, and Pranav, Prashant
Subjects: CRYPTOGRAPHY, HYBRID systems, GENERATIVE adversarial networks, STRENGTH training, MATHEMATICAL optimization
Abstract: Generative Adversarial Networks (GANs) have surfaced as a transformative approach in the domain of cryptography, introducing a novel paradigm where two neural networks, the generator (akin to Alice) and the discriminator (akin to Bob), are pitted against each other in a cryptographic setting. A third network, representing Eve, attempts to decipher the encrypted information. The efficacy of this encryption–decryption process is deeply intertwined with the choice of activation functions employed within these networks. This study conducted a comparative analysis of four widely used activation functions within a standardized GAN framework. Our recent explorations underscore the superior performance achieved when utilizing the Rectified Linear Unit (ReLU) in the hidden layers combined with the Sigmoid activation function in the output layer. The non-linear nature introduced by the ReLU provides a sophisticated encryption pattern, rendering the deciphering process for Eve intricate. Simultaneously, the Sigmoid function in the output layer guarantees that the encrypted and decrypted messages are confined within a consistent range, facilitating a straightforward comparison with original messages. The amalgamation of these activation functions not only bolsters the encryption strength but also ensures the fidelity of the decrypted messages. These findings not only shed light on the optimal design considerations for GAN-based cryptographic systems but also underscore the potential of investigating hybrid activation functions for enhanced system optimization. In our exploration of cryptographic strength and training efficiency using various activation functions, we discovered that the "ReLU and Sigmoid" combination significantly outperforms the others, demonstrating superior security and a markedly efficient mean training time of 16.51 s per 2000 steps. This highlights the enduring effectiveness of established methodologies in cryptographic applications. This paper elucidates the implications of these choices, advocating for their adoption in GAN-based cryptographic models, given the superior results they yield in ensuring security and accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. XGL-T transformer model for intelligent image captioning.

Author: Sharma, Dhruv, Dhiman, Chhavi, and Kumar, Dinesh
Abstract: Image captioning extracts multiple semantic features from an image and integrates them into a sentence-level description. For efficient description of the captions, it becomes necessary to learn higher order interactions between detected objects and the relationship among them. Most of the existing systems take into account the first order interactions while ignoring the higher order ones. It is challenging to extract discriminant higher order semantics visual features in images with highly populated objects for caption generation. In this paper, an efficient higher order interaction learning framework is proposed using encoder-decoder based image captioning. A scaled version of Gaussian Error Linear Unit (GELU) activation function, x-GELU is introduced that controls the vanishing gradients and enhances the feature learning. To leverage higher order interactions among multiple objects, an efficient XGL Transformer (XGL-T) model is introduced that exploits both spatial and channel-wise attention by integrating four XGL attention modules in image encoder and one in Bilinear Long Short-Term Memory guided sentence decoder. The proposed model captures rich semantic concepts from objects, attributes, and their relationships. Extensive experiments are conducted on publicly available MSCOCO Karapathy test split and the best performance of the work is observed as 81.5 BLEU@1, 67.1 BLEU@2, 51.6 BLEU@3, 39.9 BLEU@4, 134 CIDEr, 59.9 ROUGE-L, 29.8 METEOR, 23.8 SPICE using CIDEr-D Score Optimization Strategy. The scores validate the significant improvements over state-of-the-art results. An ablation study is also carried out to support the experimental observations. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Application to Activation Functions through Fixed-Circle Problems with Symmetric Contractions.

Author: Anjum, Rizwan, Abbas, Mujahid, Safdar, Hira, Din, Muhammad, Zhou, Mi, and Radenović, Stojan
Subjects: BANACH spaces, CIRCLE
Abstract: In this paper, our main aim is to present innovative fixed-point theorems that provide solutions to the fixed-circle problem with symmetric contractions. We accomplish this by employing operator enrichment techniques within the context of Banach spaces. Furthermore, we demonstrate the practical application of these theorems by showcasing their relevance to the rectified linear unit (ReLU) activation function. By exploring the connection between fixed points and activation functions, our work contributes to a deeper understanding of the behavior and properties of these fundamental mathematical concepts. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. Performance analysis of multimodal medical image fusion using AMT-DWT-based pre-processing and customized CNN for denoising

Author: Ghosh, Tanima and N., Jayanthi
Published: 2024
Full Text: View/download PDF

18. Rogue wave, lump, kink, periodic and breather-like solutions of the (2+1)-dimensional KdV equation.

Author: Zheng, Wanguang, Liu, Yaqing, and Chu, Jingyi
Subjects: *ROGUE waves, *ARTIFICIAL neural networks, *WATER waves, *WATER depth
Abstract: In this paper, the (2+1)-dimensional KdV equation is investigated by using the bilinear neural network method (BNNM). We construct six neural network models, extending beyond single hidden layer models to create deeper and broader network structures (e.g. [3-3-1], [3-4-1], [3-1-3-1], [3-4-1-1], [3-2-2-1] and [3-2-3-1-1] models). Introducing specific activation functions into the neural network model enables the generation of various test functions, resulting in novel solutions for equations that include rogue wave solutions, lump-kink solutions, periodic soliton solution, breather-like solutions and lump solutions. The physical properties of these novel solutions are vividly depicted through three-dimensional plots, density plots, and curve plots. The findings contribute to a better understanding of the propagation behavior of shallow water waves. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. A classification model for power corridors based on the improved PointNetþþ network.

Author: Li Bo, Liu Siyuan, Wang Xiangfeng, and Zou Cunyu
Subjects: DEEP learning, CLASSIFICATION, ELECTRIC lines, POINT cloud
Abstract: Aiming at the existing deep learning classification model for power corridor point cloud still need to improve the classification efficiency and the robustness of the classification model to meet the requirements of practical applications. An improved classification model based on PointNetþþ is proposed. Based on the fact that the main features of the power corridor scene are power lines, poles, and vegetation, the initial data are first optimally filtered, and then the ensemble abstraction module of the classical PointNetþþ is modified to better adapt to the power corridor scene. Finally, h-Swish is used as the activation function to realize the accurate classification of the features of the power corridor scene, and the training time of deep learning is also greatly reduced. The experimental results show that the improved algorithm achieves an average F1 value of 97.58%, which is 3.62 percentage points higher than the classical PointNetþþ. Therefore, the algorithm has great potential in point cloud classification. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Neural network optimizer of proportional-integral-differential controller parameters.

Author: Siddikov, Isamiddin, Nashvandova, Gulruxsor, and Alimova, Gulchekhra
Subjects: VOLTAGE regulators, PID controllers, BACK propagation
Abstract: Wide application of proportional-integral-differential (PID)-regulator in industry requires constant improvement of methods of its parameters adjustment. The paper deals with the issues of optimization of PID-regulator parameters with the use of neural network technology methods. A methodology for choosing the architecture (structure) of neural network optimizer is proposed, which consists in determining the number of layers, the number of neurons in each layer, as well as the form and type of activation function. Algorithms of neural network training based on the application of the method of minimizing the mismatch between the regulated value and the target value are developed. The method of back propagation of gradients is proposed to select the optimal training rate of neurons of the neural network. The neural network optimizer, which is a superstructure of the linear PID controller, allows increasing the regulation accuracy from 0.23 to 0.09, thus reducing the power consumption from 65% to 53%. The results of the conducted experiments allow us to conclude that the created neural superstructure may well become a prototype of an automatic voltage regulator (AVR)-type industrial controller for tuning the parameters of the PID controller. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. Human Behavior Recognition Algorithm Based on HD-C3D Model

Author: Xie, Zhihao, Yu, Lei, Wang, Qi, Ma, Ziji, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Wu, Celimuge, editor, Chen, Xianfu, editor, Feng, Jie, editor, and Wu, Zhen, editor
Published: 2024
Full Text: View/download PDF

22. A comparative analysis of various activation functions and optimizers in a convolutional neural network for hyperspectral image classification

Author: Seyrek, Eren Can and Uysal, Murat
Published: 2024
Full Text: View/download PDF

23. An optimized radial basis function neural network with modulation-window activation function

Author: Lin, Haijun, Dai, Houde, Mao, Yihan, and Wang, Lucai
Published: 2024
Full Text: View/download PDF

24. A Precision-Aware Neuron Engine for DNN Accelerators

Author: Vishwakarma, Sudheer, Raut, Gopal, Jaiswal, Sonu, Vishvakarma, Santosh Kumar, and Ghai, Dhruva
Published: 2024
Full Text: View/download PDF

25. Research on the Comparative Development of Modern Popular Music and Traditional Music Culture in Colleges and Universities in the Age of Artificial Intelligence

Author: Li Lin
Subjects: forward neural network, multi-feature fusion algorithm, activation function, sentiment classification model, music culture, 97m80, Mathematics, QA1-939
Abstract: In this paper, the forward neural network multi-feature fusion algorithm is used to extract the emotional features of music culture on artificial intelligence technology, considering the diversity and intermittency of the emotional features of the study, which needs to be parameterized. In the forward neural network architecture, the activation value obtained by using the nonlinear activation function is used, and the results obtained are passed to the next layer of data to realize layer-by-layer forward computation, which leads to the back-propagation activation function. The music culture emotion classification model is constructed based on the propagation mode of the forward neural network to determine the emotion recognition process. The research object is selected, the research process is determined, and in order to ensure the true validity of the research, it is necessary to test the reliability and validity of the research design scheme and to develop an empirical analysis of the comparison between popular music and traditional music culture. The results show that on the model, especially in the recognition of sacred, sad, passionate emotion type of music classification accuracy reached more than 88.2%. This paper’s model can improve the classification accuracy of music emotion to a certain extent. In the ontological knowledge analysis of popular music and traditional music culture, all three editions of textbooks show that general knowledge of music is predominant and has a large proportion, appreciation knowledge and extended knowledge are also considerable, and music knowledge is the least and has a small proportion. This study demonstrates the synergistic development of traditional culture and modern popular music, which is of great significance to the development of music education in colleges and universities.
Published: 2024
Full Text: View/download PDF

26. Design of a perceptron model-based physical fitness index monitoring system for sports training

Author: Li Wenming
Subjects: perceptron model, forward propagation algorithm, activation function, indicator monitoring system, 05b30, Mathematics, QA1-939
Abstract: The purpose of designing a sports training fitness index monitoring system is to grasp better the physical performance data of athletes in the training process to ensure training safety. In this paper, the principle of multilayer perceptron is explained based on the perceptron model, the optimal loss function of multilayer perceptron is solved by using the activation function and forward propagation algorithm, the sensor data collection module is constructed, and the physical fitness index monitoring system for sports training is built by this method. To verify the feasibility of the detection system in this paper, experimental analysis was conducted from three aspects: the distribution of physical fitness index monitoring information density, physical fitness index data and monitoring data accuracy. The index monitoring density distribution was between 0.11 and 2.09 from the monitoring information density. Regarding physical performance indicators, the average values of maximum oxygen uptake, heart rate, relative energy metabolism level, and exercise intensity were 41.02, 121.58, and 11.84, respectively. From the accuracy of indicator monitoring data, the accuracy of the system in this paper was 93.63%, which was 21.57 and 11.03 percentage points higher than that of GAN and MCNN algorithms, respectively. The physical fitness index monitoring system constructed based on the perceptron model can effectively realize the monitoring of physical fitness indexes, help trainers master the training rhythm, and improve the safety of sports training.
Published: 2024
Full Text: View/download PDF

27. Innovative Strategies for the Development of International Chinese Language Education Based on Deep Learning Models

Author: Wang Tiantian
Subjects: deep learning model, convolutional neural network, activation function, international chinese language education, 68m01, Mathematics, QA1-939
Abstract: The development of international Chinese language education greatly impacts the building of a global Chinese cultural environment. This paper constructs a model structure based on a convolutional neural network with a deep learning model and explains it in detail, and uses it to analyze the current development situation of international Chinese language education. The paper also presents the performance evaluation of the convolutional neural network model and discusses the problems in the development of international Chinese language education and the innovative development direction of online teaching. From the characteristics of innovative online teaching materials, rich media, mobility, interactivity, personalization, timeliness, and open sharing become the main melodies of the development of international Chinese language education, and the five-year average values of each characteristic are 53.59%, 51.17%, 49.77%, 47.84%, 45.94%, and 42.19%, respectively. The convolutional neural network model based on deep learning can effectively analyze the problems of international Chinese language education and the direction of innovation, providing an effective technology to help the development of Chinese culture in China.
Published: 2024
Full Text: View/download PDF

28. Bionic-inspired oil price prediction: Auditory multi-feature collaboration network.

Author: Wu, Han, Liang, Yan, Gao, Xiao-Zhi, and Heng, Jia-Ni
Subjects: *PETROLEUM sales & prices, *DECOMPOSITION method, *INDIVIDUAL investors, *FEATURE extraction, *CEREBRAL cortex, *POLYMER networks
Abstract: Predictions of the oil price are critical to support intelligent decision-making for individual investors, governments, and corporations, but a challenging task since there are complex nonlinear and random fluctuations involved in series. Additionally, most existing methods mainly focus on the data-layer fitting, and inherit uninterpretable design flowcharts and unclear layer functions. Based on the fact that living things evolve numerous advanced cognition processes after long-term interactions with environments, it is a promising problem to design bionic deep prediction networks. In this paper, an a uditory-inspired m ulti- f eature c ollaboration network (AMFC-Net) is explored to predict the oil price, and includes the feature extraction, brain memory, and comprehensive prediction blocks. Specifically, through imitating that complex sounds are captured by auditory canals and converted into electrical impulses, the first block adopts four typical activation functions to extract multi-group complementary features for detecting nonlinear changes. Through imitating that electrical impulses are transformed to the left and right hemispheres for handling and analyzing, the second block realizes gating and cooperation mechanisms to learn long short-term dependencies and highlight core information. Through imitating that information is sent to higher cerebral cortices for sensing the environment, the third block maps relationships from features to targets for producing final predictions. In summary, multi-channel convolution operations establish the imitations of both form (clear functional layer) and spirit (layer-by-layer collaboration), which not only ensure the prediction effectiveness but also improve the AMFC-Net interpretability. Three experiments (comparison with 12 machine learning methods), three experiments (comparison with 8 hybrid methods under decomposition plus ensemble framework), and seven discussions under two real-world oil price datasets all indicate that the proposed AMFC-Net is superior and suitable to predict the oil price. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. Computational analysis of electrode structure and configuration for efficient and localized neural stimulation.

Author: Choi, Ji Hoon, Moon, Jeongju, Park, Young Hoon, and Eom, Kyungsik
Abstract: Neuromodulation technique using electric stimulation is widely applied in neural prosthesis, therapy, and neuroscience research. Various stimulation techniques have been developed to enhance stimulation efficiency and to precisely target the specific area of the brain which involves optimizing the geometry and the configuration of the electrode, stimulation pulse type and shapes, and electrode materials. Although the effects of electrode shape, size, and configuration on the performance of neural stimulation have individually been characterized, to date, there is no integrative investigation of how this factor affects neural stimulation. In this study, we computationally modeled the various types of electrodes with varying shapes, sizes, and configurations and simulated the electric field to calculate the activation function. The electrode geometry is then integratively assessed in terms of stimulation efficiency and stimulation focality. We found that stimulation efficiency is enhanced by making the electrode sharper and smaller. A center-to-vertex distance exceeding 100 µm shows enhanced stimulation efficiency in the bipolar configuration. Additionally, the separation distance of less than 1 mm between the reference and stimulation electrodes exhibits higher stimulation efficiency compared to the monopolar configuration. The region of neurons to be stimulated can also be modified. We found that sharper electrodes can locally activate the neuron. In most cases, except for the rectangular electrode shape with a center-to-vertex distance smaller than 100 µm, the bipolar electrode configuration can locally stimulate neurons as opposed to the monopolar configuration. These findings shed light on the optimal selection of neural electrodes depending on the target applications. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. A Comparative Analysis of Deep Learning Parameters for Enhanced Detection of Yellow Rust in Wheat.

Author: Adem, Kemal, Yılmaz, Esra Kavalcı, Ölmez, Fatih, Çelik, Kübra, and Bakır, Halit
Subjects: STRIPE rust, WHEAT diseases & pests, DEEP learning, DECISION support systems, MATHEMATICAL optimization
Abstract: Copyright of International Journal of Engineering Research & Development (IJERAD) is the property of International Journal of Engineering Research & Development and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

31. A novel dynamic scene deblurring framework based on hybrid activation and edge-assisted dual-branch residuals.

Author: Li, Zihan, Cui, Guangmang, Liu, Haoyu, Chen, Ziyi, and Zhao, Jufeng
Subjects: GENERATIVE adversarial networks, SOURCE code
Abstract: Existing learning-based image deblurring algorithms tend to focus on single source of image information, and the network structure and dynamic scene blur characteristics make it difficult to recover the missing details of the image. Therefore, a novel dynamic scene deblurring framework is proposed based on hybrid activation and edge-assisted dual-branch residuals. Specifically, the network's ability to learn nonlinear features is enhanced by different activation functions, and the feature utilization at different semantic levels is improved by improving the traditional residual structure. In particular, the fixed-parameter training method is adopted to reduce ringing artifacts. And a new dual-source edge extraction algorithm is designed that organically combines edge information from different sources as network inputs. The experimental results demonstrate that our algorithm not only shows advantages in objective evaluation metrics PSNR, SSIM and VIF, but also achieves satisfactory results in subjective visual effects. Source code is publicly available at: https://github.com/Mangolzh/HN.git. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. An Accelerated Dual-Integral Structure Zeroing Neural Network Resistant to Linear Noise for Dynamic Complex Matrix Inversion.

Author: Yang, Feixiang, Wang, Tinglei, and Huang, Yun
Subjects: MATRIX inversion, COMPLEX matrices, NOISE
Abstract: The problem of inverting dynamic complex matrices remains a central and intricate challenge that has garnered significant attention in scientific and mathematical research. The zeroing neural network (ZNN) has been a notable approach, utilizing time derivatives for real-time solutions in noiseless settings. However, real-world disturbances pose a significant challenge to a ZNN's convergence. We design an accelerated dual-integral structure zeroing neural network (ADISZNN), which can enhance convergence and restrict linear noise, particularly in complex domains. Based on the Lyapunov principle, theoretical analysis proves the convergence and robustness of ADISZNN. We have selectively integrated the SBPAF activation function, and through theoretical dissection and comparative experimental validation we have affirmed the efficacy and accuracy of our activation function selection strategy. After conducting numerous experiments, we discovered oscillations and improved the model accordingly, resulting in the ADISZNN-Stable model. This advanced model surpasses current models in both linear noisy and noise-free environments, delivering a more rapid and stable convergence, marking a significant leap forward in the field. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. Trish: an efficient activation function for CNN models and analysis of its effectiveness with optimizers in diagnosing glaucoma

Author: Közkurt, Cemil, Diker, Aykut, Elen, Abdullah, Kılıçarslan, Serhat, Dönmez, Emrah, and Demir, Fahrettin Burak
Published: 2024
Full Text: View/download PDF

34. A Comparative Analysis on Various Modified Deep Convolution Neural Networks on Maize Plant Leaf Disease Classification

Author: Kumar, H. R. Sunil, Poornima, K. M., Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Tan, Kay Chen, Series Editor, Shrivastava, Vivek, editor, and Bansal, Jagdish Chand, editor
Published: 2024
Full Text: View/download PDF

35. Hardware Implementation of Three-Layered Perceptron Using FPGA

Author: Tiwari, Rishabh, Bhingarde, Abhishek, Kulkarni, Atharva, Kulkarni, Rahul, Joshi, Manisha, Charniya, Nadir, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Zhang, Junjie James, Series Editor, Tan, Kay Chen, Series Editor, Shrivastava, Vivek, editor, and Bansal, Jagdish Chand, editor
Published: 2024
Full Text: View/download PDF

36. SignReLU neural network and its approximation ability.

Author: Li, Jianfei, Feng, Han, and Zhou, Ding-Xuan
Subjects: *ARTIFICIAL neural networks, *SIGNAL processing, *RESEARCH personnel
Abstract: Deep neural networks (DNNs) have garnered significant attention in various fields of science and technology in recent years. Activation functions define how neurons in DNNs process incoming signals for them. They are essential for learning non-linear transformations and for performing diverse computations among successive neuron layers. In the last few years, researchers have investigated the approximation ability of DNNs to explain their power and success. In this paper, we explore the approximation ability of DNNs using a different activation function, called SignReLU. Our theoretical results demonstrate that SignReLU networks outperform rational and ReLU networks in terms of approximation performance. Numerical experiments are conducted comparing SignReLU with the existing activations such as ReLU, Leaky ReLU, and ELU, which illustrate the competitive practical performance of SignReLU. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. Physics-informed kernel function neural networks for solving partial differential equations.

Author: Fu, Zhuojia, Xu, Wenzhi, and Liu, Shuainan
Subjects: *PARTIAL differential equations, *KERNEL functions, *GREEN'S functions, *RADIAL basis functions, *NONLINEAR differential equations, *SHALLOW-water equations, *EMBEDDING theorems
Abstract: This paper proposes an improved version of physics-informed neural networks (PINNs), the physics-informed kernel function neural networks (PIKFNNs), to solve various linear and some specific nonlinear partial differential equations (PDEs). It can also be considered as a novel radial basis function neural network (RBFNN). In the proposed PIKFNNs, it employs one-hidden-layer shallow neural network with the physics-informed kernel functions (PIKFs) as the customized activation functions. The PIKFs fully or partially contain PDE information, which can be chosen as fundamental solutions, green's functions, T-complete functions, harmonic functions, radial Trefftz functions, probability density functions and even the solutions of some linear simplified PDEs and so on. The main difference between the PINNs and the proposed PIKFNNs is that the PINNs add PDE constraints to the loss function, and the proposed PIKFNNs embed PDE information into the activation functions of the neural network. The feasibility and accuracy of the proposed PIKFNNs are validated by some benchmark examples. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. On the ideal number of groups for isometric gradient propagation.

Author: Kim, Bum Jun, Choi, Hyeyeon, Jang, Hyeonah, and Kim, Sang Woo
Subjects: *ARTIFICIAL neural networks
Abstract: Recently, various normalization layers have been proposed to stabilize the training of deep neural networks. Among them, group normalization is a generalization of layer normalization and instance normalization by allowing a degree of freedom in the number of groups it uses. However, to determine the optimal number of groups, trial-and-error-based hyperparameter tuning is required, and such experiments are time-consuming. In this study, we discuss a reasonable method for setting the number of groups. First, we find that the number of groups influences the gradient behavior of the group normalization layer. Based on this observation, we derive the ideal number of groups, which calibrates the gradient scale to facilitate gradient descent optimization. This paper is the first to propose an optimal number of groups that is theoretically grounded, architecture-aware, and can provide a proper value in a layer-wise manner for all layers. The proposed method exhibited improved performance over existing methods in numerous neural network architectures, tasks, and datasets. • We propose a method to determine the number of groups of group normalization. • A theoretical analysis of group normalization with activation function is provided. • The proposed method is validated against various practical problems. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. Artificial Neural Network Modeling for Predicting Thermal Conductivity of EG/Water-Based CNC Nanofluid for Engine Cooling Using Different Activation Functions.

Author: Hasan, Md. Munirul, Rahman, Md Mustafizur, Islam, Mohammad Saiful, Chan, Wong Hung, Alginahi, Yasser M., Kabir, Muhammad Nomani, Bakar, Suraya Abu, and Ramasamy, Devarajan
Subjects: ARTIFICIAL neural networks, THERMAL conductivity, NANOFLUIDS, ETHYLENE glycol, INTERNAL combustion engines
Abstract: A vehicle engine cooling system is of utmost importance to ensure that the engine operates in a safe temperature range. In most radiators that are used to cool an engine, water serves as a cooling fluid. The performance of a radiator in terms of heat transmission is significantly influenced by the incorporation of nanoparticles into the cooling water. Concentration and uniformity of nanoparticle distribution are the two major factors for the practical use of nanofluids. The shape and size of nanoparticles also have a great impact on the performance of heat transfer. Many researchers are investigating the impact of nanoparticles on heat transfer. This study aims to develop an artificial neural network (ANN) model for predicting the thermal conductivity of an ethylene glycol (EG)/water-based crystalline nanocellulose (CNC) nanofluid for cooling internal combustion engine. The implementation of an artificial neural network considering different activation functions in the hidden layer is made to find the best model for the cooling of an engine using the nanofluid. Accuracies of the model with different activation functions in artificial neural networks are analyzed for different nanofluid concentrations and temperatures. In artificial neural networks, Levenberg–Marquardt is an optimization approach used with activation functions, including Tansig and Logsig functions in the training phase. The findings of each training, testing, and validation phase are presented to demonstrate the network that provides the highest level of accuracy. The best result was obtained with Tansig, which has a correlation of 0.99903 and an error of 3.7959 ×10. It has also been noticed that the Logsig function can also be a good model due to its correlation of 0.99890 and an error of 4.9218 ×10. Thus our ANN with Tansig and Logsig functions demonstrates a high correlation between the actual output and the predicted output. Graphic Abstract [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

40. HF-YOLO: Advanced Pedestrian Detection Model with Feature Fusion and Imbalance Resolution.

Author: Pan, Lihu, Diao, Jianzhong, Wang, Zhengkui, Peng, Shouxin, and Zhao, Cunhui
Abstract: Pedestrian detection is crucial for various applications, including intelligent transportation and video surveillance systems. Although recent research has advanced pedestrian detection models like the YOLO series, they still face limitations in handling diverse pedestrian scales, leading to performance challenges. To address these issues, we propose HF-YOLO, an advanced pedestrian detection model. HF-YOLO tackles the complexities of pedestrian detection in complex scenes by addressing scale variations and occlusions among pedestrians. In the feature fusion stage, our algorithm leverages both shallow localization information and deep semantic information. This involves fusing P2 layer features and adding a high-resolution detection layer, significantly improving the detection of small-scale pedestrians and occluded instances. To enhance feature representation, HF-YOLO incorporates the HardSwish activation function, introducing more non-linear factors and strengthening the model’s ability to represent complex and discriminative features. Additionally, to address regression imbalance, a balance factor is introduced to the CIoU loss function. This modification effectively resolves the imbalance problem and enhances pedestrian localization accuracy. Experimental results demonstrate the effectiveness of our proposed algorithm. HF-YOLO achieves notable improvements, including a 3.52% increase in average precision, a 1.35% boost in accuracy, and a 4.83% enhancement in recall. Moreover, the algorithm maintains real-time performance with a detection time of 8.5ms, meeting the stringent requirements of real-time applications. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. Estimation of the Regression Model Using M-Estimation Method and Artificial Neural Networks in the Presence of Outliers.

Author: Jawad, Hussein Talib and Saleh, Rabab Abdul-Ridha
Subjects: ARTIFICIAL neural networks, REGRESSION analysis, EXTREME value theory
Abstract: Copyright of Journal of Economics & Administrative Sciences is the property of Republic of Iraq Ministry of Higher Education & Scientific Research (MOHESR) and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

42. On an (i, x0)-Generalized Logistic-Type Function.

Author: Karateke, Seda
Subjects: LOGISTIC functions (Mathematics), SOFT computing, COMPUTER programming, PYTHON programming language, PARAMETER estimation
Abstract: In this article, some mathematical properties of (i, x0)-generalized logistic-type function are presented. This four-parameter generalized function can be considered as a statistical phenomenon enhancing more vigorous survival analysis models. Moreover, the behaviors of the relevant parametric functions obtained are examined with graphics using computer programming language Python 3.9. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. BI-TST_YOLOv5: Ground Defect Recognition Algorithm Based on Improved YOLOv5 Model.

Author: Qin, Jiahao, Yang, Xiaofeng, Zhang, Tianyi, and Bi, Shuilan
Subjects: ALGORITHMS, NETWORK performance, ROUTING algorithms
Abstract: Pavement defect detection technology stands as a pivotal component within intelligent driving systems, demanding heightened precision and rapid detection rates. Addressing the complexities arising from diverse defect types and intricate backgrounds in visual sensing, this study introduces an enhanced approach to augment the network structure and activation function within the foundational YOLOv5 algorithm. Initially, modifications to the YOLOv5′s architecture incorporate an adjustment to the Leaky ReLU activation function, thereby enhancing regression stability and accuracy. Subsequently, the integration of bi-level routing attention into the network's head layer optimizes the attention mechanism, notably improving overall efficiency. Additionally, the replacement of the YOLOv5 backbone layer's C3 module with the C3-TST module enhances initial convergence efficiency in target detection. Comparative analysis against the original YOLOv5s network reveals a 2% enhancement in map50 and a 1.8% improvement in F1, signifying an overall advancement in network performance. The initial convergence rate of the algorithm has been improved, and the accuracy and operational efficiency have also been greatly improved, especially on models with small-scale training sets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Medical Image Classifications Using Convolutional Neural Networks: A Survey of Current Methods and Statistical Modeling of the Literature.

Author: Mohammed, Foziya Ahmed, Tune, Kula Kekeba, Assefa, Beakal Gizachew, Jett, Marti, and Muhie, Seid
Subjects: IMAGE recognition (Computer vision), CONVOLUTIONAL neural networks, DIAGNOSTIC imaging, MACHINE learning, MEDICAL coding
Abstract: In this review, we compiled convolutional neural network (CNN) methods which have the potential to automate the manual, costly and error-prone processing of medical images. We attempted to provide a thorough survey of improved architectures, popular frameworks, activation functions, ensemble techniques, hyperparameter optimizations, performance metrics, relevant datasets and data preprocessing strategies that can be used to design robust CNN models. We also used machine learning algorithms for the statistical modeling of the current literature to uncover latent topics, method gaps, prevalent themes and potential future advancements. The statistical modeling results indicate a temporal shift in favor of improved CNN designs, such as a shift from the use of a CNN architecture to a CNN-transformer hybrid. The insights from statistical modeling point that the surge of CNN practitioners into the medical imaging field, partly driven by the COVID-19 challenge, catalyzed the use of CNN methods for detecting and diagnosing pathological conditions. This phenomenon likely contributed to the sharp increase in the number of publications on the use of CNNs for medical imaging, both during and after the pandemic. Overall, the existing literature has certain gaps in scope with respect to the design and optimization of CNN architectures and methods specifically for medical imaging. Additionally, there is a lack of post hoc explainability of CNN models and slow progress in adopting CNNs for low-resource medical imaging. This review ends with a list of open research questions that have been identified through statistical modeling and recommendations that can potentially help set up more robust, improved and reproducible CNN experiments for medical imaging. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Automatic assignment of microgenres to movies using a word embedding-based approach

Author: González-Santos, Carlos, Vega-Rodríguez, Miguel A., López-Muñoz, Joaquín M., Martínez-Sarriegui, Iñaki, and Pérez, Carlos J.
Published: 2024
Full Text: View/download PDF

46. An Improved Activation Function in Convolution Neural Network to Estimate the Hazardous Air Pollutant Based on Images

Author: Bhimavarapu, Usharani
Published: 2024
Full Text: View/download PDF

47. Approximation of One-Dimensional Darcy–Brinkman–Forchheimer Model by Physics Informed Deep Learning Feedforward Artificial Neural Network and Finite Element Methods: A Comparative Study

Author: Martinez, Mara, Rao, B. Veena S. N., and Mallikarjunaiah, S. M.
Published: 2024
Full Text: View/download PDF

48. Radar Signal Recognition Based on CSRDNN Network

Author: Zheng Zhang, Chuan Wan, Yi Chen, Fang Zhou, Xiaofei Zhu, Wenchao Zhai, and Daying Quan
Subjects: Low probability of intercept, radar signal recognition, stacked recurrent neural networks, activation function, training algorithm, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: It is essential to achieve the high-accuracy recognition of low probability of intercept (LPI) radar signals in modern electronic warfare. However, under low signal-to-noise ratio (SNR), the recognition accuracy of the LPI radar signals is relatively low. In this paper, a novel radar signal recognition method based on Convolutional Stacked Recurrent Deep Neural Network (CSRDNN) is proposed. Firstly, we design a Convolutional Neural Network (CNN) to expand the feature space of input time domain signals, the features extracted by CNN were then used as inputs of the Stacked Recurrent Neural Networks (SRNN) module. In the SRNN module, we sequentially stack GRU, LSTM, and BGRU, enabling the model to better handle the short-term and long-term dependence of signal features and effectively solve asynchronous problems in unidirectional RNN networks. Subsequently, a Fully Connected Deep Neural Network (FCDNN) was employed to accomplish the recognition task. In addition, we design a training algorithm composed of the Nesterov-Adaptive Moment Estimation (Nadam) algorithm and the CosineAnnealing Learning Rate (LR) adjustment strategy to improve the training efficiency of the model. The experimental results demonstrate that the proposed model has higher recognition accuracy at low SNR compared to other models, with an overall recognition accuracy of 92.96% at −4 dB.
Published: 2024
Full Text: View/download PDF

49. A Triple Noise Tolerant Zeroing Neural Network for Time-Varing Matrix Inverse

Author: Feixiang Yang and Yun Huang
Subjects: Activation function, matrix inverse, noise tolerant, time-variant problems, zeroing neural network, double integral, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Matrix inversion is a fundamental operation utilized across numerous disciplines such as mathematics, engineering, and control theory. The original zeroing neural network (OZNN) method has proven effective in tackling the challenge of time-varying matrix inversion (TVMI) under ideal conditions. The integration-enhanced zeroing neural network (IEZNN) is commonly used to handle TVMI issues in the presence of various types of noise. In this paper, we have enhanced the IEZNN model’s tolerance to noise by introducing a dual integral component, resulting in the dual noise tolerant zeroing neural network (DNTZNN) model. We have further improved this model by incorporating a positive odd activation function to create the triple noise tolerant zeroing neural network (TNTZNN). This advancement enables the TNTZNN to effectively solve TVMI problems despite various noise disturbances. Consequently, the TNTZNN model demonstrates excellent convergence and robustness even under noisy conditions. Furthermore, theoretical analysis grounded on the Lyapunov theorem validates the convergence and resilience of the TNTZNN model against diverse forms of noise. Computational simulations further substantiate the superior efficacy of the proposed TNTZNN model in resolving TVMI problems.
Published: 2024
Full Text: View/download PDF

50. Compressible Non-Newtonian Fluid Based Road Traffic Flow Equation Solved by Physical-Informed Rational Neural Network

Author: Zan Yang, Dan Li, Wei Nai, Lu Liu, Jingjing Sun, and Xiaowei Lv
Subjects: Traffic flow analysis, compressible non-Newtonian fluid, partial differential equation (PDE) solution, physical-informed rational neural network (PIRNN), activation function, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: The study of road traffic flow theory utilizes physics and applied mathematics to analyze relevant parameters and their relationships quanlitatively and quantitatively, in order to explore their dynamic changes. The fluid dynamics model used for traffic flow analysis is highly favored by scholars due to its solid mathematical foundation and good simulation results. However, existing models have two main shortcomings: firstly, existing research is mostly limited to non-viscoelastic fluid equation or incompressible non-Newtonian fluid equation, making it difficult to accurately describe the viscosity state and micro cluster properties of the actual traffic flow; secondly, the existing non-Newtonian fluid partial differential equations (PDEs) rely heavily on the finite element method (FEM) for solving, requiring higher computational cost, larger storage space, and more constraint conditions. Thus, in this paper, a traffic flow equation based on compressible non-Newtonian fluid has been constructed, and it has been solved by using physical-informed rational neural network (PIRNN) and noise heavy-ball acceleration gradient descent (NHAGD) to ensure learning and training speed and accuracy. Numerical results indicate that the proposed method can truly reflect the gradual change process in the viscosity of traffic flow, and has better solving performance than traditional FEM and physical-informed neural network (PINN) with activation functions; under the same conditions, the prediction error of the proposed method is also smaller than that of traditional traffic flow models.
Published: 2024
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

84 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources