1,983 results
Search Results
2. Comparing and Improving Active Learning Uncertainty Measures for Transformer Models by Discarding Outliers
- Author
-
Gonsior, Julius, Falkenberg, Christian, Magino, Silvio, Reusch, Anja, Hartmann, Claudio, Thiele, Maik, and Lehner, Wolfgang
- Published
- 2024
- Full Text
- View/download PDF
3. Deep Neural Network Regression‐Assisted Pressure Sensor for Decoupling Thermal Variations at Different Operating Temperatures.
- Author
-
Bang, Joohyung, Baek, Keuntae, Lim, Jaeyoung, Han, Yongha, and So, Hongyun
- Subjects
ARTIFICIAL neural networks ,PRESSURE sensors - Abstract
Decoupling environment‐dependent response in sensing techniques is essential for the diverse practical applications. This work presents a novel thermal effect decoupling method for sponge pressure sensors based on a deep neural network (DNN) regression model, which is difficult to achieve owing to the material‐ and structure‐related complex effects of the sponge‐based pressure sensor. A poly(3,4‐ethylenedioxythiophene):poly(styrenesulfonate)‐based multifunctional device is fabricated with a both pressure and thermally responsive part and an only thermally responsive part; and a DNN model with two input features is adapted to implement the substantial pressure prediction system without thermal interference. Proposed model shows the robust decoupled pressure‐sensing capability with high accuracy of ≈96.23% using two input features. It also enables accurate pressure prediction under both the thermally steady and transition regions, which indicates significant potential for a precise measurement system. These results demonstrate the possibility of reliable pressure monitoring under varying thermal conditions, which is important for accurately measuring pressure in complex power plants, human–machine interfaces, and compact wearable platforms. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. Attentive deep neural networks for legal document retrieval
- Author
-
Nguyen, Ha-Thanh, Phi, Manh-Kien, Ngo, Xuan-Bach, Tran, Vu, Nguyen, Le-Minh, and Tu, Minh-Phuong
- Published
- 2024
- Full Text
- View/download PDF
5. Generating FER models using ChatGPT.
- Author
-
BOLCAȘ, Radu-Daniel
- Subjects
CHATGPT ,GENERATIVE pre-trained transformers ,LANGUAGE models ,EMOTION recognition ,ARTIFICIAL neural networks ,DEBUGGING - Abstract
Copyright of Romanian Journal of Information Technology & Automatic Control / Revista Română de Informatică și Automatică is the property of National Institute for Research & Development in Informatics - ICI Bucharest and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
6. Comprehensive comparisons of gradient-based multi-label adversarial attacks
- Author
-
Chen, Zhijian, Luo, Wenjian, Naseem, Muhammad Luqman, Kong, Linghao, and Yang, Xiangkai
- Published
- 2024
- Full Text
- View/download PDF
7. GAN-Based Anomaly Detection Tailored for Classifiers.
- Author
-
Králik, Ľubomír, Kontšek, Martin, Škvarek, Ondrej, and Klimo, Martin
- Subjects
GENERATIVE adversarial networks ,ARTIFICIAL neural networks ,DATABASES - Abstract
Pattern recognition systems always misclassify anomalies, which can be dangerous for uninformed users. Therefore, anomalies must be filtered out from each classification. The main challenge for the anomaly filter design is the huge number of possible anomaly samples compared with the number of samples in the training set. Tailoring the filter for the given classifier is just the first step in this reduction. Paper tests the hypothesis that the filter trained in avoiding "near" anomalies will also refuse the "far" anomalies, and the anomaly detector is then just a classifier distinguishing between "far real" and "near anomaly" samples. As a "far real" samples generator was used, a Generative Adversarial Network (GAN) fake generator that transforms normally distributed random seeds into fakes similar to the training samples. The paper proves the assumption that seeds unused in fake training will generate anomalies. These seeds are distinguished according to their Chebyshev norms. While the fakes have seeds within the hypersphere with a given radius, the near anomalies have seeds within the sphere near cover. Experiments with various anomaly test sets have shown that GAN-based anomaly detectors create a reliable anti-anomaly shield using the abovementioned assumptions. The proposed anomaly detector is tailored to the given classifier, but its limitation is due to the need for the availability of the database on which the classifier was trained. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Synthetic Document Images with Diverse Shadows for Deep Shadow Removal Networks.
- Author
-
Matsuo, Yuhi and Aoki, Yoshimitsu
- Subjects
DOCUMENT imaging systems ,OPTICAL character recognition ,ARTIFICIAL neural networks ,NETWORK performance - Abstract
Shadow removal for document images is an essential task for digitized document applications. Recent shadow removal models have been trained on pairs of shadow images and shadow-free images. However, obtaining a large, diverse dataset for document shadow removal takes time and effort. Thus, only small real datasets are available. Graphic renderers have been used to synthesize shadows to create relatively large datasets. However, the limited number of unique documents and the limited lighting environments adversely affect the network performance. This paper presents a large-scale, diverse dataset called the Synthetic Document with Diverse Shadows (SynDocDS) dataset. The SynDocDS comprises rendered images with diverse shadows augmented by a physics-based illumination model, which can be utilized to obtain a more robust and high-performance deep shadow removal network. In this paper, we further propose a Dual Shadow Fusion Network (DSFN). Unlike natural images, document images often have constant background colors requiring a high understanding of global color features for training a deep shadow removal network. The DSFN has a high global color comprehension and understanding of shadow regions and merges shadow attentions and features efficiently. We conduct experiments on three publicly available datasets, the OSR, Kligler's, and Jung's datasets, to validate our proposed method's effectiveness. In comparison to training on existing synthetic datasets, our model training on the SynDocDS dataset achieves an enhancement in the PSNR and SSIM, increasing them from 23.00 dB to 25.70 dB and 0.959 to 0.971 on average. In addition, the experiments demonstrated that our DSFN clearly outperformed other networks across multiple metrics, including the PSNR, the SSIM, and its impact on OCR performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Chinese Fine-Grained Named Entity Recognition Based on BILTAR and GlobalPointer Modules.
- Author
-
Li, Weijun, Liu, Jintong, Gao, Yuxiao, Zhang, Xinyong, and Gu, Jianlai
- Subjects
CHINESE language ,LANGUAGE models ,ARTIFICIAL neural networks ,NATURAL language processing ,RECOGNITION (Psychology) - Abstract
The task of fine-grained named entity recognition is to locate entities in text and classify them into predefined fine-grained categories. At present, Chinese fine-grained NER only uses the pretrained language model to encode the characters in the sentence and lacks the ability to extract the deep semantic, sequence, and position information. The sequence annotation method is character-based and lacks the processing of entity boundaries. Fine-grained entity categories have a high degree of similarity, which makes it difficult to distinguish similar categories. To solve the above problems, this paper constructs the BILTAR deep semantic extraction module and adds the GlobalPointer module to improve the accuracy of Chinese fine-grained named entity recognition. The BILTAR module is used to extract deep semantic features from the coding information of pretrained language models and use higher-quality features to improve the model performance. In the GlobalPointer module, the model first adds the rotation position encoding information to the feature vector, using the position information to achieve data enhancement. Finally, the model considers all possible entity boundaries through the GlobalPointer module and calculates the scores for all possible entity boundaries in each category. In this paper, all possible entity boundaries in the text are considered by the above method, and the accuracy of entity recognition is improved. In this paper, the corresponding experiments were carried out on CLUENER 2020 and the micro Chinese fine-grained NER dataset, and the F1 scores of the model in this paper reached 80.848% and 75.751%, respectively. In ablation experiments, the proposed method outperforms the most advanced baseline model and improves the performance of the basic model. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Hierarchical ensemble deep learning for data-driven lead time prediction
- Author
-
Aslan, Ayse, Vasantha, Gokula, El-Raoui, Hanane, Quigley, John, Hanson, Jack, Corney, Jonathan, and Sherlock, Andrew
- Published
- 2023
- Full Text
- View/download PDF
11. Compression-resistant backdoor attack against deep neural networks
- Author
-
Xue, Mingfu, Wang, Xin, Sun, Shichang, Zhang, Yushu, Wang, Jian, and Liu, Weiqiang
- Published
- 2023
- Full Text
- View/download PDF
12. A Fault Diagnosis Method of Four-Mass Vibration MEMS Gyroscope Based on ResNeXt-50 with Attention Mechanism and Improved EWT Algorithm.
- Author
-
Gu, Yikuan, Wang, Yan, Li, Zhong, Zhang, Tiantian, Li, Yuanhao, Wang, Guodong, and Cao, Huiliang
- Subjects
GYROSCOPES ,FAULT diagnosis ,MACHINE learning ,ARTIFICIAL neural networks ,FEATURE extraction ,WAVELET transforms ,DIAGNOSIS methods ,WAVELETS (Mathematics) - Abstract
In this paper, a fault identification algorithm combining a signal processing algorithm and machine learning algorithm is proposed, using a four-mass vibration MEMS gyroscope (FMVMG) for signal acquisition work, constructing a gyroscope fault dataset, and performing the model training task based on this dataset. Combining the improved EWT algorithm with SEResNeXt-50 reduces the impact of white noise in the signal on the identification task and significantly improves the accuracy of fault identification. The EWT algorithm is a wavelet analysis algorithm with adaptive wavelet analysis, which can significantly reduce the impact of boundary effects, and has a good effect on decomposition of signal segments with short length, but a reconstruction method is needed to effectively separate the noise signal and effective signal, and so this paper uses multiscale permutation entropy for calculation. For the reason that the neural network has a better ability to characterize high-dimensional signals, the one-dimensional signal is reconstructed into a two-dimensional image signal and the signal features are extracted. Then, the constructed image signals are fed into the SEResNeXt-50 network, and the characterization ability of the model is further improved in the network with the addition of the Squeeze-and-Excitation module. Finally, the proposed model is applied to the FMVMG fault dataset and compared with other models. In terms of recognition accuracy, the proposed method improves about 30.25% over the BP neural network and about 1.85% over ResNeXt-50, proving the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. Two is better than one: digital siblings to improve autonomous driving testing
- Author
-
Biagiola, Matteo, Stocco, Andrea, Riccio, Vincenzo, and Tonella, Paolo
- Published
- 2024
- Full Text
- View/download PDF
14. Deep learning-based correlation analysis for probabilistic power flow considering renewable energy and energy storage.
- Author
-
Xia, Xiaotian, Xiao, Liye, Ye, Hua, Shi, Gang, and Li, Dayi
- Subjects
ELECTRICAL load ,DEEP learning ,RENEWABLE energy sources ,ARTIFICIAL neural networks ,WIND power ,STATISTICAL correlation ,ENERGY storage - Abstract
Developing photovoltaic (PV) and wind power is one of the most efficient approaches to reduce carbon emissions. Accumulating the PV and wind energy resources at different geographical locations can minimize total power output variance as injected into the power systems. To some extent, a low degree of the variance amplitude of the renewable resources can reduce the requirement of in-depth regulation and dispatch for the fossil fuel-based thermal power plants. Such an issue can alternatively reduce carbon emissions. Thus, the correlation problem by minimizing the variance of total PV and wind power plays a vital role in power system planning and operation. However, the synergistic effect of power output correlation is mainly considered on the generation side, and it is often neglected for the correlation relationship between the power grid components. To address this problem, this paper proposes a correlation coefficient analysis method for the power grid, which can quantify the relationship between energy storage and the probabilistic power flow (PPF) of the grid. Subsequently, to accelerate the mapping efficiency of power correlation coefficients, a novel deep neural network (DNN) optimized by multi-task learning and attention mechanism (MA-DNN) is developed to predict power flow fluctuations. Finally, the simulation results show that in IEEE 9-bus and IEEE14-bus systems, the strong correlation grouping percentage between the power correlation coefficients and power flow fluctuations reached 92% and 51%, respectively. The percentages of groups indicating weak correlation are 4% and 38%. In the modified IEEE 23-bus system, the computational accuracy of MA-DNN is improved by 37.35% compared to the PPF based on Latin hypercube sampling. Additionally, the MA-DNN regression prediction model exhibits a substantial improvement in assessing power flow fluctuations in the power grid, achieving a speed enhancement of 758.85 times compared to the conventional probability power flow algorithms. These findings provide the rapid selection of the grid access point with the minimum power flow fluctuations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Mixed‐decomposed convolutional network: A lightweight yet efficient convolutional neural network for ocular disease recognition.
- Author
-
Zhang, Xiaoqing, Wu, Xiao, Xiao, Zunjie, Hu, Lingxi, Qiu, Zhongxi, Sun, Qingyang, Higashita, Risa, and Liu, Jiang
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,OPTICAL coherence tomography ,IMAGE recognition (Computer vision) ,ARTIFICIAL intelligence ,EYE tracking - Abstract
Eye health has become a global health concern and attracted broad attention. Over the years, researchers have proposed many state‐of‐the‐art convolutional neural networks (CNNs) to assist ophthalmologists in diagnosing ocular diseases efficiently and precisely. However, most existing methods were dedicated to constructing sophisticated CNNs, inevitably ignoring the trade‐off between performance and model complexity. To alleviate this paradox, this paper proposes a lightweight yet efficient network architecture, mixed‐decomposed convolutional network (MDNet), to recognise ocular diseases. In MDNet, we introduce a novel mixed‐decomposed depthwise convolution method, which takes advantage of depthwise convolution and depthwise dilated convolution operations to capture low‐resolution and high‐resolution patterns by using fewer computations and fewer parameters. We conduct extensive experiments on the clinical anterior segment optical coherence tomography (AS‐OCT), LAG, University of California San Diego, and CIFAR‐100 datasets. The results show our MDNet achieves a better trade‐off between the performance and model complexity than efficient CNNs including MobileNets and MixNets. Specifically, our MDNet outperforms MobileNets by 2.5% of accuracy by using 22% fewer parameters and 30% fewer computations on the AS‐OCT dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Dynamic Traffic Management Using Reinforcement Learning.
- Author
-
Shaikh, Aryaan, Bhalekar, Babasaheb, and Futane, Pravin
- Subjects
REINFORCEMENT learning ,ARTIFICIAL neural networks ,TRAFFIC signs & signals ,TRAFFIC engineering ,TRAFFIC flow - Abstract
Traffic congestion has become a major problem in this rapidly growing world. Everyone operating a vehicle, as well as the traffic police in charge of managing the traffic, finds it difficult to become stuck in heavy traffic. For this a set, predetermined timing for traffic flow for each direction at the junction is utilized by traditional traffic light controllers. However, the concept of a fixed time traffic signal controller does not work well in places with uneven traffic. A dynamic traffic control system is therefore required, which regulates the traffic signals in accordance with the volume of traffic. This paper proposes a model that uses reinforcement learning (RL) along with deep neural networks (DNN) to manage discretions (signal status) for an environment with the help of Simulation of Urban MObility (SUMO). A simulation of real-world environment consisting a network of Four-way crossroad junction that contains 4 arriving lanes and 4 exiting lanes is used to train the agent. The main objective of this research study is to construct a model that can independently determine the best course of action and aims to provide better traffic management that will decrease the average waiting time, cause lower congestion, and provide a smooth flow of traffic. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Generating adversarial samples by manipulating image features with auto-encoder
- Author
-
Yang, Jianxin, Shao, Mingwen, Liu, Huan, and Zhuang, Xinkai
- Published
- 2023
- Full Text
- View/download PDF
18. Editorial: Deep learning approaches applied to spectral images for plant phenotyping.
- Author
-
Polder, Gerrit, Blasco, Jose, and Cen, Haiyan
- Subjects
SPECTRAL imaging ,DEEP learning ,ARTIFICIAL neural networks ,HEBBIAN memory ,PARTICLE swarm optimization ,PRECISION farming - Abstract
This document is an editorial published in the journal Frontiers in Plant Science. It discusses the application of deep learning approaches to spectral images for plant phenotyping. Spectral imaging is a sensor technology used in agriculture and plant science to quantify the composition of agricultural products and detect plant stresses and diseases. The editorial highlights the challenges of using deep learning networks for spectral image data, such as the complexity of the datasets and the lack of pre-trained networks. It also mentions research on reconstructing spectral images from RGB images and presents several research papers on disease detection, weed density, apple disease recognition, gray mold disease diagnosis, Raman spectroscopy, and chlorophyll estimation. The editorial concludes that deep learning approaches show promise for analyzing spectral images in plant phenotyping and can lead to new insights into plant biology and responses to environmental stressors. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
19. Deep limits of residual neural networks
- Author
-
Thorpe, Matthew and van Gennip, Yves
- Published
- 2023
- Full Text
- View/download PDF
20. Separating hard clean samples from noisy samples with samples’ learning risk for DNN when learning with noisy labels
- Author
-
Deng, Lihui, Yang, Bo, Kang, Zhongfeng, Wu, Jiajin, Li, Shaosong, and Xiang, Yanping
- Published
- 2024
- Full Text
- View/download PDF
21. Prediction and modeling of water quality using deep neural networks
- Author
-
El-Shebli, Marwa, Sharrab, Yousef, and Al-Fraihat, Dimah
- Published
- 2024
- Full Text
- View/download PDF
22. Early Diagnosis of Neurodegenerative Diseases Using CNN-LSTM and Wavelet Transform
- Author
-
Amooei, Elmira, Sharifi, Arash, and Manthouri, Mohammad
- Published
- 2023
- Full Text
- View/download PDF
23. Dataset authorization control: protect the intellectual property of dataset via reversible feature space adversarial examples
- Author
-
Xue, Mingfu, Wu, Yinghao, Zhang, Yushu, Wang, Jian, and Liu, Weiqiang
- Published
- 2023
- Full Text
- View/download PDF
24. A hierarchical evolution of neural architecture search method based on state transition algorithm
- Author
-
Du, Yangyi, Zhou, Xiaojun, Huang, Tingwen, and Yang, Chunhua
- Published
- 2023
- Full Text
- View/download PDF
25. Lightweight CNN-Based Image Recognition with Ecological IoT Framework for Management of Marine Fishes.
- Author
-
Jia, Lulu, Xie, Xikun, Yang, Junchao, Li, Fukun, Zhou, Yueming, Fan, Xingrong, Shen, Yu, and Guo, Zhiwei
- Subjects
MARINE fishes ,ARTIFICIAL neural networks ,IMAGE recognition (Computer vision) ,INFORMATION technology ,FISHERY management ,SMART cities - Abstract
With the development of emerging information technology, the traditional management methods of marine fishes are slowly replaced by new methods due to high cost, time-consumption and inaccurate management. The update of marine fishes management technology is also a great help for the creation of smart cities. However, some new methods have been studied that are too specific, which are not applicable for the other marine fishes, and the accuracy of identification is generally low. Therefore, this paper proposes an ecological Internet of Things (IoT) framework, in which a lightweight Deep Neural Networks model is implemented as a image recognition model for marine fishes, which is recorded as Fish-CNN. In this study, multi-training and evaluation of Fish-CNN is accomplished, and the accuracy of the final classification can be fixed to 89.89%–99.83%. Moreover, the final evaluation compared with Rem-CNN, Linear Regression and Multilayer Perceptron also verify the stability and advantage of our method. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Measurement of Music Aesthetics Using Deep Neural Networks and Dissonances.
- Author
-
Paroiu, Razvan and Trausan-Matu, Stefan
- Subjects
ARTIFICIAL neural networks ,MUSICAL aesthetics ,SUPERVISED learning ,PEARSON correlation (Statistics) ,MUSICAL perception ,ARTIFICIAL intelligence - Abstract
In this paper, a new method that computes the aesthetics of a melody fragment is proposed, starting from dissonances. While music generated with artificial intelligence applications may be produced considerably more quickly than human-composed music, it has the drawback of not being appreciated like a human composition, being many times perceived by humans as artificial. For achieving supervised machine learning objectives of improving the quality of the great number of generated melodies, it is a challenge to ask humans to grade them. Therefore, it would be preferable if the aesthetics of artificial-intelligence-generated music is calculated by an algorithm. The proposed method in this paper is based on a neural network and a mathematical formula, which has been developed with the help of a study in which 108 students evaluated the aesthetics of several melodies. For evaluation, numerical values generated by this method were compared with ratings provided by human listeners from a second study in which 30 students participated and scores were generated by an existing different method developed by psychologists and three other methods developed by musicians. Our method achieved a Pearson correlation of 0.49 with human aesthetic scores, which is a much better result than other methods obtained. Additionally, our method made a distinction between human-composed melodies and artificial-intelligence-generated scores in the same way that human listeners did. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. An Efficient Document Retrieval for Korean Open-Domain Question Answering Based on ColBERT.
- Author
-
Kang, Byungha, Kim, Yeonghwa, and Shin, Youhyun
- Subjects
QUESTION answering systems ,INFORMATION retrieval ,KOREAN language ,LANGUAGE models ,ARTIFICIAL neural networks ,NATURAL language processing - Abstract
Open-domain question answering requires the task of retrieving documents with high relevance to the query from a large-scale corpus. Deep learning-based dense retrieval methods have become the primary approach for finding related documents. Although deep learning-based methods have improved search accuracy compared to traditional techniques, they simultaneously impose a considerable increase in computational burden. Consequently, research on efficient models and methods that optimize the trade-off between search accuracy and time to alleviate computational demands is required. In this paper, we propose a Korean document retrieval method utilizing ColBERT's late interaction paradigm to efficiently calculate the relevance between questions and documents. For open-domain Korean question answering document retrieval, we construct a Korean dataset using various corpora from AI-Hub. We conduct experiments comparing the search accuracy and inference time among the traditional IR (information retrieval) model BM25, the dense retrieval approach utilizing BERT-based models for Korean, and our proposed method. The experimental results demonstrate that our approach achieves a higher accuracy than BM25 and requires less search time than the dense retrieval method employing KoBERT. Moreover, the most outstanding performance is observed when using KoSBERT, a pre-trained Korean language model that learned to position semantically similar sentences closely in vector space. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. Multiple sparse spaces network pruning via a joint similarity criterion.
- Author
-
Li, Guoqiang, Chen, Anbang, and Liu, Bowen
- Abstract
In this paper, a simple and effective neural network pruning framework is proposed to solve the problems of low model acceleration efficiency and inaccurate identification of pruning channels in conventional methods. Therefore, this paper first proposes a multi-sparse space network pruning scheme, which reduces the impact of pruning on network performance by defining the pruning task as an optimisation task in two different sparse spaces to gradually remove redundant parameters from the network. In this paper, we focus on the distribution characteristics of network weights in different sparse spaces, and we show that a decision method combining distance and direction information between weights can better locate the redundant information in the network. Experimental results and analysis have shown that the method can effectively prune neural networks, obtaining better results at higher compression and acceleration rates compared to other state-of-the-art methods. For example, on CIFAR-10, it reduces FLOPs by 67.5% and 64.2% for ResNet56 and ResNet110, respectively, while improving accuracy by 0.10% and 0.55%, respectively. On the CIFAR-100 dataset, the FLOPs for ResNet32 were reduced by 40.3%, while the accuracy was improved by 0.06%. On the STL-10 dataset, it was able to reduce the FLOPs of the ResNet18 model by 71.5% and gain an accuracy improvement of 0.59%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. Predicting anticancer synergistic drug combinations based on multi-task learning.
- Author
-
Chen, Danyi, Wang, Xiaowen, Zhu, Hongming, Jiang, Yizhi, Li, Yulong, Liu, Qi, and Liu, Qin
- Subjects
ANTINEOPLASTIC combined chemotherapy protocols ,ARTIFICIAL neural networks ,DRUG synergism ,DRUG delivery systems ,PEARSON correlation (Statistics) ,DEEP learning ,DRUG carriers - Abstract
Background: The discovery of anticancer drug combinations is a crucial work of anticancer treatment. In recent years, pre-screening drug combinations with synergistic effects in a large-scale search space adopting computational methods, especially deep learning methods, is increasingly popular with researchers. Although achievements have been made to predict anticancer synergistic drug combinations based on deep learning, the application of multi-task learning in this field is relatively rare. The successful practice of multi-task learning in various fields shows that it can effectively learn multiple tasks jointly and improve the performance of all the tasks. Methods: In this paper, we propose MTLSynergy which is based on multi-task learning and deep neural networks to predict synergistic anticancer drug combinations. It simultaneously learns two crucial prediction tasks in anticancer treatment, which are synergy prediction of drug combinations and sensitivity prediction of monotherapy. And MTLSynergy integrates the classification and regression of prediction tasks into the same model. Moreover, autoencoders are employed to reduce the dimensions of input features. Results: Compared with the previous methods listed in this paper, MTLSynergy achieves the lowest mean square error of 216.47 and the highest Pearson correlation coefficient of 0.76 on the drug synergy prediction task. On the corresponding classification task, the area under the receiver operator characteristics curve and the area under the precision–recall curve are 0.90 and 0.62, respectively, which are equivalent to the comparison methods. Through the ablation study, we verify that multi-task learning and autoencoder both have a positive effect on prediction performance. In addition, the prediction results of MTLSynergy in many cases are also consistent with previous studies. Conclusion: Our study suggests that multi-task learning is significantly beneficial for both drug synergy prediction and monotherapy sensitivity prediction when combining these two tasks into one model. The ability of MTLSynergy to discover new anticancer synergistic drug combinations noteworthily outperforms other state-of-the-art methods. MTLSynergy promises to be a powerful tool to pre-screen anticancer synergistic drug combinations. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
30. Detection of Glass Insulators Using Deep Neural Networks Based on Optical Imaging.
- Author
-
Wang, Jinyu, Li, Yingna, and Chen, Wenxiang
- Subjects
ARTIFICIAL neural networks ,EDGE detection (Image processing) ,OPTICAL images ,ELECTRIC lines ,IMAGE intensifiers ,BOX making - Abstract
As the pre-part of tasks such as fault detection and line inspection, insulator detection is a crucial task. However, considering the complex environment of high-voltage transmission lines, the traditional insulator detection accuracy is unsatisfactory, and manual inspection is dangerous and inefficient. To improve this situation, this paper proposes an insulator detection model Siamese ID-YOLO based on a deep neural network. The model achieves the best balance between speed and accuracy compared with traditional detection methods. In order to achieve the purpose of image enhancement, this paper adopts the canny-based edge detection operator to highlight the edges of insulators to obtain more semantic information. In this paper, based on the Darknet53 network and Siamese network, the insulator original image and the edge image are jointly input into the model. Siamese IN-YOLO model achieves more fine-grained extraction of insulators through weight sharing between Siamese networks, thereby improving the detection accuracy of insulators. This paper uses statistical clustering analysis on the area and aspect ratio of the insulator data set, then pre-set and adjusts the hyperparameters of the model anchor box to make it more suitable for the insulator detection task. In addition, this paper makes an insulator dataset named InsuDaSet based on UAV(Unmanned Aerial Vehicle) shoot insulator images for model training. The experiments show that the insulator detection can reach 92.72% detection accuracy and 84FPS detection speed, which can fully meet the online insulator detection requirements. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
31. Recent Advances in Large Margin Learning.
- Author
-
Guo, Yiwen and Zhang, Changshui
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,SUPPORT vector machines - Abstract
This paper serves as a survey of recent advances in large margin training and its theoretical foundations, mostly for (nonlinear) deep neural networks (DNNs) that are probably the most prominent machine learning models for large-scale data in the community over the past decade. We generalize the formulation of classification margins from classical research to latest DNNs, summarize theoretical connections between the margin, network generalization, and robustness, and introduce recent efforts in enlarging the margins for DNNs comprehensively. Since the viewpoint of different methods is discrepant, we categorize them into groups for ease of comparison and discussion in the paper. Hopefully, our discussions and overview inspire new research work in the community that aim to improve the performance of DNNs, and we also point to directions where the large margin principle can be verified to provide theoretical evidence why certain regularizations for DNNs function well in practice. We managed to shorten the paper such that the crucial spirit of large margin learning and related methods are better emphasized. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
32. The position-based compression techniques for DNN model
- Author
-
Tang, Minghua, Russo, Enrico, and Palesi, Maurizio
- Published
- 2023
- Full Text
- View/download PDF
33. Robust automatic accent identification based on the acoustic evidence
- Author
-
Alsharhan, Eiman and Ramsay, Allan
- Published
- 2023
- Full Text
- View/download PDF
34. Robust diagnosis and meta visualizations of plant diseases through deep neural architecture with explainable AI.
- Author
-
Natarajan, Sasikaladevi, Chakrabarti, Prasun, and Margala, Martin
- Abstract
Deep learning has emerged as a highly effective and precise method for classifying images. The presence of plant diseases poses a significant threat to food security. However, accurately identifying these diseases in plants is challenging due to limited infrastructure and techniques. Fortunately, the recent advancements in deep learning within the field of computer vision have opened up new possibilities for diagnosing plant pathology. Detecting plant diseases at an early stage is crucial, and this research paper proposes a deep convolutional neural network model that can rapidly and accurately identify plant diseases. Given the minimal variation in image texture and color, deep learning techniques are essential for robust recognition. In this study, we introduce a deep, explainable neural architecture specifically designed for recognizing plant diseases. Fine-tuned deep convolutional neural network is designed by freezing the layers and adjusting the weights of learnable layers. By extracting deep features from a down sampled feature map of a fine-tuned neural network, we are able to classify these features using a customized K-Nearest Neighbors Algorithm. To train and validate our model, we utilize the largest standard plant village dataset, which consists of 38 classes. To evaluate the performance of our proposed system, we estimate specificity, sensitivity, accuracy, and AUC. The results demonstrate that our system achieves an impressive maximum validation accuracy of 99.95% and an AUC of 1, making it the most ideal and highest-performing approach compared to current state-of-the-art deep learning methods for automatically identifying plant diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Prediction of Hippocampal Signals in Mice Using a Deep Learning Approach for Neurohybrid Technology Applications.
- Author
-
Lebedeva, Albina V., Samburova, Margarita I., Razin, Vyacheslav V., Gromov, Nikolay V., Gerasimova, Svetlana A., Levanova, Tatiana A., Smirnov, Lev A., and Pisarchik, Alexander N.
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,HIPPOCAMPUS (Brain) ,BRAIN-computer interfaces ,NERVOUS system - Abstract
The increasing growth in knowledge about the functioning of the nervous system of mammals and humans, as well as the significant neuromorphic technology developments in recent decades, has led to the emergence of a large number of brain–computer interfaces and neuroprosthetics for regenerative medicine tasks. Neurotechnologies have traditionally been developed for therapeutic purposes to help or replace motor, sensory or cognitive abilities damaged by injury or disease. They also have significant potential for memory enhancement. However, there are still no fully developed neurotechnologies and neural interfaces capable of restoring or expanding cognitive functions, in particular memory, in mammals or humans. In this regard, the search for new technologies in the field of the restoration of cognitive functions is an urgent task of modern neurophysiology, neurotechnology and artificial intelligence. The hippocampus is an important brain structure connected to memory and information processing in the brain. The aim of this paper is to propose an approach based on deep neural networks for the prediction of hippocampal signals in the CA1 region based on received biological input in the CA3 region. We compare the results of prediction for two widely used deep architectures: reservoir computing (RC) and long short-term memory (LSTM) networks. The proposed study can be viewed as a first step in the complex task of the development of a neurohybrid chip, which allows one to restore memory functions in the damaged rodent hippocampus. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Intelligent Learning Method for Capacity Estimation of Lithium-Ion Batteries Based on Partial Charging Curves.
- Author
-
Ding, Can, Guo, Qing, Zhang, Lulu, and Wang, Tao
- Subjects
LITHIUM-ion batteries ,LITHIUM cells ,ARTIFICIAL neural networks ,OPTIMIZATION algorithms - Abstract
Lithium-ion batteries are widely used in electric vehicles, energy storage power stations, and many other applications. Accurate and reliable monitoring of battery health status and remaining capacity is the key to establish a lithium-ion cell management system. In this paper, based on a Bayesian optimization algorithm, a deep neural network is structured to evaluate the whole charging curve of the battery using partial charging curve data as input. A 0.74 Ah battery is used for experiments, and the effect of different input data lengths is also investigated to check the high flexibility of the approach. The consequences show that using only 20 points of partial charging data as input, the whole charging profile of a cell can be exactly predicted with a root-mean-square error (RMSE) of less than 19.16 mAh (2.59% of the nominal capacity of 0.74 Ah), and its mean absolute percentage error (MAPE) is less than 1.84%. In addition, critical information including battery state-of-charge (SOC) and state-of-health (SOH) can be extracted in this way to provide a basis for safe and long-lasting battery operation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Multi-scale coupled attention for visual object detection.
- Author
-
Li, Fei, Yan, Hongping, and Shi, Linsu
- Subjects
OBJECT recognition (Computer vision) ,ARTIFICIAL neural networks ,OBJECT tracking (Computer vision) ,GEARING machinery ,ATTENTION - Abstract
The application of deep neural network has achieved remarkable success in object detection. However, the network structures should be still evolved consistently and tuned finely to acquire better performance. This gears to the continuous demands on high performance in those complex scenes, where multi-scale objects to be detected are located here and there. To this end, this paper proposes a network structure called Multi-Scale Coupled Attention (MSCA) under the framework of self-attention learning with methodologies of importance assessment. Architecturally, it consists of a Multi-Scale Coupled Channel Attention (MSCCA) module, and a Multi-Scale Coupled Spatial Attention (MSCSA) module. Specifically, the MSCCA module is developed to achieve the goal of self-attention learning linearly on the multi-scale channels. In parallel, the MSCSA module is constructed to achieve this goal nonlinearly on the multi-scale spatial grids. The MSCCA and MSSCA modules can be connected together into a sequence, which can be used as a plugin to develop end-to-end learning models for object detection. Finally, our proposed network is compared on two public datasets with 13 classical or state-of-the-art models, including the Faster R-CNN, Cascade R-CNN, RetinaNet, SSD, PP-YOLO, YOLO v3, YOLO v5, YOLO v7, YOLOX, DETR, conditional DETR, UP-DETR and FP-DETR. Comparative experimental results with numerical scores, the ablation study, and the performance behaviour all demonstrate the effectiveness of our proposed model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Sim2Real Neural Controllers for Physics-Based Robotic Deployment of Deformable Linear Objects.
- Author
-
Tong, Dezhong, Choi, Andrew, Qin, Longhui, Huang, Weicheng, Joo, Jungseock, and Jawed, Mohammad Khalid
- Subjects
ARTIFICIAL neural networks ,ROBOTICS ,GRAVITATIONAL energy ,FRICTION materials ,MACHINE learning ,HEURISTIC - Abstract
Deformable linear objects (DLOs), such as rods, cables, and ropes, play important roles in daily life. However, manipulation of DLOs is challenging as large geometrically nonlinear deformations may occur during the manipulation process. This problem is made even more difficult as the different deformation modes (e.g., stretching, bending, and twisting) may result in elastic instabilities during manipulation. In this paper, we formulate a physics-guided data-driven method to solve a challenging manipulation task—accurately deploying a DLO (an elastic rod) onto a rigid substrate along various prescribed patterns. Our framework combines machine learning, scaling analysis, and physical simulations to develop a physics-based neural controller for deployment. We explore the complex interplay between the gravitational and elastic energies of the manipulated DLO and obtain a control method for DLO deployment that is robust against friction and material properties. Out of the numerous geometrical and material properties of the rod and substrate, we show that only three non-dimensional parameters are needed to describe the deployment process with physical analysis. Therefore, the essence of the controlling law for the manipulation task can be constructed with a low-dimensional model, drastically increasing the computation speed. The effectiveness of our optimal control scheme is shown through a comprehensive robotic case study comparing against a heuristic control method for deploying rods for a wide variety of patterns. In addition to this, we also showcase the practicality of our control scheme by having a robot accomplish challenging high-level tasks such as mimicking human handwriting, cable placement, and tying knots. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Transfer Accent Identification Learning for Enhancing Speech Emotion Recognition
- Author
-
Priya Dharshini, G. and Sreenivasa Rao, K.
- Published
- 2024
- Full Text
- View/download PDF
40. A Practical Approach for Employing Tensor Train Decomposition in Edge Devices
- Author
-
Kokhazadeh, Milad, Keramidas, Georgios, Kelefouras, Vasilios, and Stamoulis, Iakovos
- Published
- 2024
- Full Text
- View/download PDF
41. EGFA-NAS: a neural architecture search method based on explosion gravitation field algorithm
- Author
-
Hu, Xuemei, Huang, Lan, Zeng, Jia, Wang, Kangping, and Wang, Yan
- Published
- 2024
- Full Text
- View/download PDF
42. Combination of Physics-Informed Neural Networks and Single-Relaxation-Time Lattice Boltzmann Method for Solving Inverse Problems in Fluid Mechanics.
- Author
-
Liu, Zhixiang, Chen, Yuanji, Song, Ge, Song, Wei, and Xu, Jingxiang
- Subjects
FLUID mechanics ,ARTIFICIAL neural networks ,PROBLEM solving ,PHYSICAL laws ,THERMODYNAMIC control ,LATTICE Boltzmann methods - Abstract
Physics-Informed Neural Networks (PINNs) improve the efficiency of data utilization by combining physical principles with neural network algorithms and thus ensure that their predictions are consistent and stable with the physical laws. PINNs open up a new approach to address inverse problems in fluid mechanics. Based on the single-relaxation-time lattice Boltzmann method (SRT-LBM) with the Bhatnagar–Gross–Krook (BGK) collision operator, the PINN-SRT-LBM model is proposed in this paper for solving the inverse problem in fluid mechanics. The PINN-SRT-LBM model consists of three components. The first component involves a deep neural network that predicts equilibrium control equations in different discrete velocity directions within the SRT-LBM. The second component employs another deep neural network to predict non-equilibrium control equations, enabling the inference of the fluid's non-equilibrium characteristics. The third component, a physics-informed function, translates the outputs of the first two networks into physical information. By minimizing the residuals of the physical partial differential equations (PDEs), the physics-informed function infers relevant macroscopic quantities of the flow. The model evolves two sub-models that are applicable to different dimensions, named the PINN-SRT-LBM-I and PINN-SRT-LBM-II models according to the construction of the physics-informed function. The innovation of this work is the introduction of SRT-LBM and discrete velocity models as physical drivers into a neural network through the interpretation function. Therefore, the PINN-SRT-LBM allows a given neural network to handle inverse problems of various dimensions and focus on problem-specific solving. Our experimental results confirm the accurate prediction by this model of flow information at different Reynolds numbers within the computational domain. Relying on the PINN-SRT-LBM models, inverse problems in fluid mechanics can be solved efficiently. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
43. Advancements in On-Device Deep Neural Networks.
- Author
-
Saravanan, Kavya and Kouzani, Abbas Z.
- Subjects
ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,DEEP learning - Abstract
In recent years, rapid advancements in both hardware and software technologies have resulted in the ability to execute artificial intelligence (AI) algorithms on low-resource devices. The combination of high-speed, low-power electronic hardware and efficient AI algorithms is driving the emergence of on-device AI. Deep neural networks (DNNs) are highly effective AI algorithms used for identifying patterns in complex data. DNNs, however, contain many parameters and operations that make them computationally intensive to execute. Accordingly, DNNs are usually executed on high-resource backend processors. This causes an increase in data processing latency and energy expenditure. Therefore, modern strategies are being developed to facilitate the implementation of DNNs on devices with limited resources. This paper presents a detailed review of the current methods and structures that have been developed to deploy DNNs on devices with limited resources. Firstly, an overview of DNNs is presented. Next, the methods used to implement DNNs on resource-constrained devices are explained. Following this, the existing works reported in the literature on the execution of DNNs on low-resource devices are reviewed. The reviewed works are classified into three categories: software, hardware, and hardware/software co-design. Then, a discussion on the reviewed approaches is given, followed by a list of challenges and future prospects of on-device AI, together with its emerging applications. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. DeepMC: DNN test sample optimization method jointly guided by misclassification and coverage
- Author
-
Sun, Jiaze, Li, Juan, and Wen, Sulei
- Published
- 2023
- Full Text
- View/download PDF
45. Classification methods for handwritten digit recognition: A survey.
- Author
-
Tuba, Ira M., Tuba, Una M., and Veinović, Mladen Đ.
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,DATA augmentation ,SWARM intelligence - Abstract
Copyright of Military Technical Courier / Vojnotehnicki Glasnik is the property of Military Technical Courier / Vojnotehnicki Glasnik and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2023
- Full Text
- View/download PDF
46. Short-Term Regional Temperature Prediction Based on Deep Spatial and Temporal Networks.
- Author
-
Wu, Shun, Fu, Fengchen, Wang, Lei, Yang, Minhang, Dong, Shi, He, Yongqing, Zhang, Qingqing, and Guo, Rong
- Subjects
TIME-varying networks ,GRIDS (Cartography) ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,HUMAN information processing ,SPATIOTEMPORAL processes ,ATMOSPHERIC temperature - Abstract
Accurate prediction of air temperature is of great significance to outdoor activities and daily life. However, it is important and more challenging to predict air temperature in complex terrain areas because of prevailing mountain and valley winds and variable wind directions. The main innovation of this paper is to propose a regional temperature prediction method based on deep spatiotemporal networks, designing a spatiotemporal information processing module to align temperature data with regional grid points and further transforming temperature time series data into image sequences. Long Short-Term Memory network is constructed on the images to extract the depth features of the data to train the model. The experiments demonstrate that the deep learning prediction model containing the spatiotemporal information processing module and the deep learning prediction module is fully feasible in short-term regional temperature prediction. The comparison experiments show that the model proposed in this paper has better prediction results for classical models, such as convolutional neural networks and LSTM networks. The experimental conclusion shows that the method proposed in this paper can predict the distribution and change trend of temperature in the next 3 h and the next 6 h on a regional scale. The experimental result RMSE reached 0.63, showing high stability and accuracy. The model provides a new method for local regional temperature prediction, which can support the planning of production and life in advance and tend to save energy and reduce consumption. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Twenty Significant Problems in AI Research, with Potential Solutions via the SP Theory of Intelligence and Its Realisation in the SP Computer Model.
- Author
-
Wolff, J. Gerard
- Subjects
ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,MACHINE learning ,COGNITION ,SOCIAL support - Abstract
This paper highlights 20 significant problems in AI research, with potential solutions via the SP Theory of Intelligence (SPTI) and its realisation in the SP Computer Model. With other evidence referenced in the paper, this is strong evidence in support of the SPTI as a promising foundation for the development of human-level broad AI, aka artificial general intelligence. The 20 problems include: the tendency of deep neural networks to make major errors in recognition; the need for a coherent account of generalisation, over- and under-generalisation, and minimising the corrupting effect of 'dirty data'; how to achieve one-trial learning; how to achieve transfer learning; the need for transparency in the representation and processing of knowledge; and how to eliminate the problem of catastrophic forgetting. In addition to its promise as a foundation for the development of AGI, the SPTI has potential as a foundation for the study of human learning, perception, and cognition. And it has potential as a foundation for mathematics, logic, and computing. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. An Ontological Knowledge Base of Poisoning Attacks on Deep Neural Networks.
- Author
-
Altoub, Majed, AlQurashi, Fahad, Yigitcanlar, Tan, Corchado, Juan M., and Mehmood, Rashid
- Subjects
ARTIFICIAL neural networks ,POISONING ,KNOWLEDGE graphs ,KNOWLEDGE base ,DATA security ,DEEP learning - Abstract
Deep neural networks (DNNs) have successfully delivered cutting-edge performance in several fields. With the broader deployment of DNN models on critical applications, the security of DNNs has become an active and yet nascent area. Attacks against DNNs can have catastrophic results, according to recent studies. Poisoning attacks, including backdoor attacks and Trojan attacks, are one of the growing threats against DNNs. Having a wide-angle view of these evolving threats is essential to better understand the security issues. In this regard, creating a semantic model and a knowledge graph for poisoning attacks can reveal the relationships between attacks across intricate data to enhance the security knowledge landscape. In this paper, we propose a DNN poisoning attack ontology (DNNPAO) that would enhance knowledge sharing and enable further advancements in the field. To do so, we have performed a systematic review of the relevant literature to identify the current state. We collected 28,469 papers from the IEEE, ScienceDirect, Web of Science, and Scopus databases, and from these papers, 712 research papers were screened in a rigorous process, and 55 poisoning attacks in DNNs were identified and classified. We extracted a taxonomy of the poisoning attacks as a scheme to develop DNNPAO. Subsequently, we used DNNPAO as a framework by which to create a knowledge base. Our findings open new lines of research within the field of AI security. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
49. Classification of Problem and Solution Strings in Scientific Texts: Evaluation of the Effectiveness of Machine Learning Classifiers and Deep Neural Networks.
- Author
-
Mishra, Rohit Bhuvaneshwar and Jiang, Hongbing
- Subjects
MACHINE learning ,DEEP learning ,LINGUISTIC analysis ,CLASSIFICATION ,EMPIRICAL research - Abstract
One of the central aspects of science is systematic problem-solving. Therefore, problem and solution statements are an integral component of the scientific discourse. The scientific analysis would be more successful if the problem–solution claims in scientific texts were automatically classified. It would help in knowledge mining, idea generation, and information classification from scientific texts. It would also help to compare scientific papers and automatically generate review articles in a given field. However, computational research on problem–solution patterns has been scarce. The linguistic analysis, instructional-design research, theory, and empirical methods have not paid enough attention to the study of problem–solution patterns. This paper tries to solve this issue by applying the computational techniques of machine learning classifiers and neural networks to a set of features to intelligently classify a problem phrase from a non-problem phrase and a solution phrase from a non-solution phrase. Our analysis shows that deep learning networks outperform machine learning classifiers. Our best model was able to classify a problem phrase from a non-problem phrase with an accuracy of 90.0% and a solution phrase from a non-solution phrase with an accuracy of 86.0%. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
50. Exploiting Frequency Characteristics for Boosting the Invisibility of Adversarial Attacks.
- Author
-
Li, Chen, Liu, Yong, Zhang, Xinpeng, and Wu, Hanzhou
- Subjects
ARTIFICIAL neural networks ,INVISIBILITY ,K-means clustering ,DISTRIBUTION (Probability theory) - Abstract
Mainstream transferable adversarial attacks tend to introduce noticeable artifacts into the generated adversarial examples, which will impair the invisibility of adversarial perturbation and make these attacks less practical in real-world scenarios. To deal with this problem, in this paper, we propose a novel black-box adversarial attack method that can significantly improve the invisibility of adversarial examples. We analyze the sensitivity of a deep neural network in the frequency domain and take into account the characteristics of the human visual system in order to quantify the contribution of each frequency component in adversarial perturbation. Then, we collect a set of candidate frequency components that are insensitive to the human visual system by applying K-means clustering and we propose a joint loss function during the generation of adversarial examples, limiting the frequency distribution of perturbations during attacks. The experimental results show that the proposed method significantly outperforms existing transferable black-box adversarial attack methods in terms of invisibility, which verifies the superiority, applicability and potential of this work. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.