134 results
Search Results
2. A review of asset management using artificial intelligence‐based machine learning models: Applications for the electric power and energy system.
- Author
-
Rajora, Gopal Lal, Sanz‐Bobi, Miguel A., Tjernberg, Lina Bertling, and Urrea Cabus, José Eduardo
- Subjects
- *
ARTIFICIAL intelligence , *ASSET management , *ASSET protection , *MACHINE learning , *DEEP learning , *SUSTAINABILITY - Abstract
Power system protection and asset management present persistent technical challenges, particularly in the context of the smart grid and renewable energy sectors. This paper aims to address these challenges by providing a comprehensive assessment of machine learning applications for effective asset management in power systems. The study focuses on the increasing demand for energy production while maintaining environmental sustainability and efficiency. By harnessing the power of modern technologies such as artificial intelligence (AI), machine learning (ML), and deep learning (DL), this research explores how ML techniques can be leveraged as powerful tools for the power industry. By showcasing practical applications and success stories, this paper demonstrates the growing acceptance of machine learning as a significant technology for current and future business needs in the power sector. Additionally, the study examines the barriers and difficulties of large‐scale ML deployment in practical settings while exploring potential opportunities for these tactics. Through this overview, insights into the transformative potential of ML in shaping the future of power system asset management are provided. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Explainable artificial intelligence for medical imaging: Review and experiments with infrared breast images.
- Author
-
Raghavan, Kaushik, Balasubramanian, Sivaselvan, and Veezhinathan, Kamakoti
- Subjects
- *
BREAST , *ARTIFICIAL intelligence , *COMPUTER-assisted image analysis (Medicine) , *INFRARED imaging , *MACHINE learning , *BREAST imaging , *DEEP learning , *ASSISTIVE technology - Abstract
There is a growing trend of using artificial intelligence, particularly deep learning algorithms, in medical diagnostics, revolutionizing healthcare by improving efficiency, accuracy, and patient outcomes. However, the use of artificial intelligence in medical diagnostics comes with the critical need to explain the reasoning behind artificial intelligence‐based predictions and ensure transparency in decision‐making. Explainable artificial intelligence has emerged as a crucial research area to address the need for transparency and interpretability in medical diagnostics. Explainable artificial intelligence techniques aim to provide insights into the decision‐making process of artificial intelligence systems, enabling clinicians to understand the factors the algorithms consider in reaching their predictions. This paper presents a detailed review of saliency‐based (visual) methods, such as class activation methods, which have gained popularity in medical imaging as they provide visual explanations by highlighting the regions of an image most influential in the artificial intelligence's decision. We also present the literature on non‐visual methods, but the focus will be on visual methods. We also use the existing literature to experiment with infrared breast images for detecting breast cancer. Towards the end of this paper, we also propose an "attention guided Grad‐CAM" that enhances the visualizations for explainable artificial intelligence. The existing literature shows that explainable artificial intelligence techniques are not explored in the context of infrared medical images and opens up a wide range of opportunities for further research to make clinical thermography into assistive technology for the medical community. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. High‐resolution reservoir prediction method based on data‐driven and model‐based approaches.
- Author
-
ZeYang, Liu, Wei, Song, XiaoHong, Chen, WenJin, Li, Zhichao, Li, and GuoChang, Liu
- Subjects
- *
DEEP learning , *SHALE oils , *NATURAL gas prospecting , *PETROLEUM prospecting , *OIL fields , *DATA mining - Abstract
The Jiyang depression in the southeastern part of the Bohai Bay Basin has a relatively large scale set of shale oil in the Paleogene Shahejie Formation, but the complex internal components lead to narrow frequency bands, low resolution and difficulty in reservoir information extraction. Impedance is important information for reservoir characterization, and how to predict high‐resolution impedance using available information is particularly important. Deep learning, known for its effectiveness in addressing non‐linear problems, has found extensive applications in various fields of oil and gas exploration. However, the challenges of overfitting and poor generalization persist due to the limited availability of training datasets. Besides, existing methods often use networks to solve a single problem in fact, deep learning can deal with a series of problems intelligently. In order to partially solve the above problems, an intelligent storage prediction network framework is proposed in this paper. Physical information is introduced to realize data‐driven and model‐based approaches, thus solving the problem of difficult construction of training datasets. The processing part accomplishes the high‐resolution processing of seismic records, thus solving the problems of narrow bandwidth and low resolution. Initial model constraints are introduced so as to obtain more stable inversion results. Finally, the well data is compared and analysed to identify and predict the lithology and complete the intelligent prediction of unconventional reservoirs. The results are compared with the traditional model‐driven inversion method, revealing that the approach presented in this paper exhibits higher resolution in predicting dolomite. This contributes to the establishment of a robust data foundation for reservoir evaluation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Guest Editorial: Advanced image restoration and enhancement in the wild.
- Author
-
Wang, Longguang, Li, Juncheng, Yokoya, Naoto, Timofte, Radu, and Guo, Yulan
- Subjects
- *
IMAGE intensifiers , *IMAGE reconstruction , *COMPUTER vision , *SCHOLARSHIPS , *COMPUTER engineering , *IMAGE denoising , *DEEP learning , *VIDEO compression - Abstract
This document is a guest editorial from the journal IET Computer Vision, discussing the topic of advanced image restoration and enhancement. The editorial highlights the challenges faced in this field, such as the complexity of degradation models for real-world low-quality images and the difficulty of acquiring paired data. It also introduces a special issue of the journal that includes five accepted papers, which focus on video reconstruction and image super-resolution. The editorial concludes by providing brief summaries of each accepted paper. The guest editors of the special issue are also mentioned, along with their research interests and affiliations. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
6. Mango wine making process optimization based on artificial intelligence deep learning technology.
- Author
-
Xubin, Hua, Qiao, Lin, Fayong, Gong, Li, Cai, and Junhua, Liu
- Subjects
- *
ARTIFICIAL intelligence , *WINE making , *ELECTRONIC noses , *ELECTRONIC data processing , *DEEP learning , *PROCESS optimization , *MANGO - Abstract
This paper combines artificial intelligence deep learning technology to optimize the wine making process of mango wine. Moreover, in view of the shortcomings of traditional electronic nose data processing methods, a deep learning method based on SSAE‐BPNN is proposed for electronic nose data processing. In addition, according to the characteristics of automatic learning features, this paper uses a deep learning method based on SSAE‐BPNN to simplify the process of traditional data processing methods. Finally, this paper constructs an electronic nose system that can be used to identify mango wine making characteristics, and enhances the effect of electronic nose recognition through deep learning. Through the analysis, it can be seen that the mango wine making process optimization method based on artificial intelligence deep learning technology proposed in this paper has a certain effect, and it has optimized the traditional mango wine making process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Prediction of mechanical properties of high‐performance concrete and ultrahigh‐performance concrete using soft computing techniques: A critical review.
- Author
-
Kumar, Rakesh, Rai, Baboo, and Samui, Pijush
- Abstract
A cement‐based material that meets the general goals of mechanical properties, workability, and durability as well as the ever‐increasing demands of environmental sustainability is produced by varying the type and quantity of individual constituents in high‐performance concrete (HPC) and ultrahigh‐performance concrete (UHPC). Expensive and time‐consuming laboratory experiments can be used to estimate the properties of concrete mixtures and elements. As an alternative, these attributes can be approximated by means of predictive models created through the application of artificial intelligence (AI) methodologies. AI approaches are among the most effective ways to solve engineering problems due to their capacity for pattern recognition and knowledge processing. Machine learning (ML) and deep learning (DL) are a subfield of AI that is gaining popularity across many scientific domains as a result of its many benefits over statistical and experimental models. These include, but are not limited to, better accuracy, faster performance, greater responsiveness in complex environments, and lower economic costs. In order to assess the critical features of the literature, a comprehensive review of ML and DL applications for HPC and UHPC was conducted in this study. This paper offers a thorough explanation of the fundamental terms and ideas of ML and DL algorithms that are frequently used to predict mechanical properties of HPC and UHPC. Engineers and researchers working with construction materials will find this paper useful in helping them choose accurate and appropriate methods for their needs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Unsupervised remote sensing image thin cloud removal method based on contrastive learning.
- Author
-
Tan, Zhan Cong, Du, Xiao Feng, Man, Wang, Xie, Xiao Zhu, Wang, Gui Song, and Nie, Qin
- Subjects
- *
DEEP learning , *IMAGE reconstruction - Abstract
Cloud removal algorithm is a crucial step of remote sensing image preprocessing. The current mainstream remote sensing image cloud removal algorithms are implemented based on deep learning, and most of them are supervised. A large number of data pairs are required for training to achieve cloud removal. However, real with/without cloud image pairs datasets are difficult to obtain in the real world, and the models obtained by training on synthetic datasets often need to generalize better to natural scenes. And the existing unsupervised thin cloud removal methods based on Cycle‐GAN framework with considerable model complexity and unstable training are not an excellent solution to the problem of lack of paired datasets. Based on this, in this paper, the authors propose an unsupervised remote sensing image thin cloud removal method based on contrastive learning—GAN‐UD. It is a network consisting of a frequency‐spatial attention generator and a discriminator. In addition, the authors introduce local contrastive loss and global content loss to constrain the content of the generated images to ensure that the generated cloud‐free images are consistent with the input cloud images in terms of image content. Experimental results show that the proposed method in this paper can still effectively remove thin clouds from remote sensing images without paired training datasets, outperforms current unsupervised cloud removal methods, and achieves comparable performance to supervised methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.
- Author
-
PL, Rini and KS, Gayathri
- Subjects
- *
DIAGNOSIS of dementia , *COGNITION disorders diagnosis , *SPEECH evaluation , *CROSS-sectional method , *PREDICTION models , *TASK performance , *DESCRIPTIVE statistics , *NATURAL language processing , *LINGUISTICS , *EXPERIMENTAL design , *DEEP learning , *COMPUTER-aided diagnosis , *LATENT semantic analysis , *NEUROPSYCHOLOGICAL tests , *RESEARCH , *SEMANTIC memory , *EARLY diagnosis , *COMPARATIVE studies , *MACHINE learning , *FACTOR analysis , *ALGORITHMS , *DEMENTIA patients - Abstract
Background: Dementia is a cognitive decline that leads to the progressive deterioration of an individual's ability to perform daily activities independently. As a result, a considerable amount of time and resources are spent on caretaking. Early detection of dementia can significantly reduce the effort and resources needed for caretaking. Aims: This research proposes an approach for assessing cognitive decline by analysing speech data, specifically focusing on speech relevance as a crucial indicator for memory recall. Methods & Procedures: This is a cross‐sectional, online, self‐administered. The proposed method used deep learning architecture based on transformers, with BERT (Bidirectional Encoder Representations from Transformers) and Sentence‐Transformer to derive encoded representations of speech transcripts. These representations provide contextually descriptive information that is used to analyse the relevance of sentences in their respective contexts. The encoded information is then compared using cosine similarity metrics to measure the relevance of uttered sequences of sentences. The study uses the Pitt Corpus Dementia dataset for experimentation, which consists of speech data from individuals with and without dementia. The accuracy of the proposed multi‐QA‐MPNet (Multi‐Query Maximum Inner Product Search Pretraining) model is compared with other pretrained transformer models of Sentence‐Transformer. Outcomes & Results: The results show that the proposed approach outperforms the other models in capturing context level information, particularly semantic memory. Additionally, the study explores the suitability of different similarity measures to evaluate the relevance of uttered sequences of sentences. The experimentation reveals that cosine similarity is the most appropriate measure for this task. Conclusions & Implications: This finding has significant implications for the early warning signs of dementia, as it suggests that cosine similarity metrics can effectively capture the semantic relevance of spoken language. The persistent cognitive decline over time acts as one of the indicators for prevalence of dementia. Additionally early dementia could be recognised by analysis on other modalities like speech and brain images. WHAT THIS PAPER ADDS: What is already known on this subject: It is already known that speech‐ and language‐based detection methods can be useful for dementia diagnosis, as language difficulties are often early signs of the disease. Additionally, deep learning algorithms have shown promise in detecting and diagnosing dementia through analysing large datasets, particularly in speech‐ and language‐based detection methods. However, further research is needed to validate the performance of these algorithms on larger and more diverse datasets and to address potential biases and limitations. What this paper adds to existing knowledge: This study presents a unique and effective approach for cognitive decline assessment through analysing speech data. The study provides valuable insights into the importance of context and semantic memory in accurately detecting the potential in dementia and demonstrates the applicability of deep learning models for this purpose. The findings of this study have important clinical implications and can inform future research and development in the field of dementia detection and care. What are the potential or actual clinical implications of this work?: The proposed approach for cognitive decline assessment using speech data and deep learning models has significant clinical implications. It has the potential to improve the accuracy and efficiency of dementia diagnosis, leading to earlier detection and more effective treatments, which can improve patient outcomes and quality of life. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. Spectral CT image reconstruction using a constrained optimization approach—An algorithm for AAPM 2022 spectral CT grand challenge and beyond.
- Author
-
Hu, Xiaoyu and Jia, Xun
- Subjects
- *
IMAGE reconstruction , *CONSTRAINED optimization , *SPECTRAL imaging , *COMPUTED tomography , *STANDARD deviations , *ALGORITHMS - Abstract
Background: CT reconstruction is of essential importance in medical imaging. In 2022, the American Association of Physicists in Medicine (AAPM) sponsored a Grand Challenge to investigate the challenging inverse problem of spectral CT reconstruction, with the aim of achieving the most accurate reconstruction results. The authors of this paper participated in the challenge and won as a runner‐up team. Purpose: This paper reports details of our PROSPECT algorithm (Prior‐based Restricted‐variable Optimization for SPEctral CT) and follow‐up studies regarding the algorithm's accuracy and enhancement of its convergence speed. Methods: We formulated the reconstruction task as an optimization problem. PROSPECT employed a one‐step backward iterative scheme to solve this optimization problem by allowing estimation of and correction for the difference between the actual polychromatic projection model and the monochromatic model used in the optimization problem. PROSPECT incorporated various forms of prior information derived by analyzing training data provided by the Grand Challenge to reduce the number of unknown variables. We investigated the impact of projection data precision on the resulting solution accuracy and improved convergence speed of the PROSPECT algorithm by incorporating a beam‐hardening correction (BHC) step in the iterative process. We also studied the algorithm's performance under noisy projection data. Results: Prior knowledge allowed a reduction of the number of unknown variables by 85.9%$85.9\%$. PROSPECT algorithm achieved the average root of mean square error (RMSE) of 3.3×10−6$3.3\,\times \,10^{-6}$ in the test data set provided by the Grand Challenge. Performing the reconstruction with the same algorithm but using double‐precision projection data reduced RMSE to 1.2×10−11$1.2\,\times \,10^{-11}$. Including the BHC step in the PROSPECT algorithm accelerated the iteration process with a 40% reduction in computation time. Conclusions: PROSPECT algorithm achieved a high degree of accuracy and computational efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. The analysis of green advertisement communication strategy based on deep factorization machine deep learning model under supply chain management.
- Author
-
Yu, Xue, Zhu, Yunfei, Jia, Congcong, Lu, Wanqiu, and Xu, Hao
- Subjects
- *
DEEP learning , *MACHINE learning , *SUPPLY chain management , *COMMUNICATION strategies , *FACTORIZATION , *PERCEPTION (Philosophy) - Abstract
Artificial intelligence (AI) technology has brought new reconstruction opportunities for the intelligence of the advertisement industry through the help of AI technologies such as machine learning and deep learning. First, the relationship between AI and the attractiveness of green advertisements is investigated, and the influence of different AI technologies in green advertisements on consumers' perception of the attractiveness of green advertisements is summarized. Second, based on the green advertisement dissemination rate data set, the data visualization exploration is carried out, and the data deletion and coding processing are carried out aiming at different characteristic variables. Finally, according to the problems existing in the current green advertisement communication and the high‐dimensional and sparse characteristics of the communication rate data set. In this paper, based on Deep FM (Factorization Machine), Gradient Boost Decision Tree (GBDT) is added to assist the experiment, and the prediction performance of green advertising communication is tested. The results are as follows. (1) Different AI expressions in green advertisements will affect consumers' perception of the attractiveness of green advertisements. (2) The prediction ability of Deep FM model after feature engineering is better than that of data cleaning only. The prediction effect of the model is obviously improved. The purpose of this paper is to integrate green advertising media communication into the ecological concept of harmonious coexistence between man and nature, strengthen the political belief of ecological civilization construction, and conform to the communication trend of today's severe ecological situation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. A Deep Learning Approach to Extract Balanced Motions From Sea Surface Height Snapshot.
- Author
-
Gao, Zhanwen, Chapron, Bertrand, Ma, Chunyong, Fablet, Ronan, Febvre, Quentin, Zhao, Wenxia, and Chen, Ge
- Subjects
- *
DEEP learning , *GULF Stream , *MESOSCALE eddies , *SPATIAL resolution , *INTERNAL waves , *MOTION , *ALTIMETRY , *PROBLEM solving - Abstract
Extracting balanced geostrophic motions (BM) from sea surface height (SSH) observations obtained by wide‐swath altimetry holds great significance in enhancing our understanding of oceanic dynamic processes at submesoscale wavelength. However, SSH observations derived from wide‐swath altimetry are characterized by high spatial resolution while relatively low temporal resolution, thereby posing challenges to extract the BM from a single SSH snapshot. To address this issue, this paper proposes a deep learning model called the BM‐UBM Network, which takes an instantaneous SSH snapshot as input and outputs the projection corresponding to the BM. Training experiments are conducted both in the Gulf Stream and South China Sea, and three metrics are considered to diagnose model's outputs. The favorable results highlight the potential capability of the BM‐UBM Network to process SSH measurements obtained by wide‐swath altimetry. Plain Language Summary: Oceanic dynamic processes can be classified into two categories: balanced geostrophic motions (BM), including large‐scale circulation, mesoscale and submesoscale eddy turbulence, and unbalanced wave motions (UBM), including barotropic tides, and inertia–gravity waves (IGWs). Both types of motions coexist and have respective contributions to the sea surface height (SSH). How to extract the BM from the total SSH observations obtained by satellite altimetry is the crucial problem to be solved in this paper. To tackle this issue, we propose a deep learning model named the BM‐UBM Network to establish the relationship between the total SSH and the BM component. The BM‐UBM Network can generate SSH estimations for the BM when provided with a well‐resolved SSH snapshot. Key Points: A Deep learning model is developed to extract balanced motions from sea surface height snapshot based on a realistic simulationDiagnostics of three metrics reveal the effectiveness of the model in extracting balanced motionsThe model exhibits remarkable advantages over the Gaussian filter (baseline) in capturing the gradient and Laplacian information [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Few‐shot segmentation framework for lung nodules via an optimized active contour model.
- Author
-
Yang, Lin, Shao, Dan, Huang, Zhenxing, Geng, Mengxiao, Zhang, Na, Chen, Long, Wang, Xi, Liang, Dong, Pang, Zhi‐Feng, and Hu, Zhanli
- Subjects
- *
ARTIFICIAL neural networks , *PULMONARY nodules , *NONSMOOTH optimization , *DEEP learning , *ACTIVE learning - Abstract
Background: Accurate segmentation of lung nodules is crucial for the early diagnosis and treatment of lung cancer in clinical practice. However, the similarity between lung nodules and surrounding tissues has made their segmentation a longstanding challenge. Purpose: Existing deep learning and active contour models each have their limitations. This paper aims to integrate the strengths of both approaches while mitigating their respective shortcomings. Methods: In this paper, we propose a few‐shot segmentation framework that combines a deep neural network with an active contour model. We introduce heat kernel convolutions and high‐order total variation into the active contour model and solve the challenging nonsmooth optimization problem using the alternating direction method of multipliers. Additionally, we use the presegmentation results obtained from training a deep neural network on a small sample set as the initial contours for our optimized active contour model, addressing the difficulty of manually setting the initial contours. Results: We compared our proposed method with state‐of‐the‐art methods for segmentation effectiveness using clinical computed tomography (CT) images acquired from two different hospitals and the publicly available LIDC dataset. The results demonstrate that our proposed method achieved outstanding segmentation performance according to both visual and quantitative indicators. Conclusion: Our approach utilizes the output of few‐shot network training as prior information, avoiding the need to select the initial contour in the active contour model. Additionally, it provides mathematical interpretability to the deep learning, reducing its dependency on the quantity of training samples. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Pan‐cancer image segmentation based on feature pyramids and Mask R‐CNN framework.
- Author
-
Wang, Juan, Zhou, Jian, and Wang, Man
- Abstract
Background Purpose Objective Methods Results Conclusions Cancer, a disease with a high mortality rate, poses a great threat to patients' physical and mental health and can lead to huge medical costs and emotional damage. With the continuous development of artificial intelligence technologies, deep learning‐based cancer image segmentation techniques are becoming increasingly important in cancer detection and accurate diagnosis. However, in segmentation tasks, there are differences in efficiency between large and small objects and limited segmentation effects on objects of individual sizes. The previous segmentation frameworks still have room for improvement in multi‐scale collaboration when segmenting objects.This paper proposes a method to train a deep learning segmentation framework using a feature pyramid processing dataset to improve the average precision (AP) index, and realizes multi‐scale cooperation in target segmentation.Pan‐Cancer Histology Dataset for Nuclei Instance Segmentation and Classification (PanNuke) dataset was selected to include approximately 7500 pathology images with cells from 19 different types of tissues, including five classifications of cancer, non‐cancer, inflammation, death, and connective tissue.First, the method uses whole‐slide images in the pan‐cancer histology dataset for nuclei instance segmentation and classification (PanNuke) dataset, combined with the mask region convolutional neural network (Mask R‐CNN) segmentation framework and improved loss function to segment and detect each cellular tissue in cancerous sections. Second, to address the problem of non‐synergistic object segmentation at different scales in cancerous tissue segmentation, a scheme using feature pyramids to process the dataset was adopted as part of the feature extraction module.Extensive experimental results on this dataset show that the method in this paper yields 0.269 AP and a boost of about 4% compared to the original Mask R‐CNN framework.It is effective and feasible to use feature pyramid to process data set to improve the effect of medical image segmentation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. A hybrid prediction method for short‐term load based on temporal convolutional networks and attentional mechanisms.
- Author
-
Li, Min, Tian, Hangwei, Chen, Qinghui, Zhou, Mingle, and Li, Gang
- Subjects
- *
CONVOLUTIONAL neural networks , *DEEP learning , *TRANSFORMER models , *HYBRID power , *FORECASTING , *FEATURE extraction - Abstract
Accurate power load prediction is an important guide for power system planning and operation. High‐ or low‐load prediction results will affect the operation of the power system. In recent years, deep learning technology represented by convolution neural network (CNN) and transformer has been proved to be suitable for power load prediction. This paper proposes a new short‐term power load hybrid forecasting model, called channel enhanced attention (CEA) and temporal convolutional network (TCN)‐based transformer comprehensive forecasting model. This method combines the short‐term feature extraction ability of TCN with the long‐term dependent capture ability of transformer for short‐term load forecasting. And the CEA designed in this study is added to improve the prediction accuracy. On the same dataset, the designed model predicts power load mean square errors of 0.056 and 0.146 for the next 24 h and the next week, respectively, which is 0.002 to 0.073 and 0.012 to 0.024 lower than the baseline model. The experimental results show that the hybrid short‐term power load prediction model proposed in this paper is significantly better than the existing methods. The predicted curve is in agreement with the actual charge change, which provides a good guidance for short‐term power load prediction. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Doing feminist longitudinal research across the COVID‐19 crisis: Unheard impacts on researchers and garment workers in Cambodia.
- Author
-
Brickell, Katherine, Chhom, Theavy, Lawreniuk, Sabina, McCarthy, Lauren, Mony, Reach, and So, Hengvotey
- Subjects
- *
CLOTHING workers , *LONGITUDINAL method , *COVID-19 pandemic , *RESEARCH personnel , *CONSCIOUSNESS raising , *DEEP learning - Abstract
This paper is based on the ReFashion study which used mixed‐method longitudinal research to track and amplify the experiences and coping mechanisms of 200 women garment workers in Cambodia as they navigated the financial repercussions of the COVID‐19 pandemic. It develops the idea and practice of 'feminist longitudinal research' (FLR) through re‐centring the too often marginalised knowledges and ways of knowing of Cambodian researchers and research participants. Hearing and learning from their experiences reveal the labours and care‐work involved in the 'doing' of longitudinal research during a time of extraordinary crisis, and the potential for feminist consciousness raising and solidarity that can arise both within and beyond the confines of an academic study. The paper advocates for geographers and other social scientists to go beyond technically‐framed issues of participant 'attrition' and 'retention' in longitudinal studies to think more creatively and critically about the process of longitudinal research and what it means for those taking part in it. FLR not only evidences the temporally contingent gendered impacts of a phenomenon, but can be distinguished by its intentionality and/or potential to challenge the patriarchal status quo, both in the lives of researchers and participants. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Privacy preserved collaborative transfer learning model with heterogeneous distributed data for brain tumor classification.
- Author
-
Aggarwal, Meenakshi, Khullar, Vikas, Goyal, Nitin, Rastogi, Rashi, Singh, Aman, Torres, Vanessa Yelamos, and Albahar, Marwan Ali
- Subjects
- *
BRAIN tumors , *TUMOR classification , *MACHINE learning , *FEDERATED learning , *COLLABORATIVE learning - Abstract
Correct identification of tumor in brain images is critical for treatment. In the medical domain, class distributions of recorded data could differ with locations and require high levels of privacy while collaboratively training the deep learning (DL) models for classifications. The main aim of this paper is to propose a privacy‐preserving collaborative model for the classification of brain tumor in heterogeneously distributed magnetic resonance imaging (MRI) images. In this paper, initially, an open‐source dataset has been acquired and analyzed as per the required competencies. The acquired dataset has four types of MRI images: pituitary tumor, meningioma tumor, glioma tumor, and no tumor. First, the acquired dataset was analyzed using DL and transfer learning algorithms. By applying implementations of basic algorithms, better algorithms were identified for further implementations in a federated learning ecosystem. DenseNet201‐based transfer learning was identified as a better neural network and further utilized for collaborative transfer learning implementations. Here, the paper also focused on developing a suitable system for a heterogeneous distributed tumor database. Heterogeneous data were converted from the available data by applying nonidentical data distribution. The study discovered that the federated DL models, involving multiple clients, exhibited superior performance compared to conventional pretrained models. The proposed framework possesses distinctive characteristics that distinguish it from existing classification methods for brain tumor identification, particularly in terms of ensuring data privacy for edge devices with limited resources. Due to these additional features, the framework stands as the optimal alternative solution for early diagnosis of brain tumor. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Advancements in medical diagnosis and treatment through machine learning: A review.
- Author
-
Ahsan, Mohammad, Khan, Anam, Khan, Kaif Rehman, Sinha, Bam Bahadur, and Sharma, Anamika
- Subjects
- *
DIAGNOSIS , *THERAPEUTICS , *DEEP learning , *CONVOLUTIONAL neural networks , *MACHINE learning , *TREATMENT programs - Abstract
The aptness of machine learning (ML) to learn from large datasets, discover trends, and make predictions has demonstrated its potential to metamorphose the medical field. Medical data analysis with ML algorithms can improve patient outcomes in terms of both treatment and diagnosis. This paper investigates the numerous possibilities of ML in the medical industries, including radiology, pathology, genomics, and clinical decision‐making. It also goes over the benefits and drawbacks of ML in various sectors as well as the limitations that come with its application. It illustrates the potential advantages of ML, such as better accuracy and efficiency in diagnosis and individualized treatment programs, through a review of previous studies. Lastly, it provides perspectives on prospective advancements and prospects for the discipline. This study also intends to investigate the applications of deep learning (DL) in the medical field. DL algorithms have performed exceptionally satisfactorily in several healthcare‐related fields. The main conclusions of the study are summarized, and their ramifications for the healthcare sector are discussed in this paper's conclusion. This paper intends to contribute to a greater understanding of the prevailing state of the discipline and the possibility for future developments by emphasizing the prospects of these methodologies to alter medical study and clinical practice. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Hate speech detection: A comprehensive review of recent works.
- Author
-
Gandhi, Ankita, Ahir, Param, Adhvaryu, Kinjal, Shah, Pooja, Lohiya, Ritika, Cambria, Erik, Poria, Soujanya, and Hussain, Amir
- Abstract
There has been surge in the usage of Internet as well as social media platforms which has led to rise in online hate speech targeted on individual or group. In the recent years, hate speech has resulted in one of the challenging problems that can unfurl at a fast pace on digital platforms leading to various issues such as prejudice, violence and even genocide. Considering the acceptance of Artificial Intelligence (AI) and Natural Language Processing (NLP) techniques in varied application domains, it would be intriguing to consider these techniques for automated hate speech detection. In literature, there have been efforts to recognize and categorize hate speech using varied Machine Learning (ML) and Deep Learning (DL) techniques. Hence, considering the need and provocations for hate speech detection we aim to present a comprehensive review that discusses fundamental taxonomy as well as recent advances in the field of online hate speech identification. There is a significant amount of literature related to the initial phases of hate speech detection. The background section provides a detailed explanation of the previous research. The subsequent section that follows is dedicated to examining the recent literature published from the year 2020 onwards. The paper presents some of the hate speech datasets considered for hate speech detection. Furthermore, the paper discusses different data modalities, namely, textual hate speech detection, multi‐modal hate speech detection and multilingual hate speech detection. Apart from systematic review on hate speech detection, the paper also implement several multi‐label models to compare the performance of hate speech detection by employing classic ML technique namely, Logistic Regression and DL technique namely, Long Short‐Term Memory (LSTM) and a multiclass multi‐label architecture. In the implemented architecture, we have derived two new elements to quantify the hatefulness and intensity of hatred to improve the results for hate speech detection using Indonesian tweet dataset. Empirical Analysis of the model reveals that the implemented approach outperforms and is able to achieve improved results for the underlying dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. A robust watermarking algorithm against JPEG compression based on multiscale autoencoder.
- Author
-
Zhang, Wei, Chen, Rongrong, and Wang, Bin
- Subjects
- *
DIGITAL watermarking , *DEEP learning , *JPEG (Image coding standard) , *QUALITY factor , *SIGNAL-to-noise ratio , *FEATURE extraction , *DATA compression , *CHANNEL coding - Abstract
The network structure of digital watermarking algorithm based on deep learning is usually encoder‐noise layer‐decoder. Most of the existing encoders suffer from the problem of insufficient feature extraction, and the introduction of simulated differentiable joint photographic experts group (JPEG) compression in the noise layer cannot ensure the robustness under real JPEG. In this paper, a watermarking algorithm based on multi‐scale auto‐encoder is proposed, which can effectively extract the image feature information by combining with the channel attention mechanism. At the same time, some parameters of decoder and encoder are shared to reduce redundant feature embedding and improve extraction accuracy. This paper also proposes a robust training scheme against JPEG compression, which can guide the model to store the watermark in the low‐frequency region needed for decoding. Experimental results show that the peak signal‐to‐noise ratio (PSNR) of the proposed algorithm is above 48 and the decoding rate is above 99% under JPEG compression with quality factor Q = 50. Moreover, this scheme can effectively promote the combination of noise layer in training. In addition, the proposed algorithm is also robust to other common network noises. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Collaborative operation optimization of distribution system and virtual power plants using multi‐agent deep reinforcement learning with parameter‐sharing mechanism.
- Author
-
Sun, Zhonghao and Lu, Tianguang
- Subjects
- *
DEEP reinforcement learning , *DATA privacy , *POWER plants , *MATHEMATICAL optimization , *INTELLIGENT agents , *REINFORCEMENT learning , *DEEP learning - Abstract
With the increasing integration of distributed energy resources (DERs) into distribution systems, the optimization of system operation has become complex, facing challenges such as inadequate consideration of market participants' benefits, poor computational efficiency, and data privacy concerns. This paper introduces the concept of a virtual power plant (VPP) as a solution for energy integration and management. To strike a balance between operational safety and the interests of market participants, a dual‐layer model is proposed. This model considers the benefits of both Distribution System Operators (DSO) and VPP, while also enhancing the consideration of distribution network constraints. The DSO considers AC optimal power flow and utilizes penalty functions to ensure network security in case of violations. To enhance computational efficiency and privacy, the paper presents the parameter‐sharing twin delayed deep deterministic policy gradient approach. This approach allows multiple intelligent agents to share a neural network model, effectively reducing the computational load. During the training process, only essential data is exchanged among the agents, ensuring the privacy of sensitive information. The effectiveness of the proposed model and the algorithm is validated through a case study on an IEEE 33‐node system. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Using gamification to support learning in K‐12 education: A systematic literature review.
- Author
-
Dehghanzadeh, Hojjat, Farrokhnia, Mohammadreza, Dehghanzadeh, Hossein, Taghipour, Kiumars, and Noroozi, Omid
- Subjects
- *
GAMIFICATION , *EDUCATIONAL literature , *LEARNING , *AFFECTIVE education , *EDUCATIONAL outcomes , *DEEP learning - Abstract
Using gamification to support learning in K‐12 education has received much attention from scholars in recent years. However, there is still a lack of comprehensive understanding of how gamification should be used to effectively enhance the learning experiences of K‐12 students. The purpose of this review was to synthesize research findings on the use of gamification in K‐12 education and to propose an evidence‐informed framework. This framework will guide teachers and scholars in developing gamified learning environments that are effective in improving K‐12 students' learning. In this regard, 54 empirical studies (out of 907 peer‐reviewed articles), dating from 2008 through 2021, were reviewed using the Preferred Reporting Items for Systematic Reviews and Meta‐Analysis (PRISMA) guideline. The findings were systematically categorized into four essential dimensions of learning environments inspired by Biggs' 3P teaching and learning model, ie, 'individual factors', 'environmental factors', 'learning process' and 'learning outcome'. The review yielded rich findings concerning each dimension, providing K‐12 teachers and scholars with a comprehensive overview of research findings on using gamification for educational purposes. Meanwhile, the findings indicated the lack of empirical studies regarding constructively aligned gamified courses, in which the different dimensions of the adopted framework are implemented and evaluated coherently. The paper concludes by presenting several suggestions and directions for future research to address this shortcoming. Practitioner notesWhat is already known about this topic Gamification has demonstrated the potential to enhance learning outcomes in K‐12 education.There are instances where the findings suggest neutral or negative effects of gamification on students' learning outcomes.There is a lack of a comprehensive overview of empirical findings concerning the effectiveness of gamification in K‐12 education.What this paper adds The study highlights the potential of gamification in enhancing cognitive, affective and behavioural learning outcomes in K‐12 education, mainly by increasing motivation, engagement and competitiveness.The study provides a comprehensive overview of empirical studies on using gamification in K‐12 education.The study proposes an evidence‐informed framework that can serve as a blueprint for developing constructively aligned gamified learning environments for K‐12 education.Implications for practice and/or policy The study presents several up‐to‐date and empirically rooted calls for future research on using gamification in K‐12 education.The study makes evidence‐based recommendations for the effective integration of gamification in K‐12 education. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. A review of detecting malware in android devices based on machine learning techniques.
- Author
-
Sharma, Monika and Kaul, Ajay
- Subjects
- *
MACHINE learning , *DEEP learning , *MALWARE , *LITERATURE reviews , *RESEARCH personnel - Abstract
Malware developers install malware on mobile users' devices and steal their personal information without their knowledge. According to recent studies, it has been observed that malware developers are now targeting Android mobile devices. Researchers have examined the issues of detecting malware in these devices and proposed different methods and techniques. This study's main goal is to aid researchers in gaining a basic understanding of Android malware and its numerous detection methods. Earlier experiments that used machine learning to detect Android malware will be carefully reviewed in this paper. This in‐depth review article thoroughly examines the origins, evolution, and sustainability of Android malware detection. It offers an in‐depth literature review that includes the most recent approaches and research trends for detecting malware, from static analysis to dynamic analysis, machine learning, and deep learning. Additionally, we review current approaches' shortcomings and difficulties and suggest possible paths for further investigation. The paper aims to stimulate further innovation in this essential field by providing researchers and practitioners with a comprehensive overview of the current status of Android malware detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Transformer Connections: Improving Segmentation in Blurred Near‐Infrared Blood Vessel Image in Different Depth.
- Author
-
Wang, Jiazhe, Shimizu, Koichi, and Yoshie, Osamu
- Subjects
- *
RETINAL blood vessels , *TRANSFORMER models , *BLOOD vessels , *CONVOLUTIONAL neural networks , *DEEP learning , *IMAGE segmentation - Abstract
High‐fidelity segmentation of blood vessels plays a pivotal role in numerous biomedical applications, such as injection assistance, cancer detection, various surgeries, and vein authentication. Near‐infrared (NIR) transillumination imaging is an effective and safe method to visualize the subcutaneous blood vessel network. However, such images are severely blurred because of the light scattering in body tissues. Inspired by the Vision Transformer model, this paper proposes a novel deep learning network known as transformer connection (TRC)‐Unet to capture global blurred and local clear correlations while using multi‐layer attention. Our method mainly consists of two blocks, thereby aiming to remap skip connection information flow and fuse different domain features. Specifically, the TRC extracts global blurred information from multiple layers and suppresses scattering to increase the clarity of vessel features. Transformer feature fusion eliminates the domain gap between the highly semantic feature maps of the convolutional neural network backbone and the adaptive self‐attention maps of TRCs. Benefiting from the long‐range dependencies of transformers, we achieved competitive results in relation to various competing methods on different data sets, including retinal vessel segmentation, simulated blur image segmentation, and real NIR blood vessel image segmentation. Moreover, our method remarkably improved the segmentation results of simulated blur image data sets and a real NIR vessel image data set. The quantitative results of ablation studies and visualizations are also reported to demonstrate the superiority of the TRC‐Unet design. © 2024 The Author(s).
IEEJ Transactions on Electrical and Electronic Engineering published by Institute of Electrical Engineers of Japan and Wiley Periodicals LLC. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
25. Artistic Style Transfer Based on Attention with Knowledge Distillation.
- Author
-
Al‐Mekhlafi, Hanadi and Liu, Shiguang
- Subjects
- *
ARTISTIC style , *GRAPHIC arts , *COMPUTER art , *GRAPHIC design , *DEEP learning - Abstract
Artistic style transfer involves the adaption of an input image to reflect the style of a reference image while maintaining its original content. This technique, now a prominent focus due to its prospective use in creative fields like digital art and graphic design, typically applies normalization techniques and attention mechanisms. While these methods yield decent results, they often fall short due to distortion of content image details and non‐artefact styles. In this paper, we introduce a novel approach that synergizes adaptive instance normalization (AdaIN), attention mechanisms, knowledge distillation (KD) and strategically placed internal layers, and new enhancements designed to preserve content details and provide a nuanced control over the style transfer process. We introduce a Detail Enhancement Module to amplify high‐frequency details in the content image, enhancing edge and texture preservation. A Multi‐scale Strategy is implemented to ensure uniform style application across various detail levels, leading to more coherent stylization. The Content Feature Refinement process refines content features, sharpening and emphasizing details to preserve structural and textural integrity. AdaIN's distinctive feature of efficiently collecting style data is exploited in our approach, coupled with attention mechanisms' inherent ability to conserve content information. We supplement this blend with KD for the enhancement of model accuracy and efficiency. Additionally, the introduction of internal layers acts as a conduit to further improve the style transfer process, increasing the transfer level of features and fostering better stylized results. The cornerstone of our technique lies in preserving the content structure amidst complex style transfers. Experimental results affirm the superior performance of our method over existing techniques in both quantitative and qualitative evaluations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Mechanical Field Guiding Structure Design Strategy for Meta‐Fiber Reinforced Hydrogel Composites by Deep Learning.
- Author
-
Liu, Chuanzhi, Zhang, Xingyu, Liu, Xia, and Yang, Qingsheng
- Subjects
- *
FIBROUS composites , *DEEP learning , *HYDROGELS , *MACHINE learning , *GENERATIVE adversarial networks , *FLEXIBLE electronics , *COMPOSITE structures - Abstract
Fiber‐reinforced hydrogel composites are widely employed in many engineering applications, such as drug release, and flexible electronics, with more flexible mechanical properties than pure hydrogel materials. Comparing to the hydrogel strengthened by continuous fiber, the meta‐fiber reinforced hydrogel provides stronger individualized design ability of deformation patterns and tunable stiffness, especially for the elaborate applications in joint, cartilage, and organ. In this paper, a novel structure design strategy based on deep learning algorithm is proposed for hydrogel reinforced by meta‐fiber to achieve targeted mechanical properties, such as stress and displacement fields. A solid mechanic model for meta‐fiber reinforced hydrogel is first developed to construct the dataset of fiber distribution and the corresponding mechanical properties of the composite. Generative adversarial network (GAN) is then trained to characterize the relationship between stress or displacement field, and meta‐fiber distribution. The well‐trained GAN is implemented to design meta‐fiber reinforced hydrogel composite structure under specific operation conditions. The results show that the deep learning method may efficiently predict the structure of the hydrogel composite with satisfied confidence, and has great potential for applications in drug delivery and flexible electronics. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. A novel fish individual recognition method for precision farming based on knowledge distillation strategy and the range of the receptive field.
- Author
-
Yin, Jianhao, Wu, Junfeng, Gao, Chunqi, Yu, Hong, Liu, Liang, and Guo, Shihao
- Abstract
With the continuous development of green and high‐quality aquaculture technology, the process of industrialized aquaculture has been promoted. Automation, intelligence, and precision have become the future development trend of the aquaculture industry. Fish individual recognition can further distinguish fish individuals based on the determination of fish categories, providing basic support for fish disease analysis, bait feeding, and precision aquaculture. However, the high similarity of fish individuals and the complexity of the underwater environment presents great challenges to fish individual recognition. To address these problems, we propose a novel fish individual recognition method for precision farming that rethinks the knowledge distillation strategy and the chunking method in the vision transformer. The method uses the traditional convolutional neural network model as the teacher model, introducing the teacher token to guide the student model to learn the fish texture features. We propose stride patch embedding to expand the range of the receptive field, thus enhancing the local continuity of the image, and self‐attention‐pruning to discard unimportant tokens and reduce the model computation. The experimental results on the DlouFish dataset show that the proposed method in this paper improves accuracy by 3.25% compared to ECA Resnet152, with an accuracy of 93.19%, and also outperforms other vision transformer models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Deep learning based brain tumour architecture for weight sharing optimization in federated learning.
- Author
-
Onaizah, Ameer N., Xia, Yuanqing, Obaid, Ahmed J., and Hussain, Khurram
- Abstract
Large amounts of data is necessary for deep learning models to semantically segment images. A major issue in the field of medical imaging is accumulating adequate data and then applying specialized skills to label those medical imaging data. Collaboration across institutions might be able to alleviate this problem, but sharing medical data to a centralized place is complicated due to legal, privacy, technical, and data ownership constraints, particularly among international institutions. By guaranteeing user privacy and preventing unauthorized access to raw data, Federated Learning plays a significant role especially in decentralized deep learning applications. Each client is given a unique learning process assignment. Clients first train a machine learning model locally using data from their area. Then, clients upload training data (local updates of model weights and biases) to a server. After that, the server compiles client‐provided updates to build a global learning model. Due to the numerous parameters (weights and biases) employed by deep learning models, the constant transmission between clients and the server raises communication costs and is inefficient from the standpoint of resource use. When there are more contributing clients and communication rounds, the cost of communication becomes a bigger concern. In this paper, a novel federated learning with weight sharing optimization compression architecture FedWSOcomp is proposed for cross institutional collaboration. In FedWSOcomp, the weights from deep learning models between clients and servers help in considerably reducing the amount of updates. Top‐z sparsification, quantization with clustering, and compression with three separate encoding techniques are all implemented by FedWSOcomp. Modern compression techniques are outperformed by FedWSOcomp, which achieves compression rates of up to 1085× while saving up to 99% of bandwidth and 99% of energy for clients during communication. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Dynamic Band‐Alignment Modulation in MoTe2/SnSe2 Heterostructure for High Performance Photodetector.
- Author
-
Zhang, Fen, Shi, Hao, Yu, Yali, Liu, Shuo, Liu, Duanyang, Zhou, Xinyun, Yuan, Le, Shi, Jiaqi, Xia, Qinglin, Wei, Zhongming, He, Jun, and Zhong, Mianzeng
- Subjects
- *
PHOTODETECTORS , *IMAGE recognition (Computer vision) , *FIELD-effect transistors , *DEEP learning , *HETEROJUNCTIONS - Abstract
For two‐dimensional (2D) layered material heterojunctions, dynamic modulation of band alignments allows for the design of devices with flexible multi‐functional applications. In this paper, a device structure is presented based on a MoTe2/SnSe2 field‐effect transistor. By applying a bias voltage to the electrostatic gate, the gate voltage is adjusted from negative to positive, causing the heterojunction to transition from type‐III band alignment to type‐II band alignment. The working mechanism and device performance of the heterojunctions with different band alignments are investigated. The device exhibited outstanding detection range (from ultraviolet to infrared), detectivity (2.42 × 109 Jones), and speed (1.3 ms). Under a positive gate voltage, a higher ratio of light current to dark current (2×103) is achieved. To further demonstrate the potential of the high‐performance devices, their reliability is confirmed through their performance in image recognition using deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Robust PUF Label Authentication System Synergistically Constructed by Hierarchical Pattern of Self‐assembled Phase‐Separation Encrypted Wrinkle and Deep Learning Model.
- Author
-
Ma, Manping, Jiang, Zetian, Ma, Tianjiao, Gao, Xiaxin, Li, Jin, Liu, Manhua, Yan, Junchi, and Jiang, Xuesong
- Abstract
With the increasing demand for information security, current anti‐counterfeiting methods not only suffer from issues such as low information density and vulnerability to forgery, but also inherently involve a trade‐off between information capacity and readout methods. This paper reports a high‐security solution using hierarchical pattern with perfect combination of micro self‐wrinkle and nano phase‐separation as physical unclonable function (PUF) labels, which is generated through self‐organization of anthracene‐functionalized poly(styrene‐block‐butadiene‐block‐styrene) (SBS‐CAN) under UV exposure. The double‐layer morphologies formed by the wrinkle and phase separation are adjustable, independent, and stable. The obtained hierarchical PUF labels exhibit random and unique features similar to the minutiae of fingerprints at both micro and nano scales, ensuring a well‐balanced bit uniformity (>0.492), high uniqueness (>0.496), and outstanding reliability (>96%). As a consequence of the multi‐layered combination of morphologies, the designed PUF label possesses an information density about 1010 times higher than that of human fingerprints. The PUF labels can be quickly obtained through simple visual scanning and exhibit sufficient security. To cope with various application environments, the advanced authentication pipeline designed
LPLA guarantees robust label recognition capability in real‐world scenarios. A practical integrated anti‐counterfeiting authentication system is developed by combining hierarchical PUF labels and authentication pipeline. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
31. A Hybrid Deep Learning‐Based Power Management Strategy for PV‐Assisted Desalination Plant.
- Author
-
Almashnowi, Majed Y. A., AlGhamdi, Sami, Alamier, Waleed M., Ali, Syed Kashif, Bakather, Omer Y., Hassan, Mohamed, Qudsieh, Isam Y., Imran, Mohd, and Alam, Md Mottahir
- Abstract
The increasing global need for freshwater has led to the widespread implementation of photovoltaic (PV) assisted desalination facilities as a viable and environmentally friendly remedy. The necessity for these plants resides in executing an effective power management strategy to provide dependable and economically feasible water generation. This paper utilizes a mixed architecture consisting of a convolutional neural network (CNN) and a deep Q‐learning Network (DQN) to implement a hybrid deep learning‐based power management strategy. The presented approach is modeled and executed in matrix laboratory (MATLAB) language, and the experimental findings validated that this algorithm achieved a computational time of 0.2 s and an energy loss of 0.01 megawatts, which is lower than the conventional models. Furthermore, the proposed strategy achieved a remarkable fit with an accuracy rate of 0.99, demonstrating its effectiveness in handling diverse load power and solar profiles in power management. Implementing this hybrid approach holds promise for a substantial reduction in operational expenditures and the advancement of sustainability in freshwater generation, making a valuable contribution toward a more environmentally friendly and optimized future for the desalination sector. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. AttUnet_R_SFT: A Novel Network to Explore the Application of Complex Terrain Information in Satellite Precipitation Estimating.
- Author
-
Zhang, Lu, Zhou, Zeming, Guan, Jiping, Gao, Yanbo, Zhang, Lifeng, and Kader, Movlan
- Abstract
Accurate rainfall measurement with a precise spatial and temporal resolution is essential for making informed decisions during disasters and conducting scientific studies, particularly in regions characterized by intricate terrain and limited coverage of automated weather stations. Retrieval of precipitation with satellite is currently the most effective means to obtain precipitation over large‐scale areas. The key to enhancing the accuracy of precipitation estimation and forecasting in regions with complex terrain lies in effectively integrating satellite data with topographic information. This paper introduces a deep learning approach called AttUnet_R_SFT that utilizes high temporal, spatial, and spectral resolution data obtained from the Fengyun 4A satellite, and incorporates the Deep Spatial Feature Transform (SFT) layer to incorporate geographical data for estimating half‐hourly precipitation in northeastern China. We assess it by compared to operational near‐real‐time satellite precipitation products demonstrated to be successful in estimating precipitation and baseline deep learning models. According to the experimental findings, the AttUnet_R_SFT model outperforms practical precipitation products and baseline deep learning models in both identifying and estimating precipitation. The main enhancement of the model performance is shown in the windward slope of the Greater Khingan Mountains as a result of the successful incorporation of geographical data. Hence, the suggested framework holds the capability to function as a superior and dependable satellite‐derived precipitation estimation solution in regions characterized by intricate terrain and infrequent rainfall. The findings of this study indicate that the utilization of deep learning algorithms for satellite precipitation estimation shows potential as a fruitful avenue for further research. Plain Language Summary: A deep learning model named AttUnet_R_SFT is proposed, which use high temporal, spatial and spectral resolution data from the Fengyun 4A satellite, and combines with the Deep Spatial Feature Transform (SFT) layer to input geographic information for half‐hourly precipitation estimation in the complex terrain region represented by northeast China. The model can provide a reference for improving the performance of precipitation estimation in areas with complex topography. Key Points: A deep‐learning model is proposed to effectively fuse satellite multispectral data of the Fengyun 4A satellite, with topographic informationHalf‐hourly precipitation is estimated with higher temporal resolution, which is closer to the operational needs of weather forecastingAs precipitation in the study area is a non‐high‐frequency event, data enhancement is attempted to use and obtain effective results [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. An infrared and visible image fusion network based on multi‐scale feature cascades and non‐local attention.
- Author
-
Xu, Jing, Liu, Zhenjin, and Fang, Ming
- Subjects
- *
IMAGE fusion , *DEEP learning , *INFRARED imaging , *ARTIFICIAL neural networks , *FEATURE extraction , *IMAGE reconstruction - Abstract
In recent years, research on infrared and visible image fusion has mainly focused on deep learning‐based approaches, particularly deep neural networks with auto‐encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto‐encoder architectures. In the designed network, the encoder employs a multi‐branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi‐scale features. In addition, the fusion layer incorporates a non‐local attention module that is inspired by the self‐attention mechanism. With its global receptive field, this module is used to build a non‐local attention fusion network, which works together with the l1${l}_1$‐norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state‐of‐the‐art fusion approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Detection of fetal facial anatomy in standard ultrasonographic sections based on real‐time target detection network.
- Author
-
Liu, Zhonghua, Yu, Weifeng, Wu, Xiuming, Yang, Tong, Lyu, Guorong, Liu, Peizhong, and Xue, Hao
- Subjects
- *
FETAL anatomy , *FETAL ultrasonic imaging , *MEDICAL screening , *RECOGNITION (Psychology) , *ULTRASONIC imaging , *FACIAL pain - Abstract
At present, prenatal ultrasound is one of the important means for screening fetal malformations. In the process of prenatal ultrasound diagnosis, the accurate recognition of fetal facial ultrasound standard plane is crucial for facial malformation detection and disease screening. Due to the dense distribution of fetal facial images, no obvious structure contour boundary, small structure area, and large area overlap in the middle of the structure detection frame, this paper regards the fetal facial standard plane and its structure recognition as a universal target detection task for the first time, and applies real‐time YOLO v5s to the fetal facial ultrasound standard plane structure detection and classification task. First, we detect the structure of a single slice, and take the structure of a slice class as the recognition object. Second, we carry out structural detection experiments on three standard planes; then, on the basis of the previous stage, the images of all parts included in the ultrasound examination of multiple fetuses were collected. In the single‐class structure detection experiment and the structure detection and classification experiment of three types of standard planes, the overall recognition effect of Precision and Recall index data is better, with Precision being 98.3% and 98.1%, and Recall being 99.3% and 98.2%, respectively. The experimental results show that the model has the ability to identify fetal facial anatomy and standard sections in different data, which can help the physician to automatically and quickly screen out the standard sections of each fetal facial ultrasound. Synopsis: By automatically identifying key anatomical structures within ultrasound images, it provides a theoretical basis for doctors to choose standard planes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Model‐driven neural network based for HPO‐MIMO channel estimation.
- Author
-
Gong, Yi, Liu, Yujia, Meng, Fanke, and Xu, Zhan
- Subjects
- *
CHANNEL estimation , *REAL-time computing , *DEEP learning , *TELECOMMUNICATION , *WIRELESS communications - Abstract
Integrated Sensing and Communications (ISAC) need to process data streams in high‐speed sensor data acquisition or high‐speed wireless communications. To process the data can require more computing and communication resources, resulting in higher power consumption. Halved‐Phase Only Multiple Input Multiple Output (HPO‐MIMO) communication technology can solve this problem by using low‐power nonlinear detection devices. In ISAC, Channel Estimation (CE) technology can provide key channel characteristics and state information for sensing and collaborative work of perception and communication tasks. However, HPO‐MIMO system cannot realize CE using traditional receiver schemes because of the missing amplitude. In order to solve this problem, two HPO‐MIMO CE schemes based on model‐driven deep learning are proposed in this paper. The proposed schemes include a Densely Residual Network (DRN) and a Inception‐Resnet (IR), which is suitable for the case of sufficient data and insufficient data, respectively. The simulation results show that the performance of DRN based scheme is better than that of IR based scheme when the data amount is sufficient, and the performance of IR based scheme is better when the dataset is small. In addition, the proposed CE schemes work well with a range of antenna sizes and distances. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Recognition of Fiber Optic Vibration Signals Based on Laplace Wavelet Transform and Deep Learning.
- Author
-
Qi, Jinshui, Mo, Jiaqing, Niu, Yasen, and Cui, Yiteng
- Subjects
- *
WAVELET transforms , *DEEP learning , *CONVOLUTIONAL neural networks , *SIGNAL processing , *OPTICAL fibers - Abstract
Convolutional neural networks possess the capability of feature learning and nonlinear mapping, which has significant advantages in classifying and recognizing optical fiber vibration signals. In order to further enhance the recognition rate of vibration signals, this paper combines wavelet transform with convolutional neural networks and designs a convolutional layer based on parameterized wavelets. In this layer, the initial signal is convolved with parameterized Laplace wavelet dictionaries to complete the wavelet transform. Such customized filters make more sense than filters with randomly initialized parameters for traditional CNNs. Simultaneously, we introduce the channel attention mechanism to enhance the features of the filtered signals. Subsequently, standard CNNs are employed to extract and process signal features, ultimately utilizing a multi‐layer perceptron for recognition and classification. The experimental results show that the network model possesses better recognition refinement. © 2024 Institute of Electrical Engineer of Japan and Wiley Periodicals LLC. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Security Enhancement of mmWave MIMO Wireless Communication System Using Adversarial Training.
- Author
-
Saini, Mehak and Grewal, Surender K.
- Subjects
- *
MIMO systems , *WIRELESS communications , *DEEP learning , *5G networks , *NEXT generation networks , *MILLIMETER waves - Abstract
Millimeter wave MIMO wireless communication systems are deployed in 5G and next‐generation networks. The effectiveness of deep learning models for improving the performance of these systems has been proven in the literature. However, several deep learning models are vulnerable to security threats, such as adversarial attacks. Therefore, for the deployment of these systems, it is essential to make them resilient to such kinds of attacks for good quality secure communication. Adversarial training is a solution by which deep learning models are trained for adversarial attacks beforehand. Adversarial training for three types of adversarial attacks, that is, Fast Gradient Sign Method, Iterative Fast Gradient Sign Method, and Momentum Iterative Fast Gradient Sign Method is implemented in this paper. The simulation results depict a decrease in the error at the receiving end after adversarial training, even after an adversarial attack has been applied. © 2024 Institute of Electrical Engineer of Japan and Wiley Periodicals LLC. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Automating hyperparameter optimization in geophysics with Optuna: A comparative study.
- Author
-
Almarzooq, Hussain and bin Waheed, Umair
- Subjects
- *
DEEP learning , *SEISMIC wave velocity , *GEOPHYSICS , *COMPARATIVE studies , *GEOPHYSICISTS , *GRIDS (Cartography) - Abstract
Deep learning has gained attraction amongst geophysicists for solving complex longstanding problems. Nevertheless, proper hyperparameter optimization methodologies remain critically underexplored in geophysical deep learning research. This paper attempts to first highlight the importance of hyperparameter optimization and then showcase two geophysics‐related deep learning examples where a grid search and Optuna framework (an automated optimization approach) were used for hyperparameter optimization. We consider two geophysical problems related to denoising seismic traces and the inversion of seismic traces for velocity information. In both cases, models created based on Optuna hyperparameter optimization were able to perform better than those created through grid search. The most significant advantage of Optuna, however, is having quantifiable results to justify the choice of a neural network architecture, depth and other hyperparameters rather than relying on inefficient methods of exploring the hyperparameter space such as a trial‐and‐error or grid search. This study aims to stimulate further exploration and adoption of these frameworks, pushing the boundaries of current deep learning based geophysical problem‐solving methodologies towards full automation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Adapting low‐dose CT denoisers for texture preservation using zero‐shot local noise‐level matching.
- Author
-
Ko, Youngjun, Song, Seongjong, Baek, Jongduk, and Shim, Hyunjung
- Subjects
- *
IMAGE denoising , *COMPUTED tomography , *SUPERCONDUCTING quantum interference devices , *DEEP learning , *RADIOLOGISTS - Abstract
Background: On enhancing the image quality of low‐dose computed tomography (LDCT), various denoising methods have achieved meaningful improvements. However, they commonly produce over‐smoothed results; the denoised images tend to be more blurred than the normal‐dose targets (NDCTs). Furthermore, many recent denoising methods employ deep learning(DL)‐based models, which require a vast amount of CT images (or image pairs). Purpose: Our goal is to address the problem of over‐smoothed results and design an algorithm that works regardless of the need for a large amount of training dataset to achieve plausible denoising results. Over‐smoothed images negatively affect the diagnosis and treatment since radiologists had developed clinical experiences with NDCT. Besides, a large‐scale training dataset is often not available in clinical situations. To overcome these limitations, we propose locally‐adaptive noise‐level matching (LANCH), emphasizing the output should retain the same noise‐level and characteristics to that of the NDCT without additional training. Methods: We represent the NDCT image as the pixel‐wisely weighted sum of an over‐smoothed output from off‐the‐shelf denoiser (OSD) and the difference between the LDCT image and the OSD output. Herein, LANCH determines a 2D ratio map (i.e., pixel‐wise weight matrix) by locally matching the noise‐level of output and NDCT, where the LDCT‐to‐NDCT device flux (mAs) ratio reveals the NDCT noise‐level. Thereby, LANCH can preserve important details in LDCT, and enhance the sharpness of the noise‐free regions. Note that LANCH can enhance any LDCT denoisers without additional training data (i.e., zero‐shot). Results: The proposed method is applicable to any OSD denoisers, reporting significant texture plausibility development over the baseline denoisers in quantitative and qualitative manners. It is surprising that the denoising accuracy achieved by our method with zero‐shot denoiser was comparable or superior to that of the best training‐based denoisers; our result showed 1% and 33% gains in terms of SSIM and DISTS, respectively. Reader study with experienced radiologists shows significant image quality improvements, a gain of + 1.18 on a five‐point mean opinion score scale. Conclusions: In this paper, we propose a technique to enhance any low‐dose CT denoiser by leveraging the fundamental physical relationship between the x‐ray flux and noise variance. Our method is capable of operating in a zero‐shot condition, which means that only a single low‐dose CT image is required for the enhancement process. We demonstrate that our approach is comparable or even superior to supervised DL‐based denoisers that are trained using numerous CT images. Extensive experiments illustrate that our method consistently improves the performance of all tested LDCT denoisers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Deep learning‐based harmonization of trabecular bone microstructures between high‐ and low‐resolution CT imaging.
- Author
-
Guha, Indranil, Nadeem, Syed Ahmed, Zhang, Xiaoliu, DiCamillo, Paul A., Levy, Steven M., Wang, Ge, and Saha, Punam K.
- Subjects
- *
CANCELLOUS bone , *COMPUTED tomography , *GENERATIVE adversarial networks , *BONE density , *TRAINING of volunteers , *VOLUNTEER recruitment , *DEEP learning , *OPTICAL scanners - Abstract
Background: Osteoporosis is a bone disease related to increased bone loss and fracture‐risk. The variability in bone strength is partially explained by bone mineral density (BMD), and the remainder is contributed by bone microstructure. Recently, clinical CT has emerged as a viable option for in vivo bone microstructural imaging. Wide variations in spatial‐resolution and other imaging features among different CT scanners add inconsistency to derived bone microstructural metrics, urging the need for harmonization of image data from different scanners. Purpose: This paper presents a new deep learning (DL) method for the harmonization of bone microstructural images derived from low‐ and high‐resolution CT scanners and evaluates the method's performance at the levels of image data as well as derived microstructural metrics. Methods: We generalized a three‐dimensional (3D) version of GAN‐CIRCLE that applies two generative adversarial networks (GANs) constrained by the identical, residual, and cycle learning ensemble (CIRCLE). Two GAN modules simultaneously learn to map low‐resolution CT (LRCT) to high‐resolution CT (HRCT) and vice versa. Twenty volunteers were recruited. LRCT and HRCT scans of the distal tibia of their left legs were acquired. Five‐hundred pairs of LRCT and HRCT image blocks of 64×64×64$64 \times 64 \times 64 $ voxels were sampled for each of the twelve volunteers and used for training in supervised as well as unsupervised setups. LRCT and HRCT images of the remaining eight volunteers were used for evaluation. LRCT blocks were sampled at 32 voxel intervals in each coordinate direction and predicted HRCT blocks were stitched to generate a predicted HRCT image. Results: Mean ± standard deviation of structural similarity (SSIM) values between predicted and true HRCT using both 3DGAN‐CIRCLE‐based supervised (0.84 ± 0.03) and unsupervised (0.83 ± 0.04) methods were significantly (p < 0.001) higher than the mean SSIM value between LRCT and true HRCT (0.75 ± 0.03). All Tb measures derived from predicted HRCT by the supervised 3DGAN‐CIRCLE showed higher agreement (CCC ∈$ \in $ [0.956 0.991]) with the reference values from true HRCT as compared to LRCT‐derived values (CCC ∈$ \in $ [0.732 0.989]). For all Tb measures, except Tb plate‐width (CCC = 0.866), the unsupervised 3DGAN‐CIRCLE showed high agreement (CCC ∈$ \in $ [0.920 0.964]) with the true HRCT‐derived reference measures. Moreover, Bland‐Altman plots showed that supervised 3DGAN‐CIRCLE predicted HRCT reduces bias and variability in residual values of different Tb measures as compared to LRCT and unsupervised 3DGAN‐CIRCLE predicted HRCT. The supervised 3DGAN‐CIRCLE method produced significantly improved performance (p < 0.001) for all Tb measures as compared to the two DL‐based supervised methods available in the literature. Conclusions: 3DGAN‐CIRCLE, trained in either unsupervised or supervised fashion, generates HRCT images with high structural similarity to the reference true HRCT images. The supervised 3DGAN‐CIRCLE improves agreements of computed Tb microstructural measures with their reference values and outperforms the unsupervised 3DGAN‐CIRCLE. 3DGAN‐CIRCLE offers a viable DL solution to retrospectively improve image resolution, which may aid in data harmonization in multi‐site longitudinal studies where scanner mismatch is unavoidable. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. A non‐verbal teaching behaviour analysis for improving pointing out gestures: The case of asynchronous video lecture analysis using deep learning.
- Author
-
Yoon, Ho Young, Kang, Seokmin, and Kim, Sungyeun
- Subjects
- *
LECTURE method in teaching , *DATA analysis , *RESEARCH funding , *TEACHING methods , *EDUCATIONAL technology , *DESCRIPTIVE statistics , *NONVERBAL communication , *BODY language , *DEEP learning , *ONLINE education , *ANALYSIS of variance , *STATISTICS , *VIDEO recording - Abstract
Background: Research into enhancing the effectiveness of information delivery in asynchronous video lectures remains sparse. This study analyzes the nonverbal teaching behaviours in asynchronous online videos, drawing comparisons between pre‐service and in‐service teachers (ITs). Objectives: This research primarily aims to juxtapose the nonverbal teaching behaviours, such as arm extensions and body orientation, utilized by pre‐service teachers (PTs) and ITs within asynchronous online videos. Methods: Asynchronous video lectures from four pre‐service and four ITs across four diverse subject topics were scrutinized. Leveraging deep learning technology, teachers' poses during their instruction towards a video camera were quantified, with a particular focus on arm stretch range and body orientation in relation to the subject being taught. Results: The findings revealed that PTs were deficient in effectively employing pointing gestures. Their arm stretches and body orientation towards the board were not differentiated across subjects. Conversely, ITs demonstrated subject‐specific variations in their arm extension and body orientation, signalling their effective strategies for knowledge dissemination. Conclusions and Discussion: This study emphasizes the importance of assessing nonverbal teaching behaviours in the development of effective instructional training. It accentuates the need for nonverbal communication and subject‐specific teaching strategy training in PTs. Future investigations could broaden their scope to include larger sample sizes and expanded subject areas to discern more comprehensive trends in nonverbal teaching behaviours. Lay Description: What is already known about this topic: The proliferation of online and distance learning has been largely driven by the widespread utilization of asynchronous video lectures.However, the focus of video analysis regarding teaching practices has predominantly been on in‐class activities. This has resulted in the underdevelopment of methods for delivering knowledge through asynchronous video lectures.Research into enhancing the effectiveness of information delivery within asynchronous video lectures remains sparse. What this paper adds: This study contributes to the development of teaching techniques for asynchronous online lectures through a comparative analysis of nonverbal teaching behaviours between pre‐service and in‐service teachers within asynchronous lecture videos.The findings suggest that pre‐service teachers have a deficiency in knowledge regarding the effective use of demonstrative gestures. Implications for practice and/or policy: Considering the widespread use of asynchronous video lectures – including flipped and blend learning, the training of pre‐school teachers using video analysis is requiredThe creation of a robust metric for developing knowledge delivery methods is feasible and advisable, potentially by leveraging state‐of‐the‐art technology such as deep learning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Deep learning method for predicting electromagnetic emission spectrum of aerospace equipment.
- Author
-
Zhang, Yuting
- Subjects
- *
ELECTROMAGNETIC spectrum , *DEEP learning , *MOLECULAR spectra , *ELECTRONIC equipment , *ELECTROMAGNETIC compatibility , *DECOMPOSITION method - Abstract
This paper proposes a deep learning method to predict the electromagnetic emission spectrum in the electromagnetic compatibility (EMC) test of aerospace products. A threshold‐based data decomposition method is used to propose the spike signal, reconstruct the original test data, and solve the contradiction between the overfitting and prediction accuracy of the deep learning method to deal with the EMC test spectrum. Using a long short‐term memory neural network architecture for predicting electromagnetic emission spectrum, the Bayesian optimization method is used to optimize the network hyperparameter, and the acquisition function is introduced into the loss function to improve the comprehensive training optimization of deep learning network. Apply the method to three numerical examples: signal line current conduction emission, power line voltage conduction emission, and electric field radiation emission. The analysis results indicate that at a 95% confidence level, the predicted electromagnetic emission spectrum is basically consistent with the test results. The prediction results can be used as the basis for EMC evaluation of aerospace electronic equipment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. An efficient direction‐of‐arrival estimation of multipath signals with impulsive noise using satin bowerbird optimization‐based deep learning neural network.
- Author
-
Gantayat, Harikrushna, Panigrahi, Trilochan, and Patra, Pradyumna
- Subjects
- *
DIRECTION of arrival estimation , *STANDARD deviations , *HOPFIELD networks , *DEEP learning , *NOISE , *FEATURE extraction , *DEEP brain stimulation , *INDEPENDENT component analysis - Abstract
There exist numerous multi‐path signals with impulsive noise (IN) in a multi‐path propagation environment. The direction‐of‐arrival (DOA) is highly challenging to be assessed because of the strong IN. For various military along with civilian applications, DOA estimation has turned into a hopeful technology. But, estimating DOA for multipath signals with IN environments is extremely complicated. A desirable output is not attained by several existent techniques, namely subspace‐centric approaches, maximum likelihood techniques, along with sparse representation‐centric methods, for DOA assessment. Therefore, an effective satin bowerbird optimization‐based deep learning neural network centered DOA estimation upon single snapshots is proposed in this paper for attaining robust and precise DOA estimation in multipath and IN environments. An arbitrary array configuration is employed by utilizing the proposed work as an input signal for establishing the received signal's model. Pre‐processing, time‐frequency domain conversion, feature extraction, feature reduction and DOA estimation are the proposed method's 5 disparate stages. To evince the proposed model's efficacy, its outcomes are analogized with other prevailing techniques regarding some performance metrics. The proposed model attains the root mean square error of 0.6484366‐AZ and 0.6484366‐EA, root mean squared percentage error (RMPSE) of 4.636107‐AZ and 5.628599‐EA and probability of resolution of 0.967999‐AZ and 0.962841‐EA, which are extremely less when compared to the prevailing methods; thus, depicting the superiority of the proposed model for an effective DOA estimation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. Artificial intelligence of medical things for disease detection using ensemble deep learning and attention mechanism.
- Author
-
Djenouri, Youcef, Belhadi, Asma, Yazidi, Anis, Srivastava, Gautam, and Lin, Jerry Chun‐Wei
- Subjects
- *
ARTIFICIAL intelligence , *DEEP learning , *DISTRIBUTED sensors - Abstract
In this paper, we present a novel paradigm for disease detection. We build an artificial intelligence based system where various biomedical data are retrieved from distributed and homogeneous sensors. We use different deep learning architectures (VGG16, RESNET, and DenseNet) with ensemble learning and attention mechanisms to study the interactions between different biomedical data to detect and diagnose diseases. We conduct extensive testing on biomedical data. The results show the benefits of using deep learning technologies in the field of artificial intelligence of medical things to diagnose diseases in the healthcare decision‐making process. For example, the disease detection rate using the proposed methodology achieves 92%, which is greatly improved compared to the higher‐level disease detection models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Efficient integration of perceptual variational autoencoder into dynamic latent scale generative adversarial network.
- Author
-
Cho, Jeongik and Krzyzak, Adam
- Abstract
Dynamic latent scale GAN is an architecture‐agnostic encoder‐based generative model inversion method. This paper introduces a method to efficiently integrate perceptual VAE into dynamic latent scale GAN to improve the performance of dynamic latent scale GAN. When dynamic latent scale GAN is trained with a normal i.i.d. latent random variable and the latent encoder is integrated into the discriminator, a sum of a predicted latent random variable of real data and a scaled normal noise follows the normal i.i.d. random variable. Since this random variable is paired with real data and follows the latent random variable, it can be used for both VAE and GAN training. Furthermore, by considering the intermediate layer output of the discriminator as the feature encoder output, the VAE can be trained to minimise the perceptual reconstruction loss. The forward propagation & backpropagation for minimising this perceptual reconstruction loss can be integrated with those of GAN training. Therefore, the proposed method does not require additional computations compared to typical GAN or dynamic latent scale GAN. Integrating perceptual VAE to dynamic latent scale GAN improved the generative and inversion performance of the model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. Philosophy of cognitive science in the age of deep learning.
- Author
-
Millière, Raphaël
- Abstract
Deep learning has enabled major advances across most areas of artificial intelligence research. This remarkable progress extends beyond mere engineering achievements and holds significant relevance for the philosophy of cognitive science. Deep neural networks have made significant strides in overcoming the limitations of older connectionist models that once occupied the center stage of philosophical debates about cognition. This development is directly relevant to long‐standing theoretical debates in the philosophy of cognitive science. Furthermore, ongoing methodological challenges related to the comparative evaluation of deep neural networks stand to benefit greatly from interdisciplinary collaboration with philosophy and cognitive science. The time is ripe for philosophers to explore foundational issues related to deep learning and cognition; this perspective paper surveys key areas where their contributions can be especially fruitful.This article is categorized under: Philosophy > Artificial Intelligence Computer Science and Robotics > Machine Learning [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Deep Learning‐Driven Robust Glucose Sensing and Fruit Brix Estimation Using a Single Microwave Split Ring Resonator.
- Author
-
Lee, Seokho, Kim, Kyungtae, Yang, Younghwan, Seong, Junhwa, Jung, Chunghwan, Lee, Hee‐Jo, and Rho, Junsuk
- Abstract
Extracting the desired information from sensor data with various internal and external effects is a significant challenge in sensor applications. Difficult‐to‐control factors such as temperature, humidity, and sample position can significantly affect the stability and reliability of sensor data. In this paper, a deep learning‐based glucose sensing method that is robust to variations in sample position is proposed. It is shown that the variations in sample position affect the sensor data measured by the designed split ring resonator‐based microwave sensor. Then, artificial neural network and 1D convolutional neural network (CNN) models are evaluated for extracting glucose concentration information from the sensor data measured at random sample positions. The concentration of the glucose solution ranged from 1% to 23% (2% increments). The 1D CNN with all frequencies (0.5–18 GHz) of the and datasets outperformed the other model, with a mean absolute error (MAE) of 0.695% and a mean squared error (MSE) of 0.876 evaluated via cross‐validation. The study demonstrated that the sensor system can be applied in real life by performing fruit Brix estimation based on transfer learning of the previous 1D CNN network, and the MAE and MSE are 0.450% and 0.305, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Distributed MA‐IDDPG‐OLSR based stable routing protocol for unmanned aerial vehicle ad‐hoc network.
- Author
-
Zeng, Youjun, Zhou, Jie, Liu, Youjiang, Cao, Tao, Yang, Dalong, Liu, Yu, and Shi, Xianhua
- Subjects
- *
NETWORK routing protocols , *DEEP learning , *END-to-end delay , *REINFORCEMENT learning , *WIRELESS mesh networks - Abstract
In unmanned aerial vehicle ad‐hoc network (UANET), the node speed of unmanned aerial vehicles (UAVs) may reach up to 400 km/h. The fast or slow movement of UAV nodes leads to different speeds of topology change of the nodes. Traditional optimized link state routing (OLSR) protocol cannot adaptively adjust the routing update period when the network topology changes, which may lead to the nodes calculating incorrect routing tables. This increases the average end‐to‐end delay and packet loss rate for packet transmission. To enhance the adaptability of OLSR routing protocol to network topology changes, this paper proposes a multi‐agent independent deep deterministic policy gradient‐OLSR (MA‐IDDPG‐OLSR) routing protocol based on distributed multi‐agent reinforcement learning. The protocol deploys DDPG algorithm on each UAV node, and each UAV node adaptively adjusts the Hello and TC message sending intervals, according to the one‐hop neighbouring nodes as well as its own state. Simulation results show that the proposed protocol is able to improve the throughput and reduce the packet loss rate as compared to traditional AODV, GRP, OLSR, and distributed multiple‐agent independent proximal policy optimization‐OLSR (MA‐IPPO‐OLSR), distributed multiple‐agent independent twin delayed deep deterministic policy gradient‐OLSR (MA‐ITD3‐OLSR) routing protocols. Since MA‐IDDPG‐OLSR relies only on local information, there is a minor performance degradation in MA‐IDDPG‐OLSR compared to centralized single‐agent DQN‐OLSR routing protocol. But it is more suitable to a completely distributed UAV network without a centralized node. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Numerical approximation based on deep convolutional neural network for high-dimensional fully nonlinear merged PDEs and 2BSDEs.
- Author
-
Xu Xiao, Wenlin Qiu, and Nikan, Omid
- Subjects
- *
CONVOLUTIONAL neural networks , *STOCHASTIC differential equations , *NONLINEAR differential equations , *PARTIAL differential equations , *DEEP learning , *HIGH-dimensional model representation , *HAMILTON-Jacobi-Bellman equation , *TIME perception - Abstract
This paper proposes two efficient approximation methods to solve high-dimensional fully nonlinear partial differential equations (NPDEs) and second-order backward stochastic differential equations (2BSDEs), where such high-dimensional fully NPDEs are extremely difficult to solve because the computational cost of standard approximation methods grows exponentially with the number of dimensions. Therefore, we consider the following methods to overcome this difficulty. For the merged fully NPDEs and 2BSDEs system, combined with the time forward discretization and ReLU function, we use multiscale deep learning fusion and convolutional neural network (CNN) techniques to obtain two numerical approximation schemes, respectively. Finally, three practical high-dimensional test problems involving Allen-Cahn, Black-Scholes-Barenblatt, and Hamilton-Jacobi-Bellman equations are given so that the first proposed method exhibits higher efficiency and accuracy than the existing method, while the second proposed method can extend the dimensionality of the completely NPDEs-2BSDEs system over 400 dimensions, from which the numerical results highlight the effectiveness of proposed methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Multi‐view stereo for weakly textured indoor 3D reconstruction.
- Author
-
Wang, Tao and Gan, Vincent J. L.
- Subjects
- *
DEEP learning , *NETWORK performance , *ALGORITHMS - Abstract
A 3D reconstruction enables an effective geometric representation to support various applications. Recently, learning‐based multi‐view stereo (MVS) algorithms have emerged, replacing conventional hand‐crafted features with convolutional neural network‐encoded deep representation to reduce feature matching ambiguity, leading to a more complete scene recovery from imagery data. However, the state‐of‐the‐art architectures are not designed for an indoor environment with abundant weakly textured or textureless objects. This paper proposes AttentionSPP‐PatchmatchNet, a deep learning‐based MVS algorithm designed for indoor 3D reconstruction. The algorithm integrates multi‐scale feature sampling to produce global‐context‐aware feature maps and recalibrates the weight of essential features to tackle challenges posed by indoor environments. A new dataset designed exclusively for indoor environments is presented to verify the performance of the proposed network. Experimental results show that AttentionSPP‐PatchmatchNet outperforms state‐of‐the‐art algorithms with relative 132.87% and 163.55% improvements at the 10 and 2 mm threshold, respectively, making it suitable for accurate and complete indoor 3D reconstruction. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.