Publication Year Range: Last 50 years / Search Limiters: Available in Library Collection and Peer Reviewed / Topic: deep learning - Searchworks@Jio Institute Digital Library Search Results

1. A deep learning-based approach for performance assessment and prediction: A case study of pulp and paper industries

Author: Jauhar, Sunil Kumar, Raj, Praveen Vijaya Raj Pushpa, Kamble, Sachin, Pratap, Saurabh, Gupta, Shivam, and Belhadi, Amine
Published: 2024
Full Text: View/download PDF

2. PRM-KGED: paper recommender model using knowledge graph embedding and deep neural network

Author: Thierry, Nimbeshaho, Bao, Bing-Kun, Ali, Zafar, Tan, Zhiyi, Christ Chatelain, Ingabire Batamira, and Kefalas, Pavlos
Published: 2023
Full Text: View/download PDF

3. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback

Author: Chenshu Liu, Songbin Ben, Chongwen Liu, Xianchao Li, Qingxia Meng, Yilin Hao, Qian Jiao, and Pinyi Yang
Subjects: Paper-based cultural relics, Conservation, Computer vision, Deep learning, Strain classification, Web application, Fine Arts, Analytical chemistry, QD71-142
Abstract: Abstract Purpose Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms’ nutrient source. Mold not only can damage papers’ structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation.
Published: 2024
Full Text: View/download PDF

4. NSTU-BDTAKA: An open dataset for Bangladeshi paper currency detection and recognition

Author: Md. Jubayar Alam Rafi, Mohammad Rony, and Nazia Majadi
Subjects: Computer vision, Deep learning, Image analysis, Taka detection, Taka recognition, YOLOv5 model, Computer applications to medicine. Medical informatics, R858-859.7, Science (General), Q1-390
Abstract: One of the most popular and well-established forms of payment in use today is paper money. Handling paper money might be challenging for those with vision impairments. Assistive technology has been reinventing itself throughout time to better serve the elderly and disabled people. To detect paper currency and extract other useful information from them, image processing techniques and other advanced technologies, such as Artificial Intelligence, Deep Learning, etc., can be used. In this paper, we present a meticulously curated and comprehensive dataset named ‘NSTU-BDTAKA’ tailored for the simultaneous detection and recognition of a specific object of cultural significance - the Bangladeshi paper currency (in Bengali it is called ‘Taka’). This research aims to facilitate the development and evaluation of models for both taka detection and recognition tasks, offering a rich resource for researchers and practitioners alike. The dataset is divided into two distinct components: (i) taka detection, and (ii) taka recognition. The taka detection subset comprises 3,111 high-resolution images, each meticulously annotated with rectangular bounding boxes that encompass instances of the taka. These annotations serve as ground truth for training and validating object detection models, and we adopt the state-of-the-art YOLOv5 architecture for this purpose. In the taka recognition subset, the dataset has been extended to include a vast collection of 28,875 images, each showcasing various instances of the taka captured in diverse contexts and environments. The recognition dataset is designed to address the nuanced task of taka recognition providing researchers with a comprehensive set of images to train, validate, and test recognition models. This subset encompasses challenges such as variations in lighting, scale, orientation, and occlusion, further enhancing the robustness of developed recognition algorithms. The dataset NSTU-BDTAKA not only serves as a benchmark for taka detection and recognition but also fosters advancements in object detection and recognition methods that can be extrapolated to other cultural artifacts and objects. We envision that the dataset will catalyze research efforts in the field of computer vision, enabling the development of more accurate, robust, and efficient models for both detection and recognition tasks.
Published: 2024
Full Text: View/download PDF

5. Automated Detection of COVID-19 Using Deep Learning Approaches with Paper-Based ECG Reports

Author: Bassiouni, Mahmoud M., Hegazy, Islam, Rizk, Nouhad, El-Dahshan, El-Sayed A., and Salem, Abdelbadeeh M.
Published: 2022
Full Text: View/download PDF

6. Research hotspots and trends of artificial intelligence in diabetic retinopathy based on bibliometrics and high-impact papers

Author: Ruo-Yu Wang, Wang-Ting Li, Shao-Chong Zhang, and Wei-Hua Yang
Subjects: artificial intelligence, diabetic retinopathy, bibliometrics, citespace, deep learning, hotspots, trends, Ophthalmology, RE1-994
Abstract: AIM: To analyze research hotspots and trends of artificial intelligence in diabetic retinopathy(DR)based on bibliometrics and high-impact papers.METHODS: Papers on artificial intelligence in DR research published in the Web of Science Core Collection(WoSCC)from January 1, 2012, to December 31, 2022 were retrieved. The data was analyzed by CiteSpace software to examine annual publication number, countries, institutions, source journal, research categories, keywords, and to perform an in-depth analysis of high-impact papers.RESULTS: A total of 1 009 papers on artificial intelligence in DR from 79 countries were included in the study, with 272 papers published in 2022. Notably, China and India contributed 287 and 234 papers, respectively. The United Kingdom exhibited a centrality score of 0.31, while the United States boasted an impressive H-index of 48. Three prominent institutions in the United Kingdom(University of London, Moorfields Eye Hospital, and University College London)and one institution in Egypt(Egyptian Knowledge Bank)all achieved a notable H-index of 14. The primary academic disciplines associated with this research field encompassed ophthalmology, computer science, and artificial intelligence. Burst keywords in the years 2021～2022 included transfer learning, vessel segmentation, and convolutional neural networks.CONCLUSION: China emerged as the leading contributor in terms of publication number in this field, while the United States stood out as a key player. Notably, Egyptian Knowledge Bank and University of London assumed leading roles among research institutions. Additionally, IEEE Access was identified as the most active journal within this domain. The research focus in the field of artificial intelligence in DR has transitioned from AI applications in disease detection and grading to a more concentrated exploration of AI-assisted diagnostic systems. Transfer learning, vessel segmentation, and convolutional neural networks hold substantial promise for widespread applications in this field.
Published: 2023
Full Text: View/download PDF

7. Web-based diagnostic platform for microorganism-induced deterioration on paper-based cultural relics with iterative training from human feedback.

Author: Liu, Chenshu, Ben, Songbin, Liu, Chongwen, Li, Xianchao, Meng, Qingxia, Hao, Yilin, Jiao, Qian, and Yang, Pinyi
Subjects: *CONVOLUTIONAL neural networks, *COMMUNITY involvement, *DEEP learning, *CLASSIFICATION, *CAMERA phones, *RELICS, *AUTOMATIC classification, *SECURITY classification (Government documents)
Abstract: Purpose: Paper-based artifacts hold significant cultural and social values. However, paper is intrinsically fragile to microorganisms, such as mold, due to its cellulose composition, which can serve as a microorganisms' nutrient source. Mold not only can damage papers' structural integrity and pose significant challenges to conservation works but also may subject individuals attending the contaminated artifacts to health risks. Current approaches for strain identification usually require extensive training, prolonged time for analysis, expensive operation costs, and higher risks of secondary damage due to sampling. Thus, in current conservation practices with mold-contaminated artifacts, little pre-screening or strain identification was performed before mold removal, and the cleaning techniques are usually broad-spectrum rather than strain-specific. With deep learning showing promising applications across various domains, this study investigated the feasibility of using a convolutional neural network (CNN) for fast in-situ recognition and classification of mold on paper. Methods: Molds were first non-invasively sampled from ancient Xuan Paper-based Chinese books from the Qing and Ming dynasties. Strains were identified using molecular biology methods and the four most prevalent strains were inoculated on Xuan paper to create mockups for image collection. Microscopic images of the molds as well as their stains situated on paper were collected using a compound microscope and commercial microscope lens for cell phone cameras, which were then used for training CNN models with a transfer learning scheme to perform the classification of mold. To enable involvement and contribution from the research community, a web interface that actuates the process while providing interactive features for users to learn about the information of the classified strain was constructed. Moreover, a feedback functionality in the web interface was embedded for catching potential classification errors, adding additional training images, or introducing new strains, all to refine the generalizability and robustness of the model. Results & Conclusion: In the study, we have constructed a suite of high-confidence classification CNN models for the diagnostic process for mold contamination in conservation. At the same time, a web interface was constructed that allows recurrently refining the model with human feedback through engaging the research community. Overall, the proposed framework opens new avenues for effective and timely identification of mold, thus enabling proactive and targeted mold remediation strategies in conservation. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. Speech recognition for Kazakh language: a research paper.

Author: Kapyshev, Galym, Nurtas, Marat, and Altaibek, Aizhan
Subjects: SPEECH perception, AUTOMATIC speech recognition, NATURAL language processing, LANGUAGE research, DEEP learning, MARKOV processes
Abstract: In recent years, the research pertaining to speech recognition technology in the Kazakh language has gained significant importance. This is due to the increasing demand for natural language processing applications in the region where Kazakh is predominantly spoken. Thus, there exists an urgent requirement for precise and dependable speech recognition systems. The research study examines the application of sophisticated deep learning methodologies, such as Natural Language Processing (NLP) and Hidden Markov Model (HMM), in facilitating speech recognition for the Kazakh language. Additionally, the investigation delves into how various techniques, including data preprocessing, acoustic modeling, and language modeling, can aid in devising effective speech recognition systems. The article deliberates on the feasible uses of speech recognition technology in the geographic area where Kazakh language is spoken and outlines its future research prospects. The investigation underscores the significance of persistent inquiry in this realm to confront distinctive obstacles encountered in creating speech recognition systems for languages with restricted resources. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. Improved GRU prediction of paper pulp press variables using different pre-processing methods

Author: Balduíno César Mateus, Mateus Mendes, José Torres Farinha, António Marques Cardoso, Rui Assis, and Hamzeh Soltanali
Subjects: Deep learning, LOWESS, forecasting failures, industrial press, recurrent neural network, predictive maintenance, Technology, Manufactures, TS1-2301, Business, HF5001-6182
Abstract: ABSTRACTPredictive maintenance strategies are becoming increasingly more important with the increased needs for automation and digitalization within pulp and paper manufacturing sector.Hence, this study contributes to examine the most efficient pre-processing approaches for predicting sensory data trends based on Gated Recurrent Unit (GRU) neural networks. To validate the model, the data from two paper pulp presses with several pre-processing methods are utilized for predicting the units’ conditions. The results of validation criteria show that pre-processing data using a LOWESS in combination with the Elimination of discrepant data filter achieves more stable results, the prediction error decreases, and the predicted values are easier to interpret. The model can anticipate future values with MAPE, RMSE and MAE of 1.2, 0.27 and 0.30 respectively. The errors are below the significance level. Moreover, it is identified that the best hyperparameters found for each paper pulp press must be different.
Published: 2023
Full Text: View/download PDF

10. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries.

Author: Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
Subjects: *MEDICAL records, *ARTIFICIAL neural networks, *COMPUTER vision, *DIASTOLIC blood pressure, *MEDICAL personnel, *DEEP learning, *SYSTOLIC blood pressure
Abstract: Background: In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods: We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results: The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions: We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. Ash Detection of Coal Slime Flotation Tailings Based on Chromatographic Filter Paper Sampling and Multi-Scale Residual Network.

Author: Wenbo Zhu, Neng Liu, Zhengjun Zhu, Haibing Li, Weijie Fu, Zhongbo Zhang, and Xinghao Zhang
Subjects: COAL ash, FILTER paper, FLOTATION, FEATURE extraction, DEEP learning, FOAM
Abstract: The detection of ash content in coal slime flotation tailings using deep learning can be hindered by various factors such as foam, impurities, and changing lighting conditions that disrupt the collection of tailings images. To address this challenge, we present a method for ash content detection in coal slime flotation tailings. This method utilizes chromatographic filter paper sampling and a multi-scale residual network, which we refer to as MRCN. Initially, tailings are sampled using chromatographic filter paper to obtain static tailings images, effectively isolating interference factors at the flotation site. Subsequently, the MRCN, consisting of a multi-scale residual network, is employed to extract image features and compute ash content. Within the MRCN structure, tailings images undergo convolution operations through two parallel branches that utilize convolution kernels of different sizes, enabling the extraction of image features at various scales and capturing a more comprehensive representation of the ash content information. Furthermore, a channel attention mechanism is integrated to enhance the performance of the model. The combination of the multi-scale residual structure and the channel attention mechanism within MRCN results in robust capabilities for image feature extraction and ash content detection. Comparative experiments demonstrate that this proposed approach, based on chromatographic filter paper sampling and the multi-scale residual network, exhibits significantly superior performance in the detection of ash content in coal slime flotation tailings. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

12. Generation of Highlights From Research Papers Using Pointer-Generator Networks and SciBERT Embeddings

Author: Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay, Plaban Kumar Bhowmick, and Partha Pratim Das
Subjects: Deep learning, natural language generation, pointer-generator network, SciBERT, scientific data, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Nowadays many research articles are prefaced with research highlights to summarize the main findings of the paper. Highlights not only help researchers precisely and quickly identify the contributions of a paper, they also enhance the discoverability of the article via search engines. We aim to automatically construct research highlights given certain segments of a research paper. We use a pointer-generator network with coverage mechanism and a contextual embedding layer at the input that encodes the input tokens into SciBERT embeddings. We test our model on a benchmark dataset, CSPubSum, and also present MixSub, a new multi-disciplinary corpus of papers for automatic research highlight generation. For both CSPubSum and MixSub, we have observed that the proposed model achieves the best performance compared to related variants and other models proposed in the literature. On the CSPubSum dataset, our model achieves the best performance when the input is only the abstract of a paper as opposed to other segments of the paper. It produces ROUGE-1, ROUGE-2 and ROUGE-L F1-scores of 38.26, 14.26 and 35.51, respectively, METEOR score of 32.62, and BERTScore F1 of 86.65 which outperform all other baselines. On the new MixSub dataset, where only the abstract is the input, our proposed model (when trained on the whole training corpus without distinguishing between the subject categories) achieves ROUGE-1, ROUGE-2 and ROUGE-L F1-scores of 31.78, 9.76 and 29.3, respectively, METEOR score of 24.00, and BERTScore F1 of 85.25.
Published: 2023
Full Text: View/download PDF

13. Study design of deep learning based automatic detection of cerebrovascular diseases on medical imaging: a position paper from Chinese Association of Radiologists

Author: Longjiang Zhang, Zhao Shi, Min Chen, Yingmin Chen, Jingliang Cheng, Li Fan, Nan Hong, Wenxiao Jia, Guihua Jiang, Shenghong Ju, Xiaogang Li, Xiuli Li, Changhong Liang, Weihua Liao, Shiyuan Liu, Zaiming Lu, Lin Ma, Ke Ren, Pengfei Rong, Bin Song, Gang Sun, Rongpin Wang, Zhibo Wen, Haibo Xu, Kai Xu, Fuhua Yan, Yizhou Yu, Yunfei Zha, Fandong Zhang, Minwen Zheng, Zhen Zhou, Wenzhen Zhu, Guangming Lu, and Zhengyu Jin
Subjects: Cerebrovascular diseases, Deep learning, Study design, Medical imaging, Medical technology, R855-855.5
Abstract: In recent years, with the development of artificial intelligence, especially deep learning technology, researches on automatic detection of cerebrovascular diseases on medical images have made tremendous progress and these models are gradually entering into clinical practice. However, because of the complexity and flexibility of the deep learning algorithms, these researches have great variability on model building, validation process, performance description and results interpretation. The lack of a reliable, consistent, standardized design protocol has, to a certain extent, affected the progress of clinical translation and technology development of computer aided detection systems. After reviewing a large number of literatures and extensive discussion with domestic experts, this position paper put forward recommendations of standardized design on the key steps of deep learning-based automatic image detection models for cerebrovascular diseases. With further research and application expansion, this position paper would continue to be updated and gradually extended to evaluate the generalizability and clinical application efficacy of such tools.
Published: 2022
Full Text: View/download PDF

14. A personalized paper recommendation method based on knowledge graph and transformer encoder with a self-attention mechanism.

Author: Gao, Li, Lan, Yu, Yu, Zhen, and Zhu, Jian-min
Subjects: KNOWLEDGE graphs, DEEP learning, RECOMMENDER systems, TRANSFORMER models, DIGITAL Object Identifiers, ELECTRONIC publications
Abstract: Paper recommendation with personalized methods helps researchers to track the latest academic trends and master cutting-edge academic trends efficiently. Meanwhile, the methods of previous paper recommendation suffer from three problems: data sparsity of content-based and collaborative filtering methods; Graph-based recommendations do not fully consider the personalized information of authors and their articles; Cold start based on deep learning. To overcome those difficulties, we propose a personalized paper recommendation method based on a knowledge graph and Transformer encoder (KGTE) with a self-attention mechanism. Firstly, we add auxiliary information (article title, publication year, citation times, and abstract) as attributes to the nodes of knowledge graph(KG), which contain author, digital object unique identifier(DOI) and keywords. Secondly, BERT is used to represent the semantic information features of the article and Transformer is introduced to fully integrate the feature context. After that, by using RippleNet, we traverse the knowledge graph, filter the user preference distribution and form a set of pre recommended nodes with multi_hop nodes. Finally, the prediction layer sorts the set and gets a Top_n paper recommendation. In the experiments on the DBLP and Aminer datasets, the precision value of KGTE improved by an average of 2.59% over the existing baseline methods DER and 4.23% improvement in NDCG. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

15. A novel application of XAI in squinting models: A position paper

Author: Kenneth Wenger, Katayoun Hossein Abadi, Damian Fozard, Kayvan Tirdad, Alex Dela Cruz, and Alireza Sadeghian
Subjects: Artificial Intelligence, Deep learning, Pathology, Explainable AI, XAI, Safety critical AI, Cybernetics, Q300-390, Electronic computers. Computer science, QA75.5-76.95
Abstract: Artificial Intelligence, and Machine Learning especially, are becoming increasingly foundational to our collective future. Recent developments around generative models such as ChatGPT, and DALL-E represent just the tip of the iceberg in new gadgets that will change the way we live our lives. Convolutional Neural Networks (CNNs) and Transformer models are at the heart of advancements in the autonomous vehicles and health care industries as well. Yet these models, as impressive as they are, still make plenty of mistakes without justifying or explaining what aspects of the input or internal state, was responsible for the error. Often, the goal of automation is to increase throughput, processing as many tasks as possible in a short a period of time. For some use cases the cost of mistakes might be acceptable as long as production is increased above some set margin. However, in health care, autonomous vehicles, and financial applications, the cost of a mistake might have catastrophic consequences. For this reason, industries where single mistakes can be costly are less enthusiastic about early AI adoption. The field of eXplainable AI (XAI) has attracted significant attention in recent years with the goal of producing algorithms that shed light into the decision-making process of neural networks. In this paper we show how robust vision pipelines can be built using XAI algorithms with the goal of producing automated watchdogs that actively monitor the decision-making process of neural networks for signs of mistakes or ambiguous data. We call these robust vision pipelines, squinting pipelines.
Published: 2023
Full Text: View/download PDF

16. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper

Author: Prasoon Kumar Vinodkumar, Dogus Karabulut, Egils Avots, Cagri Ozcinar, and Gholamreza Anbarjafari
Subjects: deep learning, 3D reconstruction, 3D augmentation, 3D registration, point cloud, voxel, Science, Astrophysics, QB460-466, Physics, QC1-999
Abstract: The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future.
Published: 2024
Full Text: View/download PDF

17. Increased Accuracy on Image Classification of Game Rock Paper Scissors using CNN

Author: Muhammad Nur Ichsan, Nur Armita, Agus Eko Minarno, Fauzi Dwi Setiawan Sumadi, and Hariyady
Subjects: cnn, deep learning, image classification, machine learning, neural network, Systems engineering, TA168, Information technology, T58.5-58.64
Abstract: Rock Paper Scissors is one of the most popular games in the world, because of their easy and simple way to play among young and elderly people. The point of this game is to do the draw or just to find out who loses or wins. The pandemic conditions made people unable to meet face-to-face and could only play this game virtually. To carry out this activity in a virtual way, this research facilitates a model in the form of image classification to distinguish the hand gestures s in the form of rock, paper, and scissors. This classification process utilizes the Convolutional Neural Network (CNN) method. This method is one type of artificial neural network in terms of image classification. CNN uses three stages, namely convolutional layer, pooling layer, and fully connected layer. The implementation of this method for hand gesture classification in the form of rock, scissors, and paper images in this study shows an increased average accuracy towards the previous study from 97.66% to 99%.
Published: 2022
Full Text: View/download PDF

18. Improved GRU prediction of paper pulp press variables using different pre-processing methods.

Author: Mateus, Balduíno César, Mendes, Mateus, Torres Farinha, José, Marques Cardoso, António, Assis, Rui, and Soltanali, Hamzeh
Subjects: PAPER pulp, RECURRENT neural networks
Abstract: Predictive maintenance strategies are becoming increasingly more important with the increased needs for automation and digitalization within pulp and paper manufacturing sector. Hence, this study contributes to examine the most efficient pre-processing approaches for predicting sensory data trends based on Gated Recurrent Unit (GRU) neural networks. To validate the model, the data from two paper pulp presses with several pre-processing methods are utilized for predicting the units' conditions. The results of validation criteria show that pre-processing data using a LOWESS in combination with the Elimination of discrepant data filter achieves more stable results, the prediction error decreases, and the predicted values are easier to interpret. The model can anticipate future values with MAPE, RMSE and MAE of 1.2, 0.27 and 0.30 respectively. The errors are below the significance level. Moreover, it is identified that the best hyperparameters found for each paper pulp press must be different. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

19. Deep Learning for Automated Visual Inspection in Manufacturing and Maintenance: A Survey of Open- Access Papers

Author: Nils Hütten, Miguel Alves Gomes, Florian Hölken, Karlo Andricevic, Richard Meyes, and Tobias Meisen
Subjects: automated visual inspection, industrial applications, deep learning, computer vision, convolutional neural network, vision transformer, Technology, Applied mathematics. Quantitative methods, T57-57.97
Abstract: Quality assessment in industrial applications is often carried out through visual inspection, usually performed or supported by human domain experts. However, the manual visual inspection of processes and products is error-prone and expensive. It is therefore not surprising that the automation of visual inspection in manufacturing and maintenance is heavily researched and discussed. The use of artificial intelligence as an approach to visual inspection in industrial applications has been considered for decades. Recent successes, driven by advances in deep learning, present a possible paradigm shift and have the potential to facilitate automated visual inspection, even under complex environmental conditions. For this reason, we explore the question of to what extent deep learning is already being used in the field of automated visual inspection and which potential improvements to the state of the art could be realized utilizing concepts from academic research. By conducting an extensive review of the openly accessible literature, we provide an overview of proposed and in-use deep-learning models presented in recent years. Our survey consists of 196 open-access publications, of which 31.7% are manufacturing use cases and 68.3% are maintenance use cases. Furthermore, the survey also shows that the majority of the models currently in use are based on convolutional neural networks, the current de facto standard for image classification, object recognition, or object segmentation tasks. Nevertheless, we see the emergence of vision transformer models that seem to outperform convolutional neural networks but require more resources, which also opens up new research opportunities for the future. Another finding is that in 97% of the publications, the authors use supervised learning techniques to train their models. However, with the median dataset size consisting of 2500 samples, deep-learning models cannot be trained from scratch, so it would be beneficial to use other training paradigms, such as self-supervised learning. In addition, we identified a gap of approximately three years between approaches from deep-learning-based computer vision being published and their introduction in industrial visual inspection applications. Based on our findings, we additionally discuss potential future developments in the area of automated visual inspection.
Published: 2024
Full Text: View/download PDF

20. A Graph-Based Topic Modeling Approach to Detection of Irrelevant Citations.

Author: Pham, Phu, Le, Hieu, Tam, Nguyen Thanh, and Tran, Quang-Dieu
Subjects: NATURAL language processing, DEEP learning, MACHINE learning, INFORMATION retrieval
Abstract: In the recent years, the academic paper influence analysis has been widely studied due to its potential applications in the multiple areas of science information metric and retrieval. By identifying the academic influence of papers, authors, etc., we can directly support researchers to easily reach academic papers. These recommended candidate papers are not only highly relevant with their desired research topics but also highly-attended by the research community within these topics. For very recent years, the rapid developments of academic networks, like Google Scholar, Research Gate, CiteSeerX, etc., have significantly boosted the number of new published papers annually. It also helps to strengthen the borderless cooperation between researchers who are interested on the same research topics. However, these current academic networks still lack the capabilities of provisioning researchers deeper into most-influenced papers. They also largely ignore quite/irrelevant papers, which are not fully related with their current interest topics. Moreover, the distributions of topics within these academic papers are considered as varying and it is difficult to extract the main concentrated topics in these papers. Thus, it leads to challenges for researchers to find their appropriated/high-qualified reference resources while doing researches. To overcome this limitation, in this paper, we proposed a novel approach of paper influence analysis through their content-based and citation relationship-based analyses within the biographical network. In order to effectively extract the topic-based relevance from papers, we apply the integrated graph-based citation relationship analysis with topic modeling approach to automatically learn the distributions of keyword-based labeled topics in forms of unsupervised learning approach, named as TopCite. Then, we base on the constructed graph-based paper–topic structure to identify their relevancy levels. Upon the identified relevancy levels between papers, we can support for improving the accuracy performance of other bibliographic network mining tasks, such as paper similarity measurement, recommendation, etc. Extensive experiments in real-world AMiner bibliographic dataset demonstrate the effectiveness of our proposed ideas in this paper. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

21. Reconfigurable intelligent surface based hybrid precoding for THz communications (invited paper)

Author: Yu Lu, Mo Hao, and Richard Mackenzie
Subjects: reconfigurable intelligent surface (ris), thz communication, massive multiple-input multiple-output (mimo), hybrid precoding, deep learning, Telecommunication, TK5101-6720
Abstract: Benefiting from the growth of the bandwidth, Terahertz (THz) communication can support the new application with explosive requirements of the ultra-high-speed rates for future 6G wireless systems. In order to compensate for the path loss of high frequency, massive Multiple-Input Multiple-Output (MIMO) can be utilized for high array gains by beamforming. However, the existing THz communication with massive MIMO has remarkably high energy consumption because a large number of analog phase shifters should be used to realize the analog beamforming. To solve this problem, a Reconfigurable Intelligent Surface (RIS) based hybrid precoding architecture for THz communication is developed in this paper, where the energy-hungry phased array is replaced by the energy-efficient RIS to realize the analog beamforming of the hybrid precoding. Then, based on the proposed RIS-based architecture, a sum-rate maximization problem for hybrid precoding is investigated. Since the phase shifts implemented by RIS in practice are often discrete, this sum-rate maximization problem with a non-convex constraint is challenging. Next, the sum-rate maximization problem is reformulated as a parallel Deep Neural Network (DNN) based classification problem, which can be solved by the proposed low-complexity Deep Learning based Multiple Discrete Classification (DL-MDC) hybrid precoding scheme. Finally, we provide numerous simulation results to show that the proposed DL-MDC scheme works well both in the theoretical Saleh-Valenzuela channel model and practical 3GPP channel model. Compared with existing iterative search algorithms, the proposed DL-MDC scheme significantly reduces the runtime with a negligible performance loss.
Published: 2022
Full Text: View/download PDF

22. Special Issue "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023".

Author: Kim, Byung-Seo, Afzal, Muhammad Khalil, and Ullah, Rehmat
Subjects: *MULTICASTING (Computer networks), *INFORMATION technology, *SENSOR networks, *ARTIFICIAL neural networks, *DEEP learning, *BEAM steering, *INTEGRATED circuit design, *COMPUTER network security
Abstract: This document is a summary of a special issue of the journal Sensors, titled "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023." The special issue features selected papers from the 10th and 11th International Conferences on Green and Human Information Technology (ICGHITs), which were held in Korea and Thailand. The conferences focused on the theme of "Emerging Artificial Intelligent (AI)+X technology" and "Hyper Automation + Human AI" respectively. The selected papers cover various topics such as network security, routing protocols, signal detection, and clustering mechanisms, all incorporating AI-based methods. The issue also includes papers on topics like secure authentication, distance estimation in RFID systems, energy optimization in smart homes, blockchain technology, and radar signal detection. The authors emphasize the importance of both technology and humanity in advancing green and information technologies. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

23. Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper.

Author: Vinodkumar, Prasoon Kumar, Karabulut, Dogus, Avots, Egils, Ozcinar, Cagri, and Anbarjafari, Gholamreza
Subjects: *DEEP learning, *COMPUTER vision, *GRAPH neural networks, *ARTIFICIAL intelligence, *MACHINE learning, *GENERATIVE adversarial networks
Abstract: The research groups in computer vision, graphics, and machine learning have dedicated a substantial amount of attention to the areas of 3D object reconstruction, augmentation, and registration. Deep learning is the predominant method used in artificial intelligence for addressing computer vision challenges. However, deep learning on three-dimensional data presents distinct obstacles and is now in its nascent phase. There have been significant advancements in deep learning specifically for three-dimensional data, offering a range of ways to address these issues. This study offers a comprehensive examination of the latest advancements in deep learning methodologies. We examine many benchmark models for the tasks of 3D object registration, augmentation, and reconstruction. We thoroughly analyse their architectures, advantages, and constraints. In summary, this report provides a comprehensive overview of recent advancements in three-dimensional deep learning and highlights unresolved research areas that will need to be addressed in the future. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. A fully-automated paper ECG digitisation algorithm using deep learning.

Author: Wu, Huiyi, Patel, Kiran Haresh Kumar, Li, Xinyang, Zhang, Bowen, Galazis, Christoforos, Bajaj, Nikesh, Sau, Arunashis, Shi, Xili, Sun, Lin, Tao, Yanda, Al-Qaysi, Harith, Tarusan, Lawrence, Yasmin, Najira, Grewal, Natasha, Kapoor, Gaurika, Waks, Jonathan W., Kramer, Daniel B., Peters, Nicholas S., and Ng, Fu Siong
Subjects: *DEEP learning, *ELECTROCARDIOGRAPHY, *ELECTRONIC paper, *ATRIAL fibrillation, *ALGORITHMS, *HEART failure, *HEART rate monitors
Abstract: There is increasing focus on applying deep learning methods to electrocardiograms (ECGs), with recent studies showing that neural networks (NNs) can predict future heart failure or atrial fibrillation from the ECG alone. However, large numbers of ECGs are needed to train NNs, and many ECGs are currently only in paper format, which are not suitable for NN training. We developed a fully-automated online ECG digitisation tool to convert scanned paper ECGs into digital signals. Using automated horizontal and vertical anchor point detection, the algorithm automatically segments the ECG image into separate images for the 12 leads and a dynamical morphological algorithm is then applied to extract the signal of interest. We then validated the performance of the algorithm on 515 digital ECGs, of which 45 were printed, scanned and redigitised. The automated digitisation tool achieved 99.0% correlation between the digitised signals and the ground truth ECG (n = 515 standard 3-by-4 ECGs) after excluding ECGs with overlap of lead signals. Without exclusion, the performance of average correlation was from 90 to 97% across the leads on all 3-by-4 ECGs. There was a 97% correlation for 12-by-1 and 3-by-1 ECG formats after excluding ECGs with overlap of lead signals. Without exclusion, the average correlation of some leads in 12-by-1 ECGs was 60–70% and the average correlation of 3-by-1 ECGs achieved 80–90%. ECGs that were printed, scanned, and redigitised, our tool achieved 96% correlation with the original signals. We have developed and validated a fully-automated, user-friendly, online ECG digitisation tool. Unlike other available tools, this does not require any manual segmentation of ECG signals. Our tool can facilitate the rapid and automated digitisation of large repositories of paper ECGs to allow them to be used for deep learning projects. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

25. Toward Interactive Music Generation: A Position Paper

Author: Shayan Dadman, Bernt Arild Bremdal, Borre Bang, and Rune Dalmo
Subjects: Deep learning, multi-agent systems, music composition, music creativity, music generation, music information retrieval, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Music generation using deep learning has received considerable attention in recent years. Researchers have developed various generative models capable of imitating musical conventions, comprehending the musical corpora, and generating new samples based on the learning outcome. Although the samples generated by these models are persuasive, they often lack musical structure and creativity. For instance, a vanilla end-to-end approach, which deals with all levels of music representation at once, does not offer human-level control and interaction during the learning process, leading to constrained results. Indeed, music creation is a recurrent process that follows some principles by a musician, where various musical features are reused or adapted. On the other hand, a musical piece adheres to a musical style, breaking down into precise concepts of timbre style, performance style, composition style, and the coherency between these aspects. Here, we study and analyze the current advances in music generation using deep learning models through different criteria. We discuss the shortcomings and limitations of these models regarding interactivity and adaptability. Finally, we draw the potential future research direction addressing multi-agent systems and reinforcement learning algorithms to alleviate these shortcomings and limitations.
Published: 2022
Full Text: View/download PDF

26. SMR–YOLO: Multi-Scale Detection of Concealed Suspicious Objects in Terahertz Images.

Author: Zhang, Yuan, Chen, Hao, Ge, Zihao, Jiang, Yuying, Ge, Hongyi, Zhao, Yang, and Xiong, Haotian
Subjects: OBJECT recognition (Computer vision), PUBLIC spaces, WRAPPING materials, KRAFT paper, DETECTION alarms
Abstract: The detection of concealed suspicious objects in public places is a critical issue and a popular research topic. Terahertz (THz) imaging technology, as an emerging detection method, can penetrate materials without emitting ionizing radiation, providing a new approach to detecting concealed suspicious objects. This study focuses on the detection of concealed suspicious objects wrapped in different materials such as polyethylene and kraft paper, including items like scissors, pistols, and blades, using THz imaging technology. To address issues such as the lack of texture details in THz images and the contour similarity of different objects, which can lead to missed detections and false alarms, we propose a THz concealed suspicious object detection model based on SMR–YOLO (SPD_Mobile + RFB + YOLO). This model, based on the MobileNext network, introduces the spatial-to-depth convolution (SPD-Conv) module to replace the backbone network, reducing computational and parameter load. The inclusion of the receptive field block (RFB) module, which uses a multi-branch structure of dilated convolutions, enhances the network's depth features. Using the EIOU loss function to assess the accuracy of predicted box localization further optimizes convergence speed and localization accuracy. Experimental results show that the improved model achieved mAP@0.5 and mAP@0.5:0.95 scores of 98.9% and 89.4%, respectively, representing improvements of 0.2% and 1.8% over the baseline model. Additionally, the detection speed reached 108.7 FPS, an improvement of 23.2 FPS over the baseline model. The model effectively identifies concealed suspicious objects within packages, offering a novel approach for detection in public places. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. fNIRS Signal Classification Based on Deep Learning in Rock-Paper-Scissors Imagery Task

Author: Tengfei Ma, Wentian Chen, Xin Li, Yuting Xia, Xinhua Zhu, and Sailing He
Subjects: BCI, fNIRS, CNN, TSC, rock–paper–scissors, deep learning, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: To explore whether the brain contains pattern differences in the rock–paper–scissors (RPS) imagery task, this paper attempts to classify this task using fNIRS and deep learning. In this study, we designed an RPS task with a total duration of 25 min and 40 s, and recruited 22 volunteers for the experiment. We used the fNIRS acquisition device (FOIRE-3000) to record the cerebral neural activities of these participants in the RPS task. The time series classification (TSC) algorithm was introduced into the time-domain fNIRS signal classification. Experiments show that CNN-based TSC methods can achieve 97% accuracy in RPS classification. CNN-based TSC method is suitable for the classification of fNIRS signals in RPS motor imagery tasks, and may find new application directions for the development of brain–computer interfaces (BCI).
Published: 2021
Full Text: View/download PDF

28. A Hybrid Model Based on LFM and BiGRU Toward Research Paper Recommendation

Author: Xu Zhao, Hui Kang, Tie Feng, Chenkun Meng, and Ziqing Nie
Subjects: Recommender systems, deep learning, LFM, BiGRU, user attention, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: To improve the accuracy of user implicit rating prediction, we combine the traditional latent factor model (LFM) and bidirectional gated recurrent unit neural network (BiGRU) model to propose a hybrid model that deeply mines the latent semantics in the unstructured content of the text and generates a more accurate rating matrix. First, we utilize the user’s historical behavior (favorites records) to build a user rating matrix and decompose the matrix to obtain the latent factor vectors of users and literature. We also apply the BERT model for word embedding of the research papers to obtain the sequence of word vectors. Then, we apply the BiGRU with the user attention mechanism to mine the research paper textual content and to generate the new literature latent feature vectors that are used to replace the original literature latent factor vectors decomposed from the rating matrix. Finally, a new rating matrix is generated to obtain users’ ratings of noninteractive research papers and to generate the recommendation list according to the user latent factor vector. We design experiments on the real datasets and verify that the research paper recommendation model is superior to traditional recommendation models in terms of precision, recall, F1-value, coverage, popularity and diversity.
Published: 2020
Full Text: View/download PDF

29. Object Detection Model, Image Data and Results from the 'When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and Around State Game Lands in Pennsylvania' Paper

Author: Jeff Blackadar, Benjamin Carter, and Weston Conner
Subjects: relict charcoal hearth, rch, mask r-cnn, pennsylvania, digital elevation model, iron production, deep learning, Archaeology, CC1-960
Abstract: These data were used to build an object detection model to locate Relict Charcoal Hearths (RCH) as described in the paper “When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and around State Game Lands in Pennsylvania” [1]. This is the second grouping of data for the paper above. The first grouping is also available in this journal, see “Geospatial and image data from the “When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and around State Game Lands in Pennsylvania” paper” [2]. These files consist of: JPEGs representing tiles of larger Slope TIFF files derived from LiDAR for the State Game Lands (SGL) of Pennsylvania, United States [3456]. A subset of these tiles was used to train the model. A Shapefile of points of known relict charcoal hearths (RCH). XML files representing the pixel points of known RCHs on JPEG files used for training. Jupyter notebooks of programs used to prepare data and train a Mask R-CNN model. The Mask R-CNN model H5 file. Shapefile and GeoJSON of object detection results from the model showing locations of possible RCH in all SGLs. XML files representing the pixel points of predicted RCH on JPEG files used for predictions. GeoJSON of results using cluster analysis. These data are stored onzenodo.org. The programs are stored on Github.com.
Published: 2021
Full Text: View/download PDF

30. Geospatial and Image Data from the 'When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and Around State Game Lands in Pennsylvania' Paper

Author: Weston Conner, Benjamin Carter, and Jeff Blackadar
Subjects: relict charcoal hearth, rch, mask r-cnn, pennsylvania, digital elevation model, iron production, deep learning, Archaeology, CC1-960
Abstract: These data were used to build an object detection model to locate Relict Charcoal Hearths (RCH) as described in the paper “When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and around State Game Lands in Pennsylvania” [1]. This is the first grouping of data for the paper above. The second grouping is also available in this journal, see “Object detection model, image data and results from the “When Computers Dream of Charcoal: Using Deep Learning, Open Tools and Open Data to Identify Relict Charcoal Hearths in and around State Game Lands in Pennsylvania” paper”. These files consist of georeferenced Digital Elevation Model, Hillshade and Slope files derived from LiDAR for the State Game Lands (SGL). Included is a Shapefile and GeoJSON of State Game Land borders as well as the program used for downloading the LiDAR files. These data are stored on 'Zenodo.org'.
Published: 2021
Full Text: View/download PDF

31. Critical appraisal of a machine learning paper: A guide for the neurologist

Author: Pulikottil W Vinny, Rahul Garg, M V Padma Srivastava, Vivek Lal, and Venugoapalan Y Vishnu
Subjects: critical appraisal, deep learning, machine learning, neural networks, Neurology. Diseases of the nervous system, RC346-429
Abstract: Machine learning (ML), a form of artificial intelligence (AI), is being increasingly employed in neurology. Reported performance metrics often match or exceed the efficiency of average clinicians. The neurologist is easily baffled by the underlying concepts and terminologies associated with ML studies. The superlative performance metrics of ML algorithms often hide the opaque nature of its inner workings. Questions regarding ML model's interpretability and reproducibility of its results in real-world scenarios, need emphasis. Given an abundance of time and information, the expert clinician should be able to deliver comparable predictions to ML models, a useful benchmark while evaluating its performance. Predictive performance metrics of ML models should not be confused with causal inference between its input and output. ML and clinical gestalt should compete in a randomized controlled trial before they can complement each other for screening, triaging, providing second opinions and modifying treatment.
Published: 2021
Full Text: View/download PDF

32. Deep Learning on Medical Imaging in Identifying Kidney Stones: Review Paper

Author: Sulaksono Nanang, Adi Kusworo, and Isnanto dan Rizal
Subjects: artificial intelligence, deep learning, medical imaging, kidney stones, Environmental sciences, GE1-350
Abstract: Medical imaging is currently using artificial intelligence-based technologies to aid evaluate diagnostic information images, particularly in enforcing kidney stones. Artificial intelligence technology continues to develop, many studies show that deep learning is more widely used compared to traditional machine learning, so an Artificial intelligence system is needed to assist the accuracy of health diagnoses, thus helping in the field of radiology health. The aim of the research is to use artificial intelligence with deep learning models to help detect abnormalities in the kidneys. This research method is a literature review of Scopus data related to deep learning in medical imaging in detecting kidney stones. The results of using Artificial Intelligence in medical imaging can be used in diagnosing diseases including detecting Covid-19, musculoskeletal, calcium scores on Cardiac CT, liver tumors, urinary tract lesions, examination of the abdomen and kidney stones. Utilization of Artificial Intelligence in detecting kidney stones can be done with various classification models including XResNet-50, ExDark19, CystoNet, CNN, ANN. Using the right model and having a high accuracy value can help radiologists to accurately detect kidney stones.
Published: 2023
Full Text: View/download PDF

33. Special issue on intelligent systems: ISMIS 2022 selected papers.

Author: Ceci, Michelangelo, Flesca, Sergio, Manco, Giuseppe, and Masciari, Elio
Subjects: MACHINE learning, ARTIFICIAL intelligence, DECISION support systems, KNOWLEDGE representation (Information theory), COMPUTER vision, DEEP learning
Abstract: This document is a special issue of the Journal of Intelligent Information Systems, focusing on the selected papers from the International Symposium on Methodologies for Intelligent Systems (ISMIS 2022). The symposium, held in Cosenza, Italy, showcased research on various topics related to artificial intelligence, including decision support, knowledge representation, machine learning, computer vision, and more. The special issue includes eleven papers that have undergone rigorous peer-reviewing and cover a wide range of research topics, such as deep learning, anomaly detection, malware detection, sentiment classification, and healthcare professionals' burnout. The authors express their gratitude to the contributors and reviewers for their valuable contributions. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

34. A Review Paper about Deep Learning for Medical Image Analysis.

Author: Sistaninejhad, Bagher, Rasi, Habib, and Nayeri, Parisa
Subjects: *DEEP learning, *COMPUTER-assisted image analysis (Medicine), *IMAGE analysis, *DIAGNOSTIC imaging, *COMPUTER-aided diagnosis, *CONVOLUTIONAL neural networks
Abstract: Medical imaging refers to the process of obtaining images of internal organs for therapeutic purposes such as discovering or studying diseases. The primary objective of medical image analysis is to improve the efficacy of clinical research and treatment options. Deep learning has revamped medical image analysis, yielding excellent results in image processing tasks such as registration, segmentation, feature extraction, and classification. The prime motivations for this are the availability of computational resources and the resurgence of deep convolutional neural networks. Deep learning techniques are good at observing hidden patterns in images and supporting clinicians in achieving diagnostic perfection. It has proven to be the most effective method for organ segmentation, cancer detection, disease categorization, and computer-assisted diagnosis. Many deep learning approaches have been published to analyze medical images for various diagnostic purposes. In this paper, we review the work exploiting current state-of-the-art deep learning approaches in medical image processing. We begin the survey by providing a synopsis of research works in medical imaging based on convolutional neural networks. Second, we discuss popular pretrained models and general adversarial networks that aid in improving convolutional networks' performance. Finally, to ease direct evaluation, we compile the performance metrics of deep learning models focusing on COVID-19 detection and child bone age prediction. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

35. Short-term train arrival delay prediction: a data-driven approach

Author: Fu, Qingyun, Ding, Shuxin, Zhang, Tao, Wang, Rongsheng, Hu, Ping, and Pu, Cunlai
Published: 2024
Full Text: View/download PDF

36. An update on computational pathology tools for genitourinary pathology practice: A review paper from the Genitourinary Pathology Society (GUPS).

Author: Parwani, Anil V., Patel, Ankush, Ming Zhou, Cheville, John C., Tizhoosh, Hamid, Humphrey, Peter, Reuter, Victor E., and True, Lawrence D.
Subjects: *DEEP learning, *ITERATIVE learning control, *PATHOLOGY, *IMAGE analysis, *MACHINE learning
Abstract: Machine learning has been leveraged for image analysis applications throughout a multitude of subspecialties. This position paper provides a perspective on the evolutionary trajectory of practical deep learning tools for genitourinary pathology through evaluating the most recent iterations of such algorithmic devices. Deep learning tools for genitourinary pathology demonstrate potential to enhance prognostic and predictive capacity for tumor assessment including grading, staging, and subtype identification, yet limitations in data availability, regulation, and standardization have stymied their implementation. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

37. SiLK-SLAM: accurate, robust and versatile visual SLAM with simple learned keypoints

Author: Yao, Jianjun and Li, Yingzhao
Published: 2024
Full Text: View/download PDF

38. A Comparison between Digital-Game-Based and Paper-Based Learning for EFL Undergraduate Students' Vocabulary Learning †.

Author: Sianturi, Alex Dharmawan and Hung, Ruei-Tang
Subjects: ENGLISH as a foreign language, ARTIFICIAL intelligence, DEEP learning, DIGITAL technology, VOCABULARY
Abstract: This research aimed to compare two strategies for vocabulary learning (digital-game-based learning and paper-based learning). The research was conducted during the first semester of the academic year 2022/2023. A total of 40 EFL undergraduate students within the Applied English Program of a private university located in the middle part of Taiwan were selected and divided into two groups: digital-based (n = 20) and paper-based (n = 20). The instrument developed by the researcher was pre- and post-vocabulary tests for both groups. The pre-vocabulary test was implemented to identify the level of students' prior knowledge of vocabulary mastery. For the intervention, Kahoot! quiz exercises were conducted weekly for the digital-game-based group, while the paper-based group received the same quiz on paper every week. The post-vocabulary tests showed no significant difference between the students using digital-game-based quizzes and paper-based quizzes during the six-week intervention. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

39. CiteOpinion: Evidence-based Evaluation Tool for Academic Contributions of Research Papers Based on Citing Sentences.

Author: Le, Xiaoqiu, Chu, Jingdan, Deng, Siyi, Jiao, Qihang, Pei, Jingjing, Zhu, Liya, and Yao, Junliang
Subjects: UNIVERSITY research, SENTIMENT analysis, COLLEGE majors, DEEP learning
Abstract: To uncover the evaluation information on the academic contribution of research papers cited by peers based on the content cited by citing papers, and to provide an evidence-based tool for evaluating the academic value of cited papers. CiteOpinion uses a deep learning model to automatically extract citing sentences from representative citing papers; it starts with an analysis on the citing sentences, then it identifies major academic contribution points of the cited paper, positive/negative evaluations from citing authors and the changes in the subjects of subsequent citing authors by means of Recognizing Categories of Moves (problems, methods, conclusions, etc.), and sentiment analysis and topic clustering. Citing sentences in a citing paper contain substantial evidences useful for academic evaluation. They can also be used to objectively and authentically reveal the nature and degree of contribution of the cited paper reflected by citation, beyond simple citation statistics. The evidence-based evaluation tool CiteOpinion can provide an objective and in-depth academic value evaluation basis for the representative papers of scientific researchers, research teams, and institutions. No other similar practical tool is found in papers retrieved. There are difficulties in acquiring full text of citing papers. There is a need to refine the calculation based on the sentiment scores of citing sentences. Currently, the tool is only used for academic contribution evaluation, while its value in policy studies, technical application, and promotion of science is not yet tested. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

40. Research Trends in Artificial Intelligence and Security—Bibliometric Analysis.

Author: Ilić, Luka, Šijan, Aleksandar, Predić, Bratislav, Viduka, Dejan, and Karabašević, Darjan
Subjects: DEEP learning, BIBLIOMETRICS, ARTIFICIAL intelligence, WEB analytics, MACHINE learning, PUBLIC health infrastructure
Abstract: This paper provides a bibliometric analysis of current research trends in the field of artificial intelligence (AI), focusing on key topics such as deep learning, machine learning, and security in AI. Through the lens of bibliometric analysis, we explore publications published from 2020 to 2024, using primary data from the Clarivate Analytics Web of Science Core Collection. The analysis includes the distribution of studies by year, the number of studies and citation rankings in journals, and the identification of leading countries, institutions, and authors in the field of AI research. Additionally, we investigate the distribution of studies by Web of Science categories, authors, affiliations, publication years, countries/regions, publishers, research areas, and citations per year. Key findings indicate a continued growth of interest in topics such as deep learning, machine learning, and security in AI over the past few years. We also identify leading countries and institutions active in researching this area. Awareness of data security is essential for the responsible application of AI technologies. Robust security frameworks are important to mitigate risks associated with AI integration into critical infrastructure such as healthcare and finance. Ensuring the integrity and confidentiality of data managed by AI systems is not only a technical challenge but also a societal necessity, demanding interdisciplinary collaboration and policy development. This analysis provides a deeper understanding of the current state of research in the field of AI and identifies key areas for further research and innovation. Furthermore, these findings may be valuable to practitioners and decision-makers seeking to understand current trends and innovations in AI to enhance their business processes and practices. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. Blockchain-based deep learning in IoT, healthcare and cryptocurrency price prediction: a comprehensive review

Author: Arora, Shefali, Mittal, Ruchi, Shrivastava, Avinash K., and Bali, Shivani
Published: 2024
Full Text: View/download PDF

42. Research and design of an expert diagnosis system for rail vehicle driven by data mechanism models

Author: Li, Lin, Wang, Jiushan, and Xiao, Shilu
Published: 2024
Full Text: View/download PDF

43. Fault detection system for paper cup machine based on real-time image processing.

Author: Aydın, Alaaddin and Güney, Selda
Subjects: *PROGRAMMABLE controllers, *OBJECT recognition (Computer vision), *SERVOMECHANISMS, *ARTIFICIAL intelligence, *DEEP learning, *DIGITAL image processing, *PRODUCT image, *IMAGE processing
Abstract: In the production of paper cups in industrial factories, it is tried to print high quality cups with less waste loss with the help of sensors and heating resistances mounted on the paper cup machine. In this study, a system that detects faulty products based on image processing and removes it by controlling the machine with servo motors, asynchronous motors and programmable logic controller (PLC) is designed. For fault product detection, classification has been performed using real-time Haarcascade algorithm and You Only Look Once (YOLO) algorithm which is a deep learning methods, and real-time object detection has been carried out using the OpenCv library. With this study, an effective faulty product detection and removing hardware system was realized by adapting artificial intelligence algorithms to a machine used in industry. Based on the results, a whole system can be applied to systems that involve removing a faulty product from a band in any production, packaging etc. facility is proposed. A hardware consisting of servo motors, asynchronous motors and PLC was designed to separate faulty cups from the existing paper cup production machine in this study. Then, a data set composed of 1068 images was created with images taken from the camera for faulty and faultless paper cups. Using this dataset, the effect of different deep learning methods on performance in the real-time system has been examined and successful results have been obtained. The optimal outcome was achieved, yielding a real-time application accuracy rate of 90.8% through the utilization of the Yolov5x architecture. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Joint torque prediction of industrial robots based on PSO-LSTM deep learning

Author: Xiao, Wei, Fu, Zhongtao, Wang, Shixian, and Chen, Xubing
Published: 2024
Full Text: View/download PDF

45. AI Machine Vision based Oven White Paper Color Classification and Label Position Real-time Monitoring System to Check Direction.

Author: Hee-Chul Kim, Youn-Saup Yoon, and Yong-Mo Kim
Subjects: COMPUTER vision, DEEP learning, JOB classification, MANUFACTURING process automation, ARTIFICIAL intelligence, COLOR image processing
Abstract: We develop a vision system for batch inspection by oven white paper model color by manufacturing a machine vision system for the oven manufacturing automation process. In the vision system, white paper object detection (spring), color clustering, and histogram extraction are performed. In addition, for the automated process of home appliances, we intend to develop an automatic mold combination detection algorithm that inspects the label position and direction (angle/coordinate) using deep learning. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

46. Image Processing Technique for Authentication of Indian Paper Currency.

Author: Colaco, Rencita Maria, V. G., Narendra, and B. V., Ravindra
Subjects: DEEP learning, IMAGE processing, IMAGE segmentation, REAL economy, EDGE detection (Image processing), GRAYSCALE model, HARD currencies, BUSINESSMEN
Abstract: As we all know day by day the technology is getting better and better, the production of counterfeit currency has been rapidly increasing. The counterfeit currency problem is faced by almost all countries. Since the real economy is affected, it has affected the economy of the country. Even when the drastic step of demonetization was taken in 2016 to overcome counterfeit currency, this problem did not end. The only one solution for this problem for a common man is to detect the fake currency, by using the fake currency detector machine. These machines are used in banks and large scale business, but for small scale businesses or for a common man these machines are not affordable. There are lot of researches taking place on this matter by using deep learning, image processing and machine learning techniques. This paper gives the complete methodology of fake note detector machine, which is affordable even for a common man. By implementing the applications of image processing techniques we can find out whether the currency notes are fake or not. Image processing technique consists of a number of operations that can be performed on an image, some of which include image segmentation, edge detection, gray scale conversion, preprocessing etc. The proposed system will detect the counterfeit currency of new denominations by distinguishing each denomination based on its size and depending on the features of each currency the comparison takes place. Based on the features matched, it detects whether the currency is counterfeit or not. The system have advantages like simplicity, reliability and cost effective. Which is affordable by a common man since the common man is the one who will be effected most, when the counterfeit currency are circulated in the market because he has to pay the real value of that currency. [ABSTRACT FROM AUTHOR]
Published: 2023

47. MLCAD: A Survey of Research in Machine Learning for CAD Keynote Paper.

Author: Rapp, Martin, Amrouch, Hussam, Lin, Yibo, Yu, Bei, Pan, David Z., Wolf, Marilyn, and Henkel, Jorg
Subjects: *MACHINE learning, *CIRCUIT complexity, *COMPUTER-aided design, *ARTIFICIAL neural networks, *INTEGRATED circuits, *CONFIGURATION space, *MULTICASTING (Computer networks)
Abstract: Due to the increasing size of integrated circuits (ICs), their design and optimization phases (i.e., computer-aided design, CAD) grow increasingly complex. At design time, a large design space needs to be explored to find an implementation that fulfills all specifications and then optimizes metrics like energy, area, delay, reliability, etc. At run time, a large configuration space needs to be searched to find the best set of parameters (e.g., voltage/frequency) to further optimize the system. Both spaces are infeasible for exhaustive search typically leading to heuristic optimization algorithms that find some tradeoff between design quality and computational overhead. Machine learning (ML) can build powerful models that have successfully been employed in related domains. In this survey, we categorize how ML may be used and is used for design-time and run-time optimization and exploration strategies of ICs. A metastudy of published techniques unveils areas in CAD that are well explored and underexplored with ML, as well as trends in the employed ML algorithms. We present a comprehensive categorization and summary of the state of the art on ML for CAD. Finally, we summarize the remaining challenges and promising open research directions. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

48. Integrated smart analytics of nucleic acid amplification tests via paper microfluidics and deep learning in cloud computing.

Author: Sun, Hao, Jiang, Qinghua, Huang, Yi, Mo, Jin, Xie, Wantao, Dong, Hui, and Jia, Yuan
Subjects: NUCLEIC acid amplification techniques, DEEP learning, CLOUD computing, MICROFLUIDICS, ARTIFICIAL intelligence, MESSENGER RNA, INTEGRATED learning systems
Abstract: [Display omitted] • Deep learning enabled predictive nucleic acid amplification tests analysis. • On-site fluorescence signal processing powered by cloud computing. • Accurate prediction can be obtained using the early 22.5% data. • Approach can be universally extended to other areas of biomedical research. Pandemics such as COVID-19 have exposed global inequalities in essential health care. Here, we proposed a novel analytics of nucleic acid amplification tests (NAATs) by combining paper microfluidics with deep learning and cloud computing. Real-time amplifications of synthesized SARS-CoV-2 RNA templates were performed in paper devices. Information pertained to on-chip reactions in time-series format were transmitted to cloud server on which deep learning (DL) models were preloaded for data analysis. DL models enable prediction of NAAT results using partly gathered real-time fluorescence data. Using information provided by the G-channel, accurate prediction can be made as early as 9 min, a 78% reduction from the conventional 40 min mark. Reaction dynamics hidden in amplification curves were effectively leveraged. Positive and negative samples can be unbiasedly and automatically distinguished. Practical utility of the approach was validated by cross-platform study using clinical datasets. Predicted clinical accuracy, sensitivity and specificity were 98.6%, 97.6% and 99.1%. Not only the approach reduced the need for the use of bulky apparatus, but also provided intelligent, distributable and robotic insights for NAAT analysis. It set a novel paradigm for analyzing NAATs, and can be combined with the most cutting-edge technologies in fields of biosensor, artificial intelligence and cloud computing to facilitate fundamental and clinical research. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

49. Deep learning based beamforming for MISO systems with dirty‐paper coding.

Author: Lou, Xingliang, Xia, Wenchao, Wen, Wanli, Zhao, Haitao, Li, Xiaohui, and Wang, Bin
Subjects: *DEEP learning, *BEAMFORMING, *MISO, *SIGNAL processing, *COMPUTATIONAL complexity
Abstract: Beamforming technique can effectively improve the spectrum utilization in the multi‐antenna systems, while the dirty‐paper coding (DPC) technique can reduce the inter‐user interference. In this letter, it is aimed to maximize the weighted sum‐rate under the total power constraint in the multiple‐input‐single‐output (MISO) system with the DPC technique. However, the existing methods of beamforming optimization mainly rely on customized iterative algorithms, which have high computational complexity. To address this issue, the beamforming neural network (BFNNet) is devised by utilizing the deep learning technique and the uplink‐downlink duality and exploring the optimal solution structure, which includes the deep neural network module and the signal processing module. Simulation results show that the BFNNet can achieve near‐optimal solutions and significantly reduce computational complexity. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

50. A Digital Simulation and Re-Editing Method for Clothing Patterns Based on Deep Learning and Somatosensory Interaction.

Author: Sun, Haiyan, Yao, Jiali, Zhang, Haoyu, Li, Zhijun, and Cai, Xingquan
Subjects: DEEP learning, DIGITAL computer simulation, ELECTRONIC paper, SIGNAL-to-noise ratio, ETHNIC costume
Abstract: To address the issues in clothing pattern style migration, this paper proposes a digital simulation and re-editing method for clothing patterns based on deep learning and somatosensory interaction. First, the proposed method encodes the black-and-white line drawing image, generating random noise images through a diffusion process, introducing color information for synthesis, and using a decoder to reconstruct a colored image. Afterwards, an improved VGG19 model is used to reconstruct content features and perform linear color transformation on style images, enabling pattern style migration through the construction of a Gram matrix and resulting in colored clothing texture patterns. Finally, a KinectV2 is utilized for fabric simulation, overlaying colorful clothing texture patterns to achieve 3D virtual dressing. The experimental results show that the proposed method improves the structural similarity index measure (SSIM) by 9–11% and the peak signal-to-noise ratio (PSNR) by 3–8% when compared to existing algorithms. The experiments provide evidence that the proposed method effectively mitigates color overflow, delivers precise image coloring, and accomplishes realistic restoration of clothing texture. Furthermore, the method offers an improved garment fit to fulfill the user's interaction requirements. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

79,469 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources