127 results on '"Image domain"'
Search Results
2. Intra Coding With Geometric Information for Urban Building Scenes
- Author
-
Hua Bao, Aibin Yan, and Qijun Wang
- Subjects
Reference software ,Image domain ,business.industry ,Computer science ,Image content ,02 engineering and technology ,Perspective transformation ,Urban building ,Coding block ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Scale shift ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Coding (social sciences) - Abstract
Regarding the interior non-local correlations of image content, the intra predictions in image/video coding standards have still not exploited them efficiently. In this article, we propose an intra coding method that leverages the geometric information extracted from the image content to strengthen the coding performance. During the process of image capturing, if the imaging plane of the camera is not fronto-parallel to the surface of an object in reality, the repetitive patterns on the object’s surface in real world will appear with a scale shift in the image domain. Thus, the corresponding potential non-local redundancy cannot be removed through the direct utilization of intra block copy and its variants. To address this problem, we begin with a theoretic analysis into the perspective transformation matrix and derive the underlying geometric information that causes the scale shift in the image domain; afterwards, we propose an intra coding framework based on the geometric information to alleviate the scale shift. It is essential for the coding framework to realize block matching in the rectified domain. Our framework mainly consists of two components, i.e., the intra block copy in the rectified domain through planar perspective transformation to obtain a more accurate prediction for the current coding block and the non-local post-processing filtering in the rectified domain after decompression to achieve a better quality of the reconstructed image. The experimental results show that our proposed method can achieve as high as 18% and an average of 5.0% bit-rate saving on the common dataset Urban100 compared to the HEVC reference software HM-SCC-extension with intra block copy and non-local post-processing filtering.
- Published
- 2021
- Full Text
- View/download PDF
3. Review of Steganography Algorithms
- Author
-
Ibrahim Mahmood Rashid and Ali Mohammed Ahmed
- Subjects
Image domain ,Theoretical computer science ,Steganography ,Computer science ,business.industry ,0202 electrical engineering, electronic engineering, information engineering ,020206 networking & telecommunications ,020201 artificial intelligence & image processing ,02 engineering and technology ,Encryption ,business - Abstract
The ability to hide plays a vital role in effective secret communication. This is achieved by hiding information (Steganography). The science of concealing information is the science of concealing information in other information so that it appears that hidden information is not something to the human eye. There are many ways to hide information within an image, audio / video, document, etc. But hiding information in pictures has its own characteristics and is the most popular among others. This paper provides a review of several methods, such as image field and conversion field algorithms available to implement image information hiding (Steganography). In this paper, high-capacity information hidings schemes are analyzed for different file formats. Secret communication is done before Password encryption to protect information. The intended recipient will decrypt the information using this password.
- Published
- 2020
- Full Text
- View/download PDF
4. Geometric Processing and Enhancement: Image Domain Techniques
- Author
-
John A. Richards
- Subjects
Image domain ,business.industry ,Computer science ,Digital image processing ,Computer vision ,Image processing ,Artificial intelligence ,Spatial frequency ,Noise (video) ,business ,Texture (geology) ,Image (mathematics) ,Feature detection (computer vision) - Abstract
This chapter presents methods that allow us to analyse or modify the geometric properties of an image. Our attention, first, is on techniques for filtering images to remove noise or to enhance geometric detail. We will then look at means by which we can characterise geometric properties like texture, and processes that allow us to analyse objects and shapes that appear in imagery.
- Published
- 2022
- Full Text
- View/download PDF
5. Deep Transfer Learning Based Human Activity Recognition By Transforming IMU Data To Image Domain Using Novel Activity Image Creation Method
- Author
-
R. Amutha and Mohammed Hashim. B. A
- Subjects
Activity recognition ,Image domain ,Statistics and Probability ,business.industry ,Computer science ,Inertial measurement unit ,Artificial Intelligence ,General Engineering ,Computer vision ,Artificial intelligence ,business ,Transfer of learning ,Image (mathematics) - Abstract
Human Activity Recognition (HAR) is the most popular research area in the pervasive computing field in recent years. Sensor data plays a vital role in identifying several human actions. Convolutional Neural Networks (CNNs) have now become the most recent technique in the computer vision phenomenon, but still, it is premature to use CNN for sensor data, particularly in ubiquitous and wearable computing.Deep CNN requires huge dataset and models which increases the computational complexity. Transfer learning that uses the pre trained CNNwith fine tuning is the better alternative to reduce the training cost.In this paper, we have proposed the idea of transforming the raw accelerometer and gyroscope sensor data to the visual domain by using our novel activity image creation method (NAICM). Pre-trained CNN (AlexNet) has been used on the converted image domain information. The proposed method is evaluated on several online available human activity recognition dataset. The results show that the proposed novel activity image creation method (NAICM) has successfully created the activity images with a classification accuracy of 98.36% using pre trained CNN.
- Published
- 2021
- Full Text
- View/download PDF
6. Particle‐filter‐based human target tracking in image domain for through‐wall imaging radar
- Author
-
Chen Guohao, Cao Lingxiao, Shisheng Guo, Guolong Cui, and Lingjiang Kong
- Subjects
back-projection algorithm ,Computer science ,0211 other engineering and technologies ,Energy Engineering and Power Technology ,02 engineering and technology ,Through wall imaging ,Tracking (particle physics) ,ultra wideband radar ,particle-filter-based human target tracking ,image domain ,tracking problem ,phase coherence factor ,law.invention ,0203 mechanical engineering ,law ,Radar imaging ,Computer vision ,Radar ,efficient image-domain tracking algorithm ,021101 geological & geomatics engineering ,Image domain ,020301 aerospace & aeronautics ,business.industry ,multiframe high quality images ,General Engineering ,amplitude-distribution-based particle filter ,Ultra wideband radar ,radar imaging ,Phase coherence ,lcsh:TA1-2040 ,time-division multiple-input-multiple-output ,particle filtering (numerical methods) ,Artificial intelligence ,target tracking ,hidden target tracking ,lcsh:Engineering (General). Civil engineering (General) ,Particle filter ,business ,through-wall imaging radar ,Software ,hidden human targets - Abstract
This study deals with a tracking problem for hidden human targets using time-division multiple-input-multiple-output through-wall imaging radar (TWIR). An efficient image-domain tracking algorithm is proposed. Specifically, the authors first utilise back-projection algorithm and the phase coherence factor (PCF) to obtain multi-frame high quality images. Then a tracking algorithm via amplitude-distribution-based particle filter is proposed. Experimental data validates that this algorithm has a commendable effectiveness for hidden target tracking.
- Published
- 2019
- Full Text
- View/download PDF
7. Deep Transfer Learning for Image Emotion Analysis: Reducing Marginal and Joint Distribution Discrepancies Together
- Author
-
Guiguang Ding and Yuwei He
- Subjects
Image domain ,0209 industrial biotechnology ,Computer Networks and Communications ,Computer science ,business.industry ,General Neuroscience ,Complex system ,Pattern recognition ,Computational intelligence ,02 engineering and technology ,Convolutional neural network ,020901 industrial engineering & automation ,Artificial Intelligence ,Joint probability distribution ,0202 electrical engineering, electronic engineering, information engineering ,Leverage (statistics) ,Labeled data ,020201 artificial intelligence & image processing ,Artificial intelligence ,Transfer of learning ,business ,Software - Abstract
A lot of research attentions have been paid to image emotion analysis in recent years. Meanwhile, as convolutional neural networks (CNNs) have made great successful in computer vision, many researchers start to employ CNN to discriminate image emotions. However, the training procedure of CNNs depends on sufficient labeled data. Therefore, a CNN is hard to perform well in an image domain with scant labeled information. In this paper, we propose a deep transfer learning method for image emotion analysis. The method can leverage rich emotion knowledge from a source domain to the target domain. Our method reduces both marginal and joint domain distribution discrepancies at fully-connected layers. Through this way, we can effectively extract more transferable features and advance the performance of CNNs on poor-label emotion-image domains.
- Published
- 2019
- Full Text
- View/download PDF
8. Robbie: A batch processing work-flow for the detection of radio transients and variables
- Author
-
Natasha Hurley-Walker, Tim E. White, and Paul Hancock
- Subjects
Image domain ,010308 nuclear & particles physics ,business.industry ,Computer science ,Real-time computing ,Astronomy and Astrophysics ,Modular design ,Makefile ,01 natural sciences ,Computer Science Applications ,Open software ,Software portability ,Space and Planetary Science ,0103 physical sciences ,Batch processing ,Work flow ,Transient (computer programming) ,business ,010303 astronomy & astrophysics - Abstract
We present Robbie : a general work-flow for the detection and characterization of radio variability and transient events in the image domain. Robbie is designed to work in a batch processing paradigm with a modular design so that components can be swapped out or upgraded to adapt to different input data, whilst retaining a consistent and coherent methodological approach. Robbie is based on commonly used and open software, and is encapsulated in a Makefile to aid portability and reproducibility. In this paper we describe the methodology behind Robbie , and demonstrate its use on real and simulated data. Robbie is available on GitHub .
- Published
- 2019
- Full Text
- View/download PDF
9. Retraction Note: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell
- Author
-
Mohamed Loey, Nour Eldeen M. Khalifa, Gunasekaran Manogaran, and Mohamed Hamed N. Taha
- Subjects
Image domain ,Materials science ,Treatment classification ,business.industry ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Deep learning ,Pooling ,Decision tree ,Bioengineering ,General Chemistry ,Human cell ,Condensed Matter Physics ,Machine learning ,computer.software_genre ,Atomic and Molecular Physics, and Optics ,Support vector machine ,Modeling and Simulation ,General Materials Science ,Artificial intelligence ,business ,computer - Abstract
Coronavirus pandemic is burdening healthcare systems around the world to the full capacity they can accommodate. There is an overwhelming need to find a treatment for this virus as early as possible. Computer algorithms and deep learning can participate positively by finding a potential treatment for SARS-CoV-2. In this paper, a deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell will be presented. The dataset selected in this work is a subset of the publicly online datasets available on RxRx.ai. The objective of this research is to automatically classify a single human cell according to the treatment type and the treatment concentration level. A DCNN model and a methodology are proposed throughout this work. The methodical idea is to convert the numerical features from the original dataset to the image domain and then fed them up into a DCNN model. The proposed DCNN model consists of three convolutional layers, three ReLU layers, three pooling layers, and two fully connected layers. The experimental results show that the proposed DCNN model for treatment classification (32 classes) achieved 98.05% in testing accuracy if it is compared with classical machine learning such as support vector machine, decision tree, and ensemble. In treatment concentration level prediction, the classical machine learning (ensemble) algorithm achieved 98.5% in testing accuracy while the proposed DCNN model achieved 98.2%. The performance metrics strengthen the obtained results from the conducted experiments for the accuracy of treatment classification and treatment concentration level prediction.
- Published
- 2021
- Full Text
- View/download PDF
10. Compressed Sensing in Parallel MRI: A Review
- Author
-
Muhammad Shahin Uddin, Shafiqul Islam, and Rafiqul Islam
- Subjects
Image domain ,medicine.diagnostic_test ,business.industry ,Computer science ,Magnetic resonance imaging ,Reconstruction algorithm ,Computer Graphics and Computer-Aided Design ,Computer Science Applications ,Compressed sensing ,Medical imaging ,medicine ,Computer vision ,Parallel magnetic resonance imaging ,Computer Vision and Pattern Recognition ,Imaging technique ,Artificial intelligence ,business - Abstract
Magnetic resonance imaging (MRI) is a dynamic and safe imaging technique in medical imaging. Recently, parallel MRI (pMRI) is widely used for accelerating conventional MRI. Both frequency and image domain-based reconstructions are the most attractive methods for generating the image from multi-channel k-space data. Compressed sensing (CS) is a recently used procedure to reduce the acquisition time of conventional MRI. This reduction is achieved by taking fewer measurements from the fully sampled k-space data. Therefore, applying the CS technique in pMRI is the most emerging way for further improving the acquisition time that is a tremendous research interest. However, as the phase encoding plane may be perpendicular or parallel to the coil elements plane, finding the exact domain for CS in pMRI reconstruction is a major challenging issue. In this work, the application of the CS technique in pMRI in both domains is investigated. Later some widely used methodologies are presented as the nonlinear reconstruction algorithm of CS in pMRI. Finally, a discussion is performed based on CS in pMRI to perceive the reality of different reconstruction algorithms at a glance for finding preferred methodologies.
- Published
- 2021
- Full Text
- View/download PDF
11. Image-domain material decomposition for single-energy CT images using cascaded network
- Author
-
Yizhong Wang, Xiaohuan Yu, Bin Yan, Zhiwei Feng, and Lei Li
- Subjects
Image domain ,Basis (linear algebra) ,business.industry ,Computer science ,Digital Enhanced Cordless Telecommunications ,Computer vision ,Tomography ,Artificial intelligence ,business ,Material decomposition ,Automation ,Convolutional neural network ,Energy (signal processing) - Abstract
Dual-energy computer tomography (DECT) has high application prospects in distinguishing and quantifying materials. However, DECT requires higher hardware cost and higher radiation dose than single-energy CT imaging. In this paper, we have developed a cascaded network method to realize DECT imaging through SECT images, then get the basis material decomposition images. Specifically, we design the mapping convolutional neural network and the material decomposition U-Net to realize the mapping of low- to high-energy images and material decomposition, respectively. To verify the feasibility of the proposed method, we extracted 1442 cranial cavity slice images of 5 patients for experiments. The qualitative and quantitative results show that the proposed method can achieve high-quality DECT imaging of single energy data and high-precision basis material decomposition.
- Published
- 2021
- Full Text
- View/download PDF
12. Using Adversarial Images to Assess the Stability of Deep Learning Models Trained on Diagnostic Images in Oncology
- Author
-
Enoch Chang, Rachel Choi, Harlan M. Krumholz, Sanjay Aneja, Daniel X. Yang, Sachin Umrao, James S. Duncan, Marina Joel, Roy S. Herbst, and Antonio Omuro
- Subjects
Image domain ,Contextual image classification ,Robustness (computer science) ,Computer science ,business.industry ,Deep learning ,Stability (learning theory) ,Pattern recognition ,Artificial intelligence ,business ,Mri model ,Imaging modalities ,Vulnerability (computing) - Abstract
PurposeDeep learning (DL) models have rapidly become a popular and cost-effective tool for image classification within oncology. A major limitation of DL models is output instability, as small perturbations in input data can dramatically alter model output. The purpose of the study is to investigate the robustness of DL models in the oncologic image domain through the application of adversarial images: manipulated images with small pixel-level perturbations designed to assess the stability of DL models.Experimental DesignWe examined the impact of adversarial images on the classification accuracies of DL models trained to classify cancerous lesions across three common oncologic imaging modalities (CT, mammogram, and MRI). The CT model was trained to classify malignant lung nodules using the LIDC dataset. The mammogram model was trained to classify malignant breast lesions using the DDSM dataset. The MRI model was trained to classify brain metastases using an institutional dataset. We also explored the utility of an iterative adversarial training approach to improve the stability of DL models to small pixel-level changes.ResultsOncologic images showed instability with small pixel-level changes. A pixel-level of perturbation of .004 resulted in a majority of oncologic images to be misclassified by their respective DL models (CT 25.64%, mammogram 23.93%, MRI 6.36%). Adversarial training mitigated improved the stability and robustness of DL models trained on oncologic images compared to naive models [(CT 67.72% vs 26.92%), mammogram (63.39% vs 27.68%), MRI (87.20% vs 24.32%)].ConclusionsDL models naively trained on oncologic images exhibited dramatic instability to small pixel-level changes resulting in substantial decreases in accuracy. Adversarial training techniques improved the stability and robustness of DL models to such pixel-level changes. Prior to clinical implementation, adversarial training should be considered to proposed DL models to improve overall performance and safety.
- Published
- 2021
- Full Text
- View/download PDF
13. A Benchmark and Baseline for Language-Driven Image Editing
- Author
-
Zheng Wen, Trung Bui, Franck Dernoncourt, Ning Xu, Chenliang Xu, and Jing Shi
- Subjects
Image domain ,Computer science ,business.industry ,Photography ,020207 software engineering ,02 engineering and technology ,Image editing ,010501 environmental sciences ,computer.software_genre ,Machine learning ,01 natural sciences ,Task (project management) ,Annotation ,0202 electrical engineering, electronic engineering, information engineering ,Benchmark (computing) ,Artificial intelligence ,business ,Baseline (configuration management) ,computer ,0105 earth and related environmental sciences - Abstract
Language-driven image editing can significantly save the laborious image editing work and be friendly to the photography novice. However, most similar work can only deal with a specific image domain or can only do global retouching. To solve this new task, we first present a new language-driven image editing dataset that supports both local and global editing with editing operation and mask annotations. Besides, we also propose a baseline method that fully utilizes the annotation to solve this problem. Our new method treats each editing operation as a submodule and can automatically predict operation parameters. Not only performing well on challenging user data, but such an approach is also highly interpretable. We believe our work, including both the benchmark and the baseline, will advance the image editing area towards a more general and free-form level.
- Published
- 2021
- Full Text
- View/download PDF
14. Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets
- Author
-
Lan Zhang, Ehsan Shareghi, and Victor Prokhorov
- Subjects
FOS: Computer and information sciences ,Image domain ,Computer Science - Computation and Language ,business.industry ,Inductive bias ,Intersection (set theory) ,Computer science ,Homotopy ,Machine learning ,computer.software_genre ,Domain (software engineering) ,Set (abstract data type) ,Artificial intelligence ,business ,Representation (mathematics) ,Computation and Language (cs.CL) ,computer ,Generative grammar - Abstract
To highlight the challenges of achieving representation disentanglement for text domain in an unsupervised setting, in this paper we select a representative set of successfully applied models from the image domain. We evaluate these models on 6 disentanglement metrics, as well as on downstream classification tasks and homotopy. To facilitate the evaluation, we propose two synthetic datasets with known generative factors. Our experiments highlight the existing gap in the text domain and illustrate that certain elements such as representation sparsity (as an inductive bias), or representation coupling with the decoder could impact disentanglement. To the best of our knowledge, our work is the first attempt on the intersection of unsupervised representation disentanglement and text, and provides the experimental framework and datasets for examining future developments in this direction., Comment: Accepted to RepL4NLP 2021
- Published
- 2021
- Full Text
- View/download PDF
15. Factors of Influence for Transfer Learning across Diverse Appearance Domains and Task Types
- Author
-
Jasper Uijlings, Alina Kuznetsova, Thomas Mensink, Vittorio Ferrari, and Michael Gygli
- Subjects
FOS: Computer and information sciences ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Computer Science - Computer Vision and Pattern Recognition ,Machine learning ,computer.software_genre ,Task (project management) ,Image (mathematics) ,Machine Learning ,Artificial Intelligence ,Learning ,Segmentation ,Image domain ,Contextual image classification ,business.industry ,Applied Mathematics ,Object detection ,Aerial imagery ,Computational Theory and Mathematics ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Neural Networks, Computer ,business ,Transfer of learning ,computer ,Software ,Algorithms - Abstract
Transfer learning enables to re-use knowledge learned on a source task to help learning a target task. A simple form of transfer learning is common in current state-of-the-art computer vision models, i.e. pre-training a model for image classification on the ILSVRC dataset, and then fine-tune on any target task. However, previous systematic studies of transfer learning have been limited and the circumstances in which it is expected to work are not fully understood. In this paper we carry out an extensive experimental exploration of transfer learning across vastly different image domains (consumer photos, autonomous driving, aerial imagery, underwater, indoor scenes, synthetic, close-ups) and task types (semantic segmentation, object detection, depth estimation, keypoint detection). Importantly, these are all complex, structured output tasks types relevant to modern computer vision applications. In total we carry out over 2000 transfer learning experiments, including many where the source and target come from different image domains, task types, or both. We systematically analyze these experiments to understand the impact of image domain, task type, and dataset size on transfer learning performance. Our study leads to several insights and concrete recommendations: (1) for most tasks there exists a source which significantly outperforms ILSVRC'12 pre-training; (2) the image domain is the most important factor for achieving positive transfer; (3) the source dataset should \emph{include} the image domain of the target dataset to achieve best results; (4) at the same time, we observe only small negative effects when the image domain of the source task is much broader than that of the target; (5) transfer across task types can be beneficial, but its success is heavily dependent on both the source and target task types., Comment: Accepted for future publication in TPAMI
- Published
- 2021
- Full Text
- View/download PDF
16. 2D+t track detection via relative persistent homology
- Author
-
Abbas Rammal, Rabih Assaf, Alban Goupil, Valeriu Vrabie, Mohammad Kacim, Université Saint-Esprit de Kaslik (USEK), Lebanese University [Beirut] (LU), Centre de Recherche en Sciences et Technologies de l'Information et de la Communication - EA 3804 (CRESTIC), and Université de Reims Champagne-Ardenne (URCA)
- Subjects
Computer science ,Computation ,02 engineering and technology ,Homology (mathematics) ,relative homology ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,algebraic topology ,0302 clinical medicine ,0202 electrical engineering, electronic engineering, information engineering ,Electrical and Electronic Engineering ,object tracking ,Image domain ,Persistent homology ,business.industry ,Pattern recognition ,object detection ,Object detection ,persistent homology ,Electronic, Optical and Magnetic Materials ,Video tracking ,[MATH.MATH-AT]Mathematics [math]/Algebraic Topology [math.AT] ,Image sequence ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing ,Software ,Relative homology - Abstract
International audience; In this paper, we demonstrate that algebraic topology can be used to perform 2D+t object detection. After the construction of a topological complex for a 2D+t image sequence, we build a nested sequence of cell complexes on which relative persistent homology is computed. The relative homology adds to “absolute” homology the computation of classes related to the first and last frames of the sequence. By identifying 2D chains with large life spans, the most persistent classes are extracted. This allows for the identification of the interesting parts in a sequence and for the detection of the movement of objects despite continuous deformations in the image domain. The results obtained on a synthetic image and on two real biomedical images with moving vesicles recorded by a quantitative phase time‐lapse technique show the potential of this method. Comparing the method with a newly developed tracking tool proves that the strength of this method is its independence from prior parameters.
- Published
- 2021
- Full Text
- View/download PDF
17. Automatic Detection of Landmarks for Fast Cardiac MR Image Registration
- Author
-
Mihaela Pop, Mia Mojica, and Mehran Ebrahimi
- Subjects
Image domain ,Similarity (geometry) ,Speedup ,Cardiac cycle ,Computer science ,business.industry ,Pipeline (computing) ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Task (computing) ,Prior probability ,Computer vision ,Artificial intelligence ,Mr images ,business - Abstract
Inter-subject registration of cardiac images is a vital yet challenging task due to the large deformations influenced by the cardiac cycle and respiration. Various intensity-based cardiac registration methods have already been proposed, but such methods utilize intensity information over the entire image domain and are thus computationally expensive. In this work, we propose a novel pipeline for fast registration of cardiac MR images that relies on shape priors and the strategic location of surface-approximating landmarks. Our holistic approach to cardiac registration requires minimal user input. It also reduces the computational runtime by \(60\%\) on average, which amounts to an 11-min speedup in runtime. Most importantly, the resulting Dice similarity coefficients are comparable to those from a widely used elastic registration method.
- Published
- 2021
- Full Text
- View/download PDF
18. RETRACTED ARTICLE: A deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell
- Author
-
Nour Eldeen M. Khalifa, Mohamed Hamed N. Taha, Mohamed Loey, and Gunasekaran Manogaran
- Subjects
Materials science ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Pooling ,Decision tree ,Bioengineering ,02 engineering and technology ,010402 general chemistry ,Machine learning ,computer.software_genre ,01 natural sciences ,Modelling and Simulation ,General Materials Science ,Image domain ,Treatment classification ,business.industry ,Deep learning ,General Chemistry ,Human cell ,021001 nanoscience & nanotechnology ,Condensed Matter Physics ,Atomic and Molecular Physics, and Optics ,0104 chemical sciences ,Support vector machine ,Modeling and Simulation ,Artificial intelligence ,0210 nano-technology ,business ,computer - Abstract
Coronavirus pandemic is burdening healthcare systems around the world to the full capacity they can accommodate. There is an overwhelming need to find a treatment for this virus as early as possible. Computer algorithms and deep learning can participate positively by finding a potential treatment for SARS-CoV-2. In this paper, a deep learning model and machine learning methods for the classification of potential coronavirus treatments on a single human cell will be presented. The dataset selected in this work is a subset of the publicly online datasets available on RxRx.ai. The objective of this research is to automatically classify a single human cell according to the treatment type and the treatment concentration level. A DCNN model and a methodology are proposed throughout this work. The methodical idea is to convert the numerical features from the original dataset to the image domain and then fed them up into a DCNN model. The proposed DCNN model consists of three convolutional layers, three ReLU layers, three pooling layers, and two fully connected layers. The experimental results show that the proposed DCNN model for treatment classification (32 classes) achieved 98.05% in testing accuracy if it is compared with classical machine learning such as support vector machine, decision tree, and ensemble. In treatment concentration level prediction, the classical machine learning (ensemble) algorithm achieved 98.5% in testing accuracy while the proposed DCNN model achieved 98.2%. The performance metrics strengthen the obtained results from the conducted experiments for the accuracy of treatment classification and treatment concentration level prediction.
- Published
- 2020
- Full Text
- View/download PDF
19. A Novel SAR Image Domain-Ground Moving Target Imaging Method
- Author
-
Jun Wan, Zhiqiang Zeng, Li Li, Dong Li, Shuwei Zhou, Yan Huang, and Zhanye Chen
- Subjects
Synthetic aperture radar ,Image domain ,Computer science ,business.industry ,020208 electrical & electronic engineering ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,0211 other engineering and technologies ,02 engineering and technology ,Image (mathematics) ,Ground moving target ,Radar imaging ,0202 electrical engineering, electronic engineering, information engineering ,Chirp ,Computer vision ,Artificial intelligence ,business ,021101 geological & geomatics engineering - Abstract
This paper mainly focuses on synthetic aperture radar (SAR) ground moving target imaging. Although there exists many excellent SAR moving target imaging algorithms, two issues, the maneuverability of the SAR platform and the type of data used for moving target imaging, are not discussed by most of them. Thus, a novel SAR image domain-ground moving target imaging method is proposed to preliminarily handle the aforementioned two issues. The method proposed contains two main steps. The first one is the pre-imaging of the raw data, and the second one is focusing the ground moving target's image data by a proposed one-dimensional parameter traversal approach. Numerical experiments are finally presented to verify the effectiveness of the proposed ground moving target imaging method.
- Published
- 2020
- Full Text
- View/download PDF
20. Image domain adaption of simulated data for human pose estimation
- Author
-
Andreas Blattmann, Thomas Golda, Jürgen Beyerer, and Jürgen Metzler
- Subjects
Image domain ,Computer science ,business.industry ,Estimator ,Machine learning ,computer.software_genre ,Domain (software engineering) ,Computer game ,Simulated data ,Deep neural networks ,Artificial intelligence ,Graphics ,business ,computer ,Pose - Abstract
Leveraging the power of deep neural networks, single-person pose estimation has made substantial progress throughout the last years. More recently, multi-person pose estimation has also become of growing importance, mainly driven by the high demand for reliable video surveillance systems in public security. To keep up with these demands, certain efforts have been made to improve the performance of such systems, which is yet limited by the insufficient amount of available training data. This work addresses this lack of labeled data: by diminishing the often faced problem of domain shift between synthetic images from computer game graphics engines and real world data, annotated training data shall be provided at zero labeling-cost. To this end, generative adversarial networks are applied as domain adaption framework, adapting the data of a novel synthetic pose estimation dataset to several real world target domains. State-of-the-art domain adaption methods are extended to meet the important requirement of exact content preservation between synthetic and adapted images. Experiments, that are subsequently conducted, indicate the improved suitability of the adapted data as human pose estimators trained on this data outperform those which are trained on purely synthetic images.
- Published
- 2020
- Full Text
- View/download PDF
21. Parametric Synthetic Aperture Radar Image Recovery for Multiple Linear Structures: An Image Domain Approach
- Author
-
Yuhan Wen, Tao Zeng, Zegang Ding, Yan Wang, and Xinliang Chen
- Subjects
Synthetic aperture radar ,Computer science ,0211 other engineering and technologies ,02 engineering and technology ,linear structure ,law.invention ,linear structures recovery ,law ,Robustness (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,Computer vision ,Radar ,lcsh:Science ,021101 geological & geomatics engineering ,Parametric statistics ,Image domain ,Scattering ,business.industry ,parametric scattering models ,020206 networking & telecommunications ,synthetic aperture radar (SAR) ,target recognition ,General Earth and Planetary Sciences ,lcsh:Q ,Artificial intelligence ,business - Abstract
The linear structures in synthetic aperture radar (SAR) images can provide important geometric information regarding the illuminated objects. However, the linear structures of various objects sometimes disappear in traditional SAR images, which severely affects the application in automated scene analysis and information extraction techniques. This paper proposes a parametric SAR image recovery method for linear structures of extended targets. By extracting the spatial phase distribution feature in image domain, the proposed method is used to identify the endpoints among all of the scattered points in SAR images and reconstruct the disappeared linear structures based on the scattering model. The method can be generally divided into three procedures: endpoints and point targets classification, linear structures recognition, and linear structures reconstruction. In the first step, the endpoints of linear structures and point targets are classified by setting a threshold related to the spatial phase distribution feature. Afterwards, the linear structures recognition method is used to determine which two endpoints can be formed into a linear structure. Finally, the parametric scattering models are used to reconstruct the disappeared linear structures. Experiments are conducted on both computer simulations and the data that were acquired by microwave anechoic chamber experiment, tower crane radar experiment, and unmanned vehicle radar experiment in order to validate the effectiveness and robustness of the proposed method.
- Published
- 2020
22. Image Domain Transfer by Deep Learning is Feasible in Multiple Sclerosis Clinical Practice
- Author
-
Akifumi Hagiwara, Koji Kamagata, and Shigeki Aoki
- Subjects
Image domain ,Multiple Sclerosis ,Artificial neural network ,business.industry ,Computer science ,Multiple sclerosis ,Deep learning ,MEDLINE ,General Medicine ,medicine.disease ,Machine learning ,computer.software_genre ,Clinical Practice ,Machine Learning ,Deep Learning ,medicine ,Humans ,Radiology, Nuclear Medicine and imaging ,Artificial intelligence ,Neural Networks, Computer ,business ,computer - Published
- 2020
23. Quality of life outcomes after minimally invasive repair of pectus excavatum utilizing a new set of metallic bars and stabilizers
- Author
-
Niura Noro Hamilton, Miguel Lia Tedde, Paulo Manuel Pêgo-Fernandes, Rafael Lucas Costa de Carvalho, Gustavo Falavigna Guilherme, Jose Ribas Milanese de Campos, Vanessa Moreira Sousa, Flavio Henrique Savazzi, and Vitor Floriano Salomao Junior
- Subjects
Adult ,medicine.medical_specialty ,Adolescent ,Intervention group ,Nuss procedure ,03 medical and health sciences ,Young Adult ,0302 clinical medicine ,Quality of life ,Pectus excavatum ,030225 pediatrics ,medicine ,Humans ,Minimally Invasive Surgical Procedures ,Prospective randomized study ,In patient ,Prospective Studies ,Retrospective Studies ,Image domain ,business.industry ,Significant difference ,General Medicine ,medicine.disease ,Surgery ,Treatment Outcome ,030220 oncology & carcinogenesis ,Funnel Chest ,Pediatrics, Perinatology and Child Health ,Quality of Life ,business - Abstract
The aim of the study was to evaluate the postoperative quality of life (QoL) of patients who underwent minimally invasive repair of pectus excavatum (MIRPE) with a newly designed bar and bar stabilizers.We conducted a prospective randomized study in which patients were operated either with standard perpendicular stabilizers (control group) or with the newly designed oblique stabilizers (intervention group). All patients were evaluated 6 months after the operation with the Pectus Excavatum Evaluation Questionnaire (PEEQ).There were 16 patients in the control group and 14 in the intervention group. Mean age was 17 (SD: 3.3, range 14-27) years. There were no demographic differences between groups. Two patients in the control group and one in the intervention group were repaired with two bars instead of one. There was one reoperation in each group. There was a significant difference between the pre- and postoperative scores, in both groups, in the patient body image domain (control group: 9.5 to 3; p 0.01; intervention group 10 to 3; p 0.01), as well as in the psychosocial domain (control group: 13.5 to 24, p 0.01; intervention group: 15 to 24, p 0.01). With regards to the patients' perception of physical difficulties before and after MIRPE, the difference between pre- and postoperative scores was greater in the intervention group (8 to 12, p 0.01) than in the control group (10 to 11, p = 0.04). The mean length of stay was 4.5 and 5 days in the intervention group and the control group, respectively.Our study showed that patients who underwent MIRPE with the newly designed bars and stabilizers had non-inferior outcomes than patients reported in the literature who underwent MIRPE with standard bars and stabilizers. We found slightly better outcomes in patients in the intervention group compared to the control group, but larger studies will be needed to confirm if those differences are statistically significant.II.
- Published
- 2020
24. Low-dose CT reconstruction with simultaneous sinogram and image domain denoising by deep neural network
- Author
-
Hairong Zheng, Jiongtao Zhu, Yongshuai Ge, Xiaolei Deng, Ting Su, Xindong Sun, and Dong Liang
- Subjects
Image domain ,Artificial neural network ,business.industry ,Computer science ,Noise reduction ,Low dose ct ,Computer vision ,Artificial intelligence ,business - Published
- 2020
- Full Text
- View/download PDF
25. Learning Shapes on Image Sampled Points with Dynamic Graph CNNs
- Author
-
Juan Castorena and Diane Oyen
- Subjects
Image domain ,Artificial neural network ,Computer science ,business.industry ,Feature extraction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Inference ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Computer Science::Computer Vision and Pattern Recognition ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,MNIST database ,Shape analysis (digital geometry) - Abstract
This research focuses on the problem of shape analysis on the 2D image domain. State of the art methods like convolution neural networks typically rely on the richness of texture information in images for inference. However, when the underlying object is better described as a shape, these methods tend to suffer. Here, we aim to bridge this gap by proposing a method to analyze shapes on images. The driving idea is to learn features defined on just a few point samples extracted from image super-pixels. A dynamic graph CNN (i.e., a neural net producing a different graph at each layer) is trained and used as the learning engine in a classification task. Our first set of experiments is tested on the 10-digit class MNIST dataset benchmark where we find that the proposed method performs better than others on small datasets. This altogether shows a promising direction for the analysis of more complex shapes fundamental in classification and retrieval of documents of scientific nature.
- Published
- 2020
- Full Text
- View/download PDF
26. Memorability based image to image translation
- Author
-
Prasen Kumar Sharma, Sathisha Basavaraju, and Arijit Sur
- Subjects
Image domain ,Set (abstract data type) ,business.industry ,Computer science ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Image translation ,Computer vision ,Artificial intelligence ,business ,Translation (geometry) ,Image (mathematics) ,Task (project management) - Abstract
This paper presents a memorability based image-to-image translation technique to make an image more memorable while retaining its high-level contents. Conventionally, the image-to-image translation task aims to learn the mapping between images of two different domains using a set of aligned image pairs. However, dataset having such one-to-one mapping is not available for memorability based image-to-image translation. Therefore, the aim of the proposed task is defined to learn the mapping F: I → I' between two image domains I and I'. Here, I corresponds to input image domain and I' is the unknown image domain containing the modified version of the input images. Also, every image in I' is more memorable than its corresponding image in I. Therefore, the proposed task is achieved by developing a deep learning based method to learn the mapping F: I→ I' using mean-squared error and memorability loss between I and F(I). The experimental results showed that the proposed approach increases the memorability of the given image better than the state-of-the-art image-to-image translation techniques.
- Published
- 2020
- Full Text
- View/download PDF
27. An InISAR Imaging Method Based on Dominant-Scatterer Phase Focusing
- Author
-
Yu Li, Xiao Dong, and Yunhua Zhang
- Subjects
Image domain ,Physics ,Interferometric phase ,Offset (computer science) ,business.industry ,Scattering ,Astrophysics::Instrumentation and Methods for Astrophysics ,law.invention ,Inverse synthetic aperture radar ,Interferometry ,Optics ,law ,Radar ,business ,3d coordinates - Abstract
A new interferometric inverse synthetic aperture radar (InISAR) imaging method based on a L shape three-antenna interferometric radar geometry and dominant-scatterer phase focusing is presented in this paper. Under this InISAR model, the distance of the target in the baseline direction introduces additional interferometric phase offset, which can cause phase ambiguity, so the interferometric phase must be corrected before interferometric processing. Firstly, a strong scattering point is searched for as a reference point in the image domain, and then the phase of the scattering point is used as the reference phase for phase focusing on each ISAR image respectively to unwrap the interferometric phase, so that the 3D coordinates of the target can be correctly retrieved. Simulation results confirm the effectiveness of the method.
- Published
- 2019
- Full Text
- View/download PDF
28. Towards Balancing the Complexity of Convolutional Neural Network with the Role of Optical Coherence Tomography in Retinal Conditions
- Author
-
Simona Delia Nicoară, Anca Marginean, George Adrian Muntean, Adrian Groza, Radu Razvan Slavescu, and Ioan Alfred Letia
- Subjects
Image domain ,0303 health sciences ,medicine.diagnostic_test ,Computer science ,business.industry ,Deep learning ,Pattern recognition ,Convolutional neural network ,03 medical and health sciences ,0302 clinical medicine ,Optical coherence tomography ,030221 ophthalmology & optometry ,medicine ,Artificial intelligence ,business ,030304 developmental biology - Abstract
Convolutional neural networks have shown impressive performance in the medical image domain, but medical experts are somewhat skeptical in their predictions since the features are not directly graspable. We are looking into one of the technical challenges, namely the explainability of the results, to try to find out some demonstration of the regions deemed abnormal by deep learning. Following the trend of heatmaps indicating which local morphology changes would modify the predictions, we are trying to verify the facilitation of the clinical understanding in the eyes of the ophthalmologist.
- Published
- 2019
- Full Text
- View/download PDF
29. Application of image-domain LSRTM for illumination study and optimal survey design
- Author
-
Zhuoquan Yuan, Jean Ji, Senren Liu, Yang He, and Vijay Singh
- Subjects
Image domain ,Computer science ,business.industry ,Survey research ,Computer vision ,Artificial intelligence ,business - Published
- 2019
- Full Text
- View/download PDF
30. Moving objects Path Tracking based on Entropy Background Subtraction and CAMShift
- Author
-
M R Sunitha and C N Arpitha
- Subjects
Image domain ,Background subtraction ,Pixel ,business.industry ,Computer science ,Path tracking ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Entropy (information theory) ,Video sequence ,Computer vision ,Artificial intelligence ,Tracing ,business - Abstract
For multiple object path tracking we integrate entropy background subtraction method and CAMShift algorithm. Initially claussius entropy theory is used to transform each pixel in the image domain into entropy domain and obtain its energy level. Later we use entropy background subtraction algorithm to detect moving object region in each frame. To improve robustness in the condition where objects are of different color, object colors are same as to background’s colors. localization of object is obtained by choosing each objective region. This approach can automatically calibrate moving vehicles in traffic video and achieve multi-target tracking by using a multi-tracker of CAMShift algorithm. The proposed system is also capable of tracing paths of moving vehicles. The results and analysis demonstrates that the methods used in the paper finds solution for automatic multi-object tracking problems in video sequence efficiently.
- Published
- 2019
- Full Text
- View/download PDF
31. A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples
- Author
-
Qiang Zeng, Chiu C. Tan, Chenglong Fu, Xiaojiang Du, Jianhai Su, Jie Wu, Lannan Luo, and Golam Kayas
- Subjects
Image domain ,business.industry ,Computer science ,Transferability ,020206 networking & telecommunications ,020207 software engineering ,02 engineering and technology ,Machine learning ,computer.software_genre ,Adversarial system ,Trustworthiness ,0202 electrical engineering, electronic engineering, information engineering ,N-version programming ,Artificial intelligence ,Hardware_ARITHMETICANDLOGICSTRUCTURES ,business ,Classifier (UML) ,computer - Abstract
Adversarial examples (AEs) are crafted by adding human-imperceptible perturbations to inputs such that a machine-learning based classifier incorrectly labels them. They have become a severe threat to the trustworthiness of machine learning. While AEs in the image domain have been well studied, audio AEs are less investigated. Recently, multiple techniques are proposed to generate audio AEs, which makes countermeasures against them urgent. Our experiments show that, given an audio AE, the transcription results by Automatic Speech Recognition (ASR) systems differ significantly (that is, poor transferability), as different ASR systems use different architectures, parameters, and training datasets. Based on this fact and inspired by Multiversion Programming, we propose a novel audio AE detection approach MVP-Ears, which utilizes the diverse off-the-shelf ASRs to determine whether an audio is an AE. We build the largest audio AE dataset to our knowledge, and the evaluation shows that the detection accuracy reaches 99.88%. While transferable audio AEs are difficult to generate at this moment, they may become a reality in future. We further adapt the idea above to proactively train the detection system for coping with transferable audio AEs. Thus, the proactive detection system is one giant step ahead of attackers working on transferable AEs.
- Published
- 2019
- Full Text
- View/download PDF
32. CT Metal Artefacts Reduction Using Convolutional Neural Networks
- Author
-
Dusan Gleich, A. Trbalic, Emir Skejic, A. Serifovic Trbalic, and Damir Demirovic
- Subjects
Image domain ,medicine.diagnostic_test ,Computer science ,business.industry ,Image quality ,Metal artefact ,Anatomical structures ,Pattern recognition ,Computed tomography ,Convolutional neural network ,Streaking ,Reduction (complexity) ,medicine ,Artificial intelligence ,business - Abstract
Artefacts caused by the presence of metallic implants and prosthesis appear as dark and bright streaks in computed tomography (CT) images, that obscure the information about underlying anatomical structures. These phenomena can severely degrade the image quality and hinder the correct diagnostic interpretation. Although many techniques for the reduction of metal artefacts have been proposed in literature, their effectiveness is still limited. In this paper, an application of a convolutional neural networks (CNN) to the problem of metal artefact reduction (MAR) in the image domain is investigated. Experimental results show that image-domain CNN can substantially suppresses streaking artefacts in the reconstructed images.
- Published
- 2019
- Full Text
- View/download PDF
33. Implicit Pairs for Boosting Unpaired Image-to-Image Translation
- Author
-
Yiftach Ginger, Hadar Averbuch-Elor, Dov Danon, and Daniel Cohen-Or
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Boosting (machine learning) ,Generative adversarial networks ,Data augmentation ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,02 engineering and technology ,Machine Learning (cs.LG) ,Paired samples ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,Image-to-image translation ,050107 human factors ,Image domain ,lcsh:T58.5-58.64 ,business.industry ,lcsh:Information technology ,05 social sciences ,020207 software engineering ,Pattern recognition ,Computer Graphics and Computer-Aided Design ,Human-Computer Interaction ,Synthetic samples ,Image translation ,Artificial intelligence ,business ,Software - Abstract
In image-to-image translation the goal is to learn a mapping from one image domain to another. In the case of supervised approaches the mapping is learned from paired samples. However, collecting large sets of image pairs is often either prohibitively expensive or not possible. As a result, in recent years more attention has been given to techniques that learn the mapping from unpaired sets. In our work, we show that injecting implicit pairs into unpaired sets strengthens the mapping between the two domains, improves the compatibility of their distributions, and leads to performance boosting of unsupervised techniques by up to 12% across several measurements. The competence of the implicit pairs is further displayed with the use of pseudo-pairs, i.e., paired samples which only approximate a real pair. We demonstrate the effect of the approximated implicit samples on image-to-image translation problems, where such pseudo-pairs may be synthesized in one direction, but not in the other. We further show that pseudo-pairs are significantly more effective as implicit pairs in an unpaired setting, than directly using them explicitly in a paired setting.
- Published
- 2019
34. Vessel segmentation using multiscale vessel enhancement and a region based level set model
- Author
-
Chaolu Feng, Chunhui Lou, Jinzhu Yang, and Jie Fu
- Subjects
Fundus Oculi ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Health Informatics ,Vessel segmentation ,030218 nuclear medicine & medical imaging ,Image (mathematics) ,03 medical and health sciences ,0302 clinical medicine ,Level set ,Image Processing, Computer-Assisted ,Effective method ,Radiology, Nuclear Medicine and imaging ,Segmentation ,Computer vision ,Image domain ,Radiological and Ultrasound Technology ,business.industry ,Retinal Vessels ,Computer Graphics and Computer-Aided Design ,Varying thickness ,Image contrast ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Algorithms ,030217 neurology & neurosurgery - Abstract
Vessel segmentation has always been a considerable challenge task due to the presence of varying thickness of the vessels and weak contrasts of medical image intensities. In this paper, an effective method is proposed, which consists of four steps. Firstly, the input images are converted into gray ones with predetermined weightings aiming at increasing image contrast if they are colorful. Secondly, the image intensities are expanded from regions of interest to the whole image domain with a mirroring operation to avoid introducing undesired boundaries by image filtering operations existing in the next step. Thirdly, an improved multi-scale enhancement method inspired by the Frangi filtering is proposed to enhance image contrast between blood vessels and other objects in the image. Finally, an improved level set model is proposed to segment blood vessels from the enhance images and the original gray images. The proposed method has been evaluated on two retinal vessel image repositories, namely, DRIVE and STARE. Experimental results and comparison with 13 existing methods show that the proposed method produces similar segmentation accuracy at the same level with representative methods in the literature. Its effectiveness on segmentation of other type vessels is also discussed at the end of this paper.
- Published
- 2020
- Full Text
- View/download PDF
35. A Structural Oriented Training Method for GAN Based Fast Compressed Sensing MRI
- Author
-
Haotian An and Yu-Jin Zhang
- Subjects
Image domain ,Computer science ,business.industry ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,02 engineering and technology ,Training methods ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,0302 clinical medicine ,Compressed sensing ,Frequency domain ,0202 electrical engineering, electronic engineering, information engineering ,Redundancy (engineering) ,020201 artificial intelligence & image processing ,Computer vision ,Spatial frequency ,Artificial intelligence ,business ,Image restoration - Abstract
Traditional strategies for reconstructing Compressed Sensing Magnetic Resonance Imaging (CS-MRI) may introduce computational redundancy, and deep learning-based methods can significantly reduce reconstruction time and improve restoration quality. However, many recent deep learning-based algorithms lay insufficient attention to spatial frequency information. In this paper, a Structural Oriented Generative Adversarial Network (SOGAN) is proposed aiming at restoring image domain information as well as refining frequency domain during the reconstruction of CS-MRI. Numerical Experiments showed our model’s efficiency and capability for diagnostic purpose.
- Published
- 2019
- Full Text
- View/download PDF
36. Experiences of image-domain least-squares migration for quantitative interpretation
- Author
-
Robin Fletcher, Maud Cavalca, and Robert Bloor
- Subjects
Image domain ,business.industry ,Pattern recognition ,Artificial intelligence ,business ,Least squares ,Geology ,Interpretation (model theory) - Published
- 2019
- Full Text
- View/download PDF
37. Accelerated MRI Reconstruction with Dual-Domain Generative Adversarial Network
- Author
-
Suchandrima Banerjee, Guanhua Wang, Enhao Gong, John M. Pauly, and Greg Zaharchuk
- Subjects
Image domain ,Dual domain ,business.industry ,Computer science ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pattern recognition ,Regularization (mathematics) ,symbols.namesake ,Fourier transform ,Discriminative model ,Frequency domain ,symbols ,Artificial intelligence ,business ,Generative adversarial network - Abstract
Fast reconstruction of under-sampled acquisitions has always been a central issue in MRI reconstruction. Recently years has seen multiple studies using deep learning as a de-aliasing framework to restore the aliased image. However, restoration of fine details is still problematic, especially when dealing with noisy image datasets. Sparked by the Fourier transform relationship, this work proposed and tested a new hypothesis: can regularization be directly added in the frequency domain to correct the high-frequency imperfection? To achieve this, discriminative networks are applied in both the image domain and the frequency domain (so-called dual-domain GAN). Evaluation on multiple datasets proved that the dual-domain GAN approach is an effective way to improve the quality of accelerated MR reconstruction.
- Published
- 2019
- Full Text
- View/download PDF
38. Multiple Instance Classification in the Image Domain
- Author
-
Pietro Pascarella, Ilaria Bartolini, Marco Patella, G. Amato, C. Gennaro, V. Oria, M. Radovanovic, Ilaria Bartolini, Pietro Pascarella, and Marco Patella
- Subjects
Image domain ,Contextual image classification ,Image Classification ,Computer science ,business.industry ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,Supervised learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Content-based image retrieval ,ComputingMethodologies_PATTERNRECOGNITION ,Content (measure theory) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Content-based Image Retrieval ,business ,Image retrieval - Abstract
Multiple instance classification (MIC) is a kind of supervised learning, where data are represented as bags and each bag contains many instances. Training bags are given a label and the system tries to learn how to label unknown bags, without necessarily learning how to label individually each of their instances. In particular, we apply concepts drawn from MIC to the realm of content-based image retrieval, where images are described as bags of visual local descriptors. We introduce several classifiers, according to the different MIC paradigms, and evaluate them experimentally on a real-world dataset, comparing their accuracy and efficiency. © 2019, Springer Nature Switzerland AG.
- Published
- 2019
39. Joint Multiresolution and Background Detection Reconstruction for Magnetic Particle Imaging
- Author
-
Alfred Mertins, Corbinian Englisch, Christine Droigk, and Marco Maass
- Subjects
Image domain ,business.industry ,Computer science ,Physics::Medical Physics ,Process (computing) ,Degrees of freedom (mechanics) ,Medical imaging technology ,Magnetic particle imaging ,Computer Science::Computer Vision and Pattern Recognition ,Computer vision ,Segmentation ,Artificial intelligence ,business ,Joint (audio engineering) - Abstract
Magnetic particle imaging is a tracer-based medical imaging technology that is quite promising for the task of imaging vessel structures or blood flows. From this possible application it can be deduced that significant areas of the image domain are related to background, because the tracer material is only inside the vessels and not in the surrounding tissue. From this fact alone it seems promising to detect the background of the image in early stages of the reconstruction process. This paper proposes a multiresolution and segmentation based reconstruction, where the background is detected on a coarse level of the reconstruction with only few degrees of freedom by a Gaussian-mixture model and transferred to finer reconstruction levels.
- Published
- 2019
- Full Text
- View/download PDF
40. Improved X-Ray Bone Segmentation by Normalization and Augmentation Strategies
- Author
-
Ruxandra Lasowski, Benedict Swartman, Florian Kordon, Peter Fischer, Jochen Franke, and Holger Kunze
- Subjects
Image domain ,Surface distance ,Computer science ,business.industry ,Femur bone ,Normalization (image processing) ,Pattern recognition ,Segmentation ,Overlay ,Artificial intelligence ,business ,Bone structure ,Bone segmentation - Abstract
X-ray images can show great variation in contrast and noise levels. In addition, important subject structures might be superimposed with surgical tools and implants. As medical image datasets tend to be of small size, these image characteristics are often under-represented. For the task of automated, learning-based segmentation of bone structures, this may lead to poor generalization towards unseen images and consequently limits practical application. In this work, we employ various data augmentation techniques that address X-ray-specific image characteristics and evaluate them on lateral projections of the femur bone. We combine those with data and feature normalization strategies that could prove beneficial to this domain. We show that instance normalization is a viable alternative to batch normalization and demonstrate that contrast scaling and the overlay of surgical tools and implants in the image domain can boost the representational capacity of available image data. By employing our best strategy, we can improve the average symmetric surface distance measure by 36:22 %.
- Published
- 2019
- Full Text
- View/download PDF
41. Target-Oriented Image Domain Q Tomography Using Ray-Based Method
- Author
-
Z. Yu and Y. Liu
- Subjects
Image domain ,Computer science ,business.industry ,Computer vision ,Artificial intelligence ,Tomography ,business - Published
- 2019
- Full Text
- View/download PDF
42. On Effectiveness of Adversarial Examples and Defenses for Malware Classification
- Author
-
Hassan Takabi and Robert Podschwadt
- Subjects
Image domain ,Artificial neural network ,business.industry ,Computer science ,Feature vector ,02 engineering and technology ,010501 environmental sciences ,Adversarial machine learning ,Machine learning ,computer.software_genre ,01 natural sciences ,Multiple data ,Adversarial system ,ComputingMethodologies_PATTERNRECOGNITION ,0202 electrical engineering, electronic engineering, information engineering ,Malware ,020201 artificial intelligence & image processing ,Artificial intelligence ,Android (operating system) ,business ,computer ,0105 earth and related environmental sciences - Abstract
Artificial neural networks have been successfully used for many different classification tasks including malware detection and distinguishing between malicious and non-malicious programs. Although artificial neural networks perform very well on these tasks, they are also vulnerable to adversarial examples. An adversarial example is a sample that has minor modifications made to it so that the neural network misclassifies it. Many techniques have been proposed, both for crafting adversarial examples and for hardening neural networks against them. Most previous work was done in the image domain. Some of the attacks have been adopted to work in the malware domain which typically deals with binary feature vectors. In order to better understand the space of adversarial examples in malware classification, we study different approaches of crafting adversarial examples and defense techniques in the malware domain and compare their effectiveness on multiple data sets.
- Published
- 2019
- Full Text
- View/download PDF
43. Traveltime tomography in the Image domain based on the Gaussian-beam-propagator
- Author
-
Huazhong Wang, Jin Chen, Ni Yao, Cai Jiexiong, and Wang Shoujin
- Subjects
Image domain ,Physics ,Optics ,business.industry ,Propagator ,Tomography ,business ,Gaussian beam - Published
- 2018
- Full Text
- View/download PDF
44. Reconstruction in Gabor Response Domain for Efficient Finger-knuckle-Print Verification
- Author
-
Hao Gao, Songsong Wu, Dong Yue, Pu Huang, and Guangwei Gao
- Subjects
Image domain ,Feature coding ,Pixel ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Binary number ,Pattern recognition ,Gabor filter ,Knuckle ,medicine.anatomical_structure ,medicine ,Image acquisition ,Artificial intelligence ,business ,Coding (social sciences) - Abstract
The finger knuckle print (FKP) based personal identification has achieved much interest. One of the disadvantages of the generally used Gabor filter based Competitive Coding method is that it is sensitive to variations caused in image acquisition stage. To handle this problem, researchers have proposed a reconstruction based scheme in original image pixel domain. However, the reconstruction in original image domain may lose much valuable directional information extracted by Gabor filters with different orientations. Compared with the previous methods, in this paper, we propose to perform reconstruction in Gabor filter response domain. For each orientation, we reconstruct the Gabor response of a query image using the responses in the template database. Thus, the directional information contained in the image is fully utilized. Then, we binarize each reconstructed responses for feature coding and matching with the constructed "mask", which considers not only the reconstructive error but also the Gabor response in a specific location. The mask can characterize the significance of a specific location along a specific orientation. Finally, by using a score level adaptive binary fusion rule, the matching distances before and after reconstruction are adaptively fused. The experimental results conducted on the benchmark PolyU FKP database show the performance advantage of the presented method.
- Published
- 2018
- Full Text
- View/download PDF
45. Material Decomposition of Energy Spectral CT by AUTOMAP
- Author
-
Zhengyang Chen and Liang Li
- Subjects
Image domain ,medicine.medical_specialty ,business.industry ,Computer science ,Pattern recognition ,Base (topology) ,030218 nuclear medicine & medical imaging ,Spectral imaging ,Image (mathematics) ,03 medical and health sciences ,0302 clinical medicine ,Computer Science::Computer Vision and Pattern Recognition ,030220 oncology & carcinogenesis ,Energy spectrum ,medicine ,Point (geometry) ,Artificial intelligence ,Material decomposition ,business ,Energy (signal processing) - Abstract
Energy spectrum CT can perform spectral imaging of objects. In the clinical and industrial fields, spectral images are often converted into a distribution image of the base material, ie, material decomposition. Traditional image domain material decomposition algorithms are often parsed point by point. This time we try to use the end-to-end AUTOMAP for fitting solution. AUTOMAP can perform image domain mapping and achieve certain effects on MRI image reconstruction, but our attempts at spectral CT material decomposition are not satisfactory.
- Published
- 2018
- Full Text
- View/download PDF
46. Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification
- Author
-
Weijian Deng, Qixiang Ye, Yi Yang, Liang Zheng, Guoliang Kang, and Jianbin Jiao
- Subjects
FOS: Computer and information sciences ,Image domain ,Similarity (geometry) ,Self-similarity ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Translation (geometry) ,Image (mathematics) ,Domain (software engineering) ,Discriminative model ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Adaptation (computer science) ,business - Abstract
Person re-identification (re-ID) models trained on one domain often fail to generalize well to another. In our attempt, we present a "learning via translation" framework. In the baseline, we translate the labeled images from source to target domain in an unsupervised manner. We then train re-ID models with the translated images by supervised methods. Yet, being an essential part of this framework, unsupervised image-image translation suffers from the information loss of source-domain labels during translation. Our motivation is two-fold. First, for each image, the discriminative cues contained in its ID label should be maintained after translation. Second, given the fact that two domains have entirely different persons, a translated image should be dissimilar to any of the target IDs. To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image. Both constraints are implemented in the similarity preserving generative adversarial network (SPGAN) which consists of an Siamese network and a CycleGAN. Through domain adaptation experiment, we show that images generated by SPGAN are more suitable for domain adaptation and yield consistent and competitive re-ID accuracy on two large-scale datasets., Comment: Accepted in CVPR 2018
- Published
- 2018
- Full Text
- View/download PDF
47. Going deeper with CNN in malicious crowd event classification
- Author
-
Heesung Kwon, Sungmin Eum, and Hyungtae Lee
- Subjects
Image domain ,Similarity (geometry) ,business.industry ,Event (computing) ,Computer science ,Event recognition ,Machine learning ,computer.software_genre ,Convolutional neural network ,Residual neural network ,Task (project management) ,Contextual information ,Artificial intelligence ,business ,computer - Abstract
Terror attacks are often targeted towards the civilians gathered in one location (e.g., Boston Marathon bombing). Distinguishing such ’malicious’ scenes from the ’normal’ ones, which are semantically different, is a difficult task as both scenes contain large groups of people with high visual similarity. To overcome the difficulty, previous methods exploited various contextual information, such as language-driven keywords or relevant objects. Although useful, they require additional human effort or dataset. In this paper, we show that using more sophisticated and deeper Convolutional Neural Networks (CNNs) can achieve better classification accuracy even without using any additional information outside the image domain. We have conducted a comparative study where we train and compare seven different CNN architectures (AlexNet, VGG-M, VGG16, GoogLeNet, ResNet- 50, ResNet-101, and ResNet-152). Based on the experimental analyses, we found out that deeper networks typically show better accuracy, and that GoogLeNet is the most favorable among the seven architectures for the task of malicious event classification.
- Published
- 2018
- Full Text
- View/download PDF
48. Elimination of Multi-Bounce Effect for Outdoor RCS Measurement via 3D Imaging
- Author
-
Xiaoling Zhang, Jun Shi, and Ling Pu
- Subjects
Image domain ,business.industry ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,Image (mathematics) ,law.invention ,Interference (communication) ,Transmission (telecommunications) ,law ,Computer Science::Computer Vision and Pattern Recognition ,Face (geometry) ,Radar imaging ,Computer vision ,Artificial intelligence ,Radar ,business ,Microwave - Abstract
The outdoor RCS measurement technique is crucial for the radar and aeronautical design applications, especially in the face of large objects. Because of the background interference, the multi-bounce effect is unavoidable and will contaminate the measuring result greatly. In this paper, a novel method to eliminate the multi-bounce effect via 3D imaging technique is proposed. Compared with the traditional methods, high-resolution 3D microwave image can separate the desired image from the multi-bounce false images in the image domain by using the difference of the transmission paths. Simulation results show that this method can clearly eliminate the multi-bounce effect of the outdoor RCS measurement.
- Published
- 2018
- Full Text
- View/download PDF
49. Least-squares migration — Data domain versus image domain using point spread functions
- Author
-
Robert Bloor, Richard Coates, Robin Fletcher, and Dave Nichols
- Subjects
Image domain ,010504 meteorology & atmospheric sciences ,business.industry ,Attenuation ,Data domain ,Propagator ,Geology ,Inversion (meteorology) ,010502 geochemistry & geophysics ,01 natural sciences ,Geophysics ,Amplitude ,Wavelet ,Computer vision ,Data pre-processing ,Artificial intelligence ,business ,Algorithm ,0105 earth and related environmental sciences ,Mathematics - Abstract
Conventional amplitude inversion assumes that the migrated image preserves relative-amplitude information. However, illumination effects caused by complex geologic settings, undersampled acquisition geometry, and limited recording aperture pose a challenge to even the most advanced imaging algorithms. In addition, standard depth-migration images can suffer from lack of resolution caused by wavelet stretch, attenuation, and suboptimal deghosting. Least-squares migration (LSM) can mitigate many of these problems and produce better resolved migration images suitable for AVO inversion. However, whether formulated in the data domain or the image domain, LSM is an inversion algorithm and is sensitive to inaccuracies in the source wavelet, velocity model, data preprocessing, and the propagator used. Practical considerations to mitigate these problems under nonideal conditions and cost-reduction strategies differ between the data- and image-domain formulations. The relative merits of each approach are evaluated by using example inversions for complex synthetic models, including free-surface ghost and attenuation effects. When a data-domain implementation of LSM is considered necessary, the image-domain implementation should be considered at the same time, especially when targeting localized reservoir targets under complex overburdens. Application of image-domain least-squares migration on a Gulf of Mexico field data set produces significant improvements in resolution and event continuity in the subsalt target region.
- Published
- 2016
- Full Text
- View/download PDF
50. Visual Attention Retargeting
- Author
-
Ivan V. Bajic and Victor A. Mateescu
- Subjects
Image domain ,Context model ,business.industry ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,02 engineering and technology ,Image segmentation ,Computer Science Applications ,Visualization ,Hardware and Architecture ,Signal Processing ,Retargeting ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Visual attention ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Graphics ,business ,Software ,Visual saliency - Abstract
This article presents an introduction to visual attention retargeting, its connection to visual saliency, the challenges associated with it, and ideas for how it can be approached. The difficulty of attention retargeting as a saliency inversion problem lies in the lack of one-to-one mapping between saliency and the image domain, in addition to the possible negative impact of saliency alterations on image aesthetics. A few approaches from recent literature to solve this challenging problem are reviewed, and several suggestions for future development are presented.
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.