Author: "Sohn, Kwanghoon" / Topic: convolutional neural networks - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Sohn, Kwanghoon"' showing total 9 results

Start Over Author "Sohn, Kwanghoon" Topic convolutional neural networks

9 results on '"Sohn, Kwanghoon"'

1. Unified Depth Prediction and Intrinsic Image Decomposition from a Single Image via Joint Convolutional Neural Fields

Author: Kim, Seungryong, Park, Kihong, Sohn, Kwanghoon, Lin, Stephen, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Leibe, Bastian, editor, Matas, Jiri, editor, Sebe, Nicu, editor, and Welling, Max, editor
Published: 2016
Full Text: View/download PDF

2. Memory-Guided Image De-Raining Using Time-Lapse Data.

Author: Cho, Jaehoon, Kim, Seungryong, and Sohn, Kwanghoon
Subjects: BOOSTING algorithms, CONVOLUTIONAL neural networks
Abstract: This paper addresses the problem of single image de-raining, that is, the task of recovering clean and rain-free background scenes from a single image obscured by a rainy artifact. Although recent advances adopt real-world time-lapse data to overcome the need for paired rain-clean images, they are limited to fully exploit the time-lapse data. The main cause is that, in terms of network architectures, they could not capture long-term rain streak information in the time-lapse data during training owing to the lack of memory components. To address this problem, we propose a novel network architecture combining the time-lapse data and, the memory network that explicitly helps to capture long-term rain streak information. Our network comprises the encoder-decoder networks and a memory network. The features extracted from the encoder are read and updated in the memory network that contains several memory items to store rain streak-aware feature representations. With the read/update operation, the memory network retrieves relevant memory items in terms of the queries, enabling the memory items to represent the various rain streaks included in the time-lapse data. To boost the discriminative power of memory features, we also present a novel background selective whitening (BSW) loss for capturing only rain streak information in the memory network by erasing the background information. Experimental results on standard benchmarks demonstrate the effectiveness and superiority of our approach. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. Learning Deeply Aggregated Alternating Minimization for General Inverse Problems.

Author: Jung, Hyungjoo, Kim, Youngjung, Min, Dongbo, Jang, Hyunsung, Ha, Namkoo, and Sohn, Kwanghoon
Subjects: CONVOLUTIONAL neural networks, INVERSE problems, COMPUTER vision, IMAGE processing
Abstract: Regularization-based image restoration is one of the most powerful tools in image processing and computer vision thanks to its flexibility for handling various inverse problems. However, designing an optimal regularization function still remains unsolved since natural images and related scene types have a complex structure. In this paper, we present a general and principled framework, called deeply aggregated alternating minimization (DeepAM). We design a convolutional neural network (CNN) to implicitly parameterize the regularizer of the alternating minimization (AM) algorithm. Contrary to the conventional AM algorithm based on a point-wise proximal mapping, the DeepAM projects intermediate estimate into a set of natural images via deep aggregation. Since the CNN is fully integrated into the AM procedure, all parameters can be jointly optimized through end-to-end training. These properties enable the DeepAM to converge with a small number of iterations, while maintaining an algorithmic simplicity. We show that the DeepAM outperforms state-of-the-art methods, including nonlocal-based methods, Plug-and-Play regularization, and recent data-driven approaches. The effectiveness of our framework is demonstrated in a variety of image restoration tasks: Guassian denoising, deraining, deblurring, super-resolution, color-guided depth upsampling, and RGB/NIR restoration. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

4. Single Image Deraining Using Time-Lapse Data.

Author: Cho, Jaehoon, Kim, Seungryong, Min, Dongbo, and Sohn, Kwanghoon
Subjects: CONVOLUTIONAL neural networks, SUPERVISED learning, STIMULUS generalization
Abstract: Leveraging on recent advances in deep convolutional neural networks (CNNs), single image deraining has been studied as a learning task, achieving an outstanding performance over traditional hand-designed approaches. Current CNNs based deraining approaches adopt the supervised learning framework that uses a massive training data generated with synthetic rain streaks, having a limited generalization ability on real rainy images. To address this problem, we propose a novel learning framework for single image deraining that leverages time-lapse sequences instead of the synthetic image pairs. The deraining networks are trained using the time-lapse sequences in which both camera and scenes are static except for time-varying rain streaks. Specifically, we formulate a background consistency loss such that the deraining networks consistently generate the same derained images from the time-lapse sequences. We additionally introduce two loss functions, the structure similarity loss that encourages the derained image to be similar with an input rainy image and the directional gradient loss using the assumption that the estimated rain streaks are likely to be sparse and have dominant directions. To consider various rain conditions, we leverage a dynamic fusion module that effectively fuses multi-scale features. We also build a novel large-scale time-lapse dataset providing real world rainy images containing various rain conditions. Experiments demonstrate that the proposed method outperforms state-of-the-art techniques on synthetic and real rainy images both qualitatively and quantitatively. On the high-level vision tasks under severe rainy conditions, it has been shown that the proposed method can be utilized as a pre-preprocessing step for subsequent tasks. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

5. Unsupervised Deep Image Fusion With Structure Tensor Representations.

Author: Jung, Hyungjoo, Kim, Youngjung, Jang, Hyunsung, Ha, Namkoo, and Sohn, Kwanghoon
Subjects: IMAGE fusion, DEEP learning, COMPUTER vision, IMAGE reconstruction, FEATURE extraction, CONVOLUTIONAL neural networks, SUPERVISED learning
Abstract: Convolutional neural networks (CNNs) have facilitated substantial progress on various problems in computer vision and image processing. However, applying them to image fusion has remained challenging due to the lack of the labelled data for supervised learning. This paper introduces a deep image fusion network (DIF-Net), an unsupervised deep learning framework for image fusion. The DIF-Net parameterizes the entire processes of image fusion, comprising of feature extraction, feature fusion, and image reconstruction, using a CNN. The purpose of DIF-Net is to generate an output image which has an identical contrast to high-dimensional input images. To realize this, we propose an unsupervised loss function using the structure tensor representation of the multi-channel image contrasts. Different from traditional fusion methods that involve time-consuming optimization or iterative procedures to obtain the results, our loss function is minimized by a stochastic deep learning solver with large-scale examples. Consequently, the proposed method can produce fused images that preserve source image details through a single forward network trained without reference ground-truth labels. The proposed method has broad applicability to various image fusion problems, including multi-spectral, multi-focus, and multi-exposure image fusions. Quantitative and qualitative evaluations show that the proposed technique outperforms existing state-of-the-art approaches for various applications. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

6. Unsupervised Stereo Matching Using Confidential Correspondence Consistency.

Author: Joung, Sunghun, Kim, Seungryong, Park, Kihong, and Sohn, Kwanghoon
Abstract: Stereo matching aims to perceive the 3D geometric configuration of scenes and facilitates a variety of computer vision in advanced driver assistance systems (ADAS) applications. Recently, deep convolutional neural networks (CNNs) have shown dramatic performance improvements for computing the matching cost in the stereo matching. However, the performance of CNN-based approaches relies heavily on datasets, requiring a large number of ground truth data which needs tremendous works. To overcome this limitation, we present a novel framework to learn CNNs for matching cost computation in an unsupervised manner. Our method leverages an image domain learning combined with stereo epipolar constraints. By exploiting the correspondence consistency between stereo images, our method selects putative positive samples in each training iteration and utilizes them to train the networks. We further propose a positive sample propagation scheme to leverage additional training samples. Our unsupervised learning method is evaluated with two kinds of network architectures, simple and precise CNNs, and shows comparable performance to that of the state-of-the-art methods including both supervised and unsupervised learning approaches on KITTI, Middlebury, HCI, and Yonsei datasets. This extensive evaluation demonstrates that the proposed learning framework can be applied to deal with various real driving conditions. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

7. Structure-Texture Image Decomposition Using Deep Variational Priors.

Author: Kim, Youngjung, Ham, Bumsub, Do, Minh N., and Sohn, Kwanghoon
Subjects: CONVOLUTIONAL neural networks, COMPUTATIONAL photography, MODULAR construction, DEGREES of freedom, IMAGE processing, DEEP learning, HIGH dynamic range imaging
Abstract: Most variational formulations for structure-texture image decomposition force the structure images to have small norm in some functional spaces and to share a common notion of edges, i.e., large-gradients or large-intensity differences. However, such a definition makes it difficult to distinguish structure edges from oscillations that have fine spatial scale but high contrast. In this paper, we introduce a new model by learning deep variational priors for structure images without explicit training data. An alternating direction method of a multiplier algorithm and its modular structure are adopted to plug deep variational priors into an iterative smoothing process. The central observations are that convolution neural networks (CNNs) can replace the total variation prior, and are indeed powerful to capture the natures of structure and texture. We show that our learned priors using CNNs successfully differentiate high-amplitude details from structure edges, and avoid halo artifacts. Different from previous data-driven smoothing schemes, our formulation provides another degree of freedom to produce continuous smoothing effects. Experimental results demonstrate the effectiveness of our approach on various computational photography and image processing applications, including texture removal, detail manipulation, HDR tone-mapping, and non-photorealistic abstraction. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

8. FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence.

Author: Kim, Seungryong, Min, Dongbo, Ham, Bumsub, Lin, Stephen, and Sohn, Kwanghoon
Subjects: SELF-similar processes, SEMANTICS, COMPUTER vision, IMAGE registration, IMAGE processing
Abstract: We present a descriptor, called fully convolutional self-similarity (FCSS), for dense semantic correspondence. Unlike traditional dense correspondence approaches for estimating depth or optical flow, semantic correspondence estimation poses additional challenges due to intra-class appearance and shape variations among different instances within the same object or scene category. To robustly match points across semantically similar images, we formulate FCSS using local self-similarity (LSS), which is inherently insensitive to intra-class appearance variations. LSS is incorporated through a proposed convolutional self-similarity (CSS) layer, where the sampling patterns and the self-similarity measure are jointly learned in an end-to-end and multi-scale manner. Furthermore, to address shape variations among different object instances, we propose a convolutional affine transformer (CAT) layer that estimates explicit affine transformation fields at each pixel to transform the sampling patterns and corresponding receptive fields. As training data for semantic correspondence is rather limited, we propose to leverage object candidate priors provided in most existing datasets and also correspondence consistency between object pairs to enable weakly-supervised learning. Experiments demonstrate that FCSS significantly outperforms conventional handcrafted descriptors and CNN-based descriptors on various benchmarks. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

9. OCEAN: Object-centric arranging network for self-supervised visual representations learning.

Author: Oh, Changjae, Ham, Bumsub, Kim, Hansung, Hilton, Adrian, and Sohn, Kwanghoon
Subjects: *CONVOLUTIONAL neural networks, *VISUAL learning, *SUPERVISED learning, *VIRTUAL networks
Abstract: Highlights • A self-supervised learning which does not require human annotations for training CNN. • Learning the correct arrangement of object proposals to represent an image by CNN. • Demonstrating the advantage of our model by applying it to PASCAL VOC datasets. • Application to other vision tasks including image retrieval and semantic matching. Abstract Learning visual representations plays an important role in computer vision and machine learning applications. It facilitates a model to understand and perform high-level tasks intelligently. A common approach for learning visual representations is supervised one which requires a huge amount of human annotations to train the model. This paper presents a self-supervised approach which learns visual representations from input images without human annotations. We learn the correct arrangement of object proposals to represent an image using a convolutional neural network (CNN) without any manual annotations. We hypothesize that the network trained for solving this problem requires the embedding of semantic visual representations. Unlike existing approaches that use uniformly sampled patches, we relate object proposals that contain prominent objects and object parts. More specifically, we discover the representation that considers overlap, inclusion, and exclusion relationship of proposals as well as their relative position. This allows focusing on potential objects and parts rather than on clutter. We demonstrate that our model outperforms existing self-supervised learning methods and can be used as a generic feature extractor by applying it to object detection, classification, action recognition, image retrieval, and semantic matching tasks. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"Sohn, Kwanghoon"'

1. Unified Depth Prediction and Intrinsic Image Decomposition from a Single Image via Joint Convolutional Neural Fields

2. Memory-Guided Image De-Raining Using Time-Lapse Data.

3. Learning Deeply Aggregated Alternating Minimization for General Inverse Problems.

4. Single Image Deraining Using Time-Lapse Data.

5. Unsupervised Deep Image Fusion With Structure Tensor Representations.

6. Unsupervised Stereo Matching Using Confidential Correspondence Consistency.

7. Structure-Texture Image Decomposition Using Deep Variational Priors.

8. FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence.

9. OCEAN: Object-centric arranging network for self-supervised visual representations learning.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

9 results on '"Sohn, Kwanghoon"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources