Author: "Kim IG" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kim IG"' showing total 646 results

Start Over Author "Kim IG"

646 results on '"Kim IG"'

1. MAIR++: Improving Multi-view Attention Inverse Rendering with Implicit Lighting Representation

Author: Choi, JunYong, Lee, SeokYeong, Park, Haesol, Jung, Seung-Won, Kim, Ig-Jae, and Cho, Junghyun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose a scene-level inverse rendering framework that uses multi-view images to decompose the scene into geometry, SVBRDF, and 3D spatially-varying lighting. While multi-view images have been widely used for object-level inverse rendering, scene-level inverse rendering has primarily been studied using single-view images due to the lack of a dataset containing high dynamic range multi-view images with ground-truth geometry, material, and spatially-varying lighting. To improve the quality of scene-level inverse rendering, a novel framework called Multi-view Attention Inverse Rendering (MAIR) was recently introduced. MAIR performs scene-level multi-view inverse rendering by expanding the OpenRooms dataset, designing efficient pipelines to handle multi-view images, and splitting spatially-varying lighting. Although MAIR showed impressive results, its lighting representation is fixed to spherical Gaussians, which limits its ability to render images realistically. Consequently, MAIR cannot be directly used in applications such as material editing. Moreover, its multi-view aggregation networks have difficulties extracting rich features because they only focus on the mean and variance between multi-view features. In this paper, we propose its extended version, called MAIR++. MAIR++ addresses the aforementioned limitations by introducing an implicit lighting representation that accurately captures the lighting conditions of an image while facilitating realistic rendering. Furthermore, we design a directional attention-based multi-view aggregation network to infer more intricate relationships between views. Experimental results show that MAIR++ not only achieves better performance than MAIR and single-view-based methods, but also displays robust performance on unseen real-world scenes.
Published: 2024

2. VIGFace: Virtual Identity Generation Model for Face Image Synthesis

Author: Kim, Minsoo, Sagong, Min-Cheol, Nam, Gi Pyo, Cho, Junghyun, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep learning-based face recognition continues to face challenges due to its reliance on huge datasets obtained from web crawling, which can be costly to gather and raise significant real-world privacy concerns. To address this issue, we propose VIGFace, a novel framework capable of generating synthetic facial images. Initially, we train the face recognition model using a real face dataset and create a feature space for both real and virtual IDs where virtual prototypes are orthogonal to other prototypes. Subsequently, we generate synthetic images by using the diffusion model based on the feature space. Our proposed framework provides two significant benefits. Firstly, it allows for creating virtual facial images without concerns about portrait rights, guaranteeing that the generated virtual face images are clearly differentiated from existing individuals. Secondly, it serves as an effective augmentation method by incorporating real existing images. Further experiments demonstrate the efficacy of our framework, achieving state-of-the-art results from both perspectives without any external data.
Published: 2024

3. IG-FIQA: Improving Face Image Quality Assessment through Intra-class Variance Guidance robust to Inaccurate Pseudo-Labels

Author: Kim, Minsoo, Nam, Gi Pyo, Kim, Haksub, Park, Haesol, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In the realm of face image quality assesment (FIQA), method based on sample relative classification have shown impressive performance. However, the quality scores used as pseudo-labels assigned from images of classes with low intra-class variance could be unrelated to the actual quality in this method. To address this issue, we present IG-FIQA, a novel approach to guide FIQA training, introducing a weight parameter to alleviate the adverse impact of these classes. This method involves estimating sample intra-class variance at each iteration during training, ensuring minimal computational overhead and straightforward implementation. Furthermore, this paper proposes an on-the-fly data augmentation methodology for improved generalization performance in FIQA. On various benchmark datasets, our proposed method, IG-FIQA, achieved novel state-of-the-art (SOTA) performance.
Published: 2024

4. MAIR: Multi-view Attention Inverse Rendering with 3D Spatially-Varying Lighting Estimation

Author: Choi, JunYong, Lee, SeokYeong, Park, Haesol, Jung, Seung-Won, Kim, Ig-Jae, and Cho, Junghyun
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: We propose a scene-level inverse rendering framework that uses multi-view images to decompose the scene into geometry, a SVBRDF, and 3D spatially-varying lighting. Because multi-view images provide a variety of information about the scene, multi-view images in object-level inverse rendering have been taken for granted. However, owing to the absence of multi-view HDR synthetic dataset, scene-level inverse rendering has mainly been studied using single-view image. We were able to successfully perform scene-level inverse rendering using multi-view images by expanding OpenRooms dataset and designing efficient pipelines to handle multi-view images, and splitting spatially-varying lighting. Our experiments show that the proposed method not only achieves better performance than single-view-based methods, but also achieves robust performance on unseen real-world scene. Also, our sophisticated 3D spatially-varying lighting volume allows for photorealistic object insertion in any 3D location., Comment: Accepted by CVPR 2023; Project Page is https://bring728.github.io/mair.project/
Published: 2023

5. Few-shot Neural Radiance Fields Under Unconstrained Illumination

Author: Lee, SeokYeong, Choi, JunYong, Kim, Seungryong, Kim, Ig-Jae, and Cho, Junghyun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we introduce a new challenge for synthesizing novel view images in practical environments with limited input multi-view images and varying lighting conditions. Neural radiance fields (NeRF), one of the pioneering works for this task, demand an extensive set of multi-view images taken under constrained illumination, which is often unattainable in real-world settings. While some previous works have managed to synthesize novel views given images with different illumination, their performance still relies on a substantial number of input multi-view images. To address this problem, we suggest ExtremeNeRF, which utilizes multi-view albedo consistency, supported by geometric alignment. Specifically, we extract intrinsic image components that should be illumination-invariant across different views, enabling direct appearance comparison between the input and novel view under unconstrained illumination. We offer thorough experimental results for task evaluation, employing the newly created NeRF Extreme benchmark-the first in-the-wild benchmark for novel view synthesis under multiple viewing directions and varying illuminations., Comment: Project Page: https://seokyeong94.github.io/ExtremeNeRF/
Published: 2023

6. Dual Prototype Attention for Unsupervised Video Object Segmentation

Author: Cho, Suhwan, Lee, Minhyeok, Lee, Seunghoon, Lee, Dogyoon, Choi, Heeseung, Kim, Ig-Jae, and Lee, Sangyoun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Unsupervised video object segmentation (VOS) aims to detect and segment the most salient object in videos. The primary techniques used in unsupervised VOS are 1) the collaboration of appearance and motion information; and 2) temporal fusion between different frames. This paper proposes two novel prototype-based attention mechanisms, inter-modality attention (IMA) and inter-frame attention (IFA), to incorporate these techniques via dense propagation across different modalities and frames. IMA densely integrates context information from different modalities based on a mutual refinement. IFA injects global context of a video to the query frame, enabling a full utilization of useful properties from multiple frames. Experimental results on public benchmark datasets demonstrate that our proposed approach outperforms all existing methods by a substantial margin. The proposed two components are also thoroughly validated via ablative study., Comment: CVPR 2024
Published: 2022

7. DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network

Author: Thakare, Kamalakar, Raghuwanshi, Yash, Dogra, Debi Prosad, Choi, Heeseung, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Unsupervised approaches for video anomaly detection may not perform as good as supervised approaches. However, learning unknown types of anomalies using an unsupervised approach is more practical than a supervised approach as annotation is an extra burden. In this paper, we use isolation tree-based unsupervised clustering to partition the deep feature space of the video segments. The RGB- stream generates a pseudo anomaly score and the flow stream generates a pseudo dynamicity score of a video segment. These scores are then fused using a majority voting scheme to generate preliminary bags of positive and negative segments. However, these bags may not be accurate as the scores are generated only using the current segment which does not represent the global behavior of a typical anomalous event. We then use a refinement strategy based on a cross-branch feed-forward network designed using a popular I3D network to refine both scores. The bags are then refined through a segment re-mapping strategy. The intuition of adding the dynamicity score of a segment with the anomaly score is to enhance the quality of the evidence. The method has been evaluated on three popular video anomaly datasets, i.e., UCF-Crime, CCTV-Fights, and UBI-Fights. Experimental results reveal that the proposed framework achieves competitive accuracy as compared to the state-of-the-art video anomaly detection methods., Comment: 10 pages, 8 figures, and 4 tables. (ACCEPTED AT WACV 2023)
Published: 2022

8. Probabilistic Representations for Video Contrastive Learning

Author: Park, Jungin, Lee, Jiyoung, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents Probabilistic Video Contrastive Learning, a self-supervised representation learning method that bridges contrastive learning with probabilistic representation. We hypothesize that the clips composing the video have different distributions in short-term duration, but can represent the complicated and sophisticated video distribution through combination in a common embedding space. Thus, the proposed method represents video clips as normal distributions and combines them into a Mixture of Gaussians to model the whole video distribution. By sampling embeddings from the whole video distribution, we can circumvent the careful sampling strategy or transformations to generate augmented views of the clips, unlike previous deterministic methods that have mainly focused on such sample generation strategies for contrastive learning. We further propose a stochastic contrastive loss to learn proper video distributions and handle the inherent uncertainty from the nature of the raw video. Experimental results verify that our probabilistic embedding stands as a state-of-the-art video representation learning for action recognition and video retrieval on the most popular benchmarks, including UCF101 and HMDB51., Comment: CVPR 2022
Published: 2022

9. Learning Canonical 3D Object Representation for Fine-Grained Recognition

Author: Joung, Sunghun, Kim, Seungryong, Kim, Minsu, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a novel framework for fine-grained object recognition that learns to recover object variation in 3D space from a single image, trained on an image collection without using any ground-truth 3D annotation. We accomplish this by representing an object as a composition of 3D shape and its appearance, while eliminating the effect of camera viewpoint, in a canonical configuration. Unlike conventional methods modeling spatial variation in 2D images only, our method is capable of reconfiguring the appearance feature in a canonical 3D space, thus enabling the subsequent object classifier to be invariant under 3D geometric variation. Our representation also allows us to go beyond existing methods, by incorporating 3D shape variation as an additional cue for object recognition. To learn the model without ground-truth 3D annotation, we deploy a differentiable renderer in an analysis-by-synthesis framework. By incorporating 3D shape and appearance jointly in a deep representation, our method learns the discriminative representation of the object and achieves competitive performance on fine-grained image recognition and vehicle re-identification. We also demonstrate that the performance of 3D shape reconstruction is improved by learning fine-grained shape deformation in a boosting manner., Comment: ICCV 2021
Published: 2021

10. K-FACE: A Large-Scale KIST Face Database in Consideration with Unconstrained Environments

Author: Choi, Yeji, Park, Hyunjung, Nam, Gi Pyo, Kim, Haksub, Choi, Heeseung, Cho, Junghyun, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Databases
Abstract: In this paper, we introduce a new large-scale face database from KIST, denoted as K-FACE, and describe a novel capturing device specifically designed to obtain the data. The K-FACE database contains more than 1 million high-quality images of 1,000 subjects selected by considering the ratio of gender and age groups. It includes a variety of attributes, including 27 poses, 35 lighting conditions, three expressions, and occlusions by the combination of five types of accessories. As the K-FACE database is systematically constructed through a hemispherical capturing system with elaborate lighting control and multiple cameras, it is possible to accurately analyze the effects of factors that cause performance degradation, such as poses, lighting changes, and accessories. We consider not only the balance of external environmental factors, such as pose and lighting, but also the balance of personal characteristics such as gender and age group. The gender ratio is the same, while the age groups of subjects are uniformly distributed from the 20s to 50s for both genders. The K-FACE database can be extensively utilized in various vision tasks, such as face recognition, face frontalization, illumination normalization, face age estimation, and three-dimensional face model generation. We expect systematic diversity and uniformity of the K-FACE database to promote these research fields., Comment: 8 pages, 8 figures
Published: 2021

11. A 3D model-based approach for fitting masks to faces in the wild

Author: Hong, Je Hyeong, Kim, Hanjo, Kim, Minsoo, Nam, Gi Pyo, Cho, Junghyun, Ko, Hyeong-Seok, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Face recognition now requires a large number of labelled masked face images in the era of this unprecedented COVID-19 pandemic. Unfortunately, the rapid spread of the virus has left us little time to prepare for such dataset in the wild. To circumvent this issue, we present a 3D model-based approach called WearMask3D for augmenting face images of various poses to the masked face counterparts. Our method proceeds by first fitting a 3D morphable model on the input image, second overlaying the mask surface onto the face model and warping the respective mask texture, and last projecting the 3D mask back to 2D. The mask texture is adapted based on the brightness and resolution of the input image. By working in 3D, our method can produce more natural masked faces of diverse poses from a single mask texture. To compare precisely between different augmentation approaches, we have constructed a dataset comprising masked and unmasked faces with labels called MFW-mini. Experimental results demonstrate WearMask3D produces more realistic masked faces, and utilizing these images for training leads to state-of-the-art recognition accuracy for masked faces., Comment: 5 pages, 5 figures and 2 tables; accepted to ICIP 2021
Published: 2021

12. Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Author: Kim, Minsu, Joung, Sunghun, Kim, Seungryong, Park, JungIn, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing techniques to adapt semantic segmentation networks across the source and target domains within deep convolutional neural networks (CNNs) deal with all the samples from the two domains in a global or category-aware manner. They do not consider an inter-class variation within the target domain itself or estimated category, providing the limitation to encode the domains having a multi-modal data distribution. To overcome this limitation, we introduce a learnable clustering module, and a novel domain adaptation framework called cross-domain grouping and alignment. To cluster the samples across domains with an aim to maximize the domain alignment without forgetting precise segmentation ability on the source domain, we present two loss functions, in particular, for encouraging semantic consistency and orthogonality among the clusters. We also present a loss so as to solve a class imbalance problem, which is the other limitation of the previous methods. Our experiments show that our method consistently boosts the adaptation performance in semantic segmentation, outperforming the state-of-the-arts on various domain adaptation settings., Comment: AAAI 2021
Published: 2020

13. ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications

Author: Hwang, Hochul, Jang, Cheongjae, Park, Geonwoo, Cho, Junghyun, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: To train deep learning models for vision-based action recognition of elders' daily activities, we need large-scale activity datasets acquired under various daily living environments and conditions. However, most public datasets used in human action recognition either differ from or have limited coverage of elders' activities in many aspects, making it challenging to recognize elders' daily activities well by only utilizing existing datasets. Recently, such limitations of available datasets have actively been compensated by generating synthetic data from realistic simulation environments and using those data to train deep learning models. In this paper, based on these ideas we develop ElderSim, an action simulation platform that can generate synthetic data on elders' daily activities. For 55 kinds of frequent daily activities of the elders, ElderSim generates realistic motions of synthetic characters with various adjustable data-generating options, and provides different output modalities including RGB videos, two- and three-dimensional skeleton trajectories. We then generate KIST SynADL, a large-scale synthetic dataset of elders' activities of daily living, from ElderSim and use the data in addition to real datasets to train three state-of the-art human action recognition models. From the experiments following several newly proposed scenarios that assume different real and synthetic dataset configurations for training, we observe a noticeable performance improvement by augmenting our synthetic data. We also offer guidance with insights for the effective utilization of synthetic data to help recognize elders' daily activities., Comment: 18 pages, 9 figures
Published: 2020

14. SumGraph: Video Summarization via Recursive Graph Modeling

Author: Park, Jungin, Lee, Jiyoung, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe selection problem by aggregating all frames with equal weight. However, to find informative parts of the video, it is necessary to consider how all the frames of the video are related to each other. To this end, we cast video summarization as a graph modeling problem. We propose recursive graph modeling networks for video summarization, termed SumGraph, to represent a relation graph, where frames are regarded as nodes and nodes are connected by semantic relationships among frames. Our networks accomplish this through a recursive approach to refine an initially estimated graph to correctly classify each node as a keyframe by reasoning the graph representation via graph convolutional networks. To leverage SumGraph in a more practical environment, we also present a way to adapt our graph modeling in an unsupervised fashion. With SumGraph, we achieved state-of-the-art performance on several benchmarks for video summarization in both supervised and unsupervised manners., Comment: ECCV 2020
Published: 2020

15. Cylindrical Convolutional Networks for Joint Object Detection and Viewpoint Estimation

Author: Joung, Sunghun, Kim, Seungryong, Kim, Hanjae, Kim, Minsu, Kim, Ig-Jae, Cho, Junghyun, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing techniques to encode spatial invariance within deep convolutional neural networks only model 2D transformation fields. This does not account for the fact that objects in a 2D space are a projection of 3D ones, and thus they have limited ability to severe object viewpoint changes. To overcome this limitation, we introduce a learnable module, cylindrical convolutional networks (CCNs), that exploit cylindrical representation of a convolutional kernel defined in the 3D space. CCNs extract a view-specific feature through a view-specific convolutional kernel to predict object category scores at each viewpoint. With the view-specific feature, we simultaneously determine objective category and viewpoints using the proposed sinusoidal soft-argmax module. Our experiments demonstrate the effectiveness of the cylindrical convolutional networks on joint object detection and viewpoint estimation., Comment: CVPR 2020
Published: 2020

16. Relational Deep Feature Learning for Heterogeneous Face Recognition

Author: Cho, MyeongAh, Kim, Taeoh, Kim, Ig-Jae, Lee, Kyungjae, and Lee, Sangyoun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Heterogeneous Face Recognition (HFR) is a task that matches faces across two different domains such as visible light (VIS), near-infrared (NIR), or the sketch domain. Due to the lack of databases, HFR methods usually exploit the pre-trained features on a large-scale visual database that contain general facial information. However, these pre-trained features cause performance degradation due to the texture discrepancy with the visual domain. With this motivation, we propose a graph-structured module called Relational Graph Module (RGM) that extracts global relational information in addition to general facial features. Because each identity's relational information between intra-facial parts is similar in any modality, the modeling relationship between features can help cross-domain matching. Through the RGM, relation propagation diminishes texture dependency without losing its advantages from the pre-trained features. Furthermore, the RGM captures global facial geometrics from locally correlated convolutional features to identify long-range relationships. In addition, we propose a Node Attention Unit (NAU) that performs node-wise recalibration to concentrate on the more informative nodes arising from relation-based propagation. Furthermore, we suggest a novel conditional-margin loss function (C-softmax) for the efficient projection learning of the embedding vector in HFR. The proposed method outperforms other state-of-the-art methods on five HFR databases. Furthermore, we demonstrate performance improvement on three backbones because our module can be plugged into any pre-trained face recognition backbone to overcome the limitations of a small HFR database.
Published: 2020
Full Text: View/download PDF

17. RareAnom: A Benchmark Video Dataset for Rare Type Anomalies

Author: Thakare, Kamalakar Vijay, Dogra, Debi Prosad, Choi, Heeseung, Kim, Haksub, and Kim, Ig-Jae
Published: 2023
Full Text: View/download PDF

18. The Unconstrained Ear Recognition Challenge 2019 - ArXiv Version With Appendix

Author: Emeršič, Žiga, V., Aruna Kumar S., Harish, B. S., Gutfeter, Weronika, Khiarak, Jalil Nourmohammadi, Pacut, Andrzej, Hansley, Earnest, Segundo, Mauricio Pamplona, Sarkar, Sudeep, Park, Hyeonjung, Nam, Gi Pyo, Kim, Ig-Jae, Sangodkar, Sagar G., Kaçar, Ümit, Kirci, Murvet, Yuan, Li, Yuan, Jishou, Zhao, Haonan, Lu, Fei, Mao, Junying, Zhang, Xiaoshuang, Yaman, Dogucan, Eyiokur, Fevziye Irem, Özler, Kadir Bulut, Ekenel, Hazım Kemal, Chowdhury, Debbrota Paul, Bakshi, Sambit, Sa, Pankaj K., Majhi, Banshidhar, Peer, Peter, and Štruc, Vitomir
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents a summary of the 2019 Unconstrained Ear Recognition Challenge (UERC), the second in a series of group benchmarking efforts centered around the problem of person recognition from ear images captured in uncontrolled settings. The goal of the challenge is to assess the performance of existing ear recognition techniques on a challenging large-scale ear dataset and to analyze performance of the technology from various viewpoints, such as generalization abilities to unseen data characteristics, sensitivity to rotations, occlusions and image resolution and performance bias on sub-groups of subjects, selected based on demographic criteria, i.e. gender and ethnicity. Research groups from 12 institutions entered the competition and submitted a total of 13 recognition approaches ranging from descriptor-based methods to deep-learning models. The majority of submissions focused on ensemble based methods combining either representations from multiple deep models or hand-crafted with learned image descriptors. Our analysis shows that methods incorporating deep learning models clearly outperform techniques relying solely on hand-crafted descriptors, even though both groups of techniques exhibit similar behaviour when it comes to robustness to various covariates, such presence of occlusions, changes in (head) pose, or variability in image resolution. The results of the challenge also show that there has been considerable progress since the first UERC in 2017, but that there is still ample room for further research in this area., Comment: The content of this paper was published in ICB, 2019. This ArXiv version is from before the peer review
Published: 2019

19. Person Re-identification in Videos by Analyzing Spatio-Temporal Tubes

Author: Ahmed, Sk. Arif, Dogra, Debi Prosad, Choi, Heeseung, Chae, Seungho, and Kim, Ig-Jae
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Typical person re-identification frameworks search for k best matches in a gallery of images that are often collected in varying conditions. The gallery may contain image sequences when re-identification is done on videos. However, such a process is time consuming as re-identification has to be carried out multiple times. In this paper, we extract spatio-temporal sequences of frames (referred to as tubes) of moving persons and apply a multi-stage processing to match a given query tube with a gallery of stored tubes recorded through other cameras. Initially, we apply a binary classifier to remove noisy images from the input query tube. In the next step, we use a key-pose detection-based query minimization. This reduces the length of the query tube by removing redundant frames. Finally, a 3-stage hierarchical re-identification framework is used to rank the output tubes as per the matching scores. Experiments with publicly available video re-identification datasets reveal that our framework is better than state-of-the-art methods. It ranks the tubes with an increased CMC accuracy of 6-8% across multiple datasets. Also, our method significantly reduces the number of false positives. A new video re-identification dataset, named Tube-based Reidentification Video Dataset (TRiViD), has been prepared with an aim to help the re-identification research community
Published: 2019

20. Distance-GCN for Action Recognition

Author: Lee, Haetsal, Cho, Junghyun, Kim, Ig-jae, Park, Unsang, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wallraven, Christian, editor, Liu, Qingshan, editor, and Nagahara, Hajime, editor
Published: 2022
Full Text: View/download PDF

21. Person re-identification in indoor videos by information fusion using Graph Convolutional Networks

Author: Soni, Komal, Dogra, Debi Prosad, Sekh, Arif Ahmed, Kar, Samarjit, Choi, Heeseung, and Kim, Ig-Jae
Published: 2022
Full Text: View/download PDF

22. A multi-stream deep neural network with late fuzzy fusion for real-world anomaly detection

Author: Thakare, Kamalakar Vijay, Sharma, Nitin, Dogra, Debi Prosad, Choi, Heeseung, and Kim, Ig-Jae
Published: 2022
Full Text: View/download PDF

23. Classification of early age facial growth pattern and identification of the genetic basis in two Korean populations

Author: Cha, Mi-Yeon, Hong, Yu-Jin, Choi, Ja-Eun, Kwon, Tae-Song, Kim, Ig-Jae, and Hong, Kyung-Won
Published: 2022
Full Text: View/download PDF

24. Memetic algorithm for multivariate time-series segmentation

Author: Lim, Hyunki, Choi, Heeseung, Choi, Yeji, and Kim, Ig-Jae
Published: 2020
Full Text: View/download PDF

25. Let’s Observe Them Over Time: An Improved Pedestrian Attribute Recognition Approach

Author: Thakare, Kamalakar Vijay, primary, Dogra, Debi Prosad, additional, Choi, Heeseung, additional, Kim, Haksub, additional, and Kim, Ig-Jae, additional
Published: 2024
Full Text: View/download PDF

26. Integrating Pretrained Encoders for Generalized Face Frontalization

Author: Choi, WonYoung, primary, Nam, Gi Pyo, additional, Cho, Junghyun, additional, Kim, Ig-Jae, additional, and Ko, Hyeong-Seok, additional
Published: 2024
Full Text: View/download PDF

27. Person Re-identification in Videos by Analyzing Spatio-temporal Tubes

Author: Sekh, Arif Ahmed, Dogra, Debi Prosad, Choi, Heeseung, Chae, Seungho, and Kim, Ig-Jae
Published: 2020
Full Text: View/download PDF

28. Cancelable fingerprint template design with randomized non-negative least squares

Author: Kho, Jun Beom, Kim, Jaihie, Kim, Ig-Jae, and Teoh, Andrew B.J.
Published: 2019
Full Text: View/download PDF

29. Face Photo-Sketch Synthesis Via Domain-Invariant Feature Embedding

Author: Choi, Yeji, primary, Sohn, Kwanghoon, additional, and Kim, Ig-Jae, additional
Published: 2023
Full Text: View/download PDF

30. SumGraph: Video Summarization via Recursive Graph Modeling

Author: Park, Jungin, primary, Lee, Jiyoung, additional, Kim, Ig-Jae, additional, and Sohn, Kwanghoon, additional
Published: 2020
Full Text: View/download PDF

31. Face alignment using a deep neural network with local feature learning and recurrent regression

Author: Park, Byung-Hwa, Oh, Se-Young, and Kim, Ig-Jae
Published: 2017
Full Text: View/download PDF

32. MAIR: Multi-View Attention Inverse Rendering with 3D Spatially-Varying Lighting Estimation

Author: Choi, JunYong, primary, Lee, SeokYeong, additional, Park, Haesol, additional, Jung, Seung-Won, additional, Kim, Ig-Jae, additional, and Cho, Junghyun, additional
Published: 2023
Full Text: View/download PDF

33. Parameterized Facial Modelling and Animation

Author: Cho, Junghyun, Choi, Heeseung, Ahn, Sang Chul, Kim, Ig-Jae, Tan, Desney, Editor-in-chief, Vanderdonckt, Jean, Editor-in-chief, Magnenat-Thalmann, Nadia, editor, Yuan, Junsong, editor, Thalmann, Daniel, editor, and You, Bum-Jae, editor
Published: 2016
Full Text: View/download PDF

34. 3-Dimensional Face from a Single Face Image with Various Expressions

Author: Hong, Yu-Jin, Nam, Gi Pyo, Choi, Heeseung, Cho, Junghyun, Kim, Ig-Jae, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Streitz, Norbert, editor, and Markopoulos, Panos, editor
Published: 2016
Full Text: View/download PDF

35. Age face simulation using aging functions on global and local features with residual images

Author: Choi, Sung Eun, Jo, Jaeik, Lee, Sanghak, Choi, Heeseung, Kim, Ig-Jae, and Kim, Jaihie
Published: 2017
Full Text: View/download PDF

36. Automatic Facial Recognition System Assisted-facial Asymmetry Scale Using Facial Landmarks

Author: Lee, Se A., Kim, Jin, Lee, Jeon Mi, Hong, Yu-Jin, Kim, Ig-Jae, and Lee, Jong Dae
Published: 2020
Full Text: View/download PDF

37. Enhancing Facial Impression for Video Conference

Author: Park, Sungyeon, Choi, Heeseung, Kim, Ig-Jae, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Streitz, Norbert, editor, and Markopoulos, Panos, editor
Published: 2015
Full Text: View/download PDF

38. Speculating on Risks of AI Clones to Selfhood and Relationships: Doppelganger-phobia, Identity Fragmentation, and Living Memories

Author: Lee, Patrick Yung Kang, primary, Ma, Ning F., additional, Kim, Ig-Jae, additional, and Yoon, Dongwook, additional
Published: 2023
Full Text: View/download PDF

39. Synchronizing Vision and Language: Bidirectional Token-Masking AutoEncoder for Referring Image Segmentation

Author: Lee, Minhyeok, Lee, Dogyoon, Lee, Jungho, Cho, Suhwan, Choi, Heeseung, Kim, Ig-Jae, Lee, Sangyoun, Lee, Minhyeok, Lee, Dogyoon, Lee, Jungho, Cho, Suhwan, Choi, Heeseung, Kim, Ig-Jae, and Lee, Sangyoun
Abstract: Referring Image Segmentation (RIS) aims to segment target objects expressed in natural language within a scene at the pixel level. Various recent RIS models have achieved state-of-the-art performance by generating contextual tokens to model multimodal features from pretrained encoders and effectively fusing them using transformer-based cross-modal attention. While these methods match language features with image features to effectively identify likely target objects, they often struggle to correctly understand contextual information in complex and ambiguous sentences and scenes. To address this issue, we propose a novel bidirectional token-masking autoencoder (BTMAE) inspired by the masked autoencoder (MAE). The proposed model learns the context of image-to-language and language-to-image by reconstructing missing features in both image and language features at the token level. In other words, this approach involves mutually complementing across the features of images and language, with a focus on enabling the network to understand interconnected deep contextual information between the two modalities. This learning method enhances the robustness of RIS performance in complex sentences and scenes. Our BTMAE achieves state-of-the-art performance on three popular datasets, and we demonstrate the effectiveness of the proposed method through various ablation studies.
Published: 2023

40. Expression Recognition Driven Virtual Human Animation

Author: Cho, Junghyun, Hong, Yu-Jin, Ahn, Sang C., Kim, Ig-Jae, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Kobsa, Alfred, editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Weikum, Gerhard, editor, Streitz, Norbert, editor, and Markopoulos, Panos, editor
Published: 2014
Full Text: View/download PDF

41. 3D Face-Aware Electronics with Low-Resolution Imaging

Author: Hong, Yu-Jin, Kim, Jaewon, Cho, Junghyun, Kim, Ig-Jae, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Kobsa, Alfred, editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Weikum, Gerhard, editor, and Kurosu, Masaaki, editor
Published: 2014
Full Text: View/download PDF

42. The age distribution of facial metrics in two large Korean populations

Author: Lee, Hae-Young, Cha, Seongwon, Ban, Hyo-Jeong, Kim, In-Young, Park, Bo-Reum, Kim, Ig-Jae, and Hong, Kyung-Won
Published: 2019
Full Text: View/download PDF

43. DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network

Author: Thakare, Kamalakar Vijay, primary, Raghuwanshi, Yash, additional, Dogra, Debi Prosad, additional, Choi, Heeseung, additional, and Kim, Ig-Jae, additional
Published: 2023
Full Text: View/download PDF

44. ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications

Author: Hwang, Hochul, primary, Jang, Cheongjae, additional, Park, Geonwoo, additional, Cho, Junghyun, additional, and Kim, Ig-Jae, additional
Published: 2023
Full Text: View/download PDF

45. Single-view-based 3D facial reconstruction method robust against pose variations

Author: Jo, Jaeik, Choi, Heeseung, Kim, Ig-Jae, and Kim, Jaihie
Published: 2015
Full Text: View/download PDF

46. Painting Alive: Handheld Augmented Reality System for Large Targets

Author: Hwang, Jae-In, Sung, Min-Hyuk, Kim, Ig-Jae, Ahn, Sang Chul, Kim, Hyoung-Gon, Ko, Heedong, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, and Shumaker, Randall, editor
Published: 2013
Full Text: View/download PDF

47. Long-Range Hand Gesture Interaction Based on Spatio-temporal Encoding

Author: Kim, Jaewon, Han, Gyuchull, Kim, Ig-Jae, Kim, Hyounggon, Ahn, Sang Chul, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Streitz, Norbert, editor, and Stephanidis, Constantine, editor
Published: 2013
Full Text: View/download PDF

48. Object Interaction-Based Localization and Description of Road Accident Events Using Deep Learning

Author: Thakare, Kamalakar Vijay, primary, Dogra, Debi Prosad, additional, Choi, Heeseung, additional, Kim, Haksub, additional, and Kim, Ig-Jae, additional
Published: 2022
Full Text: View/download PDF

49. Detecting driver drowsiness using feature-level fusion and user-specific classification

Author: Jo, Jaeik, Lee, Sung Joo, Park, Kang Ryoung, Kim, Ig-Jae, and Kim, Jaihie
Published: 2014
Full Text: View/download PDF

50. Personalized Life Log Media System in Ubiquitous Environment

Author: Kim, Ig-Jae, Ahn, Sang Chul, Kim, Hyoung-Gon, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Rangan, C. Pandu, editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Stajano, Frank, editor, Kim, Hyoung Joong, editor, Chae, Jong-Suk, editor, and Kim, Seong-Dong, editor
Published: 2007
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

646 results on '"Kim IG"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources