Author: "Lim, Hyesu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Lim, Hyesu"' showing total 8 results

Start Over Author "Lim, Hyesu"

8 results on '"Lim, Hyesu"'

1. Sparse autoencoders reveal selective remapping of visual concepts during adaptation

Author: Lim, Hyesu, Choi, Jinho, Choo, Jaegul, and Schneider, Steffen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Adapting foundation models for specific purposes has become a standard approach to build machine learning systems for downstream applications. Yet, it is an open question which mechanisms take place during adaptation. Here we develop a new Sparse Autoencoder (SAE) for the CLIP vision transformer, named PatchSAE, to extract interpretable concepts at granular levels (e.g. shape, color, or semantics of an object) and their patch-wise spatial attributions. We explore how these concepts influence the model output in downstream image classification tasks and investigate how recent state-of-the-art prompt-based adaptation techniques change the association of model inputs to these concepts. While activations of concepts slightly change between adapted and non-adapted models, we find that the majority of gains on common adaptation tasks can be explained with the existing concepts already present in the non-adapted foundation model. This work provides a concrete framework to train and use SAEs for Vision Transformers and provides insights into explaining adaptation mechanisms., Comment: A demo is available at github.com/dynamical-inference/patchsae
Published: 2024

2. Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

Author: Park, ChaeHun, Lee, Koanho, Lim, Hyesu, Kim, Jaeseok, Park, Junmo, Heo, Yu-Jung, Chang, Du-Seong, and Choo, Jaegul
Subjects: Computer Science - Computation and Language
Abstract: Building a reliable visual question answering~(VQA) system across different languages is a challenging problem, primarily due to the lack of abundant samples for training. To address this challenge, recent studies have employed machine translation systems for the cross-lingual VQA task. This involves translating the evaluation samples into a source language (usually English) and using monolingual models (i.e., translate-test). However, our analysis reveals that translated texts contain unique characteristics distinct from human-written ones, referred to as translation artifacts. We find that these artifacts can significantly affect the models, confirmed by extensive experiments across diverse models, languages, and translation processes. In light of this, we present a simple data augmentation strategy that can alleviate the adverse impacts of translation artifacts., Comment: ACL 2024 Findings Accepted
Published: 2024

3. Towards Calibrated Robust Fine-Tuning of Vision-Language Models

Author: Oh, Changdae, Lim, Hyesu, Kim, Mijoo, Han, Dongyoon, Yun, Sangdoo, Choo, Jaegul, Hauptmann, Alexander, Cheng, Zhi-Qi, and Song, Kyungwoo
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Improving out-of-distribution (OOD) generalization during in-distribution (ID) adaptation is a primary goal of robust fine-tuning of zero-shot models beyond naive fine-tuning. However, despite decent OOD generalization performance from recent robust fine-tuning methods, confidence calibration for reliable model output has not been fully addressed. This work proposes a robust fine-tuning method that improves both OOD accuracy and confidence calibration simultaneously in vision language models. Firstly, we show that both OOD classification and OOD calibration errors have a shared upper bound consisting of two terms of ID data: 1) ID calibration error and 2) the smallest singular value of the ID input covariance matrix. Based on this insight, we design a novel framework that conducts fine-tuning with a constrained multimodal contrastive loss enforcing a larger smallest singular value, which is further guided by the self-distillation of a moving-averaged model to achieve calibrated prediction as well. Starting from empirical evidence supporting our theoretical statements, we provide extensive experimental results on ImageNet distribution shift benchmarks that demonstrate the effectiveness of our theorem and its practical implementation., Comment: NeurIPS 2024 (a short version was presented at the NeurIPS 2023 Workshop on Distribution Shifts); Major modification of (v7): Fixing the x-axis of Figure 3 and Pearson correlation, accordingly
Published: 2023

4. PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration

Author: Choi, Minseok, Lim, Hyesu, and Choo, Jaegul
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Document-level relation extraction (DocRE) aims to extract relations of all entity pairs in a document. A key challenge in DocRE is the cost of annotating such data which requires intensive human effort. Thus, we investigate the case of DocRE in a low-resource setting, and we find that existing models trained on low data overestimate the NA ("no relation") label, causing limited performance. In this work, we approach the problem from a calibration perspective and propose PRiSM, which learns to adapt logits based on relation semantic information. We evaluate our method on three DocRE datasets and demonstrate that integrating existing models with PRiSM improves performance by as much as 26.38 F1 score, while the calibration error drops as much as 36 times when trained with about 3% of data. The code is publicly available at https://github.com/brightjade/PRiSM., Comment: Accepted to Findings of IJCNLP-AACL 2023
Published: 2023

5. TTN: A Domain-Shift Aware Batch Normalization in Test-Time Adaptation

Author: Lim, Hyesu, Kim, Byeonggeun, Choo, Jaegul, and Choi, Sungha
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper proposes a novel batch normalization strategy for test-time adaptation. Recent test-time adaptation methods heavily rely on the modified batch normalization, i.e., transductive batch normalization (TBN), which calculates the mean and the variance from the current test batch rather than using the running mean and variance obtained from the source data, i.e., conventional batch normalization (CBN). Adopting TBN that employs test batch statistics mitigates the performance degradation caused by the domain shift. However, re-estimating normalization statistics using test data depends on impractical assumptions that a test batch should be large enough and be drawn from i.i.d. stream, and we observed that the previous methods with TBN show critical performance drop without the assumptions. In this paper, we identify that CBN and TBN are in a trade-off relationship and present a new test-time normalization (TTN) method that interpolates the statistics by adjusting the importance between CBN and TBN according to the domain-shift sensitivity of each BN layer. Our proposed TTN improves model robustness to shifted domains across a wide range of batch sizes and in various realistic evaluation scenarios. TTN is widely applicable to other test-time adaptation methods that rely on updating model parameters via backpropagation. We demonstrate that adopting TTN further improves their performance and achieves state-of-the-art performance in various standard benchmarks., Comment: ICLR2023 Accepted
Published: 2023

6. AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain

Author: Hong, Jimin, Kim, Taehee, Lim, Hyesu, and Choo, Jaegul
Subjects: Computer Science - Computation and Language
Abstract: During the fine-tuning phase of transfer learning, the pretrained vocabulary remains unchanged, while model parameters are updated. The vocabulary generated based on the pretrained data is suboptimal for downstream data when domain discrepancy exists. We propose to consider the vocabulary as an optimizable parameter, allowing us to update the vocabulary by expanding it with domain-specific vocabulary based on a tokenization statistic. Furthermore, we preserve the embeddings of the added words from overfitting to downstream data by utilizing knowledge learned from a pretrained language model with a regularization term. Our method achieved consistent performance improvements on diverse domains (i.e., biomedical, computer science, news, and reviews)., Comment: EMNLP2021 Accepted
Published: 2021

7. Slice and Conquer: A Planar-to-3D Framework for Efficient Interactive Segmentation of Volumetric Images

Author: Cho, Wonwoo, primary, Choi, Dongmin, additional, Lim, Hyesu, additional, Choi, Jinho, additional, Choi, Saemee, additional, Min, Hyun-Seok, additional, Lim, Sungbin, additional, and Choo, Jaegul, additional
Published: 2024
Full Text: View/download PDF

8. AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain

Author: Hong, Jimin, primary, Kim, TaeHee, additional, Lim, Hyesu, additional, and Choo, Jaegul, additional
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Lim, Hyesu"'

1. Sparse autoencoders reveal selective remapping of visual concepts during adaptation

2. Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

3. Towards Calibrated Robust Fine-Tuning of Vision-Language Models

4. PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration

5. TTN: A Domain-Shift Aware Batch Normalization in Test-Time Adaptation

6. AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain

7. Slice and Conquer: A Planar-to-3D Framework for Efficient Interactive Segmentation of Volumetric Images

8. AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

8 results on '"Lim, Hyesu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources