Author: "Isonuma, Masaru" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Isonuma, Masaru"' showing total 22 results

Start Over Author "Isonuma, Masaru"

22 results on '"Isonuma, Masaru"'

1. What's New in My Data? Novelty Exploration via Contrastive Generation

Author: Isonuma, Masaru and Titov, Ivan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Fine-tuning is widely used to adapt language models for specific goals, often leveraging real-world data such as patient records, customer-service interactions, or web content in languages not covered in pre-training. These datasets are typically massive, noisy, and often confidential, making their direct inspection challenging. However, understanding them is essential for guiding model deployment and informing decisions about data cleaning or suppressing any harmful behaviors learned during fine-tuning. In this study, we introduce the task of novelty discovery through generation, which aims to identify novel properties of a fine-tuning dataset by generating examples that illustrate these properties. Our approach, Contrastive Generative Exploration (CGE), assumes no direct access to the data but instead relies on a pre-trained model and the same model after fine-tuning. By contrasting the predictions of these two models, CGE can generate examples that highlight novel characteristics of the fine-tuning data. However, this simple approach may produce examples that are too similar to one another, failing to capture the full range of novel phenomena present in the dataset. We address this by introducing an iterative version of CGE, where the previously generated examples are used to update the pre-trained model, and this updated model is then contrasted with the fully fine-tuned model to generate the next example, promoting diversity in the generated outputs. Our experiments demonstrate the effectiveness of CGE in detecting novel content, such as toxic language, as well as new natural and programming languages. Furthermore, we show that CGE remains effective even when models are fine-tuned using differential privacy techniques.
Published: 2024

2. Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation

Author: Lu, Huimin, Isonuma, Masaru, Mori, Junichiro, and Sakata, Ichiro
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large language models (LLMs) often inherit biases from vast amounts of training corpora. Traditional debiasing methods, while effective to some extent, do not completely eliminate memorized biases and toxicity in LLMs. In this paper, we study an unlearning-based approach to debiasing in LLMs by performing gradient ascent on hate speech against minority groups, i.e., minimizing the likelihood of biased or toxic content. Specifically, we propose a mask language modeling unlearning technique, which unlearns the harmful part of the text. This method enables LLMs to selectively forget and disassociate from biased and harmful content. Experimental results demonstrate the effectiveness of our approach in diminishing bias while maintaining the language modeling abilities. Surprisingly, the results also unveil an unexpected potential for cross-domain transfer unlearning: debiasing in one bias form (e.g. gender) may contribute to mitigating others (e.g. race and religion).
Published: 2024

3. Comprehensive Evaluation of Large Language Models for Topic Modeling

Author: Doi, Tomoki, Isonuma, Masaru, and Yanaka, Hitomi
Subjects: Computer Science - Computation and Language
Abstract: Recent work utilizes Large Language Models (LLMs) for topic modeling, generating comprehensible topic labels for given documents. However, their performance has mainly been evaluated qualitatively, and there remains room for quantitative investigation of their capabilities. In this paper, we quantitatively evaluate LLMs from multiple perspectives: the quality of topics, the impact of LLM-specific concerns, such as hallucination and shortcuts for limited documents, and LLMs' controllability of topic categories via prompts. Our findings show that LLMs can identify coherent and diverse topics with few hallucinations but may take shortcuts by focusing only on parts of documents. We also found that their controllability is limited.
Published: 2024

4. Unlearning Traces the Influential Training Data of Language Models

Author: Isonuma, Masaru and Titov, Ivan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Identifying the training datasets that influence a language model's outputs is essential for minimizing the generation of harmful content and enhancing its performance. Ideally, we can measure the influence of each dataset by removing it from training; however, it is prohibitively expensive to retrain a model multiple times. This paper presents UnTrac: unlearning traces the influence of a training dataset on the model's performance. UnTrac is extremely simple; each training dataset is unlearned by gradient ascent, and we evaluate how much the model's predictions change after unlearning. Furthermore, we propose a more scalable approach, UnTrac-Inv, which unlearns a test dataset and evaluates the unlearned model on training datasets. UnTrac-Inv resembles UnTrac, while being efficient for massive training datasets. In the experiments, we examine if our methods can assess the influence of pretraining datasets on generating toxic, biased, and untruthful content. Our methods estimate their influence much more accurately than existing methods while requiring neither excessive memory space nor multiple checkpoints., Comment: 14 pages, to appear in ACL2024 main conference (long paper)
Published: 2024

5. Differentiable Instruction Optimization for Cross-Task Generalization

Author: Isonuma, Masaru, Mori, Junichiro, and Sakata, Ichiro
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Instruction tuning has been attracting much attention to achieve generalization ability across a wide variety of tasks. Although various types of instructions have been manually created for instruction tuning, it is still unclear what kind of instruction is optimal to obtain cross-task generalization ability. This work presents instruction optimization, which optimizes training instructions with respect to generalization ability. Rather than manually tuning instructions, we introduce learnable instructions and optimize them with gradient descent by leveraging bilevel optimization. Experimental results show that the learned instruction enhances the diversity of instructions and improves the generalization ability compared to using only manually created instructions., Comment: 14pages, 6 figures, accepted for Findings of ACL2023
Published: 2023

6. SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation

Author: Kasanishi, Tetsu, Isonuma, Masaru, Mori, Junichiro, and Sakata, Ichiro
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Automatic literature review generation is one of the most challenging tasks in natural language processing. Although large language models have tackled literature review generation, the absence of large-scale datasets has been a stumbling block to the progress. We release SciReviewGen, consisting of over 10,000 literature reviews and 690,000 papers cited in the reviews. Based on the dataset, we evaluate recent transformer-based summarization models on the literature review generation task, including Fusion-in-Decoder extended for literature review generation. Human evaluation results show that some machine-generated summaries are comparable to human-written reviews, while revealing the challenges of automatic literature review generation such as hallucinations and a lack of detailed information. Our dataset and code are available at https://github.com/tetsu9923/SciReviewGen., Comment: ACL findings 2023 (to be appeared). arXiv admin note: text overlap with arXiv:1810.04020 by other authors
Published: 2023

7. Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

Author: Isonuma, Masaru, Mori, Junichiro, Bollegala, Danushka, and Sakata, Ichiro
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: This paper presents a novel unsupervised abstractive summarization method for opinionated texts. While the basic variational autoencoder-based models assume a unimodal Gaussian prior for the latent code of sentences, we alternate it with a recursive Gaussian mixture, where each mixture component corresponds to the latent code of a topic sentence and is mixed by a tree-structured topic distribution. By decoding each Gaussian component, we generate sentences with tree-structured topic guidance, where the root sentence conveys generic content, and the leaf sentences describe specific topics. Experimental results demonstrate that the generated topic sentences are appropriate as a summary of opinionated texts, which are more informative and cover more input contents than those generated by the recent unsupervised summarization model (Bra\v{z}inskas et al., 2020). Furthermore, we demonstrate that the variance of latent Gaussians represents the granularity of sentences, analogous to Gaussian word embedding (Vilnis and McCallum, 2015)., Comment: accepted to TACL, pre-MIT Press publication version
Published: 2021

8. Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

Author: Isonuma, Masaru, Mori, Junichiro, and Sakata, Ichiro
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: This paper focuses on the end-to-end abstractive summarization of a single product review without supervision. We assume that a review can be described as a discourse tree, in which the summary is the root, and the child sentences explain their parent in detail. By recursively estimating a parent from its children, our model learns the latent discourse tree without an external parser and generates a concise summary. We also introduce an architecture that ranks the importance of each sentence on the tree to support summary generation focusing on the main review point. The experimental results demonstrate that our model is competitive with or outperforms other unsupervised approaches. In particular, for relatively long reviews, it achieves a competitive or better performance than supervised models. The induced tree shows that the child sentences provide additional information about their parent, and the generated summary abstracts the entire review., Comment: 13 pages, ACL 2019 (long paper)
Published: 2019

9. Unsupervised Joint Learning for Headline Generation and Discourse Structure of Reviews

Author: Isonuma, Masaru, Mori, Junichiro, Sakata, Ichiro, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Ohsawa, Yukio, editor, Yada, Katsutoshi, editor, Ito, Takayuki, editor, Takama, Yasufumi, editor, Sato-Shimokawara, Eri, editor, Abe, Akinori, editor, Mori, Junichiro, editor, and Matsumura, Naohiro, editor
Published: 2020
Full Text: View/download PDF

10. Unlearning Reveals the Influential Training Data of Language Models

Author: Isonuma, Masaru, Titov, Ivan, Isonuma, Masaru, and Titov, Ivan
Abstract: In order to enhance the performance of language models while mitigating the risks of generating harmful content, it is crucial to identify which training dataset affects the model's outputs. Ideally, we can measure the influence of each dataset by removing it from training; however, it is prohibitively expensive to retrain a model multiple times. This paper presents UnTrac, which estimates the influence of a training dataset by unlearning it from the trained model. UnTrac is extremely simple; each training dataset is unlearned by gradient ascent, and we evaluate how much the model's predictions change after unlearning. We empirically examine if our methods can assess the influence of pretraining datasets on generating toxic, biased, and untruthful content. Experimental results demonstrate that our method estimates their influence much more accurately than existing methods while requiring neither excessive memory space nor multiple model checkpoints., Comment: 12 pages, under review
Published: 2024

11. Topic Modeling for Short Texts with Large Language Models

Author: Doi, Tomoki, Isonuma, Masaru, Yanaka, Hitomi, Doi, Tomoki, Isonuma, Masaru, and Yanaka, Hitomi
Abstract: As conventional topic models rely on word co-occurrence to infer latent topics, topic modeling for short texts has been a long-standing challenge. Large Language Models (LLMs) can potentially overcome this challenge by contextually learning the semantics of words via pretraining. This paper studies two approaches, parallel prompting and sequential prompting, to use LLMs for topic modeling. Due to the input length limitations, LLMs cannot process many texts at once. By splitting the texts into smaller subsets and processing them parallelly or sequentially, an arbitrary number of texts can be handled by LLMs. Experimental results demonstrated that our methods can identify more coherent topics than existing ones while maintaining the diversity of the induced topics. Furthermore, we found that the inferred topics adequately covered the input texts, while hallucinated topics were hardly generated.
Published: 2024

12. SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation

Author: Kasanishi, Tetsu, primary, Isonuma, Masaru, additional, Mori, Junichiro, additional, and Sakata, Ichiro, additional
Published: 2023
Full Text: View/download PDF

13. Dynamic Structured Neural Topic Model with Self-Attention Mechanism

Author: Miyamoto, Nozomu, primary, Isonuma, Masaru, additional, Takase, Sho, additional, Mori, Junichiro, additional, and Sakata, Ichiro, additional
Published: 2023
Full Text: View/download PDF

14. Differentiable Instruction Optimization for Cross-Task Generalization

Author: Isonuma, Masaru, primary, Mori, Junichiro, additional, and Sakata, Ichiro, additional
Published: 2023
Full Text: View/download PDF

15. Lexical Entailment with Hierarchy Representations by Deep Metric Learning

Author: Sato, Naomi, primary, Isonuma, Masaru, additional, Asatani, Kimitaka, additional, Ishizuka, Shoya, additional, Shimizu, Aori, additional, and Sakata, Ichiro, additional
Published: 2022
Full Text: View/download PDF

16. Tree-Structured Neural Topic Model

Author: Isonuma, Masaru, Mori, Junichiro, Bollegala, Danushka, Sakata, Ichiro, and Linguist, Assoc Computat
Published: 2020

17. Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

Author: Isonuma, Masaru, primary, Mori, Junichiro, additional, Bollegala, Danushka, additional, and Sakata, Ichiro, additional
Published: 2021
Full Text: View/download PDF

18. Discovering Interdisciplinarily Spread Knowledge in the Academic Literature

Author: Kamada, Maiko, primary, Asatani, Kimitaka, additional, Isonuma, Masaru, additional, and Sakata, Ichiro, additional
Published: 2021
Full Text: View/download PDF

19. Tree-Structured Neural Topic Model

Author: Isonuma, Masaru, primary, Mori, Junichiro, additional, Bollegala, Danushka, additional, and Sakata, Ichiro, additional
Published: 2020
Full Text: View/download PDF

20. Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

Author: Isonuma, Masaru, primary, Mori, Junichiro, additional, and Sakata, Ichiro, additional
Published: 2019
Full Text: View/download PDF

21. Extractive Summarization Using Multi-Task Learning with Document Classification

Author: Isonuma, Masaru, primary, Fujino, Toru, additional, Mori, Junichiro, additional, Matsuo, Yutaka, additional, and Sakata, Ichiro, additional
Published: 2017
Full Text: View/download PDF

22. A study on designing task priority rule considering rework risk of system development project

Author: MITSUYUKI, Taiga, primary, YAMATO, Hiroyuki, additional, HIEKATA, Kazuo, additional, MOSER, Bryan, additional, ISONUMA, Masaru, additional, OKADA, Isaac, additional, and OIDA, Yoshiaki, additional
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

22 results on '"Isonuma, Masaru"'

1. What's New in My Data? Novelty Exploration via Contrastive Generation

2. Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation

3. Comprehensive Evaluation of Large Language Models for Topic Modeling

4. Unlearning Traces the Influential Training Data of Language Models

5. Differentiable Instruction Optimization for Cross-Task Generalization

6. SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation

7. Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

8. Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

9. Unsupervised Joint Learning for Headline Generation and Discourse Structure of Reviews

10. Unlearning Reveals the Influential Training Data of Language Models

11. Topic Modeling for Short Texts with Large Language Models

12. SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation

13. Dynamic Structured Neural Topic Model with Self-Attention Mechanism

14. Differentiable Instruction Optimization for Cross-Task Generalization

15. Lexical Entailment with Hierarchy Representations by Deep Metric Learning

16. Tree-Structured Neural Topic Model

17. Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

18. Discovering Interdisciplinarily Spread Knowledge in the Academic Literature

19. Tree-Structured Neural Topic Model

20. Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

21. Extractive Summarization Using Multi-Task Learning with Document Classification

22. A study on designing task priority rule considering rework risk of system development project

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

22 results on '"Isonuma, Masaru"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources