Author: "Han, William" / Publisher: arxiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Han, William"' showing total 2 results

Start Over Author "Han, William" Publisher arxiv

2 results on '"Han, William"'

1. MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Author: Qiu, Jielin, Zhu, Jiacheng, Han, William, Kumar, Aditesh, Mittal, Karthik, Jin, Claire, Yang, Zhengyuan, Li, Linjie, Wang, Jianfeng, Li, Bo, Zhao, Ding, and Wang, Lijuan
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: Multimodal summarization with multimodal output (MSMO) has emerged as a promising research direction. Nonetheless, numerous limitations exist within existing public MSMO datasets, including insufficient upkeep, data inaccessibility, limited size, and the absence of proper categorization, which pose significant challenges to effective research. To address these challenges and provide a comprehensive dataset for this new direction, we have meticulously curated the MultiSum dataset. Our new dataset features (1) Human-validated summaries for both video and textual content, providing superior human instruction and labels for multimodal learning. (2) Comprehensively and meticulously arranged categorization, spanning 17 principal categories and 170 subcategories to encapsulate a diverse array of real-world scenarios. (3) Benchmark tests performed on the proposed dataset to assess varied tasks and methods, including video temporal segmentation, video summarization, text summarization, and multimodal summarization. To champion accessibility and collaboration, we release the MultiSum dataset and the data collection tool as fully open-source resources, fostering transparency and accelerating future developments. Our project website can be found at https://multisum-dataset.github.io/., Comment: Project website: https://multisum-dataset.github.io/
Published: 2023
Full Text: View/download PDF

2. Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding

Author: Qiu, Jielin, Zhu, Jiacheng, Liu, Shiqi, Han, William, Zhang, Jingqi, Duan, Chaojing, Rosenberg, Michael, Liu, Emerson, Weber, Douglas, and Zhao, Ding
Subjects: Signal Processing (eess.SP), FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, FOS: Electrical engineering, electronic engineering, information engineering, Electrical Engineering and Systems Science - Signal Processing
Abstract: Automated interpretation of electrocardiograms (ECG) has garnered significant attention with the advancements in machine learning methodologies. Despite the growing interest in automated ECG interpretation using machine learning, most current studies focus solely on classification or regression tasks and overlook a crucial aspect of clinical cardio-disease diagnosis: the diagnostic report generated by experienced human clinicians. In this paper, we introduce a novel approach to ECG interpretation, leveraging recent breakthroughs in Large Language Models (LLMs) and Vision-Transformer (ViT) models. Rather than treating ECG diagnosis as a classification or regression task, we propose an alternative method of automatically identifying the most similar clinical cases based on the input ECG data. Also, since interpreting ECG as images are more affordable and accessible, we process ECG as encoded images and adopt a vision-language learning paradigm to jointly learn vision-language alignment between encoded ECG images and ECG diagnosis reports. Encoding ECG into images can result in an efficient ECG retrieval system, which will be highly practical and useful in clinical applications. More importantly, our findings could serve as a crucial resource for providing diagnostic services in regions where only paper-printed ECG images are accessible due to past underdevelopment., Comment: 26 pages
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Han, William"'

1. MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

2. Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Han, William"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources