Author: "Feng, Yunhe" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Feng, Yunhe"' showing total 26 results

Start Over Author "Feng, Yunhe" Search Limiters Full Text

26 results on '"Feng, Yunhe"'

1. HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making

Author: Anjum, Sumera, Zhang, Hanzhi, Zhou, Wenjun, Paek, Eun Jin, Zhao, Xiaopeng, and Feng, Yunhe
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have significantly advanced natural language processing tasks, yet they are susceptible to generating inaccurate or unreliable responses, a phenomenon known as hallucination. In critical domains such as health and medicine, these hallucinations can pose serious risks. This paper introduces HALO, a novel framework designed to enhance the accuracy and reliability of medical question-answering (QA) systems by focusing on the detection and mitigation of hallucinations. Our approach generates multiple variations of a given query using LLMs and retrieves relevant information from external open knowledge bases to enrich the context. We utilize maximum marginal relevance scoring to prioritize the retrieved context, which is then provided to LLMs for answer generation, thereby reducing the risk of hallucinations. The integration of LangChain further streamlines this process, resulting in a notable and robust increase in the accuracy of both open-source and commercial LLMs, such as Llama-3.1 (from 44% to 65%) and ChatGPT (from 56% to 70%). This framework underscores the critical importance of addressing hallucinations in medical QA systems, ultimately improving clinical decision-making and patient care. The open-source HALO is available at: https://github.com/ResponsibleAILab/HALO., Comment: 10 pages, 4 figures
Published: 2024

2. RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent

Author: Xu, Huiyu, Zhang, Wenhui, Wang, Zhibo, Xiao, Feng, Zheng, Rui, Feng, Yunhe, Ba, Zhongjie, and Ren, Kui
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Recently, advanced Large Language Models (LLMs) such as GPT-4 have been integrated into many real-world applications like Code Copilot. These applications have significantly expanded the attack surface of LLMs, exposing them to a variety of threats. Among them, jailbreak attacks that induce toxic responses through jailbreak prompts have raised critical safety concerns. To identify these threats, a growing number of red teaming approaches simulate potential adversarial scenarios by crafting jailbreak prompts to test the target LLM. However, existing red teaming methods do not consider the unique vulnerabilities of LLM in different scenarios, making it difficult to adjust the jailbreak prompts to find context-specific vulnerabilities. Meanwhile, these methods are limited to refining jailbreak templates using a few mutation operations, lacking the automation and scalability to adapt to different scenarios. To enable context-aware and efficient red teaming, we abstract and model existing attacks into a coherent concept called "jailbreak strategy" and propose a multi-agent LLM system named RedAgent that leverages these strategies to generate context-aware jailbreak prompts. By self-reflecting on contextual feedback in an additional memory buffer, RedAgent continuously learns how to leverage these strategies to achieve effective jailbreaks in specific contexts. Extensive experiments demonstrate that our system can jailbreak most black-box LLMs in just five queries, improving the efficiency of existing red teaming methods by two times. Additionally, RedAgent can jailbreak customized LLM applications more efficiently. By generating context-aware jailbreak prompts towards applications on GPTs, we discover 60 severe vulnerabilities of these real-world applications with only two queries per vulnerability. We have reported all found issues and communicated with OpenAI and Meta for bug fixes.
Published: 2024

3. LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking

Author: Dong, Shaohua, Feng, Yunhe, Yang, Qing, Lin, Yuewei, and Fan, Heng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: High-performance Transformer trackers have shown excellent results, yet they often bear a heavy computational load. Observing that a smaller input can immediately and conveniently reduce computations without changing the model, an easy solution is to adopt the low-resolution input for efficient Transformer tracking. Albeit faster, this hurts tracking accuracy much due to information loss in low resolution tracking. In this paper, we aim to mitigate such information loss to boost the performance of the low-resolution Transformer tracking via dual knowledge distillation from a frozen high-resolution (but not a larger) Transformer tracker. The core lies in two simple yet effective distillation modules, comprising query-key-value knowledge distillation (QKV-KD) and discrimination knowledge distillation (Disc-KD), across resolutions. The former, from the global view, allows the low-resolution tracker to inherit the features and interactions from the high-resolution tracker, while the later, from the target-aware view, enhances the target-background distinguishing capacity via imitating discriminative regions from its high-resolution counterpart. With the dual knowledge distillation, our Low-Resolution Transformer Tracker (LoReTrack) enjoys not only high efficiency owing to reduced computation but also enhanced accuracy by distilling knowledge from the high-resolution tracker. In extensive experiments, LoReTrack with a 256x256 resolution consistently improves baseline with the same resolution, and shows competitive or even better results compared to 384x384 high-resolution Transformer tracker, while running 52% faster and saving 56% MACs. Moreover, LoReTrack is resolution-scalable. With a 128x128 resolution, it runs 25 fps on a CPU with 64.9%/46.4% SUC scores on LaSOT/LaSOText, surpassing all other CPU real-time trackers. Code will be released.
Published: 2024

4. Benchmarking the Robustness of UAV Tracking Against Common Corruptions

Author: Liu, Xiaoqiong, Feng, Yunhe, Hu, Shu, Yuan, Xiaohui, and Fan, Heng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The robustness of unmanned aerial vehicle (UAV) tracking is crucial in many tasks like surveillance and robotics. Despite its importance, little attention is paid to the performance of UAV trackers under common corruptions due to lack of a dedicated platform. Addressing this, we propose UAV-C, a large-scale benchmark for assessing robustness of UAV trackers under common corruptions. Specifically, UAV-C is built upon two popular UAV datasets by introducing 18 common corruptions from 4 representative categories including adversarial, sensor, blur, and composite corruptions in different levels. Finally, UAV-C contains more than 10K sequences. To understand the robustness of existing UAV trackers against corruptions, we extensively evaluate 12 representative algorithms on UAV-C. Our study reveals several key findings: 1) Current trackers are vulnerable to corruptions, indicating more attention needed in enhancing the robustness of UAV trackers; 2) When accompanying together, composite corruptions result in more severe degradation to trackers; and 3) While each tracker has its unique performance profile, some trackers may be more sensitive to specific corruptions. By releasing UAV-C, we hope it, along with comprehensive analysis, serves as a valuable resource for advancing the robustness of UAV tracking against corruption. Our UAV-C will be available at https://github.com/Xiaoqiong-Liu/UAV-C.
Published: 2024

5. S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and Document

Author: Shaik, Kareem, Wang, Dali, Zheng, Weijian, Cao, Qinglei, Fan, Heng, Schwartz, Peter, and Feng, Yunhe
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: The understanding of large-scale scientific software poses significant challenges due to its diverse codebase, extensive code length, and target computing architectures. The emergence of generative AI, specifically large language models (LLMs), provides novel pathways for understanding such complex scientific codes. This paper presents S3LLM, an LLM-based framework designed to enable the examination of source code, code metadata, and summarized information in conjunction with textual technical reports in an interactive, conversational manner through a user-friendly interface. S3LLM leverages open-source LLaMA-2 models to enhance code analysis through the automatic transformation of natural language queries into domain-specific language (DSL) queries. Specifically, it translates these queries into Feature Query Language (FQL), enabling efficient scanning and parsing of entire code repositories. In addition, S3LLM is equipped to handle diverse metadata types, including DOT, SQL, and customized formats. Furthermore, S3LLM incorporates retrieval augmented generation (RAG) and LangChain technologies to directly query extensive documents. S3LLM demonstrates the potential of using locally deployed open-source LLMs for the rapid understanding of large-scale scientific computing software, eliminating the need for extensive coding expertise, and thereby making the process more efficient and effective. S3LLM is available at https://github.com/ResponsibleAILab/s3llm.
Published: 2024

6. Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning

Author: Dong, Shaohua, Feng, Yunhe, Yang, Qing, Huang, Yan, Liu, Dongfang, and Fan, Heng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Multimodal (e.g., RGB-Depth/RGB-Thermal) fusion has shown great potential for improving semantic segmentation in complex scenes (e.g., indoor/low-light conditions). Existing approaches often fully fine-tune a dual-branch encoder-decoder framework with a complicated feature fusion strategy for achieving multimodal semantic segmentation, which is training-costly due to the massive parameter updates in feature extraction and fusion. To address this issue, we propose a surprisingly simple yet effective dual-prompt learning network (dubbed DPLNet) for training-efficient multimodal (e.g., RGB-D/T) semantic segmentation. The core of DPLNet is to directly adapt a frozen pre-trained RGB model to multimodal semantic segmentation, reducing parameter updates. For this purpose, we present two prompt learning modules, comprising multimodal prompt generator (MPG) and multimodal feature adapter (MFA). MPG works to fuse the features from different modalities in a compact manner and is inserted from shadow to deep stages to generate the multi-level multimodal prompts that are injected into the frozen backbone, while MPG adapts prompted multimodal features in the frozen backbone for better multimodal semantic segmentation. Since both the MPG and MFA are lightweight, only a few trainable parameters (3.88M, 4.4% of the pre-trained backbone parameters) are introduced for multimodal feature fusion and learning. Using a simple decoder (3.27M parameters), DPLNet achieves new state-of-the-art performance or is on a par with other complex approaches on four RGB-D/T semantic segmentation datasets while satisfying parameter efficiency. Moreover, we show that DPLNet is general and applicable to other multimodal tasks such as salient object detection and video semantic segmentation. Without special design, DPLNet outperforms many complicated models. Our code will be available at github.com/ShaohuaDong2021/DPLNet., Comment: 11 pages, 4 figures, 9 tables
Published: 2023

7. Addressing Weak Decision Boundaries in Image Classification by Leveraging Web Search and Generative Models

Author: Dammu, Preetam Prabhu Srikar, Feng, Yunhe, and Shah, Chirag
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Machine learning (ML) technologies are known to be riddled with ethical and operational problems, however, we are witnessing an increasing thrust by businesses to deploy them in sensitive applications. One major issue among many is that ML models do not perform equally well for underrepresented groups. This puts vulnerable populations in an even disadvantaged and unfavorable position. We propose an approach that leverages the power of web search and generative models to alleviate some of the shortcomings of discriminative models. We demonstrate our method on an image classification problem using ImageNet's People Subtree subset, and show that it is effective in enhancing robustness and mitigating bias in certain classes that represent vulnerable populations (e.g., female doctor of color). Our new method is able to (1) identify weak decision boundaries for such classes; (2) construct search queries for Google as well as text for generating images through DALL-E 2 and Stable Diffusion; and (3) show how these newly captured training samples could alleviate population bias issue. While still improving the model's overall performance considerably, we achieve a significant reduction (77.30\%) in the model's gender accuracy disparity. In addition to these improvements, we observed a notable enhancement in the classifier's decision boundary, as it is characterized by fewer weakspots and an increased separation between classes. Although we showcase our method on vulnerable populations in this study, the proposed technique is extendable to a wide range of problems and domains., Comment: Note: This is a copy of the copyrighted version published in IJCAI 2023 (DOI: https://doi.org/10.24963/ijcai.2023/659)
Published: 2023
Full Text: View/download PDF

8. FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

Author: Zhang, Boyuan, Tian, Jiannan, Di, Sheng, Yu, Xiaodong, Feng, Yunhe, Liang, Xin, Tao, Dingwen, and Cappello, Franck
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Today's large-scale scientific applications running on high-performance computing (HPC) systems generate vast data volumes. Thus, data compression is becoming a critical technique to mitigate the storage burden and data-movement cost. However, existing lossy compressors for scientific data cannot achieve a high compression ratio and throughput simultaneously, hindering their adoption in many applications requiring fast compression, such as in-memory compression. To this end, in this work, we develop a fast and high-ratio error-bounded lossy compressor on GPUs for scientific data (called FZ-GPU). Specifically, we first design a new compression pipeline that consists of fully parallelized quantization, bitshuffle, and our newly designed fast encoding. Then, we propose a series of deep architectural optimizations for each kernel in the pipeline to take full advantage of CUDA architectures. We propose a warp-level optimization to avoid data conflicts for bit-wise operations in bitshuffle, maximize shared memory utilization, and eliminate unnecessary data movements by fusing different compression kernels. Finally, we evaluate FZ-GPU on two NVIDIA GPUs (i.e., A100 and RTX A4000) using six representative scientific datasets from SDRBench. Results on the A100 GPU show that FZ-GPU achieves an average speedup of 4.2X over cuSZ and an average speedup of 37.0X over a multi-threaded CPU implementation of our algorithm under the same error bound. FZ-GPU also achieves an average speedup of 2.3X and an average compression ratio improvement of 2.0X over cuZFP under the same data distortion., Comment: 14 pages, 12 figures, accepted by ACM HPDC '23
Published: 2023
Full Text: View/download PDF

9. Towards Generating Robust, Fair, and Emotion-Aware Explanations for Recommender Systems

Author: Wen, Bingbing, Feng, Yunhe, Zhang, Yongfeng, and Shah, Chirag
Subjects: Computer Science - Artificial Intelligence
Abstract: As recommender systems become increasingly sophisticated and complex, they often suffer from lack of fairness and transparency. Providing robust and unbiased explanations for recommendations has been drawing more and more attention as it can help address these issues and improve trustworthiness and informativeness of recommender systems. However, despite the fact that such explanations are generated for humans who respond more strongly to messages with appropriate emotions, there is a lack of consideration for emotions when generating explanations for recommendations. Current explanation generation models are found to exaggerate certain emotions without accurately capturing the underlying tone or the meaning. In this paper, we propose a novel method based on a multi-head transformer, called Emotion-aware Transformer for Explainable Recommendation (EmoTER), to generate more robust, fair, and emotion-enhanced explanations. To measure the linguistic quality and emotion fairness of the generated explanations, we adopt both automatic text metrics and human perceptions for evaluation. Experiments on three widely-used benchmark datasets with multiple evaluation metrics demonstrate that EmoTER consistently outperforms the existing state-of-the-art explanation generation models in terms of text quality, explainability, and consideration for fairness to emotion distribution. Implementation of EmoTER will be released as an open-source toolkit to support further research.
Published: 2022

10. COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression

Author: Jin, Sian, Zhang, Chengming, Jiang, Xintong, Feng, Yunhe, Guan, Hui, Li, Guanpeng, Song, Shuaiwen Leon, and Tao, Dingwen
Subjects: Computer Science - Artificial Intelligence, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Training wide and deep neural networks (DNNs) require large amounts of storage resources such as memory because the intermediate activation data must be saved in the memory during forward propagation and then restored for backward propagation. However, state-of-the-art accelerators such as GPUs are only equipped with very limited memory capacities due to hardware design constraints, which significantly limits the maximum batch size and hence performance speedup when training large-scale DNNs. Traditional memory saving techniques either suffer from performance overhead or are constrained by limited interconnect bandwidth or specific interconnect technology. In this paper, we propose a novel memory-efficient CNN training framework (called COMET) that leverages error-bounded lossy compression to significantly reduce the memory requirement for training, to allow training larger models or to accelerate training. Different from the state-of-the-art solutions that adopt image-based lossy compressors (such as JPEG) to compress the activation data, our framework purposely adopts error-bounded lossy compression with a strict error-controlling mechanism. Specifically, we perform a theoretical analysis on the compression error propagation from the altered activation data to the gradients, and empirically investigate the impact of altered gradients over the training process. Based on these analyses, we optimize the error-bounded lossy compression and propose an adaptive error-bound control scheme for activation data compression. We evaluate our design against state-of-the-art solutions with five widely-adopted CNNs and ImageNet dataset. Experiments demonstrate that our proposed framework can significantly reduce the training memory consumption by up to 13.5X over the baseline training and 1.8X over another state-of-the-art compression-based framework, respectively, with little or no accuracy loss., Comment: 14 pages, 17 figures, accepted by VLDB 2022. arXiv admin note: substantial text overlap with arXiv:2011.09017
Published: 2021

11. Content-based quality evaluation of scientific papers using coarse feature and knowledge entity network

Author: Wang, Zhongyi, Zhang, Haoxuan, Chen, Haihua, Feng, Yunhe, and Ding, Junhua
Published: 2024
Full Text: View/download PDF

12. Optimization of cobalt-based MOFs for super-capacitor electrode materials of new energy vehicle

Author: Jin, Xinjun, Jiang, Zhiyu, Feng, Yunhe, and Fang, Xiaofen
Published: 2024
Full Text: View/download PDF

13. Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

Author: Tian, Jiannan, Di, Sheng, Yu, Xiaodong, Rivera, Cody, Zhao, Kai, Jin, Sian, Feng, Yunhe, Liang, Xin, Tao, Dingwen, and Cappello, Franck
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Error-bounded lossy compression is a critical technique for significantly reducing scientific data volumes. With ever-emerging heterogeneous high-performance computing (HPC) architecture, GPU-accelerated error-bounded compressors (such as cuSZ+ and cuZFP) have been developed. However, they suffer from either low performance or low compression ratios. To this end, we propose cuSZ+ to target both high compression ratios and throughputs. We identify that data sparsity and data smoothness are key factors for high compression throughputs. Our key contributions in this work are fourfold: (1) We propose an efficient compression workflow to adaptively perform run-length encoding and/or variable-length encoding. (2) We derive Lorenzo reconstruction in decompression as multidimensional partial-sum computation and propose a fine-grained Lorenzo reconstruction algorithm for GPU architectures. (3) We carefully optimize each of cuSZ+ kernels by leveraging state-of-the-art CUDA parallel primitives. (4) We evaluate cuSZ+ using seven real-world HPC application datasets on V100 and A100 GPUs. Experiments show cuSZ+ improves the compression throughputs and ratios by up to 18.4X and 5.3X, respectively, over cuSZ on the tested datasets., Comment: 12 pages, 3 figures, 7 tables, accepted by IEEE Cluster'21
Published: 2021

14. Seed Stocking Via Multi-Task Learning

Author: Feng, Yunhe and Zhou, Wenjun
Subjects: Computer Science - Machine Learning
Abstract: Sellers of crop seeds need to plan for the variety and quantity of seeds to stock at least a year in advance. There are a large number of seed varieties of one crop, and each can perform best under different growing conditions. Given the unpredictability of weather, farmers need to make decisions that balance high yield and low risk. A seed vendor needs to be able to anticipate the needs of farmers and have them ready. In this study, we propose an analytical framework for estimating seed demand with three major steps. First, we will estimate the yield and risk of each variety as if they were planted at each location. Since past experiments performed with different seed varieties are highly unbalanced across varieties, and the combination of growing conditions is sparse, we employ multi-task learning to borrow information from similar varieties. Second, we will determine the best mix of seeds for each location by seeking a tradeoff between yield and risk. Third, we will aggregate such mix and pick the top five varieties to re-balance the yield and risk for each growing location. We find that multi-task learning provides a viable solution for yield prediction, and our overall analytical framework has resulted in a good performance.
Published: 2021

15. University of Washington at TREC 2020 Fairness Ranking Track

Author: Feng, Yunhe, Saelid, Daniel, Li, Ke, Gao, Ruoyuan, and Shah, Chirag
Subjects: Computer Science - Information Retrieval
Abstract: InfoSeeking Lab's FATE (Fairness Accountability Transparency Ethics) group at University of Washington participated in 2020 TREC Fairness Ranking Track. This report describes that track, assigned data and tasks, our group definitions, and our results. Our approach to bringing fairness in retrieval and re-ranking tasks with Semantic Scholar data was to extract various dimensions of author identity. These dimensions included gender and location. We developed modules for these extractions in a way that allowed us to plug them in for either of the tasks as needed. After trying different combinations of relative weights assigned to relevance, gender, and location information, we chose five runs for retrieval and five runs for re-ranking tasks. The results showed that our runs performed below par for re-ranking task, but above average for retrieval.
Published: 2020

16. Micromobility in Smart Cities: A Closer Look at Shared Dockless E-Scooters via Big Social Data

Author: Feng, Yunhe, Zhong, Dong, Sun, Peng, Zheng, Weijian, Cao, Qinglei, Luo, Xi, and Lu, Zheng
Subjects: Computer Science - Social and Information Networks, Computer Science - Computers and Society
Abstract: The micromobility is shaping first- and last-mile travels in urban areas. Recently, shared dockless electric scooters (e-scooters) have emerged as a daily alternative to driving for short-distance commuters in large cities due to the affordability, easy accessibility via an app, and zero emissions. Meanwhile, e-scooters come with challenges in city management, such as traffic rules, public safety, parking regulations, and liability issues. In this paper, we collected and investigated 5.8 million scooter-tagged tweets and 144,197 images, generated by 2.7 million users from October 2018 to March 2020, to take a closer look at shared e-scooters via crowdsourcing data analytics. We profiled e-scooter usages from spatial-temporal perspectives, explored different business roles (i.e., riders, gig workers, and ridesharing companies), examined operation patterns (e.g., injury types, and parking behaviors), and conducted sentiment analysis. To our best knowledge, this paper is the first large-scale systematic study on shared e-scooters using big social data.
Published: 2020
Full Text: View/download PDF

17. Is Working From Home The New Norm? An Observational Study Based on a Large Geo-tagged COVID-19 Twitter Dataset

Author: Feng, Yunhe and Zhou, Wenjun
Subjects: Computer Science - Social and Information Networks
Abstract: As the COVID-19 pandemic swept over the world, people discussed facts, expressed opinions, and shared sentiments on social media. Since the reaction to COVID-19 in different locations may be tied to local cases, government regulations, healthcare resources and socioeconomic factors, we curated a large geo-tagged Twitter dataset and performed exploratory analysis by location. Specifically, we collected 650,563 unique geo-tagged tweets across the United States (50 states and Washington, D.C.) covering the date range from January 25 to May 10, 2020. Tweet locations enabled us to conduct region-specific studies such as tweeting volumes and sentiment, sometimes in response to local regulations and reported COVID-19 cases. During this period, many people started working from home. The gap between workdays and weekends in hourly tweet volumes inspired us to propose algorithms to estimate work engagement during the COVID-19 crisis. This paper also summarizes themes and topics of tweets in our dataset using both social media exclusive tools (i.e., #hashtags, @mentions) and the latent Dirichlet allocation model. We welcome requests for data sharing and conversations for more insights. Dataset link: http://covid19research.site/geo-tagged_twitter_datasets/
Published: 2020
Full Text: View/download PDF

18. Applications of Deep Learning Techniques.

Author: Ding, Junhua, Chen, Haihua, Feng, Yunhe, and Hossain, Tozammel
Subjects: ARTIFICIAL neural networks, MACHINE learning, DEEP reinforcement learning, REINFORCEMENT learning, NATURAL language processing, DEEP learning
Abstract: This document is a collection of research articles that explore the applications of deep learning techniques in various fields. The articles cover topics such as predicting car rental prices, speech recognition, highway visibility prediction, and more. Each article presents a specific application of deep learning algorithms and discusses their effectiveness in solving relevant problems. The document also mentions future directions for deep learning research, including legal intelligence, healthcare, and social media analysis. The authors express their gratitude to the researchers, reviewers, and editorial board involved in the publication of this special issue. [Extracted from the article]
Published: 2024
Full Text: View/download PDF

19. The Evaluation of the Impact of Meteorological and Marine Environment on the Combat Effectiveness of Surface Ships

Author: Li, Chenxin, primary, Li, Lifan, additional, Feng, Yunhe, additional, and Wang, Duo, additional
Published: 2024
Full Text: View/download PDF

20. Improving Text Classification with Large Language Model-Based Data Augmentation.

Author: Zhao, Huanhuan, Chen, Haihua, Ruggles, Thomas A., Feng, Yunhe, Singh, Debjani, and Yoon, Hong-Jun
Subjects: DATA augmentation, LANGUAGE models, CHATGPT, NATURAL language processing, CLASSIFICATION
Abstract: Large Language Models (LLMs) such as ChatGPT possess advanced capabilities in understanding and generating text. These capabilities enable ChatGPT to create text based on specific instructions, which can serve as augmented data for text classification tasks. Previous studies have approached data augmentation (DA) by either rewriting the existing dataset with ChatGPT or generating entirely new data from scratch. However, it is unclear which method is better without comparing their effectiveness. This study investigates the application of both methods to two datasets: a general-topic dataset (Reuters news data) and a domain-specific dataset (Mitigation dataset). Our findings indicate that: 1. ChatGPT generated new data consistently enhanced model's classification results for both datasets. 2. Generating new data generally outperforms rewriting existing data, though crafting the prompts carefully is crucial to extract the most valuable information from ChatGPT, particularly for domain-specific data. 3. The augmentation data size affects the effectiveness of DA; however, we observed a plateau after incorporating 10 samples. 4. Combining the rewritten sample with new generated sample can potentially further improve the model's performance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. A multi-granularity perspective for spatial profiling of mobile apps

Author: Feng, Yunhe, Lu, Zheng, Zhou, Wenjun, Cao, Qing, and Li, Xiaolin
Published: 2018
Full Text: View/download PDF

22. Approximate Cardinality Estimation (ACE) in large-scale Internet of Things deployments

Author: Cao, Qing, Feng, Yunhe, Lu, Zheng, Qi, Hairong, Tolbert, Leon M., Wan, Lipeng, Wang, Zhibo, and Zhou, Wenjun
Published: 2017
Full Text: View/download PDF

23. Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing

Author: Dash, Swagatika and Feng, Yunhe
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Computation and Language (cs.CL), Information Retrieval (cs.IR), Computer Science - Information Retrieval
Abstract: Multi-modal search engines have experienced significant growth and widespread use in recent years, making them the second most common internet use. While search engine systems offer a range of services, the image search field has recently become a focal point in the information retrieval community, as the adage goes, "a picture is worth a thousand words". Although popular search engines like Google excel at image search accuracy and agility, there is an ongoing debate over whether their search results can be biased in terms of gender, language, demographics, socio-cultural aspects, and stereotypes. This potential for bias can have a significant impact on individuals' perceptions and influence their perspectives. In this paper, we present our study on bias and fairness in web search, with a focus on keyword-based image search. We first discuss several kinds of biases that exist in search systems and why it is important to mitigate them. We narrow down our study to assessing and mitigating occupational stereotypes in image search, which is a prevalent fairness issue in image retrieval. For the assessment of stereotypes, we take gender as an indicator. We explore various open-source and proprietary APIs for gender identification from images. With these, we examine the extent of gender bias in top-tanked image search results obtained for several occupational keywords. To mitigate the bias, we then propose a fairness-aware re-ranking algorithm that optimizes (a) relevance of the search result with the keyword and (b) fairness w.r.t genders identified. We experiment on 100 top-ranked images obtained for 10 occupational keywords and consider random re-ranking and re-ranking based on relevance as baselines. Our experimental results show that the fairness-aware re-ranking algorithm produces rankings with better fairness scores and competitive relevance scores than the baselines., 20 Pages, Work uses Proprietary Search Systems from the year 2021
Published: 2023

24. Has CEO Gender Bias Really Been Fixed? Adversarial Attacking and Improving Gender Fairness in Image Search

Author: Feng, Yunhe, primary and Shah, Chirag, additional
Published: 2022
Full Text: View/download PDF

25. Approximate and Sublinear Spatial Queries for Large-Scale Vehicle Networks

Author: Wan, Lipeng, primary, Wang, Zhibo, additional, Lu, Zheng, additional, Feng, Yunhe, additional, Qi, Hairong, additional, Zhou, Wenjun, additional, and Cao, Qing, additional
Published: 2018
Full Text: View/download PDF

26. Generative AI for Cell Type-Specific Fluorescence Image Generation of hPSC-derived Cardiac Organoid.

Author: Kandula AKR, Phamornratanakun T, Gomez AH, El-Mokahal M, Ma Z, Feng Y, and Yang H
Abstract: Human pluripotent stem cell (hPSC)-derived cardiac organoid is the most recent three-dimensional tissue structure that mimics the structure and functionality of the human heart and plays a pivotal role in modeling heart development and disease. The hPSC-derived cardiac organoids are commonly characterized by bright-field microscopic imaging for tracking daily organoid differentiation and morphology formation. Although the brightfield microscope provides essential information about hPSC-derived cardiac organoids, such as morphology, size, and general structure, it does not extend our understanding of cardiac organoids on cell type-specific distribution and structure. Then, fluorescence microscopic imaging is required to identify the specific cardiovascular cell types in the hPSC-derived cardiac organoids by fluorescence immunostaining fixed organoid samples or fluorescence reporter imaging of live organoids. Both approaches require extra steps of experiments and techniques and do not provide general information on hPSC-derived cardiac organoids from different batches of differentiation and characterization, which limits the biomedical applications of hPSC-derived cardiac organoids. This research addresses this limitation by proposing a comprehensive workflow for colorizing phase contrast images of cardiac organoids from brightfield microscopic imaging using conditional Generative Adversarial Networks (GANs) to provide cardiovascular cell type-specific information in hPSC-derived cardiac organoids. By infusing these phase contrast images with accurate fluorescence colorization, our approach aims to unlock the hidden wealth of cell type, structure, and further quantifications of fluorescence intensity and area, for better characterizing hPSC-derived cardiac organoids., Competing Interests: Conflicts of Interest: None
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

26 results on '"Feng, Yunhe"'

1. HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making

2. RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent

3. LoReTrack: Efficient and Accurate Low-Resolution Transformer Tracking

4. Benchmarking the Robustness of UAV Tracking Against Common Corruptions

5. S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and Document

6. Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning

7. Addressing Weak Decision Boundaries in Image Classification by Leveraging Web Search and Generative Models

8. FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

9. Towards Generating Robust, Fair, and Emotion-Aware Explanations for Recommender Systems

10. COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression

11. Content-based quality evaluation of scientific papers using coarse feature and knowledge entity network

12. Optimization of cobalt-based MOFs for super-capacitor electrode materials of new energy vehicle

13. Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

14. Seed Stocking Via Multi-Task Learning

15. University of Washington at TREC 2020 Fairness Ranking Track

16. Micromobility in Smart Cities: A Closer Look at Shared Dockless E-Scooters via Big Social Data

17. Is Working From Home The New Norm? An Observational Study Based on a Large Geo-tagged COVID-19 Twitter Dataset

18. Applications of Deep Learning Techniques.

19. The Evaluation of the Impact of Meteorological and Marine Environment on the Combat Effectiveness of Surface Ships

20. Improving Text Classification with Large Language Model-Based Data Augmentation.

21. A multi-granularity perspective for spatial profiling of mobile apps

22. Approximate Cardinality Estimation (ACE) in large-scale Internet of Things deployments

23. Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing

24. Has CEO Gender Bias Really Been Fixed? Adversarial Attacking and Improving Gender Fairness in Image Search

25. Approximate and Sublinear Spatial Queries for Large-Scale Vehicle Networks

26. Generative AI for Cell Type-Specific Fluorescence Image Generation of hPSC-derived Cardiac Organoid.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

26 results on '"Feng, Yunhe"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources