Author: "Cheng, Hongrong" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cheng, Hongrong"' showing total 28 results

Start Over Author "Cheng, Hongrong" Publication Year Range Last 10 years

28 results on '"Cheng, Hongrong"'

1. MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models

Author: Cheng, Hongrong, Zhang, Miao, and Shi, Javen Qinfeng
Subjects: Computer Science - Computation and Language
Abstract: As Large Language Models (LLMs) grow dramatically in size, there is an increasing trend in compressing and speeding up these models. Previous studies have highlighted the usefulness of gradients for importance scoring in neural network compressing, especially in pruning medium-size networks. However, the substantial memory requirements involved in calculating gradients with backpropagation impede the utilization of gradients in guiding LLM pruning. As a result, most pruning strategies for LLMs rely on gradient-free criteria, such as weight magnitudes or a mix of magnitudes and activations. In this paper, we devise a hybrid pruning criterion, which appropriately integrates magnitude, activation, and gradient to capitalize on feature map sensitivity for pruning LLMs. To overcome memory requirement barriers, we estimate gradients using only forward passes. Based on this, we propose a Memory-effIcieNt structured prunIng procedure for LLMs (MINI-LLM) to remove no-critical channels and multi-attention heads. Experimental results demonstrate the superior performance of MINI-LLM over existing gradient-free methods on three LLMs: LLaMA, BLOOM, and OPT across various downstream tasks (classification, multiple-choice, and generation), while MINI-LLM maintains a GPU memory footprint akin to gradient-free methods., Comment: 13 pages
Published: 2024

2. Long-short interest network with graph-based method for sequential recommendation

Author: Mu, Wangdong, Liu, Qihe, Cheng, Hongrong, and Zhuo, Ming
Published: 2024
Full Text: View/download PDF

3. A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations

Author: Cheng, Hongrong, Zhang, Miao, and Shi, Javen Qinfeng
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1) universal/specific speedup, 2) when to prune, 3) how to prune, and 4) fusion of pruning and other compression techniques. We then provide a thorough comparative analysis of eight pairs of contrast settings for pruning and explore emerging topics, including pruning for large language models, large multimodal models, post-training pruning, and different supervision levels for pruning to shed light on the commonalities and differences of existing methods and lay the foundation for further method development. To facilitate future research, we build a curated collection of datasets, networks, and evaluations on different applications. Finally, we provide valuable recommendations on selecting pruning methods and prospect several promising research directions. We build a repository at https://github.com/hrcheng1066/awesome-pruning., Comment: IEEE TPAMI major revision
Published: 2023

4. Influence Function Based Second-Order Channel Pruning-Evaluating True Loss Changes For Pruning Is Possible Without Retraining

Author: Cheng, Hongrong, Zhang, Miao, and Shi, Javen Qinfeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: A challenge of channel pruning is designing efficient and effective criteria to select channels to prune. A widely used criterion is minimal performance degeneration. To accurately evaluate the truth performance degeneration requires retraining the survived weights to convergence, which is prohibitively slow. Hence existing pruning methods use previous weights (without retraining) to evaluate the performance degeneration. However, we observe the loss changes differ significantly with and without retraining. It motivates us to develop a technique to evaluate true loss changes without retraining, with which channels to prune can be selected more reliably and confidently. We first derive a closed-form estimator of the true loss change per pruning mask change, using influence functions without retraining. Influence function which is from robust statistics reveals the impacts of a training sample on the model's prediction and is repurposed by us to assess impacts on true loss changes. We then show how to assess the importance of all channels simultaneously and develop a novel global channel pruning algorithm accordingly. We conduct extensive experiments to verify the effectiveness of the proposed algorithm. To the best of our knowledge, we are the first that shows evaluating true loss changes for pruning without retraining is possible. This finding will open up opportunities for a series of new paradigms to emerge that differ from existing pruning methods. The code is available at https://github.com/hrcheng1066/IFSO., Comment: chrome-extension://ogjibjphoadhljaoicdnjnmgokohngcc/assets/icon-50207e67.png
Published: 2023

5. Pan-cancer analysis and experimental validation reveal FAM72D as a potential novel biomarker and therapeutic target in lung adenocarcinoma

Author: Zhou, Zonglang, Fan, Bingfu, Qiu, Qinming, Cheng, Hongrong, Wang, Li, Wu, Yibo, Xie, Jun, Ni, Cheng, and Li, Ning
Published: 2024
Full Text: View/download PDF

6. Increased iron deposition in nucleus accumbens associated with disease progression and chronicity in migraine

Author: Xu, Xiaopei, Zhou, Mengting, Wu, Xiao, Zhao, Fangling, Luo, Xiao, Li, Kaicheng, Zeng, Qingze, He, Jiahui, Cheng, Hongrong, Guan, Xiaojun, Huang, Peiyu, Zhang, Minming, and Liu, Kaiming
Published: 2023
Full Text: View/download PDF

7. FGF-21 and GDF-15 are increased in migraine and associated with the severity of migraine-related disability

Author: He, Jiahui, Zhou, Mengting, Zhao, Fanglin, Cheng, Hongrong, Huang, Hao, Xu, Xiaopei, Han, Jian, Hong, Wenwu, Wang, Faming, Xiao, Yujin, Xia, Jinjin, and Liu, Kaiming
Published: 2023
Full Text: View/download PDF

8. The Mechanistic Role of Bridging Integrator 1 (BIN1) in Alzheimer’s Disease

Author: Gao, Peirong, Ye, Lingqi, Cheng, Hongrong, and Li, Honglei
Published: 2021
Full Text: View/download PDF

9. An Efficient and Simple Graph Model for Scientific Article Cold Start Recommendation

Author: Cai, Tengyuan, Cheng, Hongrong, Luo, Jiaqing, Zhou, Shijie, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Comyn-Wattiau, Isabelle, editor, Tanaka, Katsumi, editor, Song, Il-Yeol, editor, Yamamoto, Shuichiro, editor, and Saeki, Motoshi, editor
Published: 2016
Full Text: View/download PDF

10. A Systematic Analysis of the Role of Unc-5 Netrin Receptor A (UNC5A) in Human Cancers

Author: Zhou, Zonglang, primary, Fan, Bingfu, additional, Cheng, Hongrong, additional, Wang, Ming, additional, Xie, Jun, additional, Zou, Mingyuan, additional, and Yang, Yi, additional
Published: 2022
Full Text: View/download PDF

11. An Efficient and Simple Graph Model for Scientific Article Cold Start Recommendation

Author: Cai, Tengyuan, primary, Cheng, Hongrong, additional, Luo, Jiaqing, additional, and Zhou, Shijie, additional
Published: 2016
Full Text: View/download PDF

12. DeepComp: A Deep Comparator for Improving Facial Age-Group Estimation

Author: Shijie Zhou, Ebenezer Nii Ayi Hammond, Cheng Hongrong, and Qihe Liu
Subjects: Similarity (network science), Feature (computer vision), Computer science, business.industry, Classifier (linguistics), Feature extraction, Process (computing), Benchmark (computing), Pattern recognition, Artificial intelligence, Layer (object-oriented design), business, Image (mathematics)
Abstract: We introduce an age-group estimation scheme known as DeepComp. It is a combination of an Early Information-Sharing Feature Aggregation (EISFA) mechanism and a ternary classifier. The EISFA part is a feature extractor that applies a siamese layer to input images and an aggregation module that sums up all the images. The ternary process compares the image representations into three possible outcomes corresponding to younger, similar, or older. From the comparisons, we arrive at a score indicating the similarity between an input and reference images: the higher the score, the closer the similarity. Experimentation shows that our DeepComp scheme achieves an impressive 94.9% accuracy on the Adience benchmark dataset using a minimum number of reference images per age group. Moreover, we demonstrate the generality of our method on the MORPH II dataset, and the result is equally impressive. Altogether, we show that, among other schemes, our method exemplifies facial age-group estimation.
Published: 2021

13. DeepComp: A Deep Comparator for Improving Facial Age-Group Estimation

Author: Hammond, Ebenezer Nii Ayi, primary, Zhou, Shijie, additional, Cheng, Hongrong, additional, and Liu, Qihe, additional
Published: 2021
Full Text: View/download PDF

14. Improving Juvenile Age Estimation Based on Facial Landmark Points and Gravity Moment

Author: Hammond, Ebenezer Nii Ayi, primary, Zhou, Shijie, additional, Cheng, Hongrong, additional, and Liu, Qihe, additional
Published: 2020
Full Text: View/download PDF

15. The Mechanistic Role of Bridging Integrator 1 (BIN1) in Alzheimer’s Disease

Author: Gao, Peirong, primary, Ye, Lingqi, additional, Cheng, Hongrong, additional, and Li, Honglei, additional
Published: 2020
Full Text: View/download PDF

16. Improving Juvenile Age Estimation Based on Facial Landmark Points and Gravity Moment

Author: Shijie Zhou, Ebenezer Nii Ayi Hammond, Qihe Liu, and Cheng Hongrong
Subjects: juvenile detection, Computer science, 02 engineering and technology, lcsh:Technology, Image (mathematics), lcsh:Chemistry, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, information retrieval, Projection (set theory), lcsh:QH301-705.5, Instrumentation, AAM, Fluid Flow and Transfer Processes, Landmark, lcsh:T, business.industry, Process Chemistry and Technology, ordinal relationship, General Engineering, 020207 software engineering, Pattern recognition, lcsh:QC1-999, age estimation, Computer Science Applications, Term (time), Moment (mathematics), Range (mathematics), lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, Gesture recognition, Face (geometry), 020201 artificial intelligence & image processing, Artificial intelligence, lcsh:Engineering (General). Civil engineering (General), business, lcsh:Physics
Abstract: Facial age estimation is of interest due to its potential to be applied in many real-life situations. However, recent age estimation efforts do not consider juveniles. Consequently, we introduce a juvenile age detection scheme called LaGMO, which focuses on the juvenile aging cues of facial shape and appearance. LaGMO is a combination of facial landmark points and Term Frequency Inverse Gravity Moment (TF-IGM). Inspired by the formation of words from morphemes, we obtained facial appearance features comprising facial shape and wrinkle texture and represented them as terms that described the age of the face. By leveraging the implicit ordinal relationship between the frequencies of the terms in the face, TF-IGM was used to compute the weights of the terms. From these weights, we built a matrix that corresponds to the possibilities of the face belonging to the age. Next, we reduced the reference matrix according to the juvenile age range (0&ndash, 17 years) and avoided the exhaustive search through the entire training set. LaGMO detects the age by the projection of an unlabeled face image onto the reference matrix, the value of the projection depicts the higher probability of the image belonging to the age. With Mean Absolute Error (MAE) of 89% on the Face and Gesture Recognition Research Network (FG-NET) dataset, our proposal demonstrated superior performance in juvenile age estimation.
Published: 2020

17. Gas Sensor Drift Compensation by an Optimal Linear Transformation

Author: Shijie Zhou, Cheng Hongrong, Qihe Liu, Huabo Zhang, and Xianqiong Cheng
Subjects: Linear map, ComputingMethodologies_PATTERNRECOGNITION, Control theory, Computer science, 010401 analytical chemistry, Process (computing), 02 engineering and technology, 021001 nanoscience & nanotechnology, 0210 nano-technology, 01 natural sciences, 0104 chemical sciences, Task (project management), Compensation (engineering)
Abstract: Drift compensation plays an important role in electronic nose systems. Traditional methods compensate drift often by using a reference gas. However, practically it is very expensive to use an extra reference gas. Motivated by the fact that the goal of drift compensation is to improve the classification performance, in this paper, we propose a novel linear approach for drift compensation, which incorporates drift compensation into the classification process. In our method, the data drift is modeled by a linear transformation and a k-nearest neighbor classifier is used on the compensated data. An optimal linear transformation will be obtained by optimizing the classification performance. Our main contributions are: 1) the extra reference gas is not needed any more except some labeled gas data since the data drift is compensated by the optimal linear transformation; 2) our approach integrates drift compensation into the classification task which can guarantee the classification performance when compensating the drift data. The results of our experiments show that our approach significantly outperforms other methods based on both synthesized and real data.
Published: 2017

18. Demographic Information Prediction: A Portrait of Smartphone Application Users

Author: Qin, Zhen, primary, Wang, Yilei, additional, Cheng, Hongrong, additional, Zhou, Yingjie, additional, Sheng, Zhengguo, additional, and Leung, Victor C. M., additional
Published: 2018
Full Text: View/download PDF

19. Gas Sensor Drift Compensation by an Optimal Linear Transformation

Author: Liu, Qihe, primary, Zhou, Shijie, additional, Cheng, Xianqiong, additional, Cheng, Hongrong, additional, and Zhang, Huabo, additional
Published: 2017
Full Text: View/download PDF

20. A Novel Approach of Extracting Opinion Targets and Opinion Words From Online Review

Author: Tang, Mingshuang, primary, Cheng, Hongrong, primary, Guo, Yanwei, primary, Luo, Jiaqing, primary, and Zhou, Shijie, primary
Published: 2016
Full Text: View/download PDF

21. Kernel based Collaborative Topic Regression for Tag Recommendation

Author: Guo, Yanwei, primary, Cheng, Hongrong, primary, Tang, Mingshuang, primary, Luo, Jiaqing, primary, and Zhou, Shijie, primary
Published: 2016
Full Text: View/download PDF

22. The Design and Implementation of A Cost-effective RFID Indoor Localization System

Author: Luo, Jiaqing, primary, Zhou, Shijie, additional, Cheng, Hongrong, additional, Liao, Yongjian, additional, and Guo, Bin, additional
Published: 2015
Full Text: View/download PDF

23. A Lightweight Detection of the RFID Unauthorized Reading Using RF Scanners

Author: Zhang, Wenqing, primary, Zhou, Shijie, additional, Luo, Jiaqing, additional, Cheng, Hongrong, additional, and Liao, Yongjian, additional
Published: 2015
Full Text: View/download PDF

24. A Range-Free Localization of Passive RFID Tags Using Mobile Readers

Author: Luo, Jiaqing, primary, Zhou, Shijie, additional, Cheng, Hongrong, additional, Liao, Yongjian, additional, and Bu, Kai, additional
Published: 2015
Full Text: View/download PDF

25. Demographic information prediction based on smartphone application usage

Author: Qin, Zhen, primary, Wang, Yilei, additional, Xia, Yong, additional, Cheng, Hongrong, additional, Zhou, Yingjie, additional, Sheng, Zhengguo, additional, and Leung, Victor C.M., additional
Published: 2014
Full Text: View/download PDF

26. A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations.

Author: Cheng H, Zhang M, and Shi JQ
Abstract: Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and to accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. More than three thousand pruning papers have been published from 2020 to 2024. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1) universal/specific speedup, 2) when to prune, 3) how to prune, and 4) fusion of pruning and other compression techniques. We then provide a thorough comparative analysis of eight pairs of contrast settings for pruning (e.g., unstructured/structured, one-shot/iterative, data-free/data-driven, initialized/pre-trained weights, etc.) and explore several emerging topics, including pruning for large language models, vision transformers, diffusion models, and large multimodal models, post-training pruning, and different levels of supervision for pruning to shed light on the commonalities and differences of existing methods and lay the foundation for further method development. Finally, we provide some valuable recommendations on selecting pruning methods and prospect several promising research directions for neural network pruning. To facilitate future research on deep neural network pruning, we summarize broad pruning applications (e.g., adversarial robustness, natural language understanding, etc.) and build a curated collection of datasets, networks, and evaluations on different applications. We maintain a repository on https://github.com/hrcheng1066/awesome-pruning that serves as a comprehensive resource for neural network pruning papers and corresponding open-source codes. We will keep updating this repository to include the latest advancements in the field.
Published: 2024
Full Text: View/download PDF

27. The genetic spectrum of NF1 variants in 10 unrelated Chinese families with neurofibromatosis type 1.

Author: Chen S, Cheng H, and Zhao G
Subjects: Adolescent, Adult, Child, Child, Preschool, Female, Humans, Male, Middle Aged, Young Adult, China, East Asian People, Family, Mutation, Neurofibromin 1 genetics, Pedigree, Retrospective Studies, Neurofibromatosis 1 genetics
Abstract: Objectives: To investigate the clinical and genetic features in a cohort of Chinese families with neurofibromatosis type 1 (NF1)., Methods: The clinical information of 21 patients with NF1 in 10 families was retrospectively analyzed. To broaden the genetic spectrum of NF1 , multiplex ligation-dependent probe amplification analysis was performed first, followed by the whole-exome sequencing, in order to identify pathogenic or potentially pathogenic variants of NF1 gene in 10 unrelated Chinese families., Results: Nine different NF1 variants were identified in all 10 families. Of these, 7 were known pathogenic variants and included the exon 1 deletion, exons 1-58 deletion, c.5401C>T (p.Q1801*), c.2291-2A>C, c.484C>T (p.Q162*), c.4922G>A (p.W1641*) and c.1019_1020del (p.S340Cfs*25). The 2 novel variants were c.5197T>C (p.S1733P) and c.783_797delinsC (p.K261Nfs*25). The p.S1733P variant was classified as a variant of uncertain significance, while p.K261Nfs*25 was classified as pathogenic. Hence, the positive detection rate of NF1 variants was 100% (10/10). While the truncating variants were responsible for 60.0% (6/10) of the cases, the splicing variant was responsible for 10% (1/10) of the cases., Conclusion: We identified 2 novel heterozygous variants (c.5197T>C and c.783_797delinsC) in the NF1 gene, which broadens the genetic spectrum of the NF1 gene., (Copyright: © Neurosciences.)
Published: 2024
Full Text: View/download PDF

28. Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes For Pruning Is Possible Without Retraining.

Author: Cheng H, Zhang M, and Shi JQ
Abstract: Channel pruning is attracting increasing attention in the deep model compression community due to its capability of significantly reducing computation and memory footprints without special support from specific software and hardware. A challenge of channel pruning is designing efficient and effective criteria to select channels to prune. A widely used criterion is minimal performance degeneration, e.g., loss changes before and after pruning being the smallest. To accurately evaluate the truth performance degeneration requires retraining the survived weights to convergence, which is prohibitively slow. Hence existing pruning methods settle to use previous weights (without retraining) to evaluate the performance degeneration. However, we observe that the loss changes differ significantly with and without retraining. It motivates us to develop a technique to evaluate true loss changes without retraining, using which to select channels to prune with more reliability and confidence. We first derive a closed-form estimator of the true loss change per mask change, using influence functions without retraining. Influence function is a classic technique from robust statistics that reveals the impacts of a training sample on the model's prediction and is repurposed by us to assess impacts on true loss changes. We then show how to assess the importance of all channels simultaneously and develop a novel global channel pruning algorithm accordingly. We conduct extensive experiments to verify the effectiveness of the proposed algorithm, which significantly outperforms the competing channel pruning methods on both image classification and object detection tasks. One of the attractive properties of our algorithm is that it automatically obtains the prune percentage without the cumbersome yet commonly used sensitivity analysis by local pruning. To the best of our knowledge, we are the first that shows evaluating true loss changes for pruning without retraining is possible. This finding will open up opportunities for a series of new paradigms to emerge that differ from existing pruning methods. The code is available at https://github.com/hrcheng1066/IFSO.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

28 results on '"Cheng, Hongrong"'

1. MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models

2. Long-short interest network with graph-based method for sequential recommendation

3. A Survey on Deep Neural Network Pruning-Taxonomy, Comparison, Analysis, and Recommendations

4. Influence Function Based Second-Order Channel Pruning-Evaluating True Loss Changes For Pruning Is Possible Without Retraining

5. Pan-cancer analysis and experimental validation reveal FAM72D as a potential novel biomarker and therapeutic target in lung adenocarcinoma

6. Increased iron deposition in nucleus accumbens associated with disease progression and chronicity in migraine

7. FGF-21 and GDF-15 are increased in migraine and associated with the severity of migraine-related disability

8. The Mechanistic Role of Bridging Integrator 1 (BIN1) in Alzheimer’s Disease

9. An Efficient and Simple Graph Model for Scientific Article Cold Start Recommendation

10. A Systematic Analysis of the Role of Unc-5 Netrin Receptor A (UNC5A) in Human Cancers

11. An Efficient and Simple Graph Model for Scientific Article Cold Start Recommendation

12. DeepComp: A Deep Comparator for Improving Facial Age-Group Estimation

13. DeepComp: A Deep Comparator for Improving Facial Age-Group Estimation

14. Improving Juvenile Age Estimation Based on Facial Landmark Points and Gravity Moment

15. The Mechanistic Role of Bridging Integrator 1 (BIN1) in Alzheimer’s Disease

16. Improving Juvenile Age Estimation Based on Facial Landmark Points and Gravity Moment

17. Gas Sensor Drift Compensation by an Optimal Linear Transformation

18. Demographic Information Prediction: A Portrait of Smartphone Application Users

19. Gas Sensor Drift Compensation by an Optimal Linear Transformation

20. A Novel Approach of Extracting Opinion Targets and Opinion Words From Online Review

21. Kernel based Collaborative Topic Regression for Tag Recommendation

22. The Design and Implementation of A Cost-effective RFID Indoor Localization System

23. A Lightweight Detection of the RFID Unauthorized Reading Using RF Scanners

24. A Range-Free Localization of Passive RFID Tags Using Mobile Readers

25. Demographic information prediction based on smartphone application usage

26. A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations.

27. The genetic spectrum of NF1 variants in 10 unrelated Chinese families with neurofibromatosis type 1.

28. Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes For Pruning Is Possible Without Retraining.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

28 results on '"Cheng, Hongrong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources