Author: "Xu, Nuo" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xu, Nuo"' showing total 916 results

Start Over Author "Xu, Nuo" Search Limiters Full Text

916 results on '"Xu, Nuo"'

1. CHBench: A Chinese Dataset for Evaluating Health in Large Language Models

Author: Guo, Chenlu, Xu, Nuo, Chang, Yi, and Wu, Yuan
Subjects: Computer Science - Computation and Language
Abstract: With the rapid development of large language models (LLMs), assessing their performance on health-related inquiries has become increasingly essential. It is critical that these models provide accurate and trustworthy health information, as their application in real-world contexts--where misinformation can have serious consequences for individuals seeking medical advice and support--depends on their reliability. In this work, we present CHBench, the first comprehensive Chinese Health-related Benchmark designed to evaluate LLMs' capabilities in understanding physical and mental health across diverse scenarios. CHBench includes 6,493 entries related to mental health and 2,999 entries focused on physical health, covering a broad spectrum of topics. This dataset serves as a foundation for evaluating Chinese LLMs' capacity to comprehend and generate accurate health-related information. Our extensive evaluations of four popular Chinese LLMs demonstrate that there remains considerable room for improvement in their understanding of health-related information. The code is available at https://github.com/TracyGuo2001/CHBench., Comment: 11 pages
Published: 2024

2. Distinguish Confusion in Legal Judgment Prediction via Revised Relation Knowledge

Author: Xu, Nuo, Wang, Pinghui, Zhao, Junzhou, Sun, Feiyang, Lan, Lin, Tao, Jing, Pan, Li, and Guan, Xiaohong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Legal Judgment Prediction (LJP) aims to automatically predict a law case's judgment results based on the text description of its facts. In practice, the confusing law articles (or charges) problem frequently occurs, reflecting that the law cases applicable to similar articles (or charges) tend to be misjudged. Although some recent works based on prior knowledge solve this issue well, they ignore that confusion also occurs between law articles with a high posterior semantic similarity due to the data imbalance problem instead of only between the prior highly similar ones, which is this work's further finding. This paper proposes an end-to-end model named \textit{D-LADAN} to solve the above challenges. On the one hand, D-LADAN constructs a graph among law articles based on their text definition and proposes a graph distillation operation (GDO) to distinguish the ones with a high prior semantic similarity. On the other hand, D-LADAN presents a novel momentum-updated memory mechanism to dynamically sense the posterior similarity between law articles (or charges) and a weighted GDO to adaptively capture the distinctions for revising the inductive bias caused by the data imbalance problem. We perform extensive experiments to demonstrate that D-LADAN significantly outperforms state-of-the-art methods in accuracy and robustness., Comment: Accepted by ACM TOIS
Published: 2024

3. Unified End-to-End V2X Cooperative Autonomous Driving

Author: Li, Zhiwei, Zhang, Bozhen, Yang, Lei, Shen, Tianyu, Xu, Nuo, Hao, Ruosen, Li, Weiting, Yan, Tao, and Liu, Huaping
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multiagent Systems
Abstract: V2X cooperation, through the integration of sensor data from both vehicles and infrastructure, is considered a pivotal approach to advancing autonomous driving technology. Current research primarily focuses on enhancing perception accuracy, often overlooking the systematic improvement of accident prediction accuracy through end-to-end learning, leading to insufficient attention to the safety issues of autonomous driving. To address this challenge, this paper introduces the UniE2EV2X framework, a V2X-integrated end-to-end autonomous driving system that consolidates key driving modules within a unified network. The framework employs a deformable attention-based data fusion strategy, effectively facilitating cooperation between vehicles and infrastructure. The main advantages include: 1) significantly enhancing agents' perception and motion prediction capabilities, thereby improving the accuracy of accident predictions; 2) ensuring high reliability in the data fusion process; 3) superior end-to-end perception compared to modular approaches. Furthermore, We implement the UniE2EV2X framework on the challenging DeepAccident, a simulation dataset designed for V2X cooperative driving.
Published: 2024

4. Garbage Segmentation and Attribute Analysis by Robotic Dogs

Author: Xu, Nuo, Liao, Jianfeng, Meng, Qiwei, and Song, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: Efficient waste management and recycling heavily rely on garbage exploration and identification. In this study, we propose GSA2Seg (Garbage Segmentation and Attribute Analysis), a novel visual approach that utilizes quadruped robotic dogs as autonomous agents to address waste management and recycling challenges in diverse indoor and outdoor environments. Equipped with advanced visual perception system, including visual sensors and instance segmentators, the robotic dogs adeptly navigate their surroundings, diligently searching for common garbage items. Inspired by open-vocabulary algorithms, we introduce an innovative method for object attribute analysis. By combining garbage segmentation and attribute analysis techniques, the robotic dogs accurately determine the state of the trash, including its position and placement properties. This information enhances the robotic arm's grasping capabilities, facilitating successful garbage retrieval. Additionally, we contribute an image dataset, named GSA2D, to support evaluation. Through extensive experiments on GSA2D, this paper provides a comprehensive analysis of GSA2Seg's effectiveness. Dataset available: \href{https://www.kaggle.com/datasets/hellob/gsa2d-2024}{https://www.kaggle.com/datasets/hellob/gsa2d-2024}.
Published: 2024

5. Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Author: Zhang, Jiwen, Wu, Jihao, Teng, Yihua, Liao, Minghui, Xu, Nuo, Xiao, Xiao, Wei, Zhongyu, and Tang, Duyu
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning
Abstract: Large language model (LLM) leads to a surge of autonomous GUI agents for smartphone, which completes a task triggered by natural language through predicting a sequence of actions of API. Even though the task highly relies on past actions and visual observations, existing studies typically consider little semantic information carried out by intermediate screenshots and screen operations. To address this, this work presents Chain-of-Action-Thought (dubbed CoAT), which takes the description of the previous actions, the current screen, and more importantly the action thinking of what actions should be performed and the outcomes led by the chosen action. We demonstrate that, in a zero-shot setting upon three off-the-shelf LMMs, CoAT significantly improves the action prediction compared to previous proposed context modeling. To further facilitate the research in this line, we construct a dataset Android-In-The-Zoo (AitZ), which contains 18,643 screen-action pairs together with chain-of-action-thought annotations. Experiments show that fine-tuning a 1B model (i.e. AUTO-UI-base) on our AitZ dataset achieves on-par performance with CogAgent-Chat-18B., Comment: Dataset could be found in https://github.com/IMNearth/CoAT
Published: 2024

6. Soil Seed Bank Density Enhanced at Shrub Patches Due to Grazing in a Shrub-Encroached Grassland

Author: Liu, Jiahui, Li, Le, Chen, Jiquan, Zhang, Jingmin, Zhu, Na, Wang, Chu, Luo, Yuhong, Xu, Nuo, Bao, Yufan, and Yan, Yuchun
Published: 2024
Full Text: View/download PDF

7. Recursive Windowed Variational Mode Decomposition

Author: Zhou, Zhaoheng, Ling, Bingo Wing-Kuen, and Xu, Nuo
Published: 2024
Full Text: View/download PDF

8. Aligning Knowledge Graph with Visual Perception for Object-goal Navigation

Author: Xu, Nuo, Wang, Wen, Yang, Rong, Qin, Mengjie, Lin, Zheyuan, Song, Wei, Zhang, Chunlong, Gu, Jason, and Li, Chao
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: Object-goal navigation is a challenging task that requires guiding an agent to specific objects based on first-person visual observations. The ability of agent to comprehend its surroundings plays a crucial role in achieving successful object finding. However, existing knowledge-graph-based navigators often rely on discrete categorical one-hot vectors and vote counting strategy to construct graph representation of the scenes, which results in misalignment with visual images. To provide more accurate and coherent scene descriptions and address this misalignment issue, we propose the Aligning Knowledge Graph with Visual Perception (AKGVP) method for object-goal navigation. Technically, our approach introduces continuous modeling of the hierarchical scene architecture and leverages visual-language pre-training to align natural language description with visual perception. The integration of a continuous knowledge graph architecture and multimodal feature alignment empowers the navigator with a remarkable zero-shot navigation capability. We extensively evaluate our method using the AI2-THOR simulator and conduct a series of experiments to demonstrate the effectiveness and efficiency of our navigator. Code available: https://github.com/nuoxu/AKGVP., Comment: Accepted to ICRA 2024
Published: 2024

9. LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition

Author: Ye, Junjie, Xu, Nuo, Wang, Yikun, Zhou, Jie, Zhang, Qi, Gui, Tao, and Huang, Xuanjing
Subjects: Computer Science - Computation and Language
Abstract: Despite the impressive capabilities of large language models (LLMs), their performance on information extraction tasks is still not entirely satisfactory. However, their remarkable rewriting capabilities and extensive world knowledge offer valuable insights to improve these tasks. In this paper, we propose $LLM-DA$, a novel data augmentation technique based on LLMs for the few-shot NER task. To overcome the limitations of existing data augmentation methods that compromise semantic integrity and address the uncertainty inherent in LLM-generated text, we leverage the distinctive characteristics of the NER task by augmenting the original data at both the contextual and entity levels. Our approach involves employing 14 contextual rewriting strategies, designing entity replacements of the same type, and incorporating noise injection to enhance robustness. Extensive experiments demonstrate the effectiveness of our approach in enhancing NER model performance with limited data. Furthermore, additional analyses provide further evidence supporting the assertion that the quality of the data we generate surpasses that of other existing methods.
Published: 2024

10. Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution

Author: Xu, Nuo, Zhao, Jun, Zu, Can, Li, Sixian, Chen, Lu, Zhang, Zhihao, Zheng, Rui, Dou, Shihan, Qin, Wenjuan, Gui, Tao, Zhang, Qi, and Huang, Xuanjing
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Faithfulness, expressiveness, and elegance is the constant pursuit in machine translation. However, traditional metrics like \textit{BLEU} do not strictly align with human preference of translation quality. In this paper, we explore leveraging reinforcement learning with human feedback (\textit{RLHF}) to improve translation quality. It is non-trivial to collect a large high-quality dataset of human comparisons between translations, especially for low-resource languages. To address this issue, we propose a cost-effective preference learning strategy, optimizing reward models by distinguishing between human and machine translations. In this manner, the reward model learns the deficiencies of machine translation compared to human and guides subsequent improvements in machine translation. Experimental results demonstrate that \textit{RLHF} can effectively enhance translation quality and this improvement benefits other translation directions not trained with \textit{RLHF}. Further analysis indicates that the model's language capabilities play a crucial role in preference learning. A reward model with strong language capabilities can more sensitively learn the subtle differences in translation quality and align better with real human translation preferences.
Published: 2024

11. Secrets of RLHF in Large Language Models Part II: Reward Modeling

Author: Wang, Binghai, Zheng, Rui, Chen, Lu, Liu, Yan, Dou, Shihan, Huang, Caishuang, Shen, Wei, Jin, Senjie, Zhou, Enyu, Shi, Chenyu, Gao, Songyang, Xu, Nuo, Zhou, Yuhao, Fan, Xiaoran, Xi, Zhiheng, Zhao, Jun, Wang, Xiao, Ji, Tao, Yan, Hang, Shen, Lixing, Chen, Zhan, Gui, Tao, Zhang, Qi, Qiu, Xipeng, Huang, Xuanjing, Wu, Zuxuan, and Jiang, Yu-Gang
Subjects: Computer Science - Artificial Intelligence
Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a crucial technology for aligning language models with human values and intentions, enabling models to produce more helpful and harmless responses. Reward models are trained as proxies for human preferences to drive reinforcement learning optimization. While reward models are often considered central to achieving high performance, they face the following challenges in practical applications: (1) Incorrect and ambiguous preference pairs in the dataset may hinder the reward model from accurately capturing human intent. (2) Reward models trained on data from a specific distribution often struggle to generalize to examples outside that distribution and are not suitable for iterative RLHF training. In this report, we attempt to address these two issues. (1) From a data perspective, we propose a method to measure the strength of preferences within the data, based on a voting mechanism of multiple reward models. Experimental results confirm that data with varying preference strengths have different impacts on reward model performance. We introduce a series of novel methods to mitigate the influence of incorrect and ambiguous preferences in the dataset and fully leverage high-quality preference data. (2) From an algorithmic standpoint, we introduce contrastive learning to enhance the ability of reward models to distinguish between chosen and rejected responses, thereby improving model generalization. Furthermore, we employ meta-learning to enable the reward model to maintain the ability to differentiate subtle differences in out-of-distribution samples, and this approach can be utilized for iterative RLHF optimization.
Published: 2024

12. Disordered hyperuniformity signals functioning and resilience of self-organized vegetation patterns

Author: Hu, Wensi, Liu, Quan-Xing, Wang, Bo, Xu, Nuo, Cui, Lijuan, and Xu, Chi
Subjects: Quantitative Biology - Populations and Evolution, Statistics - Applications
Abstract: In harsh environments, organisms may self-organize into spatially patterned systems in various ways. So far, studies of ecosystem spatial self-organization have primarily focused on apparent orders reflected by regular patterns. However, self-organized ecosystems may also have cryptic orders that can be unveiled only through certain quantitative analyses. Here we show that disordered hyperuniformity as a striking class of hidden orders can exist in spatially self-organized vegetation landscapes. By analyzing the high-resolution remotely sensed images across the American drylands, we demonstrate that it is not uncommon to find disordered hyperuniform vegetation states characterized by suppressed density fluctuations at long range. Such long-range hyperuniformity has been documented in a wide range of microscopic systems. Our finding contributes to expanding this domain to accommodate natural landscape ecological systems. We use theoretical modeling to propose that disordered hyperuniform vegetation patterning can arise from three generalized mechanisms prevalent in dryland ecosystems, including (1) critical absorbing states driven by an ecological legacy effect, (2) scale-dependent feedbacks driven by plant-plant facilitation and competition, and (3) density-dependent aggregation driven by plant-sediment feedbacks. Our modeling results also show that disordered hyperuniform patterns can help ecosystems cope with arid conditions with enhanced functioning of soil moisture acquisition. However, this advantage may come at the cost of slower recovery of ecosystem structure upon perturbations. Our work highlights that disordered hyperuniformity as a distinguishable but underexplored ecosystem self-organization state merits systematic studies to better understand its underlying mechanisms, functioning, and resilience., Comment: 34 pages, 6 figures; Supplementary Materials, 19 pages, 10 figures, 2 tables
Published: 2023

13. DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF

Author: Liu, Chaowei, Li, Jichun, Teng, Yihua, Wang, Chaoqun, Xu, Nuo, Wu, Jihao, and Tu, Dandan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: For capturing colored document images, e.g. posters and magazines, it is common that multiple degradations such as shadows, wrinkles, etc., are simultaneously introduced due to external factors. Restoring multi-degraded colored document images is a great challenge, yet overlooked, as most existing algorithms focus on enhancing color-ignored document images via binarization. Thus, we propose DocStormer, a novel algorithm designed to restore multi-degraded colored documents to their potential pristine PDF. The contributions are: firstly, we propose a "Perceive-then-Restore" paradigm with a reinforced transformer block, which more effectively encodes and utilizes the distribution of degradations. Secondly, we are the first to utilize GAN and pristine PDF magazine images to narrow the distribution gap between the enhanced results and PDF images, in pursuit of less degradation and better visual quality. Thirdly, we propose a non-parametric strategy, PFILI, which enables a smaller training scale and larger testing resolutions with acceptable detail trade-off, while saving memory and inference time. Fourthly, we are the first to propose a novel Multi-Degraded Colored Document image Enhancing dataset, named MD-CDE, for both training and evaluation. Experimental results show that the DocStormer exhibits superior performance, capable of revitalizing multi-degraded colored documents into their potential pristine digital versions, which fills the current academic gap from the perspective of method, data, and task.
Published: 2023

14. Assessing size shifts amidst a warming climate in lakes recharged by the Asian Water Tower through satellite imagery

Author: Xu, Nuo, Zhang, Jiahua, Daccache, Andre, Liu, Chong, Ahmadi, Arman, Zhou, Tianyu, and Gou, Peng
Subjects: Earth Sciences, Physical Geography and Environmental Geoscience, Biological Sciences, Climate Action, Climate change, Lake size, Remote sensing, Asian Water Tower, Basin recharge, Environmental Sciences
Abstract: Recent studies indicate that the Asian Water Tower (AWT) is at risk due to climate change, which can negatively impact water and food security in Asia. However, there is a lack of comprehensive information on lakes' spatial and temporal changes in this region. This information is crucial for understanding the risk magnitude and designing strategies. To fill this research gap, we analyzed 89,480 Landsat images from 1977 ± 2 to 2020 ± 2 to investigate the changes in the size of lakes recharged by the AWT. Our findings showed that out of the 209 lakes larger than 50 km2, 176 (84 %) grew during the wet season and 167 (81 %) during the dry season. 74 % of expanded lakes are located in the Inner Tibetan Plateau (TP) and Tarim basins. The lakes that shrank are found mainly in the Helmand, Indus, and Yangtze basins. Over the entire period, the area of shrinkage (55,077.028 km2 in wet season, 53,986.796 km2 in dry) markedly exceeded expansion (13,000.267 km2 in wet, 11,038.805 km2 in dry), with the drastic decline of the Aral Sea being a major contributor to shrinkage, accounting for 90 % of the total loss. From 1990 ± 2 to 2020 ± 2, alpine lakes mostly expanded, plain lakes mostly shrank, with the opposite trend from 1977 ± 2 to 1990 ± 2. Glacial loss and permafrost thawing under global warming in the Inner TP, Tarim Interior, Syr Darya, and Mekong basins were strongly correlated with lake expansion. However, permafrost discontinuities may prevent significant growth of lakes in the Indus and Ganges basins despite increased recharge. Our findings point to the prominence of the risk the lakes recharged by AWT face. Taking immediate action to manage these risks and adaptation is crucial as the AWT retreats and lake recharges are slowed.
Published: 2024

15. Spectral-DP: Differentially Private Deep Learning through Spectral Perturbation and Filtering

Author: Feng, Ce, Xu, Nuo, Wen, Wujie, Venkitasubramaniam, Parv, and Ding, Caiwen
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Computers and Society
Abstract: Differential privacy is a widely accepted measure of privacy in the context of deep learning algorithms, and achieving it relies on a noisy training approach known as differentially private stochastic gradient descent (DP-SGD). DP-SGD requires direct noise addition to every gradient in a dense neural network, the privacy is achieved at a significant utility cost. In this work, we present Spectral-DP, a new differentially private learning approach which combines gradient perturbation in the spectral domain with spectral filtering to achieve a desired privacy guarantee with a lower noise scale and thus better utility. We develop differentially private deep learning methods based on Spectral-DP for architectures that contain both convolution and fully connected layers. In particular, for fully connected layers, we combine a block-circulant based spatial restructuring with Spectral-DP to achieve better utility. Through comprehensive experiments, we study and provide guidelines to implement Spectral-DP deep learning on benchmark datasets. In comparison with state-of-the-art DP-SGD based approaches, Spectral-DP is shown to have uniformly better utility performance in both training from scratch and transfer learning settings., Comment: Accepted in 2023 IEEE Symposium on Security and Privacy (SP)
Published: 2023
Full Text: View/download PDF

16. Secrets of RLHF in Large Language Models Part I: PPO

Author: Zheng, Rui, Dou, Shihan, Gao, Songyang, Hua, Yuan, Shen, Wei, Wang, Binghai, Liu, Yan, Jin, Senjie, Liu, Qin, Zhou, Yuhao, Xiong, Limao, Chen, Lu, Xi, Zhiheng, Xu, Nuo, Lai, Wenbin, Zhu, Minghao, Chang, Cheng, Yin, Zhangyue, Weng, Rongxiang, Cheng, Wensen, Huang, Haoran, Sun, Tianxiang, Yan, Hang, Gui, Tao, Zhang, Qi, Qiu, Xipeng, and Huang, Xuanjing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramount significance, and reinforcement learning with human feedback (RLHF) emerges as the pivotal technological paradigm underpinning this pursuit. Current technical routes usually include \textbf{reward models} to measure human preferences, \textbf{Proximal Policy Optimization} (PPO) to optimize policy model outputs, and \textbf{process supervision} to improve step-by-step reasoning capabilities. However, due to the challenges of reward design, environment interaction, and agent training, coupled with huge trial and error cost of large language models, there is a significant barrier for AI researchers to motivate the development of technical alignment and safe landing of LLMs. The stable training of RLHF has still been a puzzle. In the first report, we dissect the framework of RLHF, re-evaluate the inner workings of PPO, and explore how the parts comprising PPO algorithms impact policy agent training. We identify policy constraints being the key factor for the effective implementation of the PPO algorithm. Therefore, we explore the PPO-max, an advanced version of PPO algorithm, to efficiently improve the training stability of the policy model. Based on our main results, we perform a comprehensive analysis of RLHF abilities compared with SFT models and ChatGPT. The absence of open-source implementations has posed significant challenges to the investigation of LLMs alignment. Therefore, we are eager to release technical reports, reward models and PPO codes, aiming to make modest contributions to the advancement of LLMs.
Published: 2023

17. PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment

Author: Peng, Hongwu, Zhou, Shanglin, Luo, Yukui, Xu, Nuo, Duan, Shijin, Ran, Ran, Zhao, Jiahui, Wang, Chenghong, Geng, Tong, Wen, Wujie, Xu, Xiaolin, and Ding, Caiwen
Subjects: Computer Science - Cryptography and Security, E.3, I.2, B.0
Abstract: Two-party computation (2PC) is promising to enable privacy-preserving deep learning (DL). However, the 2PC-based privacy-preserving DL implementation comes with high comparison protocol overhead from the non-linear operators. This work presents PASNet, a novel systematic framework that enables low latency, high energy efficiency & accuracy, and security-guaranteed 2PC-DL by integrating the hardware latency of the cryptographic building block into the neural architecture search loss function. We develop a cryptographic hardware scheduler and the corresponding performance model for Field Programmable Gate Arrays (FPGA) as a case study. The experimental results demonstrate that our light-weighted model PASNet-A and heavily-weighted model PASNet-B achieve 63 ms and 228 ms latency on private inference on ImageNet, which are 147 and 40 times faster than the SOTA CryptGPU system, and achieve 70.54% & 78.79% accuracy and more than 1000 times higher energy efficiency., Comment: DAC 2023 accepeted publication, short version was published on AAAI 2023 workshop on DL-Hardware Co-Design for AI Acceleration: RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Published: 2023

18. Effect of survodutide, a glucagon and GLP-1 receptor dual agonist, on weight loss: a meta-analysis of randomized controlled trials

Author: Wan, Haijun, Xu, Nuo, Wang, Lijuan, Liu, Yaping, Fatahi, Somaye, Sohouli, Mohammad Hassan, and Guimarães, Nathalia Sernizon
Published: 2024
Full Text: View/download PDF

19. VEGFB ameliorates insulin resistance in NAFLD via the PI3K/AKT signal pathway

Author: Li, Yuqi, Li, Wenhao, Zhu, Xiaonan, Xu, Nuo, Meng, Qinyu, Jiang, Wenguo, Zhang, Lei, Yang, Meizi, Xu, Fang, and Li, Yana
Published: 2024
Full Text: View/download PDF

20. A three-minute solid phase-based plant RNA extraction method

Author: Liu, Guiling, Shi, Gongfa, Liu, Huijun, Xu, Nuo, Fan, Lijuan, and Wang, Ling
Published: 2024
Full Text: View/download PDF

21. A novel missense mutation (FGG c.1168G > T) in the gamma chain of fibrinogen causing congenital hypodysfibrinogenemia with bleeding phenotype

Author: Xu, Nuo, Zheng, Liping, Dai, Zhehao, Zhu, Jun, Xie, Peng, Yang, Shun, and Chen, Fei
Published: 2024
Full Text: View/download PDF

22. OsUGE2 Regulates Plant Growth through Affecting ROS Homeostasis and Iron Level in Rice

Author: Yang, Shuaiqi, Chen, Nana, Qi, Jiaxuan, Salam, Abdul, Khan, Ali Raza, Azhar, Wardah, Yang, Chunyan, Xu, Nuo, Wu, Junyu, Liu, Yihua, Liu, Bohan, and Gan, Yinbo
Published: 2024
Full Text: View/download PDF

23. MERGE: Fast Private Text Generation

Author: Liang, Zi, Wang, Pinghui, Zhang, Ruofei, Xu, Nuo, Xing, Lifeng, and Zhang, Shuo
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The drastic increase in language models' parameters has led to a new trend of deploying models in cloud servers, raising growing concerns about private inference for Transformer-based models. Existing two-party privacy-preserving techniques, however, only take into account natural language understanding (NLU) scenarios. Private inference in natural language generation (NLG), crucial for applications like translation and code completion, remains underexplored.In addition, previous privacy-preserving techniques suffer from convergence issues during model training and exhibit poor inference speed when used with NLG models due to the neglect of time-consuming operations in auto-regressive generations. To address these issues, we propose a fast private text generation framework for Transformer-based language models, namely MERGE.MERGE reuses the output hidden state as the word embedding to bypass the embedding computation and reorganize the linear operations in the Transformer module to accelerate the forward procedure. Extensive experiments show that MERGE achieves a 26.5x speedup to the vanilla encrypted model under the sequence length 512, and reduces 80\% communication cost, with an up to 10x speedup to state-of-the-art approximated models., Comment: Accepted by AAAI 2024
Published: 2023

24. Neurogenesis Dynamics-inspired Spiking Neural Network Training Acceleration

Author: Huang, Shaoyi, Fang, Haowen, Mahmood, Kaleel, Lei, Bowen, Xu, Nuo, Lei, Bin, Sun, Yue, Xu, Dongkuan, Wen, Wujie, and Ding, Caiwen
Subjects: Computer Science - Neural and Evolutionary Computing
Abstract: Biologically inspired Spiking Neural Networks (SNNs) have attracted significant attention for their ability to provide extremely energy-efficient machine intelligence through event-driven operation and sparse activities. As artificial intelligence (AI) becomes ever more democratized, there is an increasing need to execute SNN models on edge devices. Existing works adopt weight pruning to reduce SNN model size and accelerate inference. However, these methods mainly focus on how to obtain a sparse model for efficient inference, rather than training efficiency. To overcome these drawbacks, in this paper, we propose a Neurogenesis Dynamics-inspired Spiking Neural Network training acceleration framework, NDSNN. Our framework is computational efficient and trains a model from scratch with dynamic sparsity without sacrificing model fidelity. Specifically, we design a new drop-and-grow strategy with decreasing number of non-zero weights, to maintain extreme high sparsity and high accuracy. We evaluate NDSNN using VGG-16 and ResNet-19 on CIFAR-10, CIFAR-100 and TinyImageNet. Experimental results show that NDSNN achieves up to 20.52\% improvement in accuracy on Tiny-ImageNet using ResNet-19 (with a sparsity of 99\%) as compared to other SOTA methods (e.g., Lottery Ticket Hypothesis (LTH), SET-SNN, RigL-SNN). In addition, the training cost of NDSNN is only 40.89\% of the LTH training cost on ResNet-19 and 31.35\% of the LTH training cost on VGG-16 on CIFAR-10.
Published: 2023

25. PARAGRAPH2GRAPH: A GNN-based framework for layout paragraph analysis

Author: Wei, Shu and Xu, Nuo
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Document layout analysis has a wide range of requirements across various domains, languages, and business scenarios. However, most current state-of-the-art algorithms are language-dependent, with architectures that rely on transformer encoders or language-specific text encoders, such as BERT, for feature extraction. These approaches are limited in their ability to handle very long documents due to input sequence length constraints and are closely tied to language-specific tokenizers. Additionally, training a cross-language text encoder can be challenging due to the lack of labeled multilingual document datasets that consider privacy. Furthermore, some layout tasks require a clean separation between different layout components without overlap, which can be difficult for image segmentation-based algorithms to achieve. In this paper, we present Paragraph2Graph, a language-independent graph neural network (GNN)-based model that achieves competitive results on common document layout datasets while being adaptable to business scenarios with strict separation. With only 19.95 million parameters, our model is suitable for industrial applications, particularly in multi-language scenarios.
Published: 2023

26. A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models

Author: Ye, Junjie, Chen, Xuanting, Xu, Nuo, Zu, Can, Shao, Zekai, Liu, Shichun, Cui, Yuhan, Zhou, Zeyang, Gong, Chao, Shen, Yang, Zhou, Jie, Chen, Siming, Gui, Tao, Zhang, Qi, and Huang, Xuanjing
Subjects: Computer Science - Computation and Language
Abstract: GPT series models, such as GPT-3, CodeX, InstructGPT, ChatGPT, and so on, have gained considerable attention due to their exceptional natural language processing capabilities. However, despite the abundance of research on the difference in capabilities between GPT series models and fine-tuned models, there has been limited attention given to the evolution of GPT series models' capabilities over time. To conduct a comprehensive analysis of the capabilities of GPT series models, we select six representative models, comprising two GPT-3 series models (i.e., davinci and text-davinci-001) and four GPT-3.5 series models (i.e., code-davinci-002, text-davinci-002, text-davinci-003, and gpt-3.5-turbo). We evaluate their performance on nine natural language understanding (NLU) tasks using 21 datasets. In particular, we compare the performance and robustness of different models for each task under zero-shot and few-shot scenarios. Our extensive experiments reveal that the overall ability of GPT series models on NLU tasks does not increase gradually as the models evolve, especially with the introduction of the RLHF training strategy. While this strategy enhances the models' ability to generate human-like responses, it also compromises their ability to solve some tasks. Furthermore, our findings indicate that there is still room for improvement in areas such as model robustness.
Published: 2023

27. How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks

Author: Chen, Xuanting, Ye, Junjie, Zu, Can, Xu, Nuo, Zheng, Rui, Peng, Minlong, Zhou, Jie, Gui, Tao, Zhang, Qi, and Huang, Xuanjing
Subjects: Computer Science - Computation and Language, I.2
Abstract: The GPT-3.5 models have demonstrated impressive performance in various Natural Language Processing (NLP) tasks, showcasing their strong understanding and reasoning capabilities. However, their robustness and abilities to handle various complexities of the open world have yet to be explored, which is especially crucial in assessing the stability of models and is a key aspect of trustworthy AI. In this study, we perform a comprehensive experimental analysis of GPT-3.5, exploring its robustness using 21 datasets (about 116K test samples) with 66 text transformations from TextFlint that cover 9 popular Natural Language Understanding (NLU) tasks. Our findings indicate that while GPT-3.5 outperforms existing fine-tuned models on some tasks, it still encounters significant robustness degradation, such as its average performance dropping by up to 35.74\% and 43.59\% in natural language inference and sentiment analysis tasks, respectively. We also show that GPT-3.5 faces some specific robustness challenges, including robustness instability, prompt sensitivity, and number sensitivity. These insights are valuable for understanding its limitations and guiding future research in addressing these challenges to enhance GPT-3.5's overall performance and generalization abilities.
Published: 2023

28. RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

Author: Peng, Hongwu, Zhou, Shanglin, Luo, Yukui, Xu, Nuo, Duan, Shijin, Ran, Ran, Zhao, Jiahui, Huang, Shaoyi, Xie, Xi, Wang, Chenghong, Geng, Tong, Wen, Wujie, Xu, Xiaolin, and Ding, Caiwen
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning, I.2
Abstract: The proliferation of deep learning (DL) has led to the emergence of privacy and security concerns. To address these issues, secure Two-party computation (2PC) has been proposed as a means of enabling privacy-preserving DL computation. However, in practice, 2PC methods often incur high computation and communication overhead, which can impede their use in large-scale systems. To address this challenge, we introduce RRNet, a systematic framework that aims to jointly reduce the overhead of MPC comparison protocols and accelerate computation through hardware acceleration. Our approach integrates the hardware latency of cryptographic building blocks into the DNN loss function, resulting in improved energy efficiency, accuracy, and security guarantees. Furthermore, we propose a cryptographic hardware scheduler and corresponding performance model for Field Programmable Gate Arrays (FPGAs) to further enhance the efficiency of our framework. Experiments show RRNet achieved a much higher ReLU reduction performance than all SOTA works on CIFAR-10 dataset., Comment: This is work is a updated version of arXiv:2209.09424, the original version has been withdrawn
Published: 2023

29. CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference

Author: Ran, Ran, Xu, Nuo, Wang, Wei, Quan, Gang, Yin, Jieming, and Wen, Wujie
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Recently cloud-based graph convolutional network (GCN) has demonstrated great success and potential in many privacy-sensitive applications such as personal healthcare and financial systems. Despite its high inference accuracy and performance on cloud, maintaining data privacy in GCN inference, which is of paramount importance to these practical applications, remains largely unexplored. In this paper, we take an initial attempt towards this and develop $\textit{CryptoGCN}$--a homomorphic encryption (HE) based GCN inference framework. A key to the success of our approach is to reduce the tremendous computational overhead for HE operations, which can be orders of magnitude higher than its counterparts in the plaintext space. To this end, we develop an approach that can effectively take advantage of the sparsity of matrix operations in GCN inference to significantly reduce the computational overhead. Specifically, we propose a novel AMA data formatting method and associated spatial convolution methods, which can exploit the complex graph structure and perform efficient matrix-matrix multiplication in HE computation and thus greatly reduce the HE operations. We also develop a co-optimization framework that can explore the trade offs among the accuracy, security level, and computational overhead by judicious pruning and polynomial approximation of activation module in GCNs. Based on the NTU-XVIEW skeleton joint dataset, i.e., the largest dataset evaluated homomorphically by far as we are aware of, our experimental results demonstrate that $\textit{CryptoGCN}$ outperforms state-of-the-art solutions in terms of the latency and number of homomorphic operations, i.e., achieving as much as a 3.10$\times$ speedup on latency and reduces the total Homomorphic Operation Count by 77.4\% with a small accuracy loss of 1-1.5$\%$., Comment: Accepted in Conference on Neural Information Processing Systems (NeurIPS 2022)
Published: 2022

30. PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference

Author: Peng, Hongwu, Zhou, Shanglin, Luo, Yukui, Duan, Shijin, Xu, Nuo, Ran, Ran, Huang, Shaoyi, Wang, Chenghong, Geng, Tong, Li, Ang, Wen, Wujie, Xu, Xiaolin, and Ding, Caiwen
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning, I.2, E.3, C.3
Abstract: The rapid growth and deployment of deep learning (DL) has witnessed emerging privacy and security concerns. To mitigate these issues, secure multi-party computation (MPC) has been discussed, to enable the privacy-preserving DL computation. In practice, they often come at very high computation and communication overhead, and potentially prohibit their popularity in large scale systems. Two orthogonal research trends have attracted enormous interests in addressing the energy efficiency in secure deep learning, i.e., overhead reduction of MPC comparison protocol, and hardware acceleration. However, they either achieve a low reduction ratio and suffer from high latency due to limited computation and communication saving, or are power-hungry as existing works mainly focus on general computing platforms such as CPUs and GPUs. In this work, as the first attempt, we develop a systematic framework, PolyMPCNet, of joint overhead reduction of MPC comparison protocol and hardware acceleration, by integrating hardware latency of the cryptographic building block into the DNN loss function to achieve high energy efficiency, accuracy, and security guarantee. Instead of heuristically checking the model sensitivity after a DNN is well-trained (through deleting or dropping some non-polynomial operators), our key design principle is to em enforce exactly what is assumed in the DNN design -- training a DNN that is both hardware efficient and secure, while escaping the local minima and saddle points and maintaining high accuracy. More specifically, we propose a straight through polynomial activation initialization method for cryptographic hardware friendly trainable polynomial activation function to replace the expensive 2P-ReLU operator. We develop a cryptographic hardware scheduler and the corresponding performance model for Field Programmable Gate Arrays (FPGA) platform., Comment: Uploaded a new version of the paper in another new submission: RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference [arXiv:2302.02292]
Published: 2022

31. Attacking the Spike: On the Transferability and Security of Spiking Neural Networks to Adversarial Examples

Author: Xu, Nuo, Mahmood, Kaleel, Fang, Haowen, Rathbun, Ethan, Ding, Caiwen, and Wen, Wujie
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Spiking neural networks (SNNs) have attracted much attention for their high energy efficiency and for recent advances in their classification performance. However, unlike traditional deep learning approaches, the analysis and study of the robustness of SNNs to adversarial examples remain relatively underdeveloped. In this work, we focus on advancing the adversarial attack side of SNNs and make three major contributions. First, we show that successful white-box adversarial attacks on SNNs are highly dependent on the underlying surrogate gradient technique, even in the case of adversarially trained SNNs. Second, using the best surrogate gradient technique, we analyze the transferability of adversarial attacks on SNNs and other state-of-the-art architectures like Vision Transformers (ViTs) and Big Transfer Convolutional Neural Networks (CNNs). We demonstrate that the adversarial examples created by non-SNN architectures are not misclassified often by SNNs. Third, due to the lack of an ubiquitous white-box attack that is effective across both the SNN and CNN/ViT domains, we develop a new white-box attack, the Auto Self-Attention Gradient Attack (Auto-SAGA). Our novel attack generates adversarial examples capable of fooling both SNN and non-SNN models simultaneously. Auto-SAGA is as much as $91.1\%$ more effective on SNN/ViT model ensembles and provides a $3\times$ boost in attack effectiveness on adversarially trained SNN ensembles compared to conventional white-box attacks like Auto-PGD. Our experiments and analyses are broad and rigorous covering three datasets (CIFAR-10, CIFAR-100 and ImageNet), five different white-box attacks and nineteen classifier models (seven for each CIFAR dataset and five models for ImageNet).
Published: 2022

32. Achieving High Bonding Quality between AlSi12 and TC4 Alloys by Laser Deposition Melting

Author: Jing, Zhicheng, Liu, Xiangyu, Wang, Wenbo, Xu, Nuo, Xu, Guojian, and Xing, Fei
Published: 2023
Full Text: View/download PDF

33. Microstructure and Mechanical Properties of Ti6Al4V/Inconel625 Bimetallic Structures Fabricated by Laser Melting Deposition

Author: Wang, Wenbo, Xu, Nuo, Liu, Xiangyu, Jing, Zhicheng, Xu, Guojian, and Xing, Fei
Published: 2023
Full Text: View/download PDF

34. NeuGuard: Lightweight Neuron-Guided Defense against Membership Inference Attacks

Author: Xu, Nuo, Wang, Binghui, Ran, Ran, Wen, Wujie, and Venkitasubramaniam, Parv
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Membership inference attacks (MIAs) against machine learning models can lead to serious privacy risks for the training dataset used in the model training. In this paper, we propose a novel and effective Neuron-Guided Defense method named NeuGuard against membership inference attacks (MIAs). We identify a key weakness in existing defense mechanisms against MIAs wherein they cannot simultaneously defend against two commonly used neural network based MIAs, indicating that these two attacks should be separately evaluated to assure the defense effectiveness. We propose NeuGuard, a new defense approach that jointly controls the output and inner neurons' activation with the object to guide the model output of training set and testing set to have close distributions. NeuGuard consists of class-wise variance minimization targeting restricting the final output neurons and layer-wise balanced output control aiming to constrain the inner neurons in each layer. We evaluate NeuGuard and compare it with state-of-the-art defenses against two neural network based MIAs, five strongest metric based MIAs including the newly proposed label-only MIA on three benchmark datasets. Results show that NeuGuard outperforms the state-of-the-art defenses by offering much improved utility-privacy trade-off, generality, and overhead.
Published: 2022
Full Text: View/download PDF

35. Measurement of carbon finance level and exploration of its influencing factors

Author: Zhang, Peng, Zhang, Yuwei, and Xu, Nuo
Subjects: Economics - General Economics
Abstract: Faced with increasingly severe environmental problems, carbon trading markets and related financial activities aiming at limiting carbon dioxide emissions are booming. Considering the complexity and urgency of carbon market, it is necessary to construct an effective evaluation index system. This paper selected carbon finance index as a composite indicator. Taking Beijing, Shanghai, and Guangdong as examples, we adopted the classic method of multiple criteria decision analysis (MCDA) to analyze the composite indicator. Potential impact factors were screened extensively and calculated through normalization, weighting by coefficient of variation and different aggregation methods. Under the measurement of Shannon-Spearman Measure, the method with the least loss of information was used to obtain the carbon finance index (CFI) of the pilot areas. Through panel model analysis, we found that company size, the number of patents per 10,000 people and the proportion of new energy generation were the factors with significant influence. Based on the research, corresponding suggestions were put forward for different market entities. Hopefully, this research will contribute to the steady development of the national carbon market., Comment: 8pages, 3 figures, 14 tables
Published: 2022

36. Enhanced insights into the neutrophil-driven immune mechanisms during Mycoplasma pneumoniae infection

Author: Fan, Lu, Xu, Nuo, Guo, Yun, and Li, Ling
Published: 2024
Full Text: View/download PDF

37. Soybean isoflavones protect dopaminergic neurons from atrazine damage by inhibiting VPS13A to increase autophagy

Author: Li, Peng, Song, Weiyi, Xu, Nuo, Wang, Zijie, Pang, Haoying, and Wang, Dandan
Published: 2024
Full Text: View/download PDF

38. Breastfeeding in infancy and cardiovascular disease in middle-aged and older adulthood: a prospective study of 0.36 million UK Biobank participants

Author: Li, Shanshan, Wang, Xiaoyan, Li, Xinmei, Zhang, Weiwei, Guo, Yingying, Xu, Nuo, Luo, Junkai, Zhu, Shankuan, and He, Wei
Published: 2024
Full Text: View/download PDF

39. Effects of low temperature on postharvest ripening and starchiness in ‘Cuixiang’ kiwifruit

Author: Chai, Jiaxin, Yang, Bin, Xu, Nuo, Jiang, Qinqin, Gao, Zhixiong, Ren, Xiaolin, and Liu, Zhande
Published: 2024
Full Text: View/download PDF

40. Promoting chondrogenesis by targeted delivery to the degenerating cartilage in early treatment of osteoarthritis

Author: Fei, Yuxiang, Li, Xiaojing, Lv, Zhongyang, Liu, Zizheng, Xie, Ya, Chen, Jiaqi, Li, Weitong, Liu, Xiyu, Guo, Hu, Liu, Huan, Zhang, Zhaofeng, Wang, Xunhao, Fan, Jingjing, Hu, Chunqing, Jin, Xiaoyu, Jiang, Ruiyang, Xu, Nuo, Xia, Jiang, Li, Yang, and Shi, Dongquan
Published: 2024
Full Text: View/download PDF

41. Electrocatalysis coupled super-stable mineralization for the efficient treatment of phosphorus containing plating wastewater

Author: Li, Zilong, Xu, Nuo, Liu, Shihua, Wang, Yawen, Rajput, Vishnu D., Minkina, Tatiana, Fan, Faying, Gao, Wa, and Zhao, Yufei
Published: 2025
Full Text: View/download PDF

42. Emerging role of PES1 in disease: A promising therapeutic target?

Author: Yuan, Siyu, Xu, Nuo, Yang, Jing, and Yuan, Bin
Published: 2025
Full Text: View/download PDF

43. Super-stable mineralization of arsenic contaminated water using industrialized layered double hydroxides and derivatives

Author: Xu, Nuo, Li, Zilong, Liu, Shihua, Li, Zixian, Liu, Huijie, Cao, Wenjing, Wang, Yawen, Gao, Wa, Tian, Qiang, Hao, Haigang, Oyuntsetseg, Dolgorjav, Rajput, Vishnu D., Minkina, Tatiana, and Zhao, Yufei
Published: 2024
Full Text: View/download PDF

44. Reparative homing of bone mesenchymal stem cells induced by iMSCs via the SDF-1/CXCR4 axis for articular cartilage defect restoration

Author: Cheng, Gang, Wang, Xulei, Zhang, Feng, Wang, Kang, Li, Ying, Guo, Tingting, Xu, Nuo, Wei, Wei, and Yan, Shangxue
Published: 2024
Full Text: View/download PDF

45. Fair Competition Review System and cross-regional capital flow: Evidence from China

Author: Li, Shuqi and Xu, Nuo
Published: 2024
Full Text: View/download PDF

46. Epidemiology and laboratory detection of non-tuberculous mycobacteria

Author: Xu, Nuo, Li, Lihong, and Wu, Shenghai
Published: 2024
Full Text: View/download PDF

47. Interval Value Z-probabilistic double hierarchy linguistic multi-criteria group decision making method based on ratio system-peference point-full multiplicative form its application in selection of habitable city

Author: Xian, Sidong, Xu, Nuo, Hu, Shuang, and Yin, Longjun
Published: 2024
Full Text: View/download PDF

48. Anomalous attenuation phenomenon of terahertz wave through the micron air gap defects in XLPE insulation material

Author: Xu, Nuo, Liu, Yang, Wang, Zixuan, Wu, Ming, Ahmed, Muneeb, Liu, Yueting, Gao, Jinghui, and Zhong, Lisheng
Published: 2024
Full Text: View/download PDF

49. Nonlinear impacts of climate change on dengue transmission in mainland China: Underlying mechanisms and future projection

Author: Zheng, Zhoumin, Xu, Nuo, Khan, Mohsin, Pedersen, Michael, Abdalgader, Tarteel, and Zhang, Lai
Published: 2024
Full Text: View/download PDF

50. Ubiquitination-specific protease 7 enhances stemness of hepatocellular carcinoma by stabilizing basic transcription factor 3

Author: Hu, Mingchao, Dai, Chengchen, Sun, Xieyin, Chen, Yinqi, Xu, Nuo, Lin, Zhaoyi, Xu, Shiyu, Cheng, Chun, Tan, Zhonghua, Bian, Saiyan, and Zheng, Wenjie
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

916 results on '"Xu, Nuo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources