Author: "Hu, Wei" / Publication Year Range: This year - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hu, Wei"' showing total 1,974 results

Start Over Author "Hu, Wei" Publication Year Range This year

1,974 results on '"Hu, Wei"'

1. Abrupt Learning in Transformers: A Case Study on Matrix Completion

Author: Gopalani, Pulkit, Lubana, Ekdeep Singh, and Hu, Wei
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Recent analysis on the training dynamics of Transformers has unveiled an interesting characteristic: the training loss plateaus for a significant number of training steps, and then suddenly (and sharply) drops to near--optimal values. To understand this phenomenon in depth, we formulate the low-rank matrix completion problem as a masked language modeling (MLM) task, and show that it is possible to train a BERT model to solve this task to low error. Furthermore, the loss curve shows a plateau early in training followed by a sudden drop to near-optimal values, despite no changes in the training procedure or hyper-parameters. To gain interpretability insights into this sudden drop, we examine the model's predictions, attention heads, and hidden states before and after this transition. Concretely, we observe that (a) the model transitions from simply copying the masked input to accurately predicting the masked entries; (b) the attention heads transition to interpretable patterns relevant to the task; and (c) the embeddings and hidden states encode information relevant to the problem. We also analyze the training dynamics of individual model components to understand the sudden drop in loss., Comment: NeurIPS 2024 Poster
Published: 2024

2. Thermodynamics of Classical One-dimensional Klein-Gordon Lattice Model

Author: Jia, Hu-Wei and Tong, Ning-Hua
Subjects: Condensed Matter - Statistical Mechanics
Abstract: In this paper, we study the thermodynamical properties of the classical one-dimensional Klein-Gordan lattice model ($n \ge 2$) by using the cluster variation method with linear response theory. The results of this method are exact in the thermodynamical limit. We present the single site reduced density matrix $\rho^{(1)}(z)$, averages such as $\langle z^2 \rangle$, $\langle |z^n|\rangle$, and $\langle (z_1-z_2)^2\rangle$, the specific heat $C_v$, and the static correlation functions. We analyzed the scaling behavior and obtained the exact scaling powers of these quantities in the low and high temperaures. Using these results, we gauge the accuracy of the projective truncation approximation for $\phi^{4}$ lattice model., Comment: 18 pages, 14 figures
Published: 2024

3. Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding

Author: Liu, Yang, Liu, Daizong, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper tackles the challenging task of 3D visual grounding-locating a specific object in a 3D point cloud scene based on text descriptions. Existing methods fall into two categories: top-down and bottom-up methods. Top-down methods rely on a pre-trained 3D detector to generate and select the best bounding box, resulting in time-consuming processes. Bottom-up methods directly regress object bounding boxes with coarse-grained features, producing worse results. To combine their strengths while addressing their limitations, we propose a joint top-down and bottom-up framework, aiming to enhance the performance while improving the efficiency. Specifically, in the first stage, we propose a bottom-up based proposal generation module, which utilizes lightweight neural layers to efficiently regress and cluster several coarse object proposals instead of using a complex 3D detector. Then, in the second stage, we introduce a top-down based proposal consolidation module, which utilizes graph design to effectively aggregate and propagate the query-related object contexts among the generated proposals for further refinement. By jointly training these two modules, we can avoid the inherent drawbacks of the complex proposals in the top-down framework and the coarse proposals in the bottom-up framework. Experimental results on the ScanRefer benchmark show that our framework is able to achieve the state-of-the-art performance., Comment: Accepted by ICPR2024
Published: 2024

4. A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning

Author: Cui, Yuanning, Sun, Zequn, and Hu, Wei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Extensive knowledge graphs (KGs) have been constructed to facilitate knowledge-driven tasks across various scenarios. However, existing work usually develops separate reasoning models for different KGs, lacking the ability to generalize and transfer knowledge across diverse KGs and reasoning settings. In this paper, we propose a prompt-based KG foundation model via in-context learning, namely KG-ICL, to achieve a universal reasoning ability. Specifically, we introduce a prompt graph centered with a query-related example fact as context to understand the query relation. To encode prompt graphs with the generalization ability to unseen entities and relations in queries, we first propose a unified tokenizer that maps entities and relations in prompt graphs to predefined tokens. Then, we propose two message passing neural networks to perform prompt encoding and KG reasoning, respectively. We conduct evaluation on 43 different KGs in both transductive and inductive settings. Results indicate that the proposed KG-ICL outperforms baselines on most datasets, showcasing its outstanding generalization and universal reasoning capabilities. The source code is accessible on GitHub: https://github.com/nju-websoft/KG-ICL., Comment: Accepted in the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Published: 2024

5. Dynamics of Concept Learning and Compositional Generalization

Author: Yang, Yongyi, Park, Core Francisco, Lubana, Ekdeep Singh, Okawa, Maya, Hu, Wei, and Tanaka, Hidenori
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Prior work has shown that text-conditioned diffusion models can learn to identify and manipulate primitive concepts underlying a compositional data-generating process, enabling generalization to entirely novel, out-of-distribution compositions. Beyond performance evaluations, these studies develop a rich empirical phenomenology of learning dynamics, showing that models generalize sequentially, respecting the compositional hierarchy of the data-generating process. Moreover, concept-centric structures within the data significantly influence a model's speed of learning the ability to manipulate a concept. In this paper, we aim to better characterize these empirical results from a theoretical standpoint. Specifically, we propose an abstraction of prior work's compositional generalization problem by introducing a structured identity mapping (SIM) task, where a model is trained to learn the identity mapping on a Gaussian mixture with structurally organized centroids. We mathematically analyze the learning dynamics of neural networks trained on this SIM task and show that, despite its simplicity, SIM's learning dynamics capture and help explain key empirical observations on compositional generalization with diffusion models identified in prior work. Our theory also offers several new insights -- e.g., we find a novel mechanism for non-monotonic learning dynamics of test loss in early phases of training. We validate our new predictions by training a text-conditioned diffusion model, bridging our simplified framework and complex generative models. Overall, this work establishes the SIM task as a meaningful theoretical abstraction of concept learning dynamics in modern generative models.
Published: 2024

6. Benign Overfitting in Single-Head Attention

Author: Magen, Roey, Shang, Shuning, Xu, Zhiwei, Frei, Spencer, Hu, Wei, and Vardi, Gal
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: The phenomenon of benign overfitting, where a trained neural network perfectly fits noisy training data but still achieves near-optimal test performance, has been extensively studied in recent years for linear models and fully-connected/convolutional networks. In this work, we study benign overfitting in a single-head softmax attention model, which is the fundamental building block of Transformers. We prove that under appropriate conditions, the model exhibits benign overfitting in a classification setting already after two steps of gradient descent. Moreover, we show conditions where a minimum-norm/maximum-margin interpolator exhibits benign overfitting. We study how the overfitting behavior depends on the signal-to-noise ratio (SNR) of the data distribution, namely, the ratio between norms of signal and noise tokens, and prove that a sufficiently large SNR is both necessary and sufficient for benign overfitting.
Published: 2024

7. Linear Projections of Teacher Embeddings for Few-Class Distillation

Author: Loo, Noel, Iliopoulos, Fotis, Hu, Wei, and Vee, Erik
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Knowledge Distillation (KD) has emerged as a promising approach for transferring knowledge from a larger, more complex teacher model to a smaller student model. Traditionally, KD involves training the student to mimic the teacher's output probabilities, while more advanced techniques have explored guiding the student to adopt the teacher's internal representations. Despite its widespread success, the performance of KD in binary classification and few-class problems has been less satisfactory. This is because the information about the teacher model's generalization patterns scales directly with the number of classes. Moreover, several sophisticated distillation methods may not be universally applicable or effective for data types beyond Computer Vision. Consequently, effective distillation techniques remain elusive for a range of key real-world applications, such as sentiment analysis, search query understanding, and advertisement-query relevance assessment. Taking these observations into account, we introduce a novel method for distilling knowledge from the teacher's model representations, which we term Learning Embedding Linear Projections (LELP). Inspired by recent findings about the structure of final-layer representations, LELP works by identifying informative linear subspaces in the teacher's embedding space, and splitting them into pseudo-subclasses. The student model is then trained to replicate these pseudo-classes. Our experimental evaluation on large-scale NLP benchmarks like Amazon Reviews and Sentiment140 demonstrate the LELP is consistently competitive with, and typically superior to, existing state-of-the-art distillation algorithms for binary and few-class problems, where most KD methods suffer.
Published: 2024

8. GRB 240529A: A Tale of Two Shocks

Author: Sun, Tian-Rui, Geng, Jin-Jun, Yan, Jing-Zhi, Hu, You-Dong, Wu, Xue-Feng, Castro-Tirado, Alberto J., Yang, Chao, Ping, Yi-Ding, Hu, Chen-Ran, Xu, Fan, Gao, Hao-Xuan, Jiang, Ji-An, Zhu, Yan-Tian, Xue, Yongquan, Pérez-García, Ignacio, Wu, Si-Yu, Fernández-García, Emilio, Caballero-García, María D., Sánchez-Ramírez, Rubén, Guziy, Sergiy, Olivares, Ignacio, del Pulgar, Carlos Jesus Pérez, Castellón, A., Castillo, Sebastián, Xiong, Ding-Rong, Pandey, Shashi B., Hiriart, David, García-Segura, Guillermo, Lee, William H., Carrasco-García, I. M., Park, Il H., Meintjes, Petrus J., van Heerden, Hendrik J., Martín-Carrillo, Antonio, Hanlon, Lorraine, Zhang, Bin-Bin, Maury, Alain, Hernández-García, L., Gritsevich, Maria, Rossi, Andrea, Maiorano, Elisabetta, Cusano, Felice, D'Avanzo, Paolo, Ferro, Matteo, Melandri, Andrea, De Pasquale, Massimiliano, Brivio, Riccardo, Fang, Min, Fan, Lu-Lu, Hu, Wei-Da, Wan, Zhen, Hu, Lei, Zuo, Ying-Xi, Tang, Jin-Long, Zhang, Xiao-Ling, Zheng, Xian-Zhong, Li, Bin, Luo, Wen-Tao, Liu, Wei, Wang, Jian, Zhang, Hong-Fei, Liu, Hao, Gao, Jie, Liang, Ming, Wang, Hai-Ren, Yao, Da-Zhi, Cheng, Jing-Quan, Zhao, Wen, and Dai, Zi-Gao
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: Thanks to the rapidly increasing time-domain facilities, we are entering a golden era of research on gamma-ray bursts (GRBs). In this Letter, we report our observations of GRB 240529A with the Burst Optical Observer and Transient Exploring System, the 1.5-meter telescope at Observatorio Sierra Nevada, the 2.5-meter Wide Field Survey Telescope of China, the Large Binocular Telescope, and the Telescopio Nazionale Galileo. The prompt emission of GRB 240529A shows two comparable energetic episodes separated by a quiescence time of roughly 400 s. Combining all available data on the GRB Coordinates Network, we reveal the simultaneous apparent X-ray plateau and optical re-brightening around $10^3-10^4$ s after the burst. Rather than the energy injection from the magnetar as widely invoked for similar GRBs, the multi-wavelength emissions could be better explained as two shocks launched from the central engine separately. The optical peak time and our numerical modeling suggest that the initial bulk Lorentz factor of the later shock is roughly 50, which indicates that the later jet should be accretion-driven and have a higher mass loading than a typical one. The quiescence time between the two prompt emission episodes may be caused by the transition between different accretion states of a central magnetar or black hole, or the fall-back accretion process. A sample of similar bursts with multiple emission episodes in the prompt phase and sufficient follow-up could help to probe the underlying physics of GRB central engines., Comment: Resubmitted to ApJL after addressing the referee's comments; comments are welcome
Published: 2024

9. Deep Picard Iteration for High-Dimensional Nonlinear PDEs

Author: Han, Jiequn, Hu, Wei, Long, Jihao, and Zhao, Yue
Subjects: Mathematics - Numerical Analysis
Abstract: We present the Deep Picard Iteration (DPI) method, a new deep learning approach for solving high-dimensional partial differential equations (PDEs). The core innovation of DPI lies in its use of Picard iteration to reformulate the typically complex training objectives of neural network-based PDE solutions into much simpler, standard regression tasks based on function values and gradients. This design not only greatly simplifies the optimization process but also offers the potential for further scalability through parallel data generation. Crucially, to fully realize the benefits of regressing on both function values and gradients in the DPI method, we address the issue of infinite variance in the estimators of gradients by incorporating a control variate, supported by our theoretical analysis. Our experiments on problems up to 100 dimensions demonstrate that DPI consistently outperforms existing state-of-the-art methods, with greater robustness to hyperparameters, particularly in challenging scenarios with long time horizons and strong nonlinearity.
Published: 2024

10. Analytical approach for pure high, even-order dispersion solitons

Author: Liao, Xing, Huang, Jiahan, Lu, Daquan, and Hu, Wei
Subjects: Physics - Optics, Nonlinear Sciences - Pattern Formation and Solitons
Abstract: We theoretically solve the nonlinear Schr\"{o}dinger equation describing the propagation of pure high, even order dispersion (PHEODs) solitons by variational approach. The Lagrangian for nonlinear pulse transmission systems with each dispersion order are given and the analytical solutions of PHEOD soltions are obtained and compared with the numerical results. It is shown that the variational results approximate very well for lower orders of dispersion ($\le 8$) and get worst as the order increasing. In addition, using the linear stability analysis, we demonstrate that all PHEOD solitons are stable and obtain the soliton internal modes that accompany soliton transmission. These results are helpful for the application of PHEOD solitons in high energy lasers., Comment: 4 figures
Published: 2024

11. Dual Gravitational Wave Signatures of Instant Preheating

Author: Hu, Wei-Yu, Nakayama, Kazunori, Takhistov, Volodymyr, and Tang, Yong
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, General Relativity and Quantum Cosmology, High Energy Physics - Phenomenology
Abstract: In the instant preheating scenario efficient particle production occurs immediately following the period of inflationary expansion in the early Universe. We demonstrate that instant preheating predicts unique gravitational wave (GW) signals arising from two distinct origins. One source is the bremsstrahlung GWs produced through the decay of superheavy particles, an inevitable consequence of instant preheating. The other is GWs generated from the nonlinear dynamics of the inflaton and coupled scalar fields. Using numerical simulations, we show that the peak of the GW spectrum shifts depending on the coupling constants of the theory. The detection of these dual GW signatures, characteristic of instant preheating, provides novel opportunities for probing the dynamics of the early Universe., Comment: 28 pages, 8 figures
Published: 2024

12. A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement

Author: Zhang, Huan, Cheng, Wei, Wu, Yuhan, and Hu, Wei
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have achieved impressive performance on code generation. Although prior studies enhanced LLMs with prompting techniques and code refinement, they still struggle with complex programming problems due to rigid solution plans. In this paper, we draw on pair programming practices to propose PairCoder, a novel LLM-based framework for code generation. PairCoder incorporates two collaborative LLM agents, namely a Navigator agent for high-level planning and a Driver agent for specific implementation. The Navigator is responsible for proposing promising solution plans, selecting the current optimal plan, and directing the next iteration round based on execution feedback. The Driver follows the guidance of Navigator to undertake initial code generation, code testing, and refinement. This interleaved and iterative workflow involves multi-plan exploration and feedback-based refinement, which mimics the collaboration of pair programmers. We evaluate PairCoder with both open-source and closed-source LLMs on various code generation benchmarks. Extensive experimental results demonstrate the superior accuracy of PairCoder, achieving relative pass@1 improvements of 12.00%-162.43% compared to prompting LLMs directly., Comment: Accepted in the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024)
Published: 2024

13. Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction

Author: Meng, Lingbei, Du, Bi'an, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Sparse-view 3D reconstruction stands as a formidable challenge in computer vision, aiming to build complete three-dimensional models from a limited array of viewing perspectives. This task confronts several difficulties: 1) the limited number of input images that lack consistent information; 2) dependence on the quality of input images; and 3) the substantial size of model parameters. To address these challenges, we propose a self-augmented coarse-to-fine Gaussian splatting paradigm, enhanced with a structure-aware mask, for sparse-view 3D reconstruction. In particular, our method initially employs a coarse Gaussian model to obtain a basic 3D representation from sparse-view inputs. Subsequently, we develop a fine Gaussian network to enhance consistent and detailed representation of the output with both 3D geometry augmentation and perceptual view augmentation. During training, we design a structure-aware masking strategy to further improve the model's robustness against sparse inputs and noise.Experimental results on the MipNeRF360 and OmniObject3D datasets demonstrate that the proposed method achieves state-of-the-art performances for sparse input views in both perceptual quality and efficiency.
Published: 2024

14. Using high-fidelity discrete element simulation to calibrate an expeditious terramechanics model in a multibody dynamics framework

Author: Zhang, Yuemin, Dai, Junpeng, Hu, Wei, and Negrut, Dan
Subjects: Computer Science - Computational Engineering, Finance, and Science
Abstract: The wheel-soil interaction has great impact on the dynamics of off-road vehicles in terramechanics applications. The Soil Contact Model (SCM), which anchors an empirical method to characterize the frictional contact between a wheel and soil, has been widely used in off-road vehicle dynamics simulations because it quickly produces adequate results for many terramechanics applications. The SCM approach calls for a set of model parameters that are obtained via a bevameter test. This test is expensive and time consuming to carry out, and in some cases difficult to set up, e.g., in extraterrestrial applications. We propose an approach to address these concerns by conducting the bevameter test in simulation, using a model that captures the physics of the actual experiment with high fidelity. To that end, we model the bevameter test rig as a multibody system, while the dynamics of the soil is captured using a discrete element model (DEM). The multibody dynamics--soil dynamics co-simulation is used to replicate the bevameter test, producing high-fidelity ground truth test data that is subsequently used to calibrate the SCM parameters within a Bayesian inference framework. To test the accuracy of the resulting SCM terramechanics, we run single wheel and full rover simulations using both DEM and SCM terrains. The SCM results match well with those produced by the DEM solution, and the simulation time for SCM is two to three orders of magnitude lower than that of DEM. All simulations in this work are performed using Chrono, an open-source, publicly available simulator. The scripts and models used are available in a public repository for reproducibility studies and further research., Comment: version has Appendix
Published: 2024

15. Finetuning Generative Large Language Models with Discrimination Instructions for Knowledge Graph Completion

Author: Liu, Yang, Tian, Xiaobin, Sun, Zequn, and Hu, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Traditional knowledge graph (KG) completion models learn embeddings to predict missing facts. Recent works attempt to complete KGs in a text-generation manner with large language models (LLMs). However, they need to ground the output of LLMs to KG entities, which inevitably brings errors. In this paper, we present a finetuning framework, DIFT, aiming to unleash the KG completion ability of LLMs and avoid grounding errors. Given an incomplete fact, DIFT employs a lightweight model to obtain candidate entities and finetunes an LLM with discrimination instructions to select the correct one from the given candidates. To improve performance while reducing instruction data, DIFT uses a truncated sampling method to select useful facts for finetuning and injects KG embeddings into the LLM. Extensive experiments on benchmark datasets demonstrate the effectiveness of our proposed framework., Comment: Accepted in the 23rd International Semantic Web Conference (ISWC 2024)
Published: 2024

16. LTRL: Boosting Long-tail Recognition via Reflective Learning

Author: Zhao, Qihao, Dai, Yalun, Lin, Shen, Hu, Wei, Zhang, Fan, and Liu, Jun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In real-world scenarios, where knowledge distributions exhibit long-tail. Humans manage to master knowledge uniformly across imbalanced distributions, a feat attributed to their diligent practices of reviewing, summarizing, and correcting errors. Motivated by this learning process, we propose a novel learning paradigm, called reflecting learning, in handling long-tail recognition. Our method integrates three processes for reviewing past predictions during training, summarizing and leveraging the feature relation across classes, and correcting gradient conflict for loss functions. These designs are lightweight enough to plug and play with existing long-tail learning methods, achieving state-of-the-art performance in popular long-tail visual benchmarks. The experimental results highlight the great potential of reflecting learning in dealing with long-tail recognition., Comment: ECCV2024, Oral
Published: 2024

17. Expanding the Scope: Inductive Knowledge Graph Reasoning with Multi-Starting Progressive Propagation

Author: Shao, Zhoutian, Cui, Yuanning, and Hu, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Knowledge graphs (KGs) are widely acknowledged as incomplete, and new entities are constantly emerging in the real world. Inductive KG reasoning aims to predict missing facts for these new entities. Among existing models, graph neural networks (GNNs) based ones have shown promising performance for this task. However, they are still challenged by inefficient message propagation due to the distance and scalability issues. In this paper, we propose a new inductive KG reasoning model, MStar, by leveraging conditional message passing neural networks (C-MPNNs). Our key insight is to select multiple query-specific starting entities to expand the scope of progressive propagation. To propagate query-related messages to a farther area within limited steps, we subsequently design a highway layer to propagate information toward these selected starting entities. Moreover, we introduce a training strategy called LinkVerify to mitigate the impact of noisy training samples. Experimental results validate that MStar achieves superior performance compared with state-of-the-art models, especially for distant entities., Comment: Accepted in the 23rd International Semantic Web Conference (ISWC 2024)
Published: 2024

18. A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends

Author: Liu, Daizong, Yang, Mingyu, Qu, Xiaoye, Zhou, Pan, Cheng, Yu, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the significant development of large models in recent years, Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities across a wide range of multimodal understanding and reasoning tasks. Compared to traditional Large Language Models (LLMs), LVLMs present great potential and challenges due to its closer proximity to the multi-resource real-world applications and the complexity of multi-modal processing. However, the vulnerability of LVLMs is relatively underexplored, posing potential security risks in daily usage. In this paper, we provide a comprehensive review of the various forms of existing LVLM attacks. Specifically, we first introduce the background of attacks targeting LVLMs, including the attack preliminary, attack challenges, and attack resources. Then, we systematically review the development of LVLM attack methods, such as adversarial attacks that manipulate model outputs, jailbreak attacks that exploit model vulnerabilities for unauthorized actions, prompt injection attacks that engineer the prompt type and pattern, and data poisoning that affects model training. Finally, we discuss promising research directions in the future. We believe that our survey provides insights into the current landscape of LVLM vulnerabilities, inspiring more researchers to explore and mitigate potential safety issues in LVLM developments. The latest papers on LVLM attacks are continuously collected in https://github.com/liudaizong/Awesome-LVLM-Attack.
Published: 2024

19. Planning with Large Language Models for Conversational Agents

Author: Li, Zhigen, Peng, Jianxiang, Wang, Yanmeng, Shen, Tianhao, Zhang, Minghui, Su, Linxi, Wu, Shang, Wu, Yihang, Wang, Yuqian, Wang, Ye, Hu, Wei, Li, Jianfeng, Wang, Shaojun, Xiao, Jing, and Xiong, Deyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Controllability and proactivity are crucial properties of autonomous conversational agents (CAs). Controllability requires the CAs to follow the standard operating procedures (SOPs), such as verifying identity before activating credit cards. Proactivity requires the CAs to guide the conversation towards the goal during user uncooperation, such as persuasive dialogue. Existing research cannot be unified with controllability, proactivity, and low manual annotation. To bridge this gap, we propose a new framework for planning-based conversational agents (PCA) powered by large language models (LLMs), which only requires humans to define tasks and goals for the LLMs. Before conversation, LLM plans the core and necessary SOP for dialogue offline. During the conversation, LLM plans the best action path online referring to the SOP, and generates responses to achieve process controllability. Subsequently, we propose a semi-automatic dialogue data creation framework and curate a high-quality dialogue dataset (PCA-D). Meanwhile, we develop multiple variants and evaluation metrics for PCA, e.g., planning with Monte Carlo Tree Search (PCA-M), which searches for the optimal dialogue action while satisfying SOP constraints and achieving the proactive of the dialogue. Experiment results show that LLMs finetuned on PCA-D can significantly improve the performance and generalize to unseen domains. PCA-M outperforms other CoT and ToT baselines in terms of conversation controllability, proactivity, task success rate, and overall logical coherence, and is applicable in industry dialogue scenarios. The dataset and codes are available at XXXX.
Published: 2024

20. PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway Supercomputer

Author: Jiang, Qingcai, Cao, Zhenwei, Chen, Junshi, Qin, Xinming, Hu, Wei, An, Hong, and Yang, Jinlong
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: First-principles density functional theory (DFT) with plane wave (PW) basis set is the most widely used method in quantum mechanical material simulations due to its advantages in accuracy and universality. However, a perceived drawback of PW-based DFT calculations is their substantial computational cost and memory usage, which currently limits their ability to simulate large-scale complex systems containing thousands of atoms. This situation is exacerbated in the new Sunway supercomputer, where each process is limited to a mere 16 GB of memory. Herein, we present a novel parallel implementation of plane wave density functional theory on the new Sunway supercomputer (PWDFT-SW). PWDFT-SW fully extracts the benefits of Sunway supercomputer by extensively refactoring and calibrating our algorithms to align with the system characteristics of the Sunway system. Through extensive numerical experiments, we demonstrate that our methods can substantially decrease both computational costs and memory usage. Our optimizations translate to a speedup of 64.8x for a physical system containing 4,096 silicon atoms, enabling us to push the limit of PW-based DFT calculations to large-scale systems containing 16,384 carbon atoms.
Published: 2024

21. EffectiveASR: A Single-Step Non-Autoregressive Mandarin Speech Recognition Architecture with High Accuracy and Inference Speed

Author: Zhuang, Ziyang, Miao, Chenfeng, Zou, Kun, Fang, Ming, Wei, Tao, Li, Zijian, Cheng, Ning, Hu, Wei, Wang, Shaojun, and Xiao, Jing
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Non-autoregressive (NAR) automatic speech recognition (ASR) models predict tokens independently and simultaneously, bringing high inference speed. However, there is still a gap in the accuracy of the NAR models compared to the autoregressive (AR) models. In this paper, we propose a single-step NAR ASR architecture with high accuracy and inference speed, called EffectiveASR. It uses an Index Mapping Vector (IMV) based alignment generator to generate alignments during training, and an alignment predictor to learn the alignments for inference. It can be trained end-to-end (E2E) with cross-entropy loss combined with alignment loss. The proposed EffectiveASR achieves competitive results on the AISHELL-1 and AISHELL-2 Mandarin benchmarks compared to the leading models. Specifically, it achieves character error rates (CER) of 4.26%/4.62% on the AISHELL-1 dev/test dataset, which outperforms the AR Conformer with about 30x inference speedup., Comment: Submitted to ICASSP 2025
Published: 2024

22. A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions

Author: Liu, Daizong, Liu, Yang, Huang, Wencan, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text-guided 3D visual grounding (T-3DVG), which aims to locate a specific object that semantically corresponds to a language query from a complicated 3D scene, has drawn increasing attention in the 3D research community over the past few years. Compared to 2D visual grounding, this task presents great potential and challenges due to its closer proximity to the real world and the complexity of data collection and 3D point cloud source processing. In this survey, we attempt to provide a comprehensive overview of the T-3DVG progress, including its fundamental elements, recent research advances, and future research directions. To the best of our knowledge, this is the first systematic survey on the T-3DVG task. Specifically, we first provide a general structure of the T-3DVG pipeline with detailed components in a tutorial style, presenting a complete background overview. Then, we summarize the existing T-3DVG approaches into different categories and analyze their strengths and weaknesses. We also present the benchmark datasets and evaluation metrics to assess their performances. Finally, we discuss the potential limitations of existing T-3DVG and share some insights on several promising research directions. The latest papers are continually collected at https://github.com/liudaizong/Awesome-3D-Visual-Grounding.
Published: 2024

23. A cytosol-tethered YHB variant of phytochrome B retains photomorphogenic signaling activity

Author: Hu, Wei and Lagarias, J Clark
Subjects: Plant Biology, Biological Sciences, Genetics, Aetiology, 2.1 Biological and endogenous factors, Phytochrome B, Arabidopsis, Cytosol, Arabidopsis Proteins, Signal Transduction, Basic Helix-Loop-Helix Transcription Factors, Hypocotyl, Plants, Genetically Modified, Light, Mutation, Gene Expression Regulation, Plant, Seedlings, Phenotype, Light-independent phyB signaling, Cytoplasmic phytochrome signaling, Photomorphogenesis, Subcellular localization, Plant photoreceptors, Biochemistry and Cell Biology, Plant Biology & Botany, Plant biology
Abstract: The red and far-red light photoreceptor phytochrome B (phyB) transmits light signals following cytosol-to-nuclear translocation to regulate transcriptional networks therein. This necessitates changes in protein-protein interactions of phyB in the cytosol, about which little is presently known. Via introduction of a nucleus-excluding G767R mutation into the dominant, constitutively active phyBY276H (YHB) allele, we explore the functional consequences of expressing a cytosol-localized YHBG767R variant in transgenic Arabidopsis seedlings. We show that YHBG767R elicits selective constitutive photomorphogenic phenotypes in dark-grown phyABCDE null mutants, wild type and other phy-deficient genotypes. These responses include light-independent apical hook opening, cotyledon unfolding, seed germination and agravitropic hypocotyl growth with minimal suppression of hypocotyl elongation. Such phenotypes correlate with reduced PIF3 levels, which implicates cytosolic targeting of PIF3 turnover or PIF3 translational inhibition by YHBG767R. However, as expected for a cytoplasm-tethered phyB, YHBG767R elicits reduced light-mediated signaling activity compared with similarly expressed wild-type phyB in phyABCDE mutant backgrounds. YHBG767R also interferes with wild-type phyB light signaling, presumably by formation of cytosol-retained and/or otherwise inactivated heterodimers. Our results suggest that cytosolic interactions with PIFs play an important role in phyB signaling even under physiological conditions.
Published: 2024

24. System-level time computation and representation in the suprachiasmatic nucleus revealed by large-scale calcium imaging and machine learning.

Author: Wang, Zichen, Yu, Jing, Zhai, Muyue, Wang, Zehua, Sheng, Kaiwen, Zhu, Yu, Wang, Tianyu, Liu, Mianzhi, Wang, Lu, Yan, Miao, Zhang, Jue, Xu, Ying, Wang, Xianhua, Ma, Lei, Hu, Wei, and Cheng, Heping
Subjects: Machine Learning, Suprachiasmatic Nucleus, Animals, Calcium, Mice, Male, Calcium Signaling, Circadian Rhythm, Mice, Inbred C57BL, GABAergic Neurons, Circadian Clocks, Neurons
Abstract: The suprachiasmatic nucleus (SCN) is the mammalian central circadian pacemaker with heterogeneous neurons acting in concert while each neuron harbors a self-sustained molecular clockwork. Nevertheless, how system-level SCN signals encode time of the day remains enigmatic. Here we show that population-level Ca2+ signals predict hourly time, via a group decision-making mechanism coupled with a spatially modular time feature representation in the SCN. Specifically, we developed a high-speed dual-view two-photon microscope for volumetric Ca2+ imaging of up to 9000 GABAergic neurons in adult SCN slices, and leveraged machine learning methods to capture emergent properties from multiscale Ca2+ signals as a whole. We achieved hourly time prediction by polling random cohorts of SCN neurons, reaching 99.0% accuracy at a cohort size of 900. Further, we revealed that functional neuron subtypes identified by contrastive learning tend to aggregate separately in the SCN space, giving rise to bilaterally symmetrical ripple-like modular patterns. Individual modules represent distinctive time features, such that a module-specifically learned time predictor can also accurately decode hourly time from random polling of the same module. These findings open a new paradigm in deciphering the design principle of the biological clock at the system level.
Published: 2024

25. Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation

Author: Liu, Yi, Liu, Xiangyu, Zhu, Xiangrong, and Hu, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Multi-aspect controllable text generation aims to control the generated texts in attributes from multiple aspects (e.g., "positive" from sentiment and "sport" from topic). For ease of obtaining training samples, existing works neglect attribute correlations formed by the intertwining of different attributes. Particularly, the stereotype formed by imbalanced attribute correlations significantly affects multi-aspect control. In this paper, we propose MAGIC, a new multi-aspect controllable text generation method with disentangled counterfactual augmentation. We alleviate the issue of imbalanced attribute correlations during training using counterfactual feature vectors in the attribute latent space by disentanglement. During inference, we enhance attribute correlations by target-guided counterfactual augmentation to further improve multi-aspect control. Experiments show that MAGIC outperforms state-of-the-art baselines in both imbalanced and balanced attribute correlation scenarios. Our source code and data are available at https://github.com/nju-websoft/MAGIC., Comment: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Published: 2024

26. Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion

Author: Cheng, Wei, Wu, Yuhan, and Hu, Wei
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: Recent years have witnessed the deployment of code language models (LMs) in various code intelligence tasks such as code completion. Yet, it is challenging for pre-trained LMs to generate correct completions in private repositories. Previous studies retrieve cross-file context based on import relations or text similarity, which is insufficiently relevant to completion targets. In this paper, we propose a dataflow-guided retrieval augmentation approach, called DraCo, for repository-level code completion. DraCo parses a private repository into code entities and establishes their relations through an extended dataflow analysis, forming a repo-specific context graph. Whenever triggering code completion, DraCo precisely retrieves relevant background knowledge from the repo-specific context graph and generates well-formed prompts to query code LMs. Furthermore, we construct a large Python dataset, ReccEval, with more diverse completion targets. Our experiments demonstrate the superior accuracy and applicable efficiency of DraCo, improving code exact match by 3.43% and identifier F1-score by 3.27% on average compared to the state-of-the-art approach., Comment: Accepted in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)
Published: 2024

27. BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification

Author: Lu, Yen-An, Hu, Wei-Shou, Paulson, Joel A., and Zhang, Qi
Subjects: Mathematics - Optimization and Control, Computer Science - Machine Learning
Abstract: This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem. The IO problem is commonly formulated as a large-scale bilevel program that is notoriously difficult to solve. Deviating from traditional exact solution methods, we propose a derivative-free optimization approach based on Bayesian optimization, which we call BO4IO, to solve general IO problems. We treat the IO loss function as a black box and approximate it with a Gaussian process model. Using the predicted posterior function, an acquisition function is minimized at each iteration to query new candidate solutions and sequentially converge to the optimal parameter estimates. The main advantages of using Bayesian optimization for IO are two-fold: (i) it circumvents the need of complex reformulations of the bilevel program or specialized algorithms and can hence enable computational tractability even when the underlying optimization problem is nonconvex or involves discrete variables, and (ii) it allows approximations of the profile likelihood, which provide uncertainty quantification on the IO parameter estimates. We apply the proposed method to three computational case studies, covering different classes of forward optimization problems ranging from convex nonlinear to nonconvex mixed-integer nonlinear programs. Our extensive computational results demonstrate the efficacy and robustness of BO4IO to accurately estimate unknown model parameters from small and noisy datasets. In addition, the proposed profile likelihood analysis has proven to be effective in providing good approximations of the confidence intervals on the parameter estimates and assessing the identifiability of the unknown parameters.
Published: 2024

28. Using physics-based simulation towards eliminating empiricism in extraterrestrial terramechanics applications

Author: Hu, Wei, Li, Pei, Rogg, Arno, Schepelmann, Alexander, Creager, Colin, Chandler, Samuel, Kamrin, Ken, and Negrut, Dan
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics, Astrophysics - Earth and Planetary Astrophysics, Computer Science - Robotics
Abstract: Recently, there has been a surge of international interest in extraterrestrial exploration targeting the Moon, Mars, the moons of Mars, and various asteroids. This contribution discusses how current state-of-the-art Earth-based testing for designing rovers and landers for these missions currently leads to overly optimistic conclusions about the behavior of these devices upon deployment on the targeted celestial bodies. The key misconception is that gravitational offset is necessary during the \textit{terramechanics} testing of rover and lander prototypes on Earth. The body of evidence supporting our argument is tied to a small number of studies conducted during parabolic flights and insights derived from newly revised scaling laws. We argue that what has prevented the community from fully diagnosing the problem at hand is the absence of effective physics-based models capable of simulating terramechanics under low gravity conditions. We developed such a physics-based simulator and utilized it to gauge the mobility of early prototypes of the Volatiles Investigating Polar Exploration Rover (VIPER), which is slated to depart for the Moon in November 2024. This contribution discusses the results generated by this simulator, how they correlate with physical test results from the NASA-Glenn SLOPE lab, and the fallacy of the gravitational offset in rover and lander testing. The simulator developed is open sourced and made publicly available for unfettered use; it can support principled studies that extend beyond trafficability analysis to provide insights into in-situ resource utilization activities, e.g., digging, bulldozing, and berming in low gravity.
Published: 2024

29. Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction

Author: Chen, Jianhao, Ouyang, Haoyuan, Ren, Junyang, Ding, Wentao, Hu, Wei, and Qu, Yuzhong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Facts extraction is pivotal for constructing knowledge graphs. Recently, the increasing demand for temporal facts in downstream tasks has led to the emergence of the task of temporal fact extraction. In this paper, we specifically address the extraction of temporal facts from natural language text. Previous studies fail to handle the challenge of establishing time-to-fact correspondences in complex sentences. To overcome this hurdle, we propose a timeline-based sentence decomposition strategy using large language models (LLMs) with in-context learning, ensuring a fine-grained understanding of the timeline associated with various facts. In addition, we evaluate the performance of LLMs for direct temporal fact extraction and get unsatisfactory results. To this end, we introduce TSDRE, a method that incorporates the decomposition capabilities of LLMs into the traditional fine-tuning of smaller pre-trained language models (PLMs). To support the evaluation, we construct ComplexTRED, a complex temporal fact extraction dataset. Our experiments show that TSDRE achieves state-of-the-art results on both HyperRED-Temporal and ComplexTRED datasets., Comment: Accepted to ACL2024 main conference
Published: 2024

30. Design and Implementation of Ultra-Wideband Dual-Polarized Conformal Phased Array

Author: Chen, Zhan, Hu, Wei, Gao, Yuchen, Wang, Xiangbo, and Luo, Qi
Subjects: Computer Science - Information Theory
Abstract: This letter presents an innovative design and implementation method for an ultra-wideband dual-polarized conformal phased array. The performance of single-layer continuous dipoles under bending conditions is evaluated through characteristic mode analysis to understand the impact of curvature on operational stability. This analysis provides design insights into the maximum allowable curvature while maintaining stable dipole performance. Subsequently, a single-layer metal radiation structure is designed as the array element, and simulations indicate that it supports ultra-wideband operation with dual polarizations. Using this element, an 8 * 8 ultra-wideband dual-polarized cylindrical-conformal array (UDCA) is developed, demonstrating stable performance even at a curvature radius as small as 100 mm. A physical prototype is fabricated cost-effectively using novel manufacturing techniques that stack a three-layer conformal substrate. Experimental result demonstrates that the proposed UDCA with a 1.2{\lambda} curvature radius operates at 3.6~9.6 GHz (90.9%) and achieves 60{\deg} wide-scanning in E-/H-planes, which provides a practical and promising solution for conformal array applications., Comment: The paper is submitted to lEEE Antennas and Wireless Propagation Letters
Published: 2024

31. HERTA: A High-Efficiency and Rigorous Training Algorithm for Unfolded Graph Neural Networks

Author: Yang, Yongyi, Yang, Jiaming, Hu, Wei, and Dereziński, Michał
Subjects: Computer Science - Machine Learning
Abstract: As a variant of Graph Neural Networks (GNNs), Unfolded GNNs offer enhanced interpretability and flexibility over traditional designs. Nevertheless, they still suffer from scalability challenges when it comes to the training cost. Although many methods have been proposed to address the scalability issues, they mostly focus on per-iteration efficiency, without worst-case convergence guarantees. Moreover, those methods typically add components to or modify the original model, thus possibly breaking the interpretability of Unfolded GNNs. In this paper, we propose HERTA: a High-Efficiency and Rigorous Training Algorithm for Unfolded GNNs that accelerates the whole training process, achieving a nearly-linear time worst-case training guarantee. Crucially, HERTA converges to the optimum of the original model, thus preserving the interpretability of Unfolded GNNs. Additionally, as a byproduct of HERTA, we propose a new spectral sparsification method applicable to normalized and regularized graph Laplacians that ensures tighter bounds for our algorithm than existing spectral sparsifiers do. Experiments on real-world datasets verify the superiority of HERTA as well as its adaptability to various loss functions and optimizers.
Published: 2024

32. KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation

Author: Luo, Xindi, Sun, Zequn, Zhao, Jing, Zhao, Zhe, and Hu, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Parameter-efficient finetuning (PEFT) is a key technique for adapting large language models (LLMs) to downstream tasks. In this paper, we study leveraging knowledge graph embeddings to improve the effectiveness of PEFT. We propose a knowledgeable adaptation method called KnowLA. It inserts an adaptation layer into an LLM to integrate the embeddings of entities appearing in the input text. The adaptation layer is trained in combination with LoRA on instruction data. Experiments on six benchmarks with two popular LLMs and three knowledge graphs demonstrate the effectiveness and robustness of KnowLA. We show that \modelname can help activate the relevant parameterized knowledge in an LLM to answer a question without changing its parameters or input prompts., Comment: Accepted in the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)
Published: 2024

33. A fast low-rank inversion algorithm of dielectric matrix in GW approximation

Author: Zhou, Zhengbang, Ma, Huanhuan, Wu, Wentiao, Gao, Weiguo, Yang, Jinlong, Shao, Meiyue, and Hu, Wei
Subjects: Mathematics - Numerical Analysis, G.1.3, J.2
Abstract: The dielectric response function and its inverse are crucial physical quantities in materials science. We propose an accurate and efficient strategy to invert the dielectric function matrix. The GW approximation, a powerful approach to accurately describe many-body excited states, is taken as an application to demonstrate accuracy and efficiency. We incorporate the interpolative separable density fitting (ISDF) algorithm with Sherman--Morrison--Woodbury (SMW) formula to accelerate the inversion process by exploiting low-rank properties of dielectric function in plane-wave GW calculations. Our ISDF--SMW strategy produces accurate quasiparticle energies with $O(N_{\mathrm{r}}N_{\mathrm{e}}^2)$ computational cost $(N_{\mathrm{e}}$ is the number of electrons and $N_{\mathrm{r}}=100$--$1000N_{\mathrm{e}}$ is the number of grid points) with negligible small error of $0.03$ eV for both complex molecules and solids. This new strategy for inverting the dielectric matrix can be $50\times$ faster than the current state-of-the-art implementation in BerkeleyGW, resulting in two orders of magnitude speedup for total GW calculations.
Published: 2024

34. CasSR: Activating Image Power for Real-World Image Super-Resolution

Author: Chen, Haolan, Hao, Jinhua, Zhao, Kai, Yuan, Kun, Sun, Ming, Zhou, Chao, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The objective of image super-resolution is to generate clean and high-resolution images from degraded versions. Recent advancements in diffusion modeling have led to the emergence of various image super-resolution techniques that leverage pretrained text-to-image (T2I) models. Nevertheless, due to the prevalent severe degradation in low-resolution images and the inherent characteristics of diffusion models, achieving high-fidelity image restoration remains challenging. Existing methods often exhibit issues including semantic loss, artifacts, and the introduction of spurious content not present in the original image. To tackle this challenge, we propose Cascaded diffusion for Super-Resolution, CasSR , a novel method designed to produce highly detailed and realistic images. In particular, we develop a cascaded controllable diffusion model that aims to optimize the extraction of information from low-resolution images. This model generates a preliminary reference image to facilitate initial information extraction and degradation mitigation. Furthermore, we propose a multi-attention mechanism to enhance the T2I model's capability in maximizing the restoration of the original image content. Through a comprehensive blend of qualitative and quantitative analyses, we substantiate the efficacy and superiority of our approach.
Published: 2024

35. RangeLDM: Fast Realistic LiDAR Point Cloud Generation

Author: Hu, Qianjiang, Zhang, Zhimin, and Hu, Wei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Autonomous driving demands high-quality LiDAR data, yet the cost of physical LiDAR sensors presents a significant scaling-up challenge. While recent efforts have explored deep generative models to address this issue, they often consume substantial computational resources with slow generation speeds while suffering from a lack of realism. To address these limitations, we introduce RangeLDM, a novel approach for rapidly generating high-quality range-view LiDAR point clouds via latent diffusion models. We achieve this by correcting range-view data distribution for accurate projection from point clouds to range images via Hough voting, which has a critical impact on generative learning. We then compress the range images into a latent space with a variational autoencoder, and leverage a diffusion model to enhance expressivity. Additionally, we instruct the model to preserve 3D structural fidelity by devising a range-guided discriminator. Experimental results on KITTI-360 and nuScenes datasets demonstrate both the robust expressiveness and fast speed of our LiDAR point cloud generation.
Published: 2024

36. Near-Interpolators: Rapid Norm Growth and the Trade-Off between Interpolation and Generalization

Author: Wang, Yutong, Sonthalia, Rishi, and Hu, Wei
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: We study the generalization capability of nearly-interpolating linear regressors: $\boldsymbol{\beta}$'s whose training error $\tau$ is positive but small, i.e., below the noise floor. Under a random matrix theoretic assumption on the data distribution and an eigendecay assumption on the data covariance matrix $\boldsymbol{\Sigma}$, we demonstrate that any near-interpolator exhibits rapid norm growth: for $\tau$ fixed, $\boldsymbol{\beta}$ has squared $\ell_2$-norm $\mathbb{E}[\|{\boldsymbol{\beta}}\|_{2}^{2}] = \Omega(n^{\alpha})$ where $n$ is the number of samples and $\alpha >1$ is the exponent of the eigendecay, i.e., $\lambda_i(\boldsymbol{\Sigma}) \sim i^{-\alpha}$. This implies that existing data-independent norm-based bounds are necessarily loose. On the other hand, in the same regime we precisely characterize the asymptotic trade-off between interpolation and generalization. Our characterization reveals that larger norm scaling exponents $\alpha$ correspond to worse trade-offs between interpolation and generalization. We verify empirically that a similar phenomenon holds for nearly-interpolating shallow neural networks., Comment: AISTATS 2024
Published: 2024

37. LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

Author: Zhao, Qihao, Dai, Yalun, Li, Hao, Hu, Wei, Zhang, Fan, and Liu, Jun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Long-tail recognition is challenging because it requires the model to learn good representations from tail categories and address imbalances across all categories. In this paper, we propose a novel generative and fine-tuning framework, LTGC, to handle long-tail recognition via leveraging generated content. Firstly, inspired by the rich implicit knowledge in large-scale models (e.g., large language models, LLMs), LTGC leverages the power of these models to parse and reason over the original tail data to produce diverse tail-class content. We then propose several novel designs for LTGC to ensure the quality of the generated data and to efficiently fine-tune the model using both the generated and original data. The visualization demonstrates the effectiveness of the generation module in LTGC, which produces accurate and diverse tail data. Additionally, the experimental results demonstrate that our LTGC outperforms existing state-of-the-art methods on popular long-tailed benchmarks., Comment: CVPR 2024, Oral
Published: 2024

38. Simultaneously blocking ANGPTL3 and IL-1β for the treatment of atherosclerosis through lipid-lowering and anti-inflammation

Author: Wang, Hanqi, Hu, Xiaozhi, Zhang, Yuting, Zhu, An, Fan, Jiajun, Wu, Zhengyu, Wang, Xuebin, Hu, Wei, and Ju, Dianwen
Published: 2024
Full Text: View/download PDF

39. MeFD-Net: multi-expert fusion diagnostic network for generating radiology image reports

Author: Ran, Ruisheng, Pan, Renjie, Yang, Wen, Deng, Yan, Zhang, Wenfeng, Hu, Wei, and Qing, Qibing
Published: 2024
Full Text: View/download PDF

40. Beneficial effect of sodium-glucose cotransporter-2 inhibitors on mortality among patients with cancer and diabetes mellitus

Author: Hu, Wei-Syun and Lin, Cheng-Li
Published: 2024
Full Text: View/download PDF

41. NASICON Li1.3Al0.3Ti1.7(PO4)3 electrolyte coating enables stable cycling of Li-rich manganese-based cathode

Author: Hu, Wei, Li, Xiao-Yan, Huang, Jing-Biao, and Zhong, Sheng-Wen
Published: 2024
Full Text: View/download PDF

42. Total tubeless percutaneous nephrolithotomy without retrograde insertion of a ureteral catheter for the treatment of kidney stone patients without hydronephrosis: a randomized controlled trial

Author: Fu, Xiaowen, Hu, Wei, Deng, Weiming, Jin, Wei, Zu, Xiongbing, Zhu, Guoqiang, and Li, Mingyong
Published: 2024
Full Text: View/download PDF

43. Comparative study on non-isothermal dehydroxylation kinetics of talc based on multi-scan thermogravimetry and thermodilatometry methods

Author: Zhang, Xianghui, Yang, Huan, Cheng, Wenchong, Tang, Chong, Hu, Wei, Dai, Yanqiu, Kou, Yuanyu, Lei, Shengjun, Yang, Wenling, Liu, Qin, Wang, Ling, and Feng, Qian
Published: 2024
Full Text: View/download PDF

44. Soil Quality Assessment of the Cultivated Land Around the Tailing Ponds in the Qinling Mountains of China

Author: Weige Yang, Hu, Wei, Ye, Yuanyuan, and Zhuang, Danya
Published: 2024
Full Text: View/download PDF

45. Linking servant leadership to employees’ knowledge sharing: the role of thriving at work and organizational identification

Author: Xu, Yan and Hu, Wei
Published: 2024
Full Text: View/download PDF

46. The high cost of competition: how and when trait competitiveness triggers work-family conflict

Author: Xu, Yan, Liu, Doudou, and Hu, Wei
Published: 2024
Full Text: View/download PDF

47. A Thermo-Poro-Mechanics Model Predicts the Transition from Creep to Rapid Movement of Large Landslides

Author: Zhang, Huanhuan, Liu, Wei, He, Siming, and Hu, Wei
Published: 2024
Full Text: View/download PDF

48. A key regulator of tumor-associated neutrophils: the CXCR2 chemokine receptor

Author: Kang, Wenyan, Wang, Chengkun, Wang, Minhui, Liu, Meiqi, Hu, Wei, Liang, Xiaoqiu, Yang, Juanli, and Zhang, Yang
Published: 2024
Full Text: View/download PDF

49. Evaluation of Snowmelt and Rainfall Erosion in the Total Soil Losses in a Typical Small Watershed in Black Soil Region of Northeast China

Author: Ren, Zhongzheng, Hu, Wei, Chen, Yuan, Ding, Guihui, Fan, Xu, and Zhang, Xingyi
Published: 2024
Full Text: View/download PDF

50. Effect of Magnesium Substitution on Electrochemical Performances of Layered LiNiO2 Cathode Materials

Author: He, Huihui, Wen, Huanming, Zhang, Huaxin, Xu, Huihui, Cheng, Jinming, and Hu, Wei
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,974 results on '"Hu, Wei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources