4,242 results on '"Hu, Yue"'
Search Results
2. Differentiable SVD based on Moore-Penrose Pseudoinverse for Inverse Imaging Problems
- Author
-
Zhang, Yinghao and Hu, Yue
- Subjects
Mathematics - Numerical Analysis ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,G.1.4 ,I.2.0 ,I.4.4 ,I.4.5 - Abstract
Low-rank regularization-based deep unrolling networks have achieved remarkable success in various inverse imaging problems (IIPs). However, the singular value decomposition (SVD) is non-differentiable when duplicated singular values occur, leading to severe numerical instability during training. In this paper, we propose a differentiable SVD based on the Moore-Penrose pseudoinverse to address this issue. To the best of our knowledge, this is the first work to provide a comprehensive analysis of the differentiability of the trivial SVD. Specifically, we show that the non-differentiability of SVD is essentially due to an underdetermined system of linear equations arising in the derivation process. We utilize the Moore-Penrose pseudoinverse to solve the system, thereby proposing a differentiable SVD. A numerical stability analysis in the context of IIPs is provided. Experimental results in color image compressed sensing and dynamic MRI reconstruction show that our proposed differentiable SVD can effectively address the numerical instability issue while ensuring computational precision. Code is available at https://github.com/yhao-z/SVD-inv., Comment: 11 pages
- Published
- 2024
3. Estimate Sonic Mach Number in the Interstellar Medium with Convolutional Neural Network
- Author
-
Schmaltz, Tyler, Hu, Yue, and Lazarian, Alex
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Understanding the role of turbulence in shaping the interstellar medium (ISM) is crucial for studying star formation, molecular cloud evolution, and cosmic ray propagation. Central to this is the measurement of the sonic Mach number ($M_s$), which quantifies the ratio of turbulent velocity to the sound speed. In this work, we introduce a convolutional neural network (CNN)-based approach for estimating $M_s$ directly from spectroscopic observations. The approach leverages the physical correlation between increasing $M_s$ and the shock-induced small-scale fluctuations that alter the morphological features in intensity, velocity centroid, and velocity channel maps. These maps, derived from 3D magnetohydrodynamic (MHD) turbulence simulations, serve as inputs for the CNN training. By learning the relationship between these structural features and the underlying turbulence properties, CNN can predict $M_s$ under various conditions, including different magnetic fields and levels of observational noise. The median uncertainty of the CNN-predicted $M_s$ ranges from 0.5 to 1.5 depending on the noise level. While intensity maps offer lower uncertainty, channel maps have the advantage of predicting the 3D $M_s$ distribution, which is crucial in estimating 3D magnetic field strength. Our results demonstrate that machine-learning-based tools can effectively characterize complex turbulence properties in the ISM., Comment: 16 pages, 8 figures, submitted to ApJ
- Published
- 2024
4. Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
- Author
-
Liu, Ting, Shi, Liangtao, Hong, Richang, Hu, Yue, Yin, Quanjun, and Zhang, Linfeng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The vision tokens in multimodal large language models usually exhibit significant spatial and temporal redundancy and take up most of the input tokens, which harms their inference efficiency. To solve this problem, some recent works were introduced to drop the unimportant tokens during inference where the importance of each token is decided only by the information in either the vision encoding stage or the prefilling stage. In this paper, we propose Multi-stage Token Dropping (MustDrop) to measure the importance of each token from the whole lifecycle, including the vision encoding stage, prefilling stage, and decoding stage. Concretely, in the visual encoding stage, MustDrop merges spatially adjacent tokens with high similarity, and establishes a key token set to retain the most vision-critical tokens, preventing them from being discarded in later stages. In the prefilling stage, MustDrop further compresses vision tokens by the guidance of text semantics, with a dual-attention filtering strategy. In the decoding stage, an output-aware cache policy is proposed to further reduce the size of the KV cache. By leveraging tailored strategies in the multi-stage process, MustDrop can more precisely recognize the important and redundant tokens, thus achieving an optimal balance between performance and efficiency. For instance, MustDrop reduces about 88.5\% FLOPs on LLaVA with a compression ratio of 92.2\% while maintaining comparable accuracy. Our codes are available at \url{https://github.com/liuting20/MustDrop}., Comment: 8 pages, 4figures
- Published
- 2024
5. Machine Learning Approach for Estimating Magnetic Field Strength in Galaxy Clusters from Synchrotron Emission
- Author
-
Zhang, Jiyao, Hu, Yue, and Lazarian, A.
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Magnetic fields play a crucial role in various astrophysical processes within the intracluster medium, including heat conduction, cosmic ray acceleration, and the generation of synchrotron radiation. However, measuring magnetic field strength is typically challenging due to the limited availability of Faraday Rotation Measure sources. To address the challenge, we propose a novel method that employs Convolutional Neural Networks (CNNs) alongside synchrotron emission observations to estimate magnetic field strengths in galaxy clusters. Our CNN model is trained on either Magnetohydrodynamic (MHD) turbulence simulations or MHD galaxy cluster simulations, which incorporate complex dynamics such as cluster mergers and sloshing motions. The results demonstrate that CNNs can effectively estimate magnetic field strengths with median uncertainties of approximately $0.22\mu$G, $0.01\mu$G, and $0.1\mu$G for $\beta = 100$, 200, and 500 conditions, respectively. Additionally, we have confirmed that our CNN model remains robust against noise and variations in viewing angles with sufficient training, ensuring reliable performance under a wide range of observational conditions. We compare the CNN approach with the traditional magnetic field strength estimates method that assumes equipartition between cosmic ray electron energy and magnetic field energy. Unlike the equipartition method, this CNN approach does not rely on the equipartition assumption, offering a new perspective for comparing traditional estimates and enhancing our understanding of cosmic ray acceleration mechanisms., Comment: 13 pages, 9 figures, submitted to ApJ
- Published
- 2024
6. Unlocking high hole mobility in diamond over a wide temperature range via efficient shear strain
- Author
-
Sun, Jianshi, Li, Shouhang, Shao, Cheng, Tong, Zhen, An, Meng, Yao, Yuhang, Hu, Yue, Zhu, Xiongfei, Liu, Yifan, Wang, Renzong, Liu, Xiangjun, and Frauenheim, Thomas
- Subjects
Condensed Matter - Materials Science - Abstract
As a wide bandgap semiconductor, diamond holds both excellent electrical and thermal properties, making it highly promising in the electrical industry. However, its hole mobility is relatively low and dramatically decreases with increasing temperature, which severely limits further applications. Herein, we proposed that the hole mobility can be efficiently enhanced via slight compressive shear strain along the [100] direction, while the improvement via shear strain along the [111] direction is marginal. This impressive distinction is attributed to the deformation potential and the elastic compliance matrix. The shear strain breaks the symmetry of the crystalline structure and lifts the band degeneracy near the valence band edge, resulting in a significant suppression of interband electron-phonon scattering. Moreover, the hole mobility becomes less temperature-dependent due to the decrease of electron scatterings from high-frequency acoustic phonons. Remarkably, the in-plane hole mobility of diamond is increased by approximately 800% at 800 K with a 2% compressive shear strain along the [100] direction. The efficient shear strain strategy can be further extended to other semiconductors with face-centered cubic geometry., Comment: 7 pages, 4 figures
- Published
- 2024
7. Self-Evolving Multi-Agent Collaboration Networks for Software Development
- Author
-
Hu, Yue, Cai, Yuzhu, Du, Yaxin, Zhu, Xinyu, Liu, Xiangrui, Yu, Zijie, Hou, Yuchen, Tang, Shuo, and Chen, Siheng
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence ,Computer Science - Multiagent Systems - Abstract
LLM-driven multi-agent collaboration (MAC) systems have demonstrated impressive capabilities in automatic software development at the function level. However, their heavy reliance on human design limits their adaptability to the diverse demands of real-world software development. To address this limitation, we introduce EvoMAC, a novel self-evolving paradigm for MAC networks. Inspired by traditional neural network training, EvoMAC obtains text-based environmental feedback by verifying the MAC network's output against a target proxy and leverages a novel textual backpropagation to update the network. To extend coding capabilities beyond function-level tasks to more challenging software-level development, we further propose rSDE-Bench, a requirement-oriented software development benchmark, which features complex and diverse software requirements along with automatic evaluation of requirement correctness. Our experiments show that: i) The automatic requirement-aware evaluation in rSDE-Bench closely aligns with human evaluations, validating its reliability as a software-level coding benchmark. ii) EvoMAC outperforms previous SOTA methods on both the software-level rSDE-Bench and the function-level HumanEval benchmarks, reflecting its superior coding capabilities. The benchmark can be downloaded at https://yuzhu-cai.github.io/rSDE-Bench/., Comment: 25 pages
- Published
- 2024
8. Probing Three-Dimensional Magnetic Fields: IV -- Synchrotron Polarization Derivative and Vision Transformer
- Author
-
Hu, Yue and Lazarian, A.
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Measuring the 3D spatial distribution of magnetic fields in the interstellar medium and the intracluster medium is crucial yet challenging. The probing of 3D magnetic field's 3D distribution, including the field plane-of-sky orientation ($\psi$), the magnetic field's inclination angle ($\gamma$) relative to the line of sight, and magnetization ($\sim$ the inverse Alfv\'en Mach number $M_A^{-1}$), at different distances from the observer makes the task even more formidable. However, the anisotropy and Faraday decorrelation effect in polarized synchrotron emission offer a unique solution. We show that due to the Faraday decorrelation, only regions up to a certain effective path length along the line of sight contribute to the measured polarization. The 3D spatial information can be consequently derived from synchrotron polarization derivatives (SPDs), which are calculated from the difference in synchrotron polarization across two wavelengths. We find that the 3D magnetic field can be estimated from the anisotropy observed in SPD: the elongation direction of the SPD structures probes $\psi$ and the degree of SPD anisotropy, along with its morphological curvature, provides insights into $M_A^{-1}$ and $\gamma$. To extract these anisotropic features and their correlation with the 3D magnetic field, we propose utilizing a machine learning approach, specifically the Vision Transformer (ViT) architecture, which was exemplified by the success of the ChatGPT. We train the ViT using synthetic synchrotron observations generated from MHD turbulence simulations in sub-Alfv\'enic and super-Alfv\'enic conditions. We show that ViT's application to multi-wavelength SPDs can successfully reconstruct the 3D magnetic fields' 3D spatial distribution., Comment: 15 pages, 9 figures, submitted to ApJ
- Published
- 2024
9. Anisotropic Velocity Fluctuations in Galaxy Mergers: A Probe of the Magnetic Field
- Author
-
Hu, Yue, Whittingham, Joseph, Lazarian, A., Pfrommer, Christoph, Xu, Siyao, and Berlok, Thomas
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Magnetic fields and turbulence are fundamental to the evolution of galaxies, yet their precise measurement and analysis present significant challenges. The recently developed Velocity Gradient Technique (VGT), which capitalizes on the anisotropy inherent in magnetohydrodynamic (MHD) turbulence, represents a new method for mapping magnetic fields in galaxies using spectroscopic observations. Most validations of VGT thus far, however, have relied upon idealized MHD turbulence simulations, which lack the more complex dynamics found in galaxies and galaxy mergers. In this study, we scrutinize VGT using an AREPO-based cosmological galaxy merger simulation, testing its effectiveness across pre-merger, merging, and post-merger stages. We examine the underlying assumptions of VGT and probe the statistics of gas density, velocity, and magnetic fields over time. We find that velocity fluctuations are indeed anisotropic at each stage, being larger in the direction perpendicular to the local magnetic field, as required by VGT. We find, additionally, that galaxy mergers substantially intensify velocity and density fluctuations and amplify magnetic fields at all scales. The observed scaling behavior of the velocity fluctuations corresponds to $r^{1/2}$ up to 0.4~kpc, shifting to a steeper trend between 0.6 and 3~kpc, and to a shallower trend thereafter. The scaling of the magnetic field and density fluctuations at scales $\lesssim$ 1.0 kpc also predominantly aligns with $r^{1/2}$. Finally, we compare results from VGT to those derived from polarization-based magnetic field measurements, finding consistent and statistically significant global agreement in all cases. This opens the way to applying VGT to external galaxies., Comment: 19 pages, 10 figures, submitted to ApJ
- Published
- 2024
10. Optimized Magnetic Resonance Fingerprinting Using Ziv-Zakai Bound
- Author
-
Gong, Chaoguang, Hu, Yue, Li, Peng, Zou, Lixian, Liu, Congcong, Zhou, Yihang, Zhu, Yanjie, Liang, Dong, and Wang, Haifeng
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Quantitative Biology - Quantitative Methods ,Statistics - Applications - Abstract
Magnetic Resonance Fingerprinting (MRF) has emerged as a promising quantitative imaging technique within the field of Magnetic Resonance Imaging (MRI), offers comprehensive insights into tissue properties by simultaneously acquiring multiple tissue parameter maps in a single acquisition. Sequence optimization is crucial for improving the accuracy and efficiency of MRF. In this work, a novel framework for MRF sequence optimization is proposed based on the Ziv-Zakai bound (ZZB). Unlike the Cram\'er-Rao bound (CRB), which aims to enhance the quality of a single fingerprint signal with deterministic parameters, ZZB provides insights into evaluating the minimum mismatch probability for pairs of fingerprint signals within the specified parameter range in MRF. Specifically, the explicit ZZB is derived to establish a lower bound for the discrimination error in the fingerprint signal matching process within MRF. This bound illuminates the intrinsic limitations of MRF sequences, thereby fostering a deeper understanding of existing sequence performance. Subsequently, an optimal experiment design problem based on ZZB was formulated to ascertain the optimal scheme of acquisition parameters, maximizing discrimination power of MRF between different tissue types. Preliminary numerical experiments show that the optimized ZZB scheme outperforms both the conventional and CRB schemes in terms of the reconstruction accuracy of multiple parameter maps., Comment: Accepted at 2024 IEEE International Conference on Imaging Systems and Techniques (IST 2024)
- Published
- 2024
11. Pathfinding pulsar observations with the CVN incorporating the FAST
- Author
-
Yan, Zhen, Shen, Zhiqiang, Jiang, Peng, Zhang, Bo, Zhang, Haiyan, Cui, Lang, Luo, Jintao, Chen, Rurong, Jiang, Wu, Zhang, Hua, Wu, De, Zhao, Rongbing, Yuan, Jianping, Hu, Yue, Wu, Yajun, Xia, Bo, Li, Guanghui, Rao, Yongnan, Chen, Chenyu, Wang, Xiaowei, Ding, Hao, Liu, Yongpeng, Zhang, Fuchen, and Jiang, Yongbin
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Instrumentation and Methods for Astrophysics - Abstract
The importance of Very Long Baseline Interferometry (VLBI) for pulsar research is becoming increasingly prominent and receiving more and more attention. In this paper, we present pathfinding pulsar observation results with the Chinese VLBI Network (CVN) incorporating the Five-hundred-meter Aperture Spherical radio Telescope (FAST). On MJD 60045 (April 11th, 2023), PSRs B0919+06 and B1133+16 were observed with the phase-referencing mode in the L-band using four radio telescopes (FAST, TianMa, Haoping and Nanshan) and correlated with the pulsar binning mode of the distributed FX-style software correlator in Shanghai. After further data processing with the NRAO Astronomical Image Processing System (AIPS), we detected these two pulsars and fitted their current positions with accuracy at the milliarcsecond level. By comparison, our results show significantly better agreement with predicted values based on historical VLBI observations than that with previous timing observations, as pulsar astrometry with the VLBI provides a more direct and model-independent method for accurately obtaining related parameters., Comment: Accepted by the Chinese Physics Letters
- Published
- 2024
12. MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension
- Author
-
Liu, Ting, Xu, Zunnan, Hu, Yue, Shi, Liangtao, Wang, Zhiqiang, and Yin, Quanjun
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Referring Expression Comprehension (REC), which aims to ground a local visual region via natural language, is a task that heavily relies on multimodal alignment. Most existing methods utilize powerful pre-trained models to transfer visual/linguistic knowledge by full fine-tuning. However, full fine-tuning the entire backbone not only breaks the rich prior knowledge embedded in the pre-training, but also incurs significant computational costs. Motivated by the recent emergence of Parameter-Efficient Transfer Learning (PETL) methods, we aim to solve the REC task in an effective and efficient manner. Directly applying these PETL methods to the REC task is inappropriate, as they lack the specific-domain abilities for precise local visual perception and visual-language alignment. Therefore, we propose a novel framework of Multimodal Prior-guided Parameter Efficient Tuning, namely MaPPER. Specifically, MaPPER comprises Dynamic Prior Adapters guided by an aligned prior, and Local Convolution Adapters to extract precise local semantics for better visual perception. Moreover, the Prior-Guided Text module is proposed to further utilize the prior for facilitating the cross-modal alignment. Experimental results on three widely-used benchmarks demonstrate that MaPPER achieves the best accuracy compared to the full fine-tuning and other PETL methods with only 1.41% tunable backbone parameters. Our code is available at https://github.com/liuting20/MaPPER., Comment: EMNLP 2024
- Published
- 2024
13. Ab Initio Device-Driven Screening of Sub-1-nm Thickness Oxide Semiconductors for Future CMOS Technology Nodes
- Author
-
Xu, Linqiang, Hu, Yue, Xu, Lianqiang, Xu, Lin, Li, Qiuhui, Wang, Aili, Lau, Chit Siong, Lu, Jing, and Ang, Yee Sin
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics ,Physics - Applied Physics - Abstract
Ultrathin oxide semiconductors with sub-1-nm thickness are promising building blocks for ultrascaled field-effect transistor (FET) applications due to their resilience against short-channel effects, high air stability, and potential for low-energy device operation. However, the n-type dominance of ultrathin oxide FET has hindered their integration into complementary metal-oxide-semiconductor (CMOS) technology, which requires both n-and p-type devices. Here we develop an ab initio device-driven computational screening workflow to identify sub-1-nm thickness oxide semiconductors for sub-5-nm FET applications. We demonstrate that ultrathin CaO2, CaO, and SrO are compatible with p-type device operations under both high-performance (HP) and low-power (LP) requirements specified by the International Technology Roadmap of Semiconductors (ITRS), thereby expanding the limited family of p-type oxide semiconductors. Notably, CaO and SrO emerge as the first-of-kind sub-1-nm thickness oxide semiconductors capable of simultaneously meeting the ITRS HP and LP criteria for both n-and p-type devices. CaO and SrO FETs outperform many existing low-dimensional semiconductors, exhibiting scalability below 5-nm gate length. Our findings offer a pioneering effort in the ab initio, device-driven screening of sub-1-nm thickness oxide semiconductors, significantly broadening the material candidate pool for future CMOS technology nodes., Comment: 23 pages, 5 figures, 1 table
- Published
- 2024
14. Mutagenesis screen to map the functions of parameters of Large Language Models
- Author
-
Hu, Yue, Hu, Kai, Zhao, Patrick X., Khan, Javed, and Xu, Chengming
- Subjects
Computer Science - Artificial Intelligence ,I.2.0 - Abstract
Large Language Models (LLMs) have significantly advanced artificial intelligence, excelling in numerous tasks. Although the functionality of a model is inherently tied to its parameters, a systematic method for exploring the connections between the parameters and the functionality are lacking. Models sharing similar structure and parameter counts exhibit significant performance disparities across various tasks, prompting investigations into the varying patterns that govern their performance. We adopted a mutagenesis screen approach inspired by the methods used in biological studies, to investigate Llama2-7b and Zephyr. This technique involved mutating elements within the models' matrices to their maximum or minimum values to examine the relationship between model parameters and their functionalities. Our research uncovered multiple levels of fine structures within both models. Many matrices showed a mixture of maximum and minimum mutations following mutagenesis, but others were predominantly sensitive to one type. Notably, mutations that produced phenotypes, especially those with severe outcomes, tended to cluster along axes. Additionally, the location of maximum and minimum mutations often displayed a complementary pattern on matrix in both models, with the Gate matrix showing a unique two-dimensional asymmetry after rearrangement. In Zephyr, certain mutations consistently resulted in poetic or conversational rather than descriptive outputs. These "writer" mutations grouped according to the high-frequency initial word of the output, with a marked tendency to share the row coordinate even when they are in different matrices. Our findings affirm that the mutagenesis screen is an effective tool for deciphering the complexities of large language models and identifying unexpected ways to expand their potential, providing deeper insights into the foundational aspects of AI systems., Comment: 10 pages, 6 figures, supplementary material available online
- Published
- 2024
15. M$^2$IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension
- Author
-
Liu, Xuyang, Liu, Ting, Huang, Siteng, Xin, Yi, Hu, Yue, Yin, Quanjun, Wang, Donglin, and Chen, Honggang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Referring expression comprehension (REC) is a vision-language task to locate a target object in an image based on a language expression. Fully fine-tuning general-purpose pre-trained vision-language foundation models for REC yields impressive performance but becomes increasingly costly. Parameter-efficient transfer learning (PETL) methods have shown strong performance with fewer tunable parameters. However, directly applying PETL to REC faces two challenges: (1) insufficient multi-modal interaction between pre-trained vision-language foundation models, and (2) high GPU memory usage due to gradients passing through the heavy vision-language foundation models. To this end, we present M$^2$IST: Multi-Modal Interactive Side-Tuning with M$^3$ISAs: Mixture of Multi-Modal Interactive Side-Adapters. During fine-tuning, we keep the pre-trained uni-modal encoders fixed, updating M$^3$ISAs on side networks to progressively connect them, enabling more comprehensive vision-language alignment and efficient tuning for REC. Empirical results reveal that M$^2$IST achieves an optimal balance between performance and efficiency compared to most full fine-tuning and other PETL methods. With M$^2$IST, standard transformer-based REC methods present competitive or even superior performance compared to full fine-tuning, while utilizing only 2.11\% of the tunable parameters, 39.61\% of the GPU memory, and 63.46\% of the fine-tuning time required for full fine-tuning.
- Published
- 2024
16. Wide-binary eccentricity distribution in young star clusters: dependence on the binary separation and mass
- Author
-
Mathew, Sajay Sunny, Xu, Siyao, Federrath, Christoph, Hu, Yue, and Seta, Amit
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We study the wide-binary eccentricity ($e$) distribution in young star clusters and the role of turbulence in setting the form of the $e$ distribution using magnetohydrodynamical (MHD) simulations of star cluster formation. The simulations incorporate gravity, turbulence, magnetic fields, protostellar heating, and jets/outflows. We find that (1) simulations that employ purely compressive turbulence driving produce binaries with a superthermal $e$ distribution ($\alpha>1$ in $p(e) \propto e^\alpha$), while simulations with purely solenoidal driving or natural mixture of driving modes produce subthermal/thermal distributions ($\alpha \leq$ 1), (2) the $e$ distribution over the full range of binary separations in our simulations is set at the early stages of the star cluster formation process, (3) while binaries (separation of $r_{\mathrm{pair}} \leq 1000\, \mathrm{AU}$) have subthermal to thermal $e$ distributions ($\alpha \sim 0.8$), wide binaries ($r_{\mathrm{pair}} > 1000\, \mathrm{AU}$) have a superthermal distribution ($\alpha \sim 1.8$), and (4) low-mass binary systems (system masses of $M_{\mathrm{sys}} \leq 0.8\, \mathrm{M_\odot}$) have a highly superthermal distribution ($\alpha \sim 2.4$), whereas high-mass systems ($M_{\mathrm{sys}} > 0.8\, \mathrm{M_\odot}$) exhibit a subthermal/thermal distribution ($\alpha \sim 0.8$). The binary eccentricity distribution is often modelled as a thermal distribution. However, our results suggest that the $e$ distribution depends on the range of separation of the sampled binaries, which agrees with the findings from recent Gaia observations. We conclude that the dependence of the $e$ distribution on the binary separation and mass is linked to the binary formation mechanism governed by the turbulent properties of the parent cloud., Comment: 15 pages, 9 figures, 1 table (added additional citation, published in MNRAS)
- Published
- 2024
- Full Text
- View/download PDF
17. Exploring magnetic fields in merging galaxy: combining polarization and velocity gradient in the Centaurus Galaxy
- Author
-
Nguyen, Quynh Lan, Hu, Yue, and Lazarian, Alex
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
In this study, we apply the Velocity Gradient Technique (VGT) to the merging Centaurus galaxy. We compare gradient maps derived from the PHANGS-ALMA survey using CO emission lines with magnetic field tracings from dust polarization data obtained via the HAWC+ instrument. Our analysis reveals a strong correspondence between the directions indicated by these two tracers across most of the galactic image. Specifically, we identify jet regions as areas of anti-alignment, consistent with previous reports that gradients tend to rotate 90 degrees in outflow regions. Statistically, we find that the alignment of magnetic fields, as revealed by polarization, is most accurate in regions with the highest signal-to-noise ratios. Our findings underscore the utility of velocity gradients as a valuable complementary tool for probing magnetic fields and dynamical processes in merging galaxies., Comment: 7 pages, 4 figures
- Published
- 2024
18. AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field
- Author
-
Liu, Rong, Xu, Rui, Hu, Yue, Chen, Meida, and Feng, Andrew
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
3D Gaussian Splatting (3DGS) has recently advanced radiance field reconstruction by offering superior capabilities for novel view synthesis and real-time rendering speed. However, its strategy of blending optimization and adaptive density control might lead to sub-optimal results; it can sometimes yield noisy geometry and blurry artifacts due to prioritizing optimizing large Gaussians at the cost of adequately densifying smaller ones. To address this, we introduce AtomGS, consisting of Atomized Proliferation and Geometry-Guided Optimization. The Atomized Proliferation constrains ellipsoid Gaussians of various sizes into more uniform-sized Atom Gaussians. The strategy enhances the representation of areas with fine features by placing greater emphasis on densification in accordance with scene details. In addition, we proposed a Geometry-Guided Optimization approach that incorporates an Edge-Aware Normal Loss. This optimization method effectively smooths flat surfaces while preserving intricate details. Our evaluation shows that AtomGS outperforms existing state-of-the-art methods in rendering quality. Additionally, it achieves competitive accuracy in geometry reconstruction and offers a significant improvement in training speed over other SDF-based methods. More interactive demos can be found in our website (https://rongliu-leo.github.io/AtomGS/)., Comment: BMVC 2024
- Published
- 2024
19. DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
- Author
-
Liu, Ting, Liu, Xuyang, Huang, Siteng, Chen, Honggang, Yin, Quanjun, Qin, Long, Wang, Donglin, and Hu, Yue
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Visual grounding (VG) is a challenging task to localize an object in an image based on a textual description. Recent surge in the scale of VG models has substantially improved performance, but also introduced a significant burden on computational costs during fine-tuning. In this paper, we explore applying parameter-efficient transfer learning (PETL) to efficiently transfer the pre-trained vision-language knowledge to VG. Specifically, we propose \textbf{DARA}, a novel PETL method comprising \underline{\textbf{D}}omain-aware \underline{\textbf{A}}dapters (DA Adapters) and \underline{\textbf{R}}elation-aware \underline{\textbf{A}}dapters (RA Adapters) for VG. DA Adapters first transfer intra-modality representations to be more fine-grained for the VG domain. Then RA Adapters share weights to bridge the relation between two modalities, improving spatial reasoning. Empirical results on widely-used benchmarks demonstrate that DARA achieves the best accuracy while saving numerous updated parameters compared to the full fine-tuning and other PETL methods. Notably, with only \textbf{2.13\%} tunable backbone parameters, DARA improves average accuracy by \textbf{0.81\%} across the three benchmarks compared to the baseline model. Our code is available at \url{https://github.com/liuting20/DARA}., Comment: Accepted by ICME 2024 (Oral)
- Published
- 2024
20. Communication-Efficient Collaborative Perception via Information Filling with Codebook
- Author
-
Hu, Yue, Peng, Juntong, Liu, Sifei, Ge, Junhao, Liu, Si, and Chen, Siheng
- Subjects
Computer Science - Information Theory ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multiagent Systems - Abstract
Collaborative perception empowers each agent to improve its perceptual ability through the exchange of perceptual messages with other agents. It inherently results in a fundamental trade-off between perception ability and communication cost. To address this bottleneck issue, our core idea is to optimize the collaborative messages from two key aspects: representation and selection. The proposed codebook-based message representation enables the transmission of integer codes, rather than high-dimensional feature maps. The proposed information-filling-driven message selection optimizes local messages to collectively fill each agent's information demand, preventing information overflow among multiple agents. By integrating these two designs, we propose CodeFilling, a novel communication-efficient collaborative perception system, which significantly advances the perception-communication trade-off and is inclusive to both homogeneous and heterogeneous collaboration settings. We evaluate CodeFilling in both a real-world dataset, DAIR-V2X, and a new simulation dataset, OPV2VH+. Results show that CodeFilling outperforms previous SOTA Where2comm on DAIR-V2X/OPV2VH+ with 1,333/1,206 times lower communication volume. Our code is available at https://github.com/PhyllisH/CodeFilling., Comment: 10 pages, Accepted by CVPR 2024
- Published
- 2024
21. Detection of circular permutations by Protein Language Models
- Author
-
Hu, Yue, Huang, Bin, and Zang, Chunzi
- Subjects
Quantitative Biology - Quantitative Methods - Abstract
Protein circular permutations are crucial for understanding protein evolution and functionality. Traditional detection methods, sequence-based or structure-based, struggle with accuracy and computational efficiency, the latter also limited by treating proteins as rigid bodies. The plmCP method, utilizing a protein language model, not only speeds up the detection process but also enhances the accuracy of identifying circular permutations, contributing significantly to protein research and engineering by acknowledging structural flexibility.
- Published
- 2024
22. An improved upper bound for planar Tur\'an number of double star $S_{2,5}$
- Author
-
Xu, Xin, Hu, Yue, and Zhang, Xu
- Subjects
Mathematics - Combinatorics - Abstract
The planar Tur\'{a}n number of a graph $H$, denoted by $ex_{\mathcal{P}}(n,H)$, is the maximum number of edges in an $n$-vertex $H$-free planar graph. Recently, D. Ghosh, et al. initiated the topic of double stars and prove that $ex_{\mathcal{P}}(n,S_{2,5})\leq \frac{20}{7}n$. In this paper, we continue to study this and give a sharp upper bound $ex_{\mathcal{P}}(n,S_{2,5})\leq \frac{19}{7}n-\frac{18}{7}$ for all $n\geq 1$, with equality when $n=12$. This improves Ghosh's result.
- Published
- 2024
- Full Text
- View/download PDF
23. Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System
- Author
-
Liu, Genjia, Hu, Yue, Xu, Chenxin, Mao, Weibo, Ge, Junhao, Huang, Zhengxiang, Lu, Yifan, Xu, Yinda, Xia, Junkai, Wang, Yafei, and Chen, Siheng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Vehicle-to-everything-aided autonomous driving (V2X-AD) has a huge potential to provide a safer driving solution. Despite extensive researches in transportation and communication to support V2X-AD, the actual utilization of these infrastructures and communication resources in enhancing driving performances remains largely unexplored. This highlights the necessity of collaborative autonomous driving: a machine learning approach that optimizes the information sharing strategy to improve the driving performance of each vehicle. This effort necessitates two key foundations: a platform capable of generating data to facilitate the training and testing of V2X-AD, and a comprehensive system that integrates full driving-related functionalities with mechanisms for information sharing. From the platform perspective, we present V2Xverse, a comprehensive simulation platform for collaborative autonomous driving. This platform provides a complete pipeline for collaborative driving. From the system perspective, we introduce CoDriving, a novel end-to-end collaborative driving system that properly integrates V2X communication over the entire autonomous pipeline, promoting driving with shared perceptual information. The core idea is a novel driving-oriented communication strategy. Leveraging this strategy, CoDriving improves driving performance while optimizing communication efficiency. We make comprehensive benchmarks with V2Xverse, analyzing both modular performance and closed-loop driving performance. Experimental results show that CoDriving: i) significantly improves the driving score by 62.49% and drastically reduces the pedestrian collision rate by 53.50% compared to the SOTA end-to-end driving method, and ii) achieves sustaining driving performance superiority over dynamic constraint communication conditions.
- Published
- 2024
24. A broad linewidth, compact, millimeter-bright molecular emission line source near the Galactic Center
- Author
-
Ginsburg, Adam, Bally, John, Barnes, Ashley T., Battersby, Cara, Budaiev, Nazar, Butterfield, Natalie O., Caselli, Paola, Colzi, Laura, Dutkowska, Katarzyna M., García, Pablo, Gramze, Savannah, Henshaw, Jonathan D., Hu, Yue, Jeff, Desmond, Jiménez-Serra, Izaskun, Kauffmann, Jens, Klessen, Ralf S., Levesque, Emily M., Longmore, Steven N., Lu, Xing, Mills, Elisabeth A. C., Morris, Mark R., Nogueras-Lara, Francisco, Oka, Tomoharu, Pineda, Jaime E., Pillai, Thushara G. S., Rivilla, Víctor M., Sánchez-Monge, Álvaro, Santa-Maria, Miriam G., Smith, Howard A., Sofue, Yoshiaki, Sormani, Mattia C., Tremblay, Grant R., Vermariën, Gijs, Vikhlinin, Alexey, Viti, Serena, Walker, Dan, Wang, Q. Daniel, Xu, Fengwei, and Zhang, Qizhou
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
A compact source, G0.02467-0.0727, was detected in ALMA \threemm observations in continuum and very broad line emission. The continuum emission has a spectral index $\alpha\approx3.3$, suggesting that the emission is from dust. The line emission is detected in several transitions of CS, SO, and SO$_2$ and exhibits a line width FWHM $\approx160$ \kms. The line profile appears Gaussian. The emission is weakly spatially resolved, coming from an area on the sky $\lesssim1"$ in diameter ($\lesssim10^4$ AU at the distance of the Galactic Center; GC). The centroid velocity is $v_{LSR}\approx40$-$50$ \kms, which is consistent with a location in the Galactic Center. With multiple SO lines detected, and assuming local thermodynamic equilibrium (LTE) conditions, $T_\mathrm{LTE} = 13$ K, which is colder than seen in typical GC clouds, though we cannot rule out low-density, subthermally excited, warmer gas. Despite the high velocity dispersion, no emission is observed from SiO, suggesting that there are no strong ($\gtrsim10~\mathrm{km~s}^{-1}$) shocks in the molecular gas. There are no detections at other wavelengths, including X-ray, infrared, and radio. We consider several explanations for the Millimeter Ultra-Broad Line Object (MUBLO), including protostellar outflow, explosive outflow, collapsing cloud, evolved star, stellar merger, high-velocity compact cloud, intermediate mass black hole, and background galaxy. Most of these conceptual models are either inconsistent with the data or do not fully explain it. The MUBLO is, at present, an observationally unique object., Comment: Accepted to ApJL
- Published
- 2024
25. Probing Three-Dimensional Magnetic Fields: III -- Synchrotron Emission and Machine Learning
- Author
-
Hu, Yue and Lazarian, Alex
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Synchrotron observation serves as a tool for studying magnetic fields in the interstellar medium and intracluster medium, yet its ability to unveil three-dimensional (3D) magnetic fields, meaning probing the field'splane-of-the-sky (POS) orientation, inclination angle relative to the line of sight, and magnetization from one observational data, remains largely underexplored. Inspired by the latest insights into anisotropic magnetohydrodynamic (MHD) turbulence, we found that synchrotron emission's intensity structures inherently reflect this anisotropy, providing crucial information to aid in 3D magnetic field studies: (i) the structure's elongation gives the magnetic field's POS orientation and (ii) the structure's anisotropy degree and topology reveal the inclination angle and magnetization. Capitalizing on this foundation, we integrate a machine learning approach-Convolutional Neural Network (CNN)-to extract this latent information, thereby facilitating the exploration of 3D magnetic fields. The model is trained on synthetic synchrotron emission maps, derived from 3D MHD turbulence simulations encompassing a range of sub-Alfv\'enic to super-Alfv\'enic conditions. We show that the CNN is physically interpretable and the CNN is capable of obtaining the POS orientation, inclination angle, and magnetization. Additionally, we test the CNN against the noise effect and the missing low-spatial frequency. We show that this CNN-based approach maintains a high degree of robustness even when only high-spatial frequencies are maintained. This renders the method particularly suitable for application to interferometric data lacking single-dish measurements. We applied this trained CNN to the synchrotron observations of a diffuse region. The CNN-predicted POS magnetic field orientation shows a statistical agreement with that derived from synchrotron polarization., Comment: 15 pages, 11 figures, accepted for publication in ApJ
- Published
- 2024
26. Magnetic Field of Molecular Gas Measured with the Velocity Gradient Technique II: Curved Magnetic Field in kpc-Scale Bubble of NGC\,628
- Author
-
Zhao, Mengke, Zhou, Jianjun, Baan, Willem A., Hu, Yue, Lazarian, A., Tang, Xindi, Esimbek, Jarken, He, Yuxin, Li, Dalei, Ji, Weiguang, Chang, Zhengxue, and Tursun, Kadirya
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We report the detection of the ordered alignment between the magnetic field and kpc-scale bubbles in the nearby spiral galaxy, NGC\,628. Applying the Velocity Gradient Technique (VGT) on CO spectroscopic data from the ALMA-PHANGS, the magnetic field of NGC\,628 is measured at the scale of 191\,pc ($\sim$ 4\,$''$). The large-scale magnetic field is oriented parallel to the spiral arms and curves around the galactic bubble structures in the mid-infrared emission observed by the James Webb Space Telescope (JWST). Twenty-one bubble structures have been identified at the edges of spiral arms with scales over 300\,pc, which includes two kpc-scale structures. These bubbles are caused by supernova remnants and prolonged star formation and are similar to the outflow chimneys found in neutral hydrogen in galactic disks. At the edge of the bubbles, the shocks traced by the OIII emission present a curved magnetic field that parallels the bubble's shell. The magnetic field follows the bubble expansion and binds the gas in the shell to trigger further star formation. By analyzing the larger sample of 1694 bubbles, we found a distinct radial-size distribution of bubbles in NGC\,628 indicating the star formation history in the galaxy., Comment: 15 pages, 7 figures, Accepted by ApJ
- Published
- 2024
27. GCAM: Gaussian and causal-attention model of food fine-grained recognition
- Author
-
Zhuang, Guohang, Hu, Yue, Yan, Tianxing, and Gao, JiaZhan
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Currently, most food recognition relies on deep learning for category classification. However, these approaches struggle to effectively distinguish between visually similar food samples, highlighting the pressing need to address fine-grained issues in food recognition. To mitigate these challenges, we propose the adoption of a Gaussian and causal-attention model for fine-grained object recognition.In particular, we train to obtain Gaussian features over target regions, followed by the extraction of fine-grained features from the objects, thereby enhancing the feature mapping capabilities of the target regions. To counteract data drift resulting from uneven data distributions, we employ a counterfactual reasoning approach. By using counterfactual interventions, we analyze the impact of the learned image attention mechanism on network predictions, enabling the network to acquire more useful attention weights for fine-grained image recognition. Finally, we design a learnable loss strategy to balance training stability across various modules, ultimately improving the accuracy of the final target recognition. We validate our approach on four relevant datasets, demonstrating its excellent performance across these four datasets.We experimentally show that GCAM surpasses state-of-the-art methods on the ETH-FOOD101, UECFOOD256, and Vireo-FOOD172 datasets. Furthermore, our approach also achieves state-of-the-art performance on the CUB-200 dataset., Comment: 23 pages, 11 figures
- Published
- 2024
28. On the properties and implications of collapse-driven MHD turbulence
- Author
-
Vázquez-Semadeni, Enrique, Hu, Yue, Xu, Siyao, Guerrero-Gamboa, Rubén, and Lazarian, Alex
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We numerically investigate the driving of MHD turbulence by gravitational contraction using simulations of an initially spherical, magnetically supercritical cloud core with initially transonic and trans-Alfv\'enic turbulence. We perform a Helmholtz decomposition of the velocity field, and investigate the evolution of its solenoidal and compressible parts, as well as of the velocity component along the gravitational acceleration vector, a proxy for the infall component of the velocity field. We find that: 1) In spite of being supercritical, the core first contracts to a sheet perpendicular to the mean field, and the sheet itself collapses. 2) The solenoidal component of the turbulence remains at roughly its initial level throughout the simulation, while the compressible component increases continuously. This implies that turbulence does {\it not} dissipate towards the center of the core. 3) The distribution of simulation cells in the $B$-$\rho$ plane occupies a wide triangular region at low densities, bounded below by the expected trend for fast MHD waves ($B \propto \rho$, applicable for high local Alfv\'enic Mach number $\Ma$) and above by the trend expected for slow waves ($B \sim$ constant, applicable for low local $\Ma$). At high densities, the distribution follows a single trend $B \propto \rho^{\gamef}$, with $1/2 < \gamef < 2/3$, as expected for gravitational compression. 4) The measured mass-to-magnetic flux ratio $\lambda$ increases with radius $r$, due to the different scalings of the mass and magnetic flux with $r$. At a fixed radius, $\lambda$ increases with time due to the accretion of material along field lines. 5) The solenoidal energy fraction is much smaller than the total turbulent component, indicating that the collapse drives the turbulence mainly compressibly, even in directions orthogonal to that of the collapse., Comment: Resubmitted to MNRAS after first set of reviewer's recommendations. Comments welcome
- Published
- 2024
29. Lectin-Like Oxidized Low-Density Lipoprotein Receptor-1 (LOX-1): A Potential Therapeutic Target in Ischemic Stroke
- Author
-
Hu, Yue, Li, Yuhao, Luo, Yumin, Wang, Ningqun, and Zheng, Yangmin
- Published
- 2024
- Full Text
- View/download PDF
30. ADAR1 enhances tumor proliferation and radioresistance in non-small cell lung cancer by interacting with Rad18
- Author
-
Tian, Chen, Li, Chang, Wang, Juanjuan, Liu, Yuting, Gao, Jiaqi, Hong, Xiaohua, Gu, Feifei, Zhang, Kai, Hu, Yue, Fan, Hongjie, Liu, Li, and Zeng, Yulan
- Published
- 2024
- Full Text
- View/download PDF
31. Oncolytic virus encoding 4-1BBL and IL15 enhances the efficacy of tumor-infiltrating lymphocyte adoptive therapy in HCC
- Author
-
Ye, Kai, Yan, Yongfeng, Su, Rui, Dai, Qinghai, Qiao, Kunyan, Cao, Yu, Xu, Jian, Yan, Lihua, Huo, Zhixiao, Liu, Wei, Hu, Yue, Zhu, Yu, Xu, Liang, and Mi, Yuqiang
- Published
- 2024
- Full Text
- View/download PDF
32. Effect of L-Carnitine Level on the Risk of Neuromyelitis Optica Spectrum Disorders: A Two-Sample Mendelian Randomization Study
- Author
-
Hu, Wenyu, Hu, Yue, Li, Jiahong, Men, Yi, Xia, Jiangwei, Zheng, Wenxu, and Zhao, Yinan
- Published
- 2024
- Full Text
- View/download PDF
33. Single-dose methamphetamine administration impairs ORM retrieval in mice via excessive DA-mediated inhibition of PrLGlu activity
- Author
-
Ma, Jian-chi, Che, Xiao-hang, Zhu, Xiao-na, Ren, Ao-xin, Hu, Yue, Yang, Cheng-li, Xu, Zhong-tian, Li, Yu-ting, Wu, Chun-fu, and Yang, Jing-yu
- Published
- 2024
- Full Text
- View/download PDF
34. Dual-physical network PVA hydrogel commensurate with articular cartilage bearing lubrication enabled by harnessing nanoscale crystalline domains
- Author
-
Hu, Danli, Liu, Desheng, Hu, Yue, Wang, Yixian, Lu, Yaozhong, Bai, Changcheng, Hossain, Khan Rajib, Jiang, Pan, and Wang, Xiaolong
- Published
- 2024
- Full Text
- View/download PDF
35. Using Small Punch Test to Investigate the Mechanical Properties of X42 Exposed to Gaseous Hydrogen: Effect of Pressure, Pre-charge Time, Punch Velocity and Oxygen Content
- Author
-
Wang, Hu-Yue, Ming, Hong-Liang, Hou, Dong-Ceng, Wang, Jian-Qiu, Ke, Wei, and Han, En-Hou
- Published
- 2024
- Full Text
- View/download PDF
36. Investigating the impact of structured knowledge feedback on collaborative academic writing
- Author
-
Li, Xu, Jiang, Shiyan, Hu, Yue, Feng, Xiaoxiao, Chen, Wenzhi, and Ouyang, Fan
- Published
- 2024
- Full Text
- View/download PDF
37. Targeting the Ferroptosis and Endoplasmic Reticulum Stress Signaling Pathways by CBX7 in Myocardial Ischemia/reperfusion Injury
- Author
-
Jiang, Weipeng, Yan, Zeyu, Zheng, Xueou, Huang, Shiyi, Hu, Yue, Xiong, Fengjuan, He, Bufan, Wu, Yingzhi, Fu, Qiang, Li, Zhiliang, and Zhou, Baihua
- Published
- 2024
- Full Text
- View/download PDF
38. Prognostic Significance of Plasma VEGFA and VEGFR2 in Acute Ischemic Stroke-a Prospective Cohort Study
- Author
-
Hu, Yue, Huang, Shuangfeng, Shen, Tong, Wang, Rongliang, Geng, Meng, Wang, Yilin, Zheng, Yangmin, Luo, Yumin, and Li, Sijie
- Published
- 2024
- Full Text
- View/download PDF
39. Room-temperature sub-100 nm N\'eel-type skyrmions in non-stoichiometric van der Waals ferromagnet $\rm Fe_{3-x}GaTe_{2}$ with ultrafast laser writability
- Author
-
Li, Zefang, Zhang, Huai, Li, Guanqi, Guo, Jiangteng, Wang, Qingping, Deng, Ying, Hu, Yue, Hu, Xuange, Liu, Can, Qin, Minghui, Shen, Xi, Yu, Richeng, Gao, Xingsen, Liao, Zhimin, Liu, Junming, Hou, Zhipeng, Zhu, Yimei, and Fu, Xuewen
- Subjects
Condensed Matter - Materials Science - Abstract
Realizing room-temperature magnetic skyrmions in two-dimensional van der Waals ferromagnets offers unparalleled prospects for future spintronic applications. However, due to the intrinsic spin fluctuations that suppress atomic long-range magnetic order and the inherent inversion crystal symmetry that excludes the presence of the Dzyaloshinskii-Moriya interaction, achieving room-temperature skyrmions in 2D magnets remains a formidable challenge. In this study, we target room-temperature 2D magnet $\rm Fe_3GaTe_2$ and unveil that the introduction of iron-deficient into this compound enables spatial inversion symmetry breaking, thus inducing a significant Dzyaloshinskii-Moriya interaction that brings about room-temperature N\'eel-type skyrmions with unprecedentedly small size. To further enhance the practical applications of this finding, we employ a homemade in-situ optical Lorentz transmission electron microscopy to demonstrate ultrafast writing of skyrmions in $\rm Fe_{3-x}GaTe_2$ using a single femtosecond laser pulse. Our results manifest the $\rm Fe_{3-x}GaTe_2$ as a promising building block for realizing skyrmion-based magneto-optical functionalities.
- Published
- 2024
- Full Text
- View/download PDF
40. An Extensible Framework for Open Heterogeneous Collaborative Perception
- Author
-
Lu, Yifan, Hu, Yue, Zhong, Yiqi, Wang, Dequan, Wang, Yanfeng, and Chen, Siheng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Collaborative perception aims to mitigate the limitations of single-agent perception, such as occlusions, by facilitating data exchange among multiple agents. However, most current works consider a homogeneous scenario where all agents use identity sensors and perception models. In reality, heterogeneous agent types may continually emerge and inevitably face a domain gap when collaborating with existing agents. In this paper, we introduce a new open heterogeneous problem: how to accommodate continually emerging new heterogeneous agent types into collaborative perception, while ensuring high perception performance and low integration cost? To address this problem, we propose HEterogeneous ALliance (HEAL), a novel extensible collaborative perception framework. HEAL first establishes a unified feature space with initial agents via a novel multi-scale foreground-aware Pyramid Fusion network. When heterogeneous new agents emerge with previously unseen modalities or models, we align them to the established unified space with an innovative backward alignment. This step only involves individual training on the new agent type, thus presenting extremely low training costs and high extensibility. To enrich agents' data heterogeneity, we bring OPV2V-H, a new large-scale dataset with more diverse sensor types. Extensive experiments on OPV2V-H and DAIR-V2X datasets show that HEAL surpasses SOTA methods in performance while reducing the training parameters by 91.5% when integrating 3 new agent types. We further implement a comprehensive codebase at: https://github.com/yifanlu0227/HEAL, Comment: Accepted by ICLR 2024. The code and data are open-sourced at https://github.com/yifanlu0227/HEAL
- Published
- 2024
41. Pragmatic Communication in Multi-Agent Collaborative Perception
- Author
-
Hu, Yue, Pang, Xianghe, Qin, Xiaoqi, Eldar, Yonina C., Chen, Siheng, Zhang, Ping, and Zhang, Wenjun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Collaborative perception allows each agent to enhance its perceptual abilities by exchanging messages with others. It inherently results in a trade-off between perception ability and communication costs. Previous works transmit complete full-frame high-dimensional feature maps among agents, resulting in substantial communication costs. To promote communication efficiency, we propose only transmitting the information needed for the collaborator's downstream task. This pragmatic communication strategy focuses on three key aspects: i) pragmatic message selection, which selects task-critical parts from the complete data, resulting in spatially and temporally sparse feature vectors; ii) pragmatic message representation, which achieves pragmatic approximation of high-dimensional feature vectors with a task-adaptive dictionary, enabling communicating with integer indices; iii) pragmatic collaborator selection, which identifies beneficial collaborators, pruning unnecessary communication links. Following this strategy, we first formulate a mathematical optimization framework for the perception-communication trade-off and then propose PragComm, a multi-agent collaborative perception system with two key components: i) single-agent detection and tracking and ii) pragmatic collaboration. The proposed PragComm promotes pragmatic communication and adapts to a wide range of communication conditions. We evaluate PragComm for both collaborative 3D object detection and tracking tasks in both real-world, V2V4Real, and simulation datasets, OPV2V and V2X-SIM2.0. PragComm consistently outperforms previous methods with more than 32.7K times lower communication volume on OPV2V. Code is available at github.com/PhyllisH/PragComm., Comment: 18 pages
- Published
- 2024
42. Helping Language Models Learn More: Multi-dimensional Task Prompt for Few-shot Tuning
- Author
-
Weng, Jinta, Zhang, Jiarui, Hu, Yue, Fa, Daidong, Xuand, Xiaofeng, and Huang, Heyan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large language models (LLMs) can be used as accessible and intelligent chatbots by constructing natural language queries and directly inputting the prompt into the large language model. However, different prompt' constructions often lead to uncertainty in the answers and thus make it hard to utilize the specific knowledge of LLMs (like ChatGPT). To alleviate this, we use an interpretable structure to explain the prompt learning principle in LLMs, which certificates that the effectiveness of language models is determined by position changes of the task's related tokens. Therefore, we propose MTPrompt, a multi-dimensional task prompt learning method consisting based on task-related object, summary, and task description information. By automatically building and searching for appropriate prompts, our proposed MTPrompt achieves the best results on few-shot samples setting and five different datasets. In addition, we demonstrate the effectiveness and stability of our method in different experimental settings and ablation experiments. In interaction with large language models, embedding more task-related information into prompts will make it easier to stimulate knowledge embedded in large language models., Comment: arXiv admin note: text overlap with arXiv:2210.16489
- Published
- 2023
43. On the Feasibility of Fingerprinting Collaborative Robot Traffic
- Author
-
Tang, Cheng, Barradas, Diogo, Hengartner, Urs, and Hu, Yue
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Robotics - Abstract
This study examines privacy risks in collaborative robotics, focusing on the potential for traffic analysis in encrypted robot communications. While previous research has explored low-level command recovery, our work investigates high-level motion recovery from command message sequences. We evaluate the efficacy of traditional website fingerprinting techniques (k-FP, KNN, and CUMUL) and their limitations in accurately identifying robotic actions due to their inability to capture detailed temporal relationships. To address this, we introduce a traffic classification approach using signal processing techniques, demonstrating high accuracy in action identification and highlighting the vulnerability of encrypted communications to privacy breaches. Additionally, we explore defenses such as packet padding and timing manipulation, revealing the challenges in balancing traffic analysis resistance with network efficiency. Our findings emphasize the need for continued development of practical defenses in robotic privacy and security., Comment: 12 pages
- Published
- 2023
44. The Impact of Robots' Facial Emotional Expressions on Light Physical Exercises
- Author
-
Abdulazeem, Nourhan and Hu, Yue
- Subjects
Computer Science - Robotics - Abstract
To address the global challenge of population aging, our goal is to enhance successful aging through the introduction of robots capable of assisting in daily physical activities and promoting light exercises, which would enhance the cognitive and physical well-being of older adults. Previous studies have shown that facial expressions can increase engagement when interacting with robots. This study aims to investigate how older adults perceive and interact with a robot capable of displaying facial emotions while performing a physical exercise task together. We employed a collaborative robotic arm with a flat panel screen to encourage physical exercise across three different facial emotion conditions. We ran the experiment with older adults aged between 66 and 88. Our findings suggest that individuals perceive robots exhibiting facial expressions as less competent than those without such expressions. Additionally, the presence of facial expressions does not appear to significantly impact participants' levels of engagement, unlike other state-of-the-art studies. This observation is likely linked to our study's emphasis on collaborative physical human-robot interaction (pHRI) applications, as opposed to socially oriented pHRI applications. Additionally, we foresee a requirement for more suitable non-verbal social behavior to effectively enhance participants' engagement levels.
- Published
- 2023
- Full Text
- View/download PDF
45. A Case of Asymptomatic Duodenal Foreign Body Perforation
- Author
-
Ye, Cheng, Sun, Shaopeng, Zeng, Xuyan, Xu, Li, Hu, Yue, Lv, Bin, and Cao, Haijun
- Published
- 2024
- Full Text
- View/download PDF
46. Neural Network Mechanisms Underlying General Anesthesia: Cortical and Subcortical Nuclei
- Author
-
Hu, Yue, Wang, Yun, Zhang, Lingjing, Luo, Mengqiang, and Wang, Yingwei
- Published
- 2024
- Full Text
- View/download PDF
47. Clinical characteristics and detection of MYB-QKI fusions in patients with angiocentric glioma
- Author
-
Li, Tiemin, Aihemaitiniyazi, Adilijiang, Zhang, Huawei, Wei, Da, Hu, Yue, Guan, Yuguang, Zhou, Jian, Qi, Xueling, Wang, Mengyang, Wu, Bin, Zhu, Mingwang, Zhang, Linpeng, Luan, Guoming, and Liu, Changqing
- Published
- 2024
- Full Text
- View/download PDF
48. Nitrogen and fluorine co-doped graphene for ultra-stable lithium metal anodes
- Author
-
Li, Pan, Liu, Yifan, Bao, Xujian, Xie, Jinghao, Li, Zhao, Li, Hongcheng, Ren, Qiang, Feng, Xiaomiao, Hu, Yue, and Ma, Yanwen
- Published
- 2024
- Full Text
- View/download PDF
49. Applying Grain Boundary Engineering and Stabilizing Heat Treatment to 321 Stainless Steel for Enhancing Intergranular Corrosion Resistance
- Author
-
Hu, Yue, Bai, Qin, Xia, Shuang, Liu, Ke, He, Qinqin, and Xu, Gang
- Published
- 2024
- Full Text
- View/download PDF
50. Efficacy and safety of adjunctive perampanel treatment in pediatric patients with epilepsy aged 4–12 years: a real-world study
- Author
-
Zeng, Qiao, Xia, Xueqian, Jiang, Li, Chen, Jin, Liu, Yuhang, and Hu, Yue
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.