97,538 results on '"Chen, Li"'
Search Results
2. A Contemporaneous Interpretation of Chung-Shu Lo’s Reply to the UNESCO Human Rights Survey
- Author
-
Chen, Li
- Published
- 2022
- Full Text
- View/download PDF
3. Oregon's Pioneer Chinese Law Students and Their Untold Stories
- Author
-
Voices, Oregon and Chen, Li
- Published
- 2022
- Full Text
- View/download PDF
4. Unveiling the Binary Nature of NGC 2323
- Author
-
Qin, Songmei, Zhong, Jing, Tang, Tong, Jiang, Yueyue, Wang, Long, Wu, Kai, Anders, Friedrich, Balaguer-Núñez, Lola, Liu, Guimei, Li, Chunyan, Hou, Jinliang, and Chen, Li
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Solar and Stellar Astrophysics - Abstract
As a well-known open cluster, NGC 2323 (also called M50) has been widely investigated for over a hundred years and has always been considered a classical single cluster. In this work, with the help of Gaia DR3, we study the binary structure nature of this cluster. Although indistinguishable in the spatial space, the small but undeniable difference in the proper motion indicates that they may be two individual clusters. After investigating the properties of the two clusters, it is found that they have very close positions (three-dimensional $\Delta$pos = 12.3 pc, $\sigma_{\Delta \mathrm{pos}} = 3.4$ pc) and similar tangential velocities (two-dimensional $\Delta$V = 2.2 km s$^{-1}$, $\sigma_{\Delta \mathrm{V}} = 0.02$ km s$^{-1}$), indicating the existence of their physical association. Moreover, the best isochrone fitting ages of the two clusters are the same (158 Myr), further proving their possibly common origin. To comprehensively understand the formation and evolution of this binary cluster, we employ the PETAR $N$-body code to trace back their birthplace and deduce their dynamical evolutionary fate. With observational mean cluster properties, the simulations suggest that they may form together, and then orbit each other as a binary cluster for over 200 Myr. After that, because of their gradual mass loss, the two clusters will eventually separate and evolve into two independent clusters. Meanwhile, the numerical $N$-body simulation suggests that the less massive cluster is unlikely to be the cluster tidal tails created by the differential rotation of the Milky Way., Comment: 14 pages, 8 figures
- Published
- 2024
5. QCS:Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition
- Author
-
Wang, Chengpeng, Chen, Li, Wang, Lili, Li, Zhaofan, and Lv, Xuebin
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
On facial expression datasets with complex and numerous feature types, where the significance and dominance of labeled features are difficult to predict, facial expression recognition(FER) encounters the challenges of inter-class similarity and intra-class variances, making it difficult to mine effective features. We aim to solely leverage the feature similarity among facial samples to address this. We introduce the Cross Similarity Attention (CSA), an input-output position-sensitive attention mechanism that harnesses feature similarity across different images to compute the corresponding global spatial attention. Based on this, we propose a four-branch circular framework, called Quadruplet Cross Similarity (QCS), to extract discriminative features from the same class and eliminate redundant ones from different classes synchronously to refine cleaner features. The symmetry of the network ensures balanced and stable training and reduces the amount of CSA interaction matrix. Contrastive residual distillation is utilized to transfer the information learned in the cross module back to the base network. The cross-attention module exists during training, and only one base branch is retained during inference. our proposed QCS model outperforms state-of-the-art methods on several popular FER datasets, without requiring additional landmark information or other extra training data. The code is available at https://github.com/birdwcp/QCS.
- Published
- 2024
6. Observation of O+ Characteristics During the Terrestrial Alfv\'en Wing State Induced by the April 2023 Coronal Mass Ejection
- Author
-
Liang, Haoming, Chen, Li-Jen, Fuselier, Stephen A., Gomez, Roman G., Burkholder, Brandon, Bessho, Naoki, Gurram, Harsha, Rice, Rachel C., Shuster, Jason, and Ardakani, Akhtar S.
- Subjects
Physics - Space Physics ,Astrophysics - Solar and Stellar Astrophysics - Abstract
We report Magnetospheric Multiscale observations of oxygen ions (O+) during a coronal mass ejection in April 2023 when the solar wind was sub-Alfv\'enic and Alfv\'en wings formed. For the first time, O+ characteristics are studied at the contact region between the unshocked solar wind and the magnetosphere. The O+ ions show energies between 100s eV and ~30 keV. The possible sources are the ring current, the warm plasma cloak, and the ionosphere. The O+ ions exhibit bi-directional streaming along newly-formed closed field lines (CFLs), and dominantly anti-parallel on earlier-formed CFLs. Escaping O+ ions in the unshocked solar wind are observed. During the recovery phase, the O+ pitch-angle distribution associated with flux tubes shows dispersion, indicating potential loss to the solar wind. Our results show escaping as well as trapped O+ ions in the region where a magnetic cloud, an Alfv\'en wing, and magnetospheric field lines are mixed.
- Published
- 2024
7. Interaction of the Prominence Plasma within the Magnetic Cloud of an ICME with the Earth's Bow Shock
- Author
-
Madanian, Hadi, Chen, Li-Jen, Ng, Jonathan, Starkey, Michael J., Fuselier, Stephen A., Bessho, Naoki, Gershman, Daniel J., and Liu, Terry Z.
- Subjects
Physics - Space Physics ,Physics - Plasma Physics - Abstract
The magnetic cloud within an interplanetary coronal mass ejection (ICME) is characterized by high magnetic field intensities. In this study, we investigate the interaction of a magnetic cloud carrying a density structure with the Earth's bow shock during the ICME event on 24 April 2023. Elevated abundances of cold protons and heavier ions, namely alpha particles and singly charged helium ions, associated with the prominence plasma are observed within this structure. The plasma downstream of the bow shock exhibits an irregular compression pattern which could be due to the presence of heavy ions. Heavy ions carry a significant fraction of the upstream flow energy; however, due to their different charge per mass ratio and rigidity, they are less scattered by the electromagnetic and electrostatic waves at the shock. We find that downstream of the shock, while the thermal ion energy is only a small fraction of the background magnetic energy density, nevertheless increased ion fluxes reduce the characteristic wave speeds in the that region. As such, we observe a transition state of an unstable bow shock layer across which the plasma flow is super Alfv\'enic in both upstream and downstream regions. Our findings help with understanding the intense space weather impacts of such events.
- Published
- 2024
- Full Text
- View/download PDF
8. Class-RAG: Content Moderation with Retrieval Augmented Generation
- Author
-
Chen, Jianfa, Shen, Emily, Bavalatti, Trupti, Lin, Xiaowen, Wang, Yongkai, Hu, Shuming, Subramanyam, Harihar, Vepuri, Ksheeraj Sai, Jiang, Ming, Qi, Ji, Chen, Li, Jiang, Nan, and Jain, Ankit
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Robust content moderation classifiers are essential for the safety of Generative AI systems. Content moderation, or safety classification, is notoriously ambiguous: differences between safe and unsafe inputs are often extremely subtle, making it difficult for classifiers (and indeed, even humans) to properly distinguish violating vs. benign samples without further context or explanation. Furthermore, as these technologies are deployed across various applications and audiences, scaling risk discovery and mitigation through continuous model fine-tuning becomes increasingly challenging and costly. To address these challenges, we propose a Classification approach employing Retrieval-Augmented Generation (Class-RAG). Class-RAG extends the capability of its base LLM through access to a retrieval library which can be dynamically updated to enable semantic hotfixing for immediate, flexible risk mitigation. Compared to traditional fine-tuned models, Class-RAG demonstrates flexibility and transparency in decision-making. As evidenced by empirical studies, Class-RAG outperforms on classification and is more robust against adversarial attack. Besides, our findings suggest that Class-RAG performance scales with retrieval library size, indicating that increasing the library size is a viable and low-cost approach to improve content moderation., Comment: 11 pages, submit to ACL
- Published
- 2024
9. Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
- Author
-
Bu, Qingwen, Li, Hongyang, Chen, Li, Cai, Jisong, Zeng, Jia, Cui, Heming, Yao, Maoqing, and Qiao, Yu
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
The increasing demand for versatile robotic systems to operate in diverse and dynamic environments has emphasized the importance of a generalist policy, which leverages a large cross-embodiment data corpus to facilitate broad adaptability and high-level reasoning. However, the generalist would struggle with inefficient inference and cost-expensive training. The specialist policy, instead, is curated for specific domain data and excels at task-level precision with efficiency. Yet, it lacks the generalization capacity for a wide range of applications. Inspired by these observations, we introduce RoboDual, a synergistic dual-system that supplements the merits of both generalist and specialist policy. A diffusion transformer-based specialist is devised for multi-step action rollouts, exquisitely conditioned on the high-level task understanding and discretized action output of a vision-language-action (VLA) based generalist. Compared to OpenVLA, RoboDual achieves 26.7% improvement in real-world setting and 12% gain on CALVIN by introducing a specialist policy with merely 20M trainable parameters. It maintains strong performance with 5% of demonstration data only, and enables a 3.8 times higher control frequency in real-world deployment. Code would be made publicly available. Our project page is hosted at: https://opendrivelab.com/RoboDual/, Comment: Project page: https://opendrivelab.com/RoboDual/
- Published
- 2024
10. Integrating Planning into Single-Turn Long-Form Text Generation
- Author
-
Liang, Yi, Wu, You, Zhuang, Honglei, Chen, Li, Shen, Jiaming, Jia, Yiling, Qin, Zhen, Sanghai, Sumit, Wang, Xuanhui, Yang, Carl, and Bendersky, Michael
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Generating high-quality, in-depth textual documents, such as academic papers, news articles, Wikipedia entries, and books, remains a significant challenge for Large Language Models (LLMs). In this paper, we propose to use planning to generate long form content. To achieve our goal, we generate intermediate steps via an auxiliary task that teaches the LLM to plan, reason and structure before generating the final text. Our main novelty lies in a single auxiliary task that does not require multiple rounds of prompting or planning. To overcome the scarcity of training data for these intermediate steps, we leverage LLMs to generate synthetic intermediate writing data such as outlines, key information and summaries from existing full articles. Our experiments demonstrate on two datasets from different domains, namely the scientific news dataset SciNews and Wikipedia datasets in KILT-Wiki and FreshWiki, that LLMs fine-tuned with the auxiliary task generate higher quality documents. We observed +2.5% improvement in ROUGE-Lsum, and a strong 3.60 overall win/loss ratio via human SxS evaluation, with clear wins in organization, relevance, and verifiability.
- Published
- 2024
11. Deep Transfer Learning-based Detection for Flash Memory Channels
- Author
-
Mei, Zhen, Cai, Kui, Shi, Long, Li, Jun, Chen, Li, and Immink, Kees A. Schouhamer
- Subjects
Computer Science - Information Theory ,Electrical Engineering and Systems Science - Signal Processing - Abstract
The NAND flash memory channel is corrupted by different types of noises, such as the data retention noise and the wear-out noise, which lead to unknown channel offset and make the flash memory channel non-stationary. In the literature, machine learning-based methods have been proposed for data detection for flash memory channels. However, these methods require a large number of training samples and labels to achieve a satisfactory performance, which is costly. Furthermore, with a large unknown channel offset, it may be impossible to obtain enough correct labels. In this paper, we reformulate the data detection for the flash memory channel as a transfer learning (TL) problem. We then propose a model-based deep TL (DTL) algorithm for flash memory channel detection. It can effectively reduce the training data size from $10^6$ samples to less than 104 samples. Moreover, we propose an unsupervised domain adaptation (UDA)-based DTL algorithm using moment alignment, which can detect data without any labels. Hence, it is suitable for scenarios where the decoding of error-correcting code fails and no labels can be obtained. Finally, a UDA-based threshold detector is proposed to eliminate the need for a neural network. Both the channel raw error rate analysis and simulation results demonstrate that the proposed DTL-based detection schemes can achieve near-optimal bit error rate (BER) performance with much less training data and/or without using any labels., Comment: This paper has been accepted for publication in IEEE Transactions on Communications
- Published
- 2024
12. Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
- Author
-
Zhao, Xinyu, Sun, Guoheng, Cai, Ruisi, Zhou, Yukun, Li, Pingzhi, Wang, Peihao, Tan, Bowen, He, Yexiao, Chen, Li, Liang, Yi, Chen, Beidi, Yuan, Binhang, Wang, Hongyi, Li, Ang, Wang, Zhangyang, and Chen, Tianlong
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
As Large Language Models (LLMs) excel across tasks and specialized domains, scaling LLMs based on existing models has garnered significant attention, which faces the challenge of decreasing performance when combining disparate models. Various techniques have been proposed for the aggregation of pre-trained LLMs, including model merging, Mixture-of-Experts, and stacking. Despite their merits, a comprehensive comparison and synergistic application of them to a diverse model zoo is yet to be adequately addressed. In light of this research gap, this paper introduces Model-GLUE, a holistic LLM scaling guideline. First, our work starts with a benchmarking of existing LLM scaling techniques, especially selective merging, and variants of mixture. Utilizing the insights from the benchmark results, we formulate an strategy for the selection and aggregation of a heterogeneous model zoo characterizing different architectures and initialization. Our methodology involves the clustering of mergeable models and optimal merging strategy selection, and the integration of clusters through a model mixture. Finally, evidenced by our experiments on a diverse Llama-2-based model zoo, Model-GLUE shows an average performance enhancement of 5.61%, achieved without additional training. Codes are available at: https://github.com/Model-GLUE/Model-GLUE., Comment: 24 pages, 4 figures, accepted to NeurIPS 2024 Datasets and Benchmarks Track
- Published
- 2024
13. Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving
- Author
-
Liu, Haochen, Chen, Li, Qiao, Yu, Lv, Chen, and Li, Hongyang
- Subjects
Computer Science - Robotics - Abstract
Autonomous driving system aims for safe and social-consistent driving through the behavioral integration among interactive agents. However, challenges remain due to multi-agent scene uncertainty and heterogeneous interaction. Current dense and sparse behavioral representations struggle with inefficiency and inconsistency in multi-agent modeling, leading to instability of collective behavioral patterns when integrating prediction and planning (IPP). To address this, we initiate a topological formation that serves as a compliant behavioral foreground to guide downstream trajectory generations. Specifically, we introduce Behavioral Topology (BeTop), a pivotal topological formulation that explicitly represents the consensual behavioral pattern among multi-agent future. BeTop is derived from braid theory to distill compliant interactive topology from multi-agent future trajectories. A synergistic learning framework (BeTopNet) supervised by BeTop facilitates the consistency of behavior prediction and planning within the predicted topology priors. Through imitative contingency learning, BeTop also effectively manages behavioral uncertainty for prediction and planning. Extensive verification on large-scale real-world datasets, including nuPlan and WOMD, demonstrates that BeTop achieves state-of-the-art performance in both prediction and planning tasks. Further validations on the proposed interactive scenario benchmark showcase planning compliance in interactive cases.
- Published
- 2024
14. Impact of the Out-of-Plane Flow Shear on Magnetic Reconnection at the Flanks of Earth's Magnetopause
- Author
-
Liang, Haoming, Chen, Li-Jen, Bessho, Naoki, and Ng, Jonathan
- Subjects
Physics - Space Physics ,Physics - Plasma Physics - Abstract
Magnetic reconnection changes the magnetic field topology and facilitates the energy and particle exchange at magnetospheric boundaries such as the Earth's magnetopause. The flow shear perpendicular to the reconnecting plane prevails at the flank magnetopause under southward interplanetary magnetic field (IMF) conditions. However, the effect of the out-of-plane flow shear on asymmetric reconnection is an open question. In this study, we utilize kinetic simulations to investigate the impact of the out-of-plane flow shear on asymmetric reconnection. By systematically varying the flow shear strength, we analyze the flow shear effects on the reconnection rate, the diffusion region structure, and the energy conversion rate. We find that the reconnection rate increases with the upstream out-of-plane flow shear, and for the same upstream conditions, it is higher at the dusk side than at the dawn side. The diffusion region is squeezed in the outflow direction due to magnetic pressure which is proportional to the square of the Alfv\'en Mach number of the shear flow. The out-of-plane flow shear increases the energy conversion rate J \cdot E', and for the same upstream conditions, the magnitude of J \cdot E' is larger at the dusk side than at the dawn side. This study reveals that out-of-plane flow shear not only enhances the reconnection rate but also significantly boosts energy conversion, with more pronounced effects on the dusk-side flank than on the dawn-side flank. These insights pave the way for better understanding the solar wind-magnetosphere interactions., Comment: 20 pages, 8 figures
- Published
- 2024
15. Imagine yourself: Tuning-Free Personalized Image Generation
- Author
-
He, Zecheng, Sun, Bo, Juefei-Xu, Felix, Ma, Haoyu, Ramchandani, Ankit, Cheung, Vincent, Shah, Siddharth, Kalia, Anmol, Subramanyam, Harihar, Zareian, Alireza, Chen, Li, Jain, Ankit, Zhang, Ning, Zhang, Peizhao, Sumbaly, Roshan, Vajda, Peter, and Sinha, Animesh
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Diffusion models have demonstrated remarkable efficacy across various image-to-image tasks. In this research, we introduce Imagine yourself, a state-of-the-art model designed for personalized image generation. Unlike conventional tuning-based personalization techniques, Imagine yourself operates as a tuning-free model, enabling all users to leverage a shared framework without individualized adjustments. Moreover, previous work met challenges balancing identity preservation, following complex prompts and preserving good visual quality, resulting in models having strong copy-paste effect of the reference images. Thus, they can hardly generate images following prompts that require significant changes to the reference image, \eg, changing facial expression, head and body poses, and the diversity of the generated images is low. To address these limitations, our proposed method introduces 1) a new synthetic paired data generation mechanism to encourage image diversity, 2) a fully parallel attention architecture with three text encoders and a fully trainable vision encoder to improve the text faithfulness, and 3) a novel coarse-to-fine multi-stage finetuning methodology that gradually pushes the boundary of visual quality. Our study demonstrates that Imagine yourself surpasses the state-of-the-art personalization model, exhibiting superior capabilities in identity preservation, visual quality, and text alignment. This model establishes a robust foundation for various personalization applications. Human evaluation results validate the model's SOTA superiority across all aspects (identity preservation, text faithfulness, and visual appeal) compared to the previous personalization models.
- Published
- 2024
16. Correlations of Methyl Formate (CH3OCHO), Dimethyl Ether (CH3OCH3) and Ketene (H2CCO) in High-mass Star-forming Regions
- Author
-
Li, Chuanshou, Qin, Sheng-Li, Liu, Tie, Liu, Sheng-Yuan, Tang, Mengyao, Liu, Hong-Li, Chen, Li, Li, Xiaohu, Xu, Fengwei, Zhang, Tianwei, Liu, Meizhu, Shi, Hongqiong, and Wu, Yuefang
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present high-spatial-resolution (0.7 to 1.0 arcsec) submillimeter observations of continuum and molecular lines of CH3OCHO, CH3OCH3, and H2CCO toward 11 high-mass star-forming regions using the Atacama Large Millimetre/submillimetre Array (ALMA). A total of 19 separate cores from 9 high-mass star-forming regions are found to be line-rich, including high-, intermediate-, and low-mass line-rich cores. The three molecules are detected in these line-rich cores. We map the emission of CH3OCHO, CH3OCH3, and H2CCO in 9 high-mass star-forming regions. The spatial distribution of the three molecules is very similar and concentrated in the areas of intense continuum emission. We also calculate the rotation temperatures, column densities, and abundances of CH3OCHO, CH3OCH3, and H2CCO under the local thermodynamic equilibrium (LTE) assumption. The abundances relative to H2 and CH3OH, and line widths of the three molecules are significantly correlated. The abundances relative to H2, temperatures and line widths of the three molecules tend to be higher in cores with higher mass and outflows detected. The possible chemical links of the three molecules are discussed., Comment: 35 pages, 9 figures
- Published
- 2024
- Full Text
- View/download PDF
17. Hardware-efficient quantum error correction using concatenated bosonic qubits
- Author
-
Putterman, Harald, Noh, Kyungjoo, Hann, Connor T., MacCabe, Gregory S., Aghaeimeibodi, Shahriar, Patel, Rishi N., Lee, Menyoung, Jones, William M., Moradinejad, Hesam, Rodriguez, Roberto, Mahuli, Neha, Rose, Jefferson, Owens, John Clai, Levine, Harry, Rosenfeld, Emma, Reinhold, Philip, Moncelsi, Lorenzo, Alcid, Joshua Ari, Alidoust, Nasser, Arrangoiz-Arriola, Patricio, Barnett, James, Bienias, Przemyslaw, Carson, Hugh A., Chen, Cliff, Chen, Li, Chinkezian, Harutiun, Chisholm, Eric M., Chou, Ming-Han, Clerk, Aashish, Clifford, Andrew, Cosmic, R., Curiel, Ana Valdes, Davis, Erik, DeLorenzo, Laura, D'Ewart, J. Mitchell, Diky, Art, D'Souza, Nathan, Dumitrescu, Philipp T., Eisenmann, Shmuel, Elkhouly, Essam, Evenbly, Glen, Fang, Michael T., Fang, Yawen, Fling, Matthew J., Fon, Warren, Garcia, Gabriel, Gorshkov, Alexey V., Grant, Julia A., Gray, Mason J., Grimberg, Sebastian, Grimsmo, Arne L., Haim, Arbel, Hand, Justin, He, Yuan, Hernandez, Mike, Hover, David, Hung, Jimmy S. C., Hunt, Matthew, Iverson, Joe, Jarrige, Ignace, Jaskula, Jean-Christophe, Jiang, Liang, Kalaee, Mahmoud, Karabalin, Rassul, Karalekas, Peter J., Keller, Andrew J., Khalajhedayati, Amirhossein, Kubica, Aleksander, Lee, Hanho, Leroux, Catherine, Lieu, Simon, Ly, Victor, Madrigal, Keven Villegas, Marcaud, Guillaume, McCabe, Gavin, Miles, Cody, Milsted, Ashley, Minguzzi, Joaquin, Mishra, Anurag, Mukherjee, Biswaroop, Naghiloo, Mahdi, Oblepias, Eric, Ortuno, Gerson, Pagdilao, Jason, Pancotti, Nicola, Panduro, Ashley, Paquette, JP, Park, Minje, Peairs, Gregory A., Perello, David, Peterson, Eric C., Ponte, Sophia, Preskill, John, Qiao, Johnson, Refael, Gil, Resnick, Rachel, Retzker, Alex, Reyna, Omar A., Runyan, Marc, Ryan, Colm A., Sahmoud, Abdulrahman, Sanchez, Ernesto, Sanil, Rohan, Sankar, Krishanu, Sato, Yuki, Scaffidi, Thomas, Siavoshi, Salome, Sivarajah, Prasahnt, Skogland, Trenton, Su, Chun-Ju, Swenson, Loren J., Teo, Stephanie M., Tomada, Astrid, Torlai, Giacomo, Wollack, E. Alex, Ye, Yufeng, Zerrudo, Jessica A., Zhang, Kailing, Brandão, Fernando G. S. L., Matheny, Matthew H., and Painter, Oskar
- Subjects
Quantum Physics - Abstract
In order to solve problems of practical importance, quantum computers will likely need to incorporate quantum error correction, where a logical qubit is redundantly encoded in many noisy physical qubits. The large physical-qubit overhead typically associated with error correction motivates the search for more hardware-efficient approaches. Here, using a microfabricated superconducting quantum circuit, we realize a logical qubit memory formed from the concatenation of encoded bosonic cat qubits with an outer repetition code of distance $d=5$. The bosonic cat qubits are passively protected against bit flips using a stabilizing circuit. Cat-qubit phase-flip errors are corrected by the repetition code which uses ancilla transmons for syndrome measurement. We realize a noise-biased CX gate which ensures bit-flip error suppression is maintained during error correction. We study the performance and scaling of the logical qubit memory, finding that the phase-flip correcting repetition code operates below threshold, with logical phase-flip error decreasing with code distance from $d=3$ to $d=5$. Concurrently, the logical bit-flip error is suppressed with increasing cat-qubit mean photon number. The minimum measured logical error per cycle is on average $1.75(2)\%$ for the distance-3 code sections, and $1.65(3)\%$ for the longer distance-5 code, demonstrating the effectiveness of bit-flip error suppression throughout the error correction cycle. These results, where the intrinsic error suppression of the bosonic encodings allows us to use a hardware-efficient outer error correcting code, indicate that concatenated bosonic codes are a compelling paradigm for reaching fault-tolerant quantum computation., Comment: Comments on the manuscript welcome!
- Published
- 2024
18. Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition
- Author
-
Wang, Chien-Chun, Chen, Li-Wei, Chou, Cheng-Kang, Lee, Hung-Shin, Chen, Berlin, and Wang, Hsin-Min
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
While pre-trained automatic speech recognition (ASR) systems demonstrate impressive performance on matched domains, their performance often degrades when confronted with channel mismatch stemming from unseen recording environments and conditions. To mitigate this issue, we propose a novel channel-aware data simulation method for robust ASR training. Our method harnesses the synergistic power of channel-extractive techniques and generative adversarial networks (GANs). We first train a channel encoder capable of extracting embeddings from arbitrary audio. On top of this, channel embeddings are extracted using a minimal amount of target-domain data and used to guide a GAN-based speech synthesizer. This synthesizer generates speech that faithfully preserves the phonetic content of the input while mimicking the channel characteristics of the target domain. We evaluate our method on the challenging Hakka Across Taiwan (HAT) and Taiwanese Across Taiwan (TAT) corpora, achieving relative character error rate (CER) reductions of 20.02% and 9.64%, respectively, compared to the baselines. These results highlight the efficacy of our channel-aware data simulation method for bridging the gap between source- and target-domain acoustics., Comment: Submitted to ICASSP 2025
- Published
- 2024
19. Differentiable Collision-Supervised Tooth Arrangement Network with a Decoupling Perspective
- Author
-
He, Zhihui, Wang, Chengyuan, Yang, Shidong, Chen, Li, Zhou, Yanheng, and Wang, Shuo
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Tooth arrangement is an essential step in the digital orthodontic planning process. Existing learning-based methods use hidden teeth features to directly regress teeth motions, which couples target pose perception and motion regression. It could lead to poor perceptions of three-dimensional transformation. They also ignore the possible overlaps or gaps between teeth of predicted dentition, which is generally unacceptable. Therefore, we propose DTAN, a differentiable collision-supervised tooth arrangement network, decoupling predicting tasks and feature modeling. DTAN decouples the tooth arrangement task by first predicting the hidden features of the final teeth poses and then using them to assist in regressing the motions between the beginning and target teeth. To learn the hidden features better, DTAN also decouples the teeth-hidden features into geometric and positional features, which are further supervised by feature consistency constraints. Furthermore, we propose a novel differentiable collision loss function for point cloud data to constrain the related gestures between teeth, which can be easily extended to other 3D point cloud tasks. We propose an arch-width guided tooth arrangement network, named C-DTAN, to make the results controllable. We construct three different tooth arrangement datasets and achieve drastically improved performance on accuracy and speed compared with existing methods., Comment: 16 pages, 13 figures
- Published
- 2024
20. Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
- Author
-
Aldeneh, Zakaria, Higuchi, Takuya, Jung, Jee-weon, Chen, Li-Wei, Shum, Stephen, Abdelaziz, Ahmed Hussen, Watanabe, Shinji, Likhomanenko, Tatiana, and Theobald, Barry-John
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Sound - Abstract
Iterative self-training, or iterative pseudo-labeling (IPL)--using an improved model from the current iteration to provide pseudo-labels for the next iteration--has proven to be a powerful approach to enhance the quality of speaker representations. Recent applications of IPL in unsupervised speaker recognition start with representations extracted from very elaborate self-supervised methods (e.g., DINO). However, training such strong self-supervised models is not straightforward (they require hyper-parameters tuning and may not generalize to out-of-domain data) and, moreover, may not be needed at all. To this end, we show the simple, well-studied, and established i-vector generative model is enough to bootstrap the IPL process for unsupervised learning of speaker representations. We also systematically study the impact of other components on the IPL process, which includes the initial model, the encoder, augmentations, the number of clusters, and the clustering algorithm. Remarkably, we find that even with a simple and significantly weaker initial model like i-vector, IPL can still achieve speaker verification performance that rivals state-of-the-art methods., Comment: Submitted to ICASSP 2025
- Published
- 2024
21. Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models
- Author
-
Chen, Li-Wei, Higuchi, Takuya, Bai, He, Abdelaziz, Ahmed Hussen, Rudnicky, Alexander, Watanabe, Shinji, Likhomanenko, Tatiana, Theobald, Barry-John, and Aldeneh, Zakaria
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Sound - Abstract
Speech foundation models, such as HuBERT and its variants, are pre-trained on large amounts of unlabeled speech for various downstream tasks. These models use a masked prediction objective, where the model learns to predict information about masked input segments from the unmasked context. The choice of prediction targets in this framework can influence performance on downstream tasks. For example, targets that encode prosody are beneficial for speaker-related tasks, while targets that encode phonetics are more suited for content-related tasks. Additionally, prediction targets can vary in the level of detail they encode; targets that encode fine-grained acoustic details are beneficial for denoising tasks, while targets that encode higher-level abstractions are more suited for content-related tasks. Despite the importance of prediction targets, the design choices that affect them have not been thoroughly studied. This work explores the design choices and their impact on downstream task performance. Our results indicate that the commonly used design choices for HuBERT can be suboptimal. We propose novel approaches to create more informative prediction targets and demonstrate their effectiveness through improvements across various downstream tasks., Comment: Submitted to ICASSP 2025
- Published
- 2024
22. Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
- Author
-
Bu, Qingwen, Zeng, Jia, Chen, Li, Yang, Yanchao, Zhou, Guyue, Yan, Junchi, Luo, Ping, Cui, Heming, Ma, Yi, and Li, Hongyang
- Subjects
Computer Science - Robotics - Abstract
Despite significant progress in robotics and embodied AI in recent years, deploying robots for long-horizon tasks remains a great challenge. Majority of prior arts adhere to an open-loop philosophy and lack real-time feedback, leading to error accumulation and undesirable robustness. A handful of approaches have endeavored to establish feedback mechanisms leveraging pixel-level differences or pre-trained visual representations, yet their efficacy and adaptability have been found to be constrained. Inspired by classic closed-loop control systems, we propose CLOVER, a closed-loop visuomotor control framework that incorporates feedback mechanisms to improve adaptive robotic control. CLOVER consists of a text-conditioned video diffusion model for generating visual plans as reference inputs, a measurable embedding space for accurate error quantification, and a feedback-driven controller that refines actions from feedback and initiates replans as needed. Our framework exhibits notable advancement in real-world robotic tasks and achieves state-of-the-art on CALVIN benchmark, improving by 8% over previous open-loop counterparts. Code and checkpoints are maintained at https://github.com/OpenDriveLab/CLOVER., Comment: Accepted at NeurIPS 2024. Code and models: https://github.com/OpenDriveLab/CLOVER
- Published
- 2024
23. Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages
- Author
-
Cheng, Yao-Fei, Chen, Li-Wei, Lee, Hung-Shin, and Wang, Hsin-Min
- Subjects
Computer Science - Computation and Language ,Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
This study investigates the efficacy of data augmentation techniques for low-resource automatic speech recognition (ASR), focusing on two endangered Austronesian languages, Amis and Seediq. Recognizing the potential of self-supervised learning (SSL) in low-resource settings, we explore the impact of data volume on the continued pre-training of SSL models. We propose a novel data-selection scheme leveraging a multilingual corpus to augment the limited target language data. This scheme utilizes a language classifier to extract utterance embeddings and employs one-class classifiers to identify utterances phonetically and phonologically proximate to the target languages. Utterances are ranked and selected based on their decision scores, ensuring the inclusion of highly relevant data in the SSL-ASR pipeline. Our experimental results demonstrate the effectiveness of this approach, yielding substantial improvements in ASR performance for both Amis and Seediq. These findings underscore the feasibility and promise of data augmentation through cross-lingual transfer learning for low-resource language ASR.
- Published
- 2024
24. Multiplex Graph Contrastive Learning with Soft Negatives
- Author
-
Zhao, Zhenhao, Zhu, Minhong, Wang, Chen, Wang, Sijia, Zhang, Jiqiang, Chen, Li, and Cai, Weiran
- Subjects
Computer Science - Machine Learning - Abstract
Graph Contrastive Learning (GCL) seeks to learn nodal or graph representations that contain maximal consistent information from graph-structured data. While node-level contrasting modes are dominating, some efforts commence to explore consistency across different scales. Yet, they tend to lose consistent information and be contaminated by disturbing features. Here, we introduce MUX-GCL, a novel cross-scale contrastive learning paradigm that utilizes multiplex representations as effective patches. While this learning mode minimizes contaminating noises, a commensurate contrasting strategy using positional affinities further avoids information loss by correcting false negative pairs across scales. Extensive downstream experiments demonstrate that MUX-GCL yields multiple state-of-the-art results on public datasets. Our theoretical analysis further guarantees the new objective function as a stricter lower bound of mutual information of raw input features and output embeddings, which rationalizes this paradigm. Code is available at https://github.com/MUX-GCL/Code.
- Published
- 2024
25. Revisiting the Time Cost Model of AllReduce
- Author
-
Xiong, Dian, Chen, Li, Jiang, Youhe, Li, Dan, Wang, Shuai, and Wang, Songtao
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
AllReduce is an important and popular collective communication primitive, which has been widely used in areas such as distributed machine learning and high performance computing. To design, analyze, and choose from various algorithms and implementations of AllReduce, the time cost model plays a crucial role, and the predominant one is the $(\alpha,\beta,\gamma)$ model. In this paper, we revisit this model, and reveal that it cannot well characterize the time cost of AllReduce on modern clusters; thus must be updated. We perform extensive measurements to identify two additional terms contributing to the time cost: the incast term and the memory access term. We augment the $(\alpha,\beta,\gamma)$ model with these two terms, and present GenModel as a result. Using GenModel, we discover two new optimalities for AllReduce algorithms, and prove that they cannot be achieved simultaneously. Finally, striking the balance between the two new optimalities, we design GenTree, an AllReduce plan generation algorithm specialized for tree-like topologies. Experiments on a real testbed with 64 GPUs show that GenTree can achieve 1.22$\times$ to 1.65$\times$ speed-up against NCCL. Large-scale simulations also confirm that GenTree can improve the state-of-the-art AllReduce algorithm by a factor of $1.2$ to $7.4$ in scenarios where the two new terms dominate.
- Published
- 2024
26. VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka
- Author
-
Chen, Li-Wei, Lee, Hung-Shin, and Chang, Chen-Chi
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
This paper introduces VoxHakka, a text-to-speech (TTS) system designed for Taiwanese Hakka, a critically under-resourced language spoken in Taiwan. Leveraging the YourTTS framework, VoxHakka achieves high naturalness and accuracy and low real-time factor in speech synthesis while supporting six distinct Hakka dialects. This is achieved by training the model with dialect-specific data, allowing for the generation of speaker-aware Hakka speech. To address the scarcity of publicly available Hakka speech corpora, we employed a cost-effective approach utilizing a web scraping pipeline coupled with automatic speech recognition (ASR)-based data cleaning techniques. This process ensured the acquisition of a high-quality, multi-speaker, multi-dialect dataset suitable for TTS training. Subjective listening tests conducted using comparative mean opinion scores (CMOS) demonstrate that VoxHakka significantly outperforms existing publicly available Hakka TTS systems in terms of pronunciation accuracy, tone correctness, and overall naturalness. This work represents a significant advancement in Hakka language technology and provides a valuable resource for language preservation and revitalization efforts., Comment: Accepted to O-COCOSDA 2024
- Published
- 2024
27. Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation
- Author
-
Wang, Chien-Chun, Chen, Li-Wei, Lee, Hung-Shin, Chen, Berlin, and Wang, Hsin-Min
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Cross-domain speech enhancement (SE) is often faced with severe challenges due to the scarcity of noise and background information in an unseen target domain, leading to a mismatch between training and test conditions. This study puts forward a novel data simulation method to address this issue, leveraging noise-extractive techniques and generative adversarial networks (GANs) with only limited target noisy speech data. Notably, our method employs a noise encoder to extract noise embeddings from target-domain data. These embeddings aptly guide the generator to synthesize utterances acoustically fitted to the target domain while authentically preserving the phonetic content of the input clean speech. Furthermore, we introduce the notion of dynamic stochastic perturbation, which can inject controlled perturbations into the noise embeddings during inference, thereby enabling the model to generalize well to unseen noise conditions. Experiments on the VoiceBank-DEMAND benchmark dataset demonstrate that our domain-adaptive SE method outperforms an existing strong baseline based on data simulation., Comment: Accepted to IEEE SLT 2024
- Published
- 2024
28. ProphetFuzz: Fully Automated Prediction and Fuzzing of High-Risk Option Combinations with Only Documentation via Large Language Model
- Author
-
Wang, Dawei, Zhou, Geng, Chen, Li, Li, Dan, and Miao, Yukai
- Subjects
Computer Science - Cryptography and Security - Abstract
Vulnerabilities related to option combinations pose a significant challenge in software security testing due to their vast search space. Previous research primarily addressed this challenge through mutation or filtering techniques, which inefficiently treated all option combinations as having equal potential for vulnerabilities, thus wasting considerable time on non-vulnerable targets and resulting in low testing efficiency. In this paper, we utilize carefully designed prompt engineering to drive the large language model (LLM) to predict high-risk option combinations (i.e., more likely to contain vulnerabilities) and perform fuzz testing automatically without human intervention. We developed a tool called ProphetFuzz and evaluated it on a dataset comprising 52 programs collected from three related studies. The entire experiment consumed 10.44 CPU years. ProphetFuzz successfully predicted 1748 high-risk option combinations at an average cost of only \$8.69 per program. Results show that after 72 hours of fuzzing, ProphetFuzz discovered 364 unique vulnerabilities associated with 12.30\% of the predicted high-risk option combinations, which was 32.85\% higher than that found by state-of-the-art in the same timeframe. Additionally, using ProphetFuzz, we conducted persistent fuzzing on the latest versions of these programs, uncovering 140 vulnerabilities, with 93 confirmed by developers and 21 awarded CVE numbers., Comment: Preprint
- Published
- 2024
- Full Text
- View/download PDF
29. Effects of food restriction on energy metabolism in male Apodemus chevrieri from Hengduan mountain region of China
- Author
-
Gong, Xue-na, Chen, Li-xin, Zhang, Hao, and Zhu, Wan-long
- Published
- 2020
- Full Text
- View/download PDF
30. Earth's Alfv\'{e}n Wings: Unveiling Dynamic Variations of Field-line Topologies with Electron Distributions
- Author
-
Gurram, Harsha, Shuster, Jason R., Chen, Li-Jen, Hasegawa, Hiroshi, Denton, Richard E., Burkholder, Brandon L., Beedle, Jason, Gershman, Daniel J., and Burch, James
- Subjects
Physics - Space Physics ,Physics - Plasma Physics - Abstract
The magnetic cloud (MC) of the Coronal Mass Ejection on April 24, 2023, contains sub-Alfv\'{e}nic solar wind, transforming Earth's magnetosphere from conventional bow-shock magnetotail configuration to Alfv\'{e}n wings. Utilizing measurements from the Magnetosphere Multiscale (MMS) mission, we present for the first time electron distribution signatures as the spacecraft traverses through various magnetic topologies during this transformation. Specifically, we characterize electrons inside the sub-Alfv\'{e}nic MC, on the dawn-dusk wing field lines and on the closed field lines. The signatures include strahl electrons in MC regions and energetic keV electrons streaming along the dawn and dusk wing field lines. We demonstrate the distribution signatures of dual wing reconnection, defined as reconnection between dawn-dusk Alfv\'{e}n wing field lines and the IMF. These signatures include four electron populations comprised of partially-depleted MC electrons and bi-directional energetic electrons with variations in energy and pitch-angle. The distributions reveal evidence of bursty magnetic reconnection under northward IMF., Comment: 11pages 4 figures
- Published
- 2024
31. In-Lab High Resolution Mid-infrared Up-conversion Stellar Interferometer Based on Synthetic Long Base-Line
- Author
-
Han, Zhao-Qi-Zhi, Ge, Zheng, Luo, Wen-Tao, Cai, Yi-Fu, Wang, Xiao-Hua, Chen, Li, Li, Wu-Zhen, Zhou, Zhi-Yuan, and Shi, Bao-Sen
- Subjects
Astrophysics - Instrumentation and Methods for Astrophysics ,Physics - Applied Physics ,Physics - Optics - Abstract
Detecting mid-infrared (MIR) radiation has significant astronomical applications, although limited by unsatisfactory MIR detectors. Here we reported on the realization of a MIR up-conversion interferometer based on synthetic long base-line (SLBL) in the laboratory. The experimental system consisted of an interferometer and subsequent up-conversion detection part of mid-infrared signal, which streamlined the structure and enhanced the reliability of the system. By using a tungsten filament lamp as an imitated star, we not only achieved the single target angle resolution of 1.10 times 10^(-4) rad, but also obtained the field angle resolution of 3.0 times 10^(-4) rad of double star targets. The angular resolution is in inverse proportion to the length of baseline. The maximum length of simulated baseline in the laboratory is about 3cm. In a Keck Interferometer (KI) liked program, the base line can reach up to 85m leading to a corresponding angular resolution of 3.0 times 10^(-9) rad (about 1.8mas). The study will offer potential benefits in extending the usage of mid-infrared light in astronomical exploration., Comment: 11 pages, 4 figures. Accepted by Physics Review D
- Published
- 2024
- Full Text
- View/download PDF
32. BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment
- Author
-
Strafforello, Ombretta, Odriozola, Gonzalo Muradas, Behrad, Fatemeh, Chen, Li-Wei, Maerten, Anne-Sofie, Soydaner, Derya, and Wagemans, Johan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Assessing the aesthetic quality of artistic images presents unique challenges due to the subjective nature of aesthetics and the complex visual characteristics inherent to artworks. Basic data augmentation techniques commonly applied to natural images in computer vision may not be suitable for art images in aesthetic evaluation tasks, as they can change the composition of the art images. In this paper, we explore the impact of local and global data augmentation techniques on artistic image aesthetic assessment (IAA). We introduce BackFlip, a local data augmentation technique designed specifically for artistic IAA. We evaluate the performance of BackFlip across three artistic image datasets and four neural network architectures, comparing it with the commonly used data augmentation techniques. Then, we analyze the effects of components within the BackFlip pipeline through an ablation study. Our findings demonstrate that local augmentations, such as BackFlip, tend to outperform global augmentations on artistic IAA in most cases, probably because they do not perturb the composition of the art images. These results emphasize the importance of considering both local and global augmentations in future computational aesthetics research., Comment: Published at the VISART VII workshop at ECCV 2024. Ombretta Strafforello, Gonzalo Muradas Odriozola, Fatemeh Behrad, Li-Wei Chen, Anne-Sofie Maerten and Derya Soydaner contributed equally to this work
- Published
- 2024
33. Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services
- Author
-
Wang, Weiyan, Jin, Yilun, Zhang, Yiming, Wei, Victor Junqiu, Tian, Han, Chen, Li, and Chen, Kai
- Subjects
Computer Science - Machine Learning - Abstract
Due to high accuracy, BERT-like models have been widely adopted by discriminative text mining and web searching. However, large BERT-like models suffer from inefficient online inference, as they face the following two problems on GPUs. First, they rely on the large model depth to achieve high accuracy, which linearly increases the sequential computation on GPUs. Second, stochastic and dynamic online workloads cause extra costs. In this paper, we present Academus for low-latency online inference of BERT-like models. At the core of Academus is the novel student parallelism, which adopts boosting ensemble and stacking distillation to distill the original deep model into an equivalent group of parallel and shallow student models. This enables Academus to achieve the lower model depth (e.g., two layers) than baselines and consequently the lowest inference latency without affecting the accuracy.For occasional workload bursts, it can temporarily decrease the number of students with minimal accuracy loss to improve throughput. Additionally, it employs specialized system designs for student parallelism to better handle stochastic online workloads. We conduct comprehensive experiments to verify the effectiveness. The results show that Academus outperforms the baselines by 4.1X~1.6X in latency without compromising accuracy, and achieves up to 22.27X higher throughput for workload bursts.
- Published
- 2024
34. Quantum entanglement and non-Hermiticity in free-fermion systems
- Author
-
Chen, Li-Mei, Zhou, Yao, Chen, Shuai A., and Ye, Peng
- Subjects
Quantum Physics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Strongly Correlated Electrons - Abstract
This topical review article reports rapid progress on the generalization and application of entanglement in non-Hermitian free-fermion quantum systems. We begin by examining the realization of non-Hermitian quantum systems through the Lindblad master equation, alongside a review of typical non-Hermitian free-fermion systems that exhibit unique features. A pedagogical discussion is provided on the relationship between entanglement quantities and the correlation matrix in Hermitian systems. Building on this foundation, we focus on how entanglement concepts are extended to non-Hermitian systems from their Hermitian free-fermion counterparts, with a review of the general properties that emerge. Finally, we highlight various concrete studies, demonstrating that entanglement entropy remains a powerful diagnostic tool for characterizing non-Hermitian physics. The entanglement spectrum also reflects the topological characteristics of non-Hermitian topological systems, while unique non-Hermitian entanglement behaviors are also discussed. The review is concluded with several future directions. Through this review, we hope to provide a useful guide for researchers who are interested in entanglement in non-Hermitian quantum systems., Comment: A Topical Review of the Interplay of Entanglement and Non-Hermitian Physics (to appear in the Special Issue of Non-Hermitian Physics in Chin. Phys. Lett.). version 3; ~15p, 1figure, texts and refs. updated, approximate to final version
- Published
- 2024
35. Foreshock Ultra-Low Frequency Waves at Mars: Consequence on the Particle Acceleration Mechanisms at the Martian Bow Shock
- Author
-
Andrés, Nahuel, Romanelli, Norberto, Mazelle, Christian, Chen, Li-Jen, Gruesbeck, Jacob R., and Espley, Jared R.
- Subjects
Physics - Plasma Physics ,Physics - Space Physics - Abstract
Using Mars Atmosphere and Volatile EvolutioN Magnetometer observations, we report the first statistical study of ultra-low frequency (ULF) waves at the Martian foreshock. The analyzed foreshock ULF wave events are observed in the 0.008-0.086 Hz frequency range, with nearly circular and elliptical left-handed polarization in the spacecraft reference frame. These waves are propagated quasi-parallel to the ambient magnetic field, with a moderate wave amplitude. All these properties are consistent with fast magnetosonic waves, most likely generated through the ion-ion right-hand resonant instability. In addition, our results suggest that the associated resonant backstreaming protons' velocities parallel to the mean magnetic field in the solar wind reference frame is $1.33 \pm 0.40$ times the solar wind velocity. The similarity between our results and previous reports at other foreshocks may indicate the presence of a common acceleration process acting in planetary bow shocks and that is responsible for this particular backstreaming population.
- Published
- 2024
36. Quantum key distribution based on mid-infrared and telecom band two-color entanglement source
- Author
-
Li, Wu-Zhen, Zhou, Chun, Wang, Yang, Chen, Li, Chen, Ren-Hui, Han, Zhao-Qi-Zhi, Gao, Ming-Yuan, Wang, Xiao-Hua, Zheng, Di-Yuan, Xie, Meng-Yu, Li, Yin-Hai, Zhou, Zhi-Yuan, Bao, Wan-Su, and Shi, Bao-Sen
- Subjects
Quantum Physics - Abstract
Due to the high noise caused by solar background radiation, the existing satellite-based free-space quantum key distribution (QKD) experiments are mainly carried out at night, hindering the establishment of a practical all-day real-time global-scale quantum network. Given that the 3-5 {\mu}m mid-infrared (MIR) band has extremely low solar background radiation and strong scattering resistance, it is one of the ideal bands for free-space quantum communication. Here, firstly, we report on the preparation of a high-quality MIR (3370 nm) and telecom band (1555 nm) two-color polarization-entangled photon source, then we use this source to realize a principle QKD based on free-space and fiber hybrid channels in a laboratory. The theoretical analysis clearly shows that a long-distance QKD over 500 km of free-space and 96 km of fiber hybrid channels can be reached simultaneously. This work represents a significant step toward developing all-day global-scale quantum communication networks., Comment: 24 pages, 9 figures
- Published
- 2024
37. Quantum-Enhanced Polarimetric Imaging
- Author
-
Xie, Meng-Yu, Niu, Su-Jian, Han, Zhao-Qi-Zhi, Li, Yin-Hai, Chen, Ren-Hui, Wang, Xiao-Hua, Gao, Ming-Yuan, Chen, Li, Song, Yue-Wei, Zhou, Zhi-Yuan, and Shi, Bao-Sen
- Subjects
Physics - Optics ,Quantum Physics - Abstract
Polarimetric imaging, a technique that captures the invisible polarization-related properties of given materials, has broad applications from fundamental physics to advanced fields such as target recognition, stress detection, biomedical diagnosis and remote sensing. The introduction of quantum sources into classical imaging systems has demonstrated distinct advantages, yet few studies have explored their combination with polarimetric imaging. In this study, we present a quantum polarimetric imaging system that integrates polarization-entangled photon pairs into a polarizer-sample-compensator-analyzer (PSRA)-type polarimeter. Our system visualizes the birefringence properties of a periodical-distributed anisotropic material under decreasing illumination levels and diverse disturbing light sources. Compared to the classical system, the quantum approach reveals the superior sensitivity and robustness in low-light conditions, particularly useful in biomedical studies where the low illumination and non-destructive detection are urgently needed. The study also highlights the nonlocality of entangled photons in birefringence measurement, indicating the potential of quantum polarimetric system in the remote sensing domain.
- Published
- 2024
38. Comprehensive characterization of tumor therapeutic response with simultaneous mapping cell size, density, and transcytolemmal water exchange
- Author
-
Shi, Diwei, Li, Sisi, Liu, Fan, Jiang, Xiaoyu, Wu, Lei, Chen, Li, Zheng, Quanshui, Bao, Haihua, Guo, Hua, and Xu, Junzhong
- Subjects
Physics - Medical Physics - Abstract
Early assessment of tumor therapeutic response is an important topic in precision medicine to optimize personalized treatment regimens and reduce unnecessary toxicity, cost, and delay. Although diffusion MRI (dMRI) has shown potential to address this need, its predictive accuracy is limited, likely due to its unspecific sensitivity to overall pathological changes. In this work, we propose a new quantitative dMRI-based method dubbed EXCHANGE (MRI of water Exchange, Confined and Hindered diffusion under Arbitrary Gradient waveform Encodings) for simultaneous mapping of cell size, cell density, and transcytolemmal water exchange. Such rich microstructural information comprehensively evaluates tumor pathologies at the cellular level. Validations using numerical simulations and in vitro cell experiments confirmed that the EXCHANGE method can accurately estimate mean cell size, density, and water exchange rate constants. The results from in vivo animal experiments show the potential of EXCHANGE for monitoring tumor treatment response. Finally, the EXCHANGE method was implemented in breast cancer patients with neoadjuvant chemotherapy, demonstrating its feasibility in assessing tumor therapeutic response in clinics. In summary, a new, quantitative dMRI-based EXCHANGE method was proposed to comprehensively characterize tumor microstructural properties at the cellular level, suggesting a unique means to monitor tumor treatment response in clinical practice., Comment: 40 pages, 6 figures
- Published
- 2024
39. Comparison of Large Language Models for Generating Contextually Relevant Questions
- Author
-
Molina, Ivo Lodovico, Švábenský, Valdemar, Minematsu, Tsubasa, Chen, Li, Okubo, Fumiya, and Shimada, Atsushi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Computers and Society ,K.3 - Abstract
This study explores the effectiveness of Large Language Models (LLMs) for Automatic Question Generation in educational settings. Three LLMs are compared in their ability to create questions from university slide text without fine-tuning. Questions were obtained in a two-step pipeline: first, answer phrases were extracted from slides using Llama 2-Chat 13B; then, the three models generated questions for each answer. To analyze whether the questions would be suitable in educational applications for students, a survey was conducted with 46 students who evaluated a total of 246 questions across five metrics: clarity, relevance, difficulty, slide relation, and question-answer alignment. Results indicate that GPT-3.5 and Llama 2-Chat 13B outperform Flan T5 XXL by a small margin, particularly in terms of clarity and question-answer alignment. GPT-3.5 especially excels at tailoring questions to match the input answers. The contribution of this research is the analysis of the capacity of LLMs for Automatic Question Generation in education., Comment: Published in Springer ECTEL 2024 conference proceedings, see https://doi.org/10.1007/978-3-031-72312-4_18
- Published
- 2024
- Full Text
- View/download PDF
40. Evolution of cooperation in the public goods game with Q-learning
- Author
-
Zheng, Guozhong, Zhang, Jiqiang, Deng, Shengfeng, Cai, Weiran, and Chen, Li
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Statistical Mechanics ,Nonlinear Sciences - Adaptation and Self-Organizing Systems - Abstract
Recent paradigm shifts from imitation learning to reinforcement learning (RL) is shown to be productive in understanding human behaviors. In the RL paradigm, individuals search for optimal strategies through interaction with the environment to make decisions. This implies that gathering, processing, and utilizing information from their surroundings are crucial. However, existing studies typically study pairwise games such as the prisoners' dilemma and employ a self-regarding setup, where individuals play against one opponent based solely on their own strategies, neglecting the environmental information. In this work, we investigate the evolution of cooperation with the multiplayer game -- the public goods game using the Q-learning algorithm by leveraging the environmental information. Specifically, the decision-making of players is based upon the cooperation information in their neighborhood. Our results show that cooperation is more likely to emerge compared to the case of imitation learning by using Fermi rule. Of particular interest is the observation of an anomalous non-monotonic dependence which is revealed when voluntary participation is further introduced. The analysis of the Q-table explains the mechanisms behind the cooperation evolution. Our findings indicate the fundamental role of environment information in the RL paradigm to understand the evolution of cooperation, and human behaviors in general., Comment: 16 pages, 12 figures, comments are appreciated
- Published
- 2024
41. The evolution of cooperation with Q-learning: the impact of information perception
- Author
-
Zheng, Guozhong, Ding, Zhenwei, Zhang, Jiqiang, Deng, Shengfeng, Cai, Weiran, and Chen, Li
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Statistical Mechanics ,Nonlinear Sciences - Adaptation and Self-Organizing Systems - Abstract
The inherent huge complexities in human beings show a remarkable diversity in response to complex surroundings, enabling us to tackle problems from different perspectives. In the realm of cooperation studies, however, existing work assumes that individuals get access to the same kind of information to make their decisions, in contrast to the facts that individuals often perceive differently. Here, within the reinforcement learning framework, we investigate the impact of information perception on the evolution of cooperation in a 2-person scenario when playing the prisoner's dilemma game. We demonstrate that distinctly different evolution processes are observed in three information perception scenarios, revealing that the structure of information significantly affects the emergence of cooperation. Notably, the asymmetric information scenario exhibits a rich dynamical process, including the cooperation emergence, breakdown, and reconstruction, akin to psychological changes in humans. Our findings indicate that the information structure is vital to the emergence of cooperation, shedding new light on establishing mutually stable cooperative relationships and understanding human behavioral complexities in general., Comment: 12pages, 13figures, comments are appreciated
- Published
- 2024
42. Mean-Field Control for Diffusion Aggregation system with Coulomb Interaction
- Author
-
Chen, Li, Wang, Yucheng, and Wang, Zhao
- Subjects
Mathematics - Optimization and Control ,Mathematics - Analysis of PDEs - Abstract
The mean-field control problem for a multi-dimensional diffusion-aggregation system with Coulomb interaction (the so called parabolic elliptic Keller-Segel system) is considered. The existence of optimal control is proved through the $\Gamma$-convergence of the corresponding control problem of the interacting particle system. There are three building blocks in the whole argument. Firstly, for the optimal control problem on the particle level, instead of using classical method for stochastic system, we study directly the control problem of high-dimensional parabolic equation, i.e. the Liouville equation of it. Secondly, we obtain a strong propagation of chaos result for the interacting particle system by combining the convergence in probability and relative entropy method. Due to this strong mean field limit result, we avoid giving compact support requirement for control functions, which has been often used in the literature. Thirdly, because of strong aggregation effect, additional difficulties arise from control function in obtaining the well-posedness theory of the diffusion-aggregation equation, so that the known method cannot be directly applied. Instead, we use a combination of local existence result and bootstrap argument to obtain the global solution in the sub-critical regime.
- Published
- 2024
43. Almost-Linear Time Algorithms for Decremental Graphs: Min-Cost Flow and More via Duality
- Author
-
Brand, Jan van den, Chen, Li, Kyng, Rasmus, Liu, Yang P., Meierhans, Simon, Gutenberg, Maximilian Probst, and Sachdeva, Sushant
- Subjects
Computer Science - Data Structures and Algorithms - Abstract
We give the first almost-linear total time algorithm for deciding if a flow of cost at most $F$ still exists in a directed graph, with edge costs and capacities, undergoing decremental updates, i.e., edge deletions, capacity decreases, and cost increases. This implies almost-linear time algorithms for approximating the minimum-cost flow value and $s$-$t$ distance on such decremental graphs. Our framework additionally allows us to maintain decremental strongly connected components in almost-linear time deterministically. These algorithms also improve over the current best known runtimes for statically computing minimum-cost flow, in both the randomized and deterministic settings. We obtain our algorithms by taking the dual perspective, which yields cut-based algorithms. More precisely, our algorithm computes the flow via a sequence of $m^{1+o(1)}$ dynamic min-ratio cut problems, the dual analog of the dynamic min-ratio cycle problem that underlies recent fast algorithms for minimum-cost flow. Our main technical contribution is a new data structure that returns an approximately optimal min-ratio cut in amortized $m^{o(1)}$ time by maintaining a tree-cut sparsifier. This is achieved by devising a new algorithm to maintain the dynamic expander hierarchy of [Goranci-R\"{a}cke-Saranurak-Tan, SODA 2021] that also works in capacitated graphs. All our algorithms are deterministc, though they can be sped up further using randomized techniques while still working against an adaptive adversary., Comment: 61 pages, Accepted to FOCS 2024
- Published
- 2024
44. FedClust: Tackling Data Heterogeneity in Federated Learning through Weight-Driven Client Clustering
- Author
-
Islam, Md Sirajul, Javaherian, Simin, Xu, Fei, Yuan, Xu, Chen, Li, and Tzeng, Nian-Feng
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Federated learning (FL) is an emerging distributed machine learning paradigm that enables collaborative training of machine learning models over decentralized devices without exposing their local data. One of the major challenges in FL is the presence of uneven data distributions across client devices, violating the well-known assumption of independent-and-identically-distributed (IID) training samples in conventional machine learning. To address the performance degradation issue incurred by such data heterogeneity, clustered federated learning (CFL) shows its promise by grouping clients into separate learning clusters based on the similarity of their local data distributions. However, state-of-the-art CFL approaches require a large number of communication rounds to learn the distribution similarities during training until the formation of clusters is stabilized. Moreover, some of these algorithms heavily rely on a predefined number of clusters, thus limiting their flexibility and adaptability. In this paper, we propose {\em FedClust}, a novel approach for CFL that leverages the correlation between local model weights and the data distribution of clients. {\em FedClust} groups clients into clusters in a one-shot manner by measuring the similarity degrees among clients based on the strategically selected partial weights of locally trained models. We conduct extensive experiments on four benchmark datasets with different non-IID data settings. Experimental results demonstrate that {\em FedClust} achieves higher model accuracy up to $\sim$45\% as well as faster convergence with a significantly reduced communication cost up to 2.7$\times$ compared to its state-of-the-art counterparts.
- Published
- 2024
45. T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy
- Author
-
Duan, Fan, Yu, Jiahao, and Chen, Li
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Point clouds are commonly used in various practical applications such as autonomous driving and the manufacturing industry. However, these point clouds often suffer from incompleteness due to limited perspectives, scanner resolution and occlusion. Therefore the prediction of missing parts performs a crucial task. In this paper, we propose a novel method for point cloud completion. We utilize a spherical template to guide the generation of the coarse complete template and generate the dynamic query tokens through a correspondence pooling (Corres-Pooling) query generator. Specifically, we first generate the coarse complete template by embedding a Gaussian spherical template into the partial input and transforming the template to best match the input. Then we use the Corres-Pooling query generator to refine the coarse template and generate dynamic query tokens which could be used to predict the complete point proxies. Finally, we generate the complete point cloud with a FoldingNet following the coarse-to-fine paradigm, according to the fine template and the predicted point proxies. Experimental results demonstrate that our T-CorresNet outperforms the state-of-the-art methods on several benchmarks. Our Codes are available at https://github.com/df-boy/T-CorresNet., Comment: Accepted to ECCV 2024
- Published
- 2024
46. Linear algebra of quadratic forms and polynomial identity
- Author
-
Chen, Li
- Subjects
Mathematics - Number Theory ,Mathematics - Functional Analysis - Abstract
Let $S_1=\{p_1,p_2,\cdots, p_l\}\subset\cc[z_1,z_2\cdots,z_n]$ be a set of quadratic forms such that $p_i=q_i^2$ where $\{q_i\}_{i=1}^l$ are linear forms. For $1\leq k\leq l$, let $S_k=\{p_{i_1}p_{i_2}\cdots p_{i_k}|1\leq i_1
- Published
- 2024
47. Can GPT-4 Help Detect Quit Vaping Intentions? An Exploration of Automatic Data Annotation Approach
- Author
-
Vuruma, Sai Krishna Revanth, Wu, Dezhi, Gupta, Saborny Sen, Aust, Lucas, Lookingbill, Valerie, Bellamy, Wyatt, Ren, Yang, Kasson, Erin, Chen, Li-Shiun, Cavazos-Rehg, Patricia, Hu, Dian, and Huang, Ming
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Emerging Technologies ,Computer Science - Human-Computer Interaction ,Computer Science - Social and Information Networks - Abstract
In recent years, the United States has witnessed a significant surge in the popularity of vaping or e-cigarette use, leading to a notable rise in cases of e-cigarette and vaping use-associated lung injury (EVALI) that caused hospitalizations and fatalities during the EVALI outbreak in 2019, highlighting the urgency to comprehend vaping behaviors and develop effective strategies for cessation. Due to the ubiquity of social media platforms, over 4.7 billion users worldwide use them for connectivity, communications, news, and entertainment with a significant portion of the discourse related to health, thereby establishing social media data as an invaluable organic data resource for public health research. In this study, we extracted a sample dataset from one vaping sub-community on Reddit to analyze users' quit-vaping intentions. Leveraging OpenAI's latest large language model GPT-4 for sentence-level quit vaping intention detection, this study compares the outcomes of this model against layman and clinical expert annotations. Using different prompting strategies such as zero-shot, one-shot, few-shot and chain-of-thought prompting, we developed 8 prompts with varying levels of detail to explain the task to GPT-4 and also evaluated the performance of the strategies against each other. These preliminary findings emphasize the potential of GPT-4 in social media data analysis, especially in identifying users' subtle intentions that may elude human detection., Comment: Accepted for the AI Applications in Public Health and Social Services workshop at the 22nd International Conference on Artificial Intelligence in Medicine (AIME 2024)
- Published
- 2024
48. Pistis-RAG: Enhancing Retrieval-Augmented Generation with Human Feedback
- Author
-
Bai, Yu, Miao, Yukai, Chen, Li, Wang, Dawei, Li, Dan, Ren, Yanyu, Xie, Hongtao, Yang, Ce, and Cai, Xuhui
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
RAG systems face limitations when semantic relevance alone does not guarantee improved generation quality. This issue becomes particularly evident due to the sensitivity of large language models (LLMs) to the ordering of few-shot prompts, which can affect model performance. To address this challenge, aligning LLM outputs with human preferences using structured feedback, such as options to copy, regenerate, or dislike, offers a promising method for improvement. This feedback is applied to the entire list of inputs rather than giving specific ratings for individual documents, making it a Listwide Labels Learning-to-Rank task. To address this task, we propose Pistis-RAG, a new RAG framework designed with a content-centric approach to better align LLMs with human preferences. Pistis-RAG effectively utilizes human feedback, enhancing content ranking and generation quality. To validate our framework, we use public datasets to simulate human feedback, allowing us to evaluate and refine our method effectively. Experimental results indicate that Pistis-RAG improves alignment with human preferences relative to the baseline RAG system, showing a 6.06% increase in MMLU (English) and a 7.08% increase in C-EVAL (Chinese) accuracy metrics. These results highlight Pistis-RAG's effectiveness in overcoming the limitations associated with traditional RAG approaches.
- Published
- 2024
49. Catalytic evolution of cooperation in a population with behavioural bimodality
- Author
-
Sheng, Anhui, Zhang, Jing, Zheng, Guozhong, Zhang, Jiqiang, Cai, Weiran, and Chen, Li
- Subjects
Quantitative Biology - Populations and Evolution ,Condensed Matter - Disordered Systems and Neural Networks ,Nonlinear Sciences - Adaptation and Self-Organizing Systems - Abstract
The remarkable adaptability of humans in response to complex environments is often demonstrated by the context-dependent adoption of different behavioral modes. However, the existing game-theoretic studies mostly focus on the single-mode assumption, and the impact of this behavioral multimodality on the evolution of cooperation remains largely unknown. Here, we study how cooperation evolves in a population with two behavioral modes. Specifically, we incorporate Q-learning and Tit-for-Tat (TFT) rules into our toy model, where prisoner's dilemma game is played and we investigate the impact of the mode mixture on the evolution of cooperation. While players in Q-learning mode aim to maximize their accumulated payoffs, players within TFT mode repeat what their neighbors have done to them. In a structured mixing implementation where the updating rule is fixed for each individual, we find that the mode mixture greatly promotes the overall cooperation prevalence. The promotion is even more significant in the probabilistic mixing, where players randomly select one of the two rules at each step. Finally, this promotion is robust when players are allowed to adaptively choose the two modes by real-time comparison. In all three scenarios, players within the Q-learning mode act as catalyzer that turns the TFT players to be more cooperative, and as a result drive the whole population to be highly cooperative. The analysis of Q-tables explains the underlying mechanism of cooperation promotion, which captures the ``psychologic evolution" in the players' mind. Our study indicates that the variety of behavioral modes is non-negligible, and could be crucial to clarify the emergence of cooperation in the real world., Comment: 11 pages, 12 figure. Comments are appreciated
- Published
- 2024
50. FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
- Author
-
Mu, Xinzhi, Chen, Li, Chen, Bohan, Gu, Shuyang, Bao, Jianmin, Chen, Dong, Li, Ji, and Yuan, Yuhui
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recently, the application of modern diffusion-based text-to-image generation models for creating artistic fonts, traditionally the domain of professional designers, has garnered significant interest. Diverging from the majority of existing studies that concentrate on generating artistic typography, our research aims to tackle a novel and more demanding challenge: the generation of text effects for multilingual fonts. This task essentially requires generating coherent and consistent visual content within the confines of a font-shaped canvas, as opposed to a traditional rectangular canvas. To address this task, we introduce a novel shape-adaptive diffusion model capable of interpreting the given shape and strategically planning pixel distributions within the irregular canvas. To achieve this, we curate a high-quality shape-adaptive image-text dataset and incorporate the segmentation mask as a visual condition to steer the image generation process within the irregular-canvas. This approach enables the traditionally rectangle canvas-based diffusion model to produce the desired concepts in accordance with the provided geometric shapes. Second, to maintain consistency across multiple letters, we also present a training-free, shape-adaptive effect transfer method for transferring textures from a generated reference letter to others. The key insights are building a font effect noise prior and propagating the font effect information in a concatenated latent space. The efficacy of our FontStudio system is confirmed through user preference studies, which show a marked preference (78% win-rates on aesthetics) for our system even when compared to the latest unrivaled commercial product, Adobe Firefly., Comment: Project-page: https://font-studio.github.io/
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.