Author: "Wang, Yuxin" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Yuxin"' showing total 6,102 results

Start Over Author "Wang, Yuxin"

6,102 results on '"Wang, Yuxin"'

1. Category-Selective Neurons in Deep Networks: Comparing Purely Visual and Visual-Language Models

Author: Lu, Zitong and Wang, Yuxin
Subjects: Quantitative Biology - Neurons and Cognition, Computer Science - Computer Vision and Pattern Recognition
Abstract: Category-selective regions in the human brain, such as the fusiform face area (FFA), extrastriate body area (EBA), parahippocampal place area (PPA), and visual word form area (VWFA), play a crucial role in high-level visual processing. Here, we investigate whether artificial neural networks (ANNs) exhibit similar category-selective neurons and how these neurons vary across model layers and between purely visual and vision-language models. Inspired by fMRI functional localizer experiments, we presented images from different categories (faces, bodies, scenes, words, scrambled scenes, and scrambled words) to deep networks and identified category-selective neurons using statistical criteria. Comparing ResNet and the structurally controlled ResNet-based CLIP model, we found that both models contain category-selective neurons, with their proportion increasing across layers, mirroring category selectivity in higher-level visual brain regions. However, CLIP exhibited a higher proportion but lower specificity of category-selective neurons compared to ResNet. Additionally, CLIP's category-selective neurons were more evenly distributed across feature maps and demonstrated greater representational consistency across layers. These findings suggest that language learning increases the number of category-selective neurons while reducing their selectivity strength, reshaping visual representations in deep networks. Our study provides insights into how ANNs mirror biological vision and how multimodal learning influences category-selective representations.
Published: 2025

2. DreamDDP: Accelerating Data Parallel Distributed LLM Training with Layer-wise Scheduled Partial Synchronization

Author: Tang, Zhenheng, Tang, Zichen, Huang, Junlin, Pan, Xinglin, Yan, Rudan, Wang, Yuxin, Zhou, Amelie Chi, Shi, Shaohuai, Chu, Xiaowen, and Li, Bo
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The growth of large language models (LLMs) increases challenges of accelerating distributed training across multiple GPUs in different data centers. Moreover, concerns about data privacy and data exhaustion have heightened interest in geo-distributed data centers. Communication in geo-distributed data parallel training (DDP) with stochastic gradient descent (S-SGD) is the main bottleneck in low-bandwidth environments. Local SGD mitigates communication overhead by reducing synchronization frequency, and recent studies have successfully applied it to geo-distributedly pre-train LLMs. However, we identify that its model synchronization mechanism prevents overlapping communication and computation, which makes the system lose opportunities to overlap communication and computation. To overcome this limitation, we expand the design space of local SGD by layer-wisely decoupling model synchronization. In each iteration, only some layers are synchronized instead of the entire model after a specific number of iterations. Leveraging this methodology, we introduce DreamDDP, a training framework to accelerate low-bandwidth distributed training with three key innovations: (1) partial local SGD with theoretical assurances of convergence rates comparable to S-SGD; (2) overlapping parameter synchronization with computation without extra GPU memory occupation; (3) identifying and exploiting three properties to schedule the communication and computation to reduce the training time based on fine-grained profiling of layer-wise communication and computation time. Empirical evaluations conducted on 32 GPUs using prominent deep learning models, including ResNet-18, ResNet-50, GPT-2, and Llama-2, demonstrate that DreamDDP enhances the convergence properties of Local SGD (and Adam) and achieves speedups ranging from $1.49\times$ to $3.91\times$ over leading baseline methods.
Published: 2025

3. Field induced density wave in a kagome superconductor

Author: Hossain, Md Shafayat, Zhang, Qi, Ingham, Julian, Liu, Jinjin, Shao, Sen, Li, Yangmu, Wang, Yuxin, Pokharel, Bal K., Cheng, Zi-Jia, Jiang, Yu-Xiao, Litskevich, Maksim, Kim, Byunghoon, Yang, Xian, Li, Yongkai, Cochran, Tyler A., Yao, Yugui, Popović, Dragana, Wang, Zhiwei, Chang, Guoqing, Thomale, Ronny, Balicas, Luis, and Hasan, M. Zahid
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Materials Science
Abstract: On the kagome lattice, electrons benefit from the simultaneous presence of band topology, flat electronic bands, and van Hove singularities, forming competing or cooperating orders. Understanding the interrelation between these distinct order parameters remains a significant challenge, leaving much of the associated physics unexplored. In the kagome superconductor KV3Sb5, which exhibits a charge density wave (CDW) state below T = 78 K, we uncover an unpredicted field-induced phase transition below 6 K. The observed transition is marked by a hysteretic anomaly in the resistivity, nonlinear electrical transport, and a change in the symmetry of the electronic response as probed via the angular dependence of the magnetoresistivity. These observations surprisingly suggest the emergence of an unanticipated broken symmetry state coexisting with the original CDW. To understand this experimental observation, we developed a theoretical minimal model for the normal state inside the high-temperature parent CDW phase where an incommensurate CDW order emerges as an instability sub-leading to superconductivity. The incommensurate CDW emerges when superconducting fluctuations become fully suppressed by large magnetic fields. Our results suggest that, in kagome superconductors, quantum states can either coexist or are nearly degenerate in energy, indicating that these are rich platforms to expose new correlated phenomena.
Published: 2025

4. The Mottness and the Anderson localization in bilayer nickelate La$_3$Ni$_2$O$_{7-\delta}$

Author: Wang, Yuxin, Chen, Ziyan, Zhang, Yi, Jiang, Kun, and Hu, Jiangping
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Superconductivity
Abstract: The oxygen content plays a pivotal role in determining the electronic properties of the recently discovered La$_3$Ni$_2$O$_{7-\delta}$ superconductors. In this work, we investigate the impact of oxygen vacancies on the insulating behavior of La$_3$Ni$_2$O$_{7-\delta}$ across the doping range $\delta = 0$ to $0.5$. At $\delta = 0.5$, we construct a bilayer Hubbard model to describe the system. Using dynamical mean-field theory, we demonstrate that the model captures the characteristics of a bilayer Mott insulator. To explore the effects of disorder within the range $\delta = 0$ to $0.5$, we treat the system as a mixture of metallic and Mott insulating phases. By analyzing the geometric average of the local density of states, we identify an Anderson localization transition occurring around $\delta \sim 0.2$. These results provide a quantitative explanation of recent experimental observations and highlight the critical influence of oxygen content on the physical properties of La$_3$Ni$_2$O$_{7-\delta}$., Comment: 6 pages, 4 figures and Supplemental Materials
Published: 2025

5. F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting

Author: Wang, Yuxin, Wu, Qianyi, and Xu, Dan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper tackles the problem of generalizable 3D-aware generation from monocular datasets, e.g., ImageNet. The key challenge of this task is learning a robust 3D-aware representation without multi-view or dynamic data, while ensuring consistent texture and geometry across different viewpoints. Although some baseline methods are capable of 3D-aware generation, the quality of the generated images still lags behind state-of-the-art 2D generation approaches, which excel in producing high-quality, detailed images. To address this severe limitation, we propose a novel feed-forward pipeline based on pixel-aligned Gaussian Splatting, coined as F3D-Gaus, which can produce more realistic and reliable 3D renderings from monocular inputs. In addition, we introduce a self-supervised cycle-consistent constraint to enforce cross-view consistency in the learned 3D representation. This training strategy naturally allows aggregation of multiple aligned Gaussian primitives and significantly alleviates the interpolation limitations inherent in single-view pixel-aligned Gaussian Splatting. Furthermore, we incorporate video model priors to perform geometry-aware refinement, enhancing the generation of fine details in wide-viewpoint scenarios and improving the model's capability to capture intricate 3D textures. Extensive experiments demonstrate that our approach not only achieves high-quality, multi-view consistent 3D-aware generation from monocular datasets, but also significantly improves training and inference efficiency., Comment: Project Page: https://w-ted.github.io/publications/F3D-Gaus
Published: 2025

6. Field-applicable loop-mediated isothermal amplification for the detection of seven common human papillomavirus subtypes

Author: Li, Hongyi, He, Tan, Lv, Xiaona, Zhiqiang, Han, Wang, Yuxin, Gao, Shijue, Zhang, Ruiqin, Shen, Xinxin, Ma, Xuejun, and Yanqing, Tie
Published: 2024

7. The electronic structure and disorder effect of La$_3$Ni$_2$O$_{7}$ superconductor

Author: Wang, Yuxin, Zhang, Yi, and Jiang, Kun
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Strongly Correlated Electrons
Abstract: Determining the electronic structure of La$_3$Ni$_2$O$_7$ is an essential step towards uncovering their superconducting mechanism. It is widely believed that the bilayer apical oxygens play an important role in the bilayer La$_3$Ni$_2$O$_7$ electronic structure. Applying the hybrid exchange-correlation functionals, we obtain a more accurate electronic structure of La$_3$Ni$_2$O$_7$ at its high-pressure phase, where the binding $d_{z^2}$ band is below the Fermi level owing to apical oxygen. The symmetry properties of this electronic structure and its corresponding tight-binding model are further analyzed. We find the antisymmetric part is highly entangled leading to a minimal nearly degenerate two-orbital model. Then, the apical oxygen vacancies effect is studied using the dynamical cluster approximation. This disorder effect strongly destroys the antisymmetric $\beta$ Fermi surface leading to the possible disappearance of superconductivity., Comment: 6 pages, 4 figures and appendix
Published: 2024

8. Low-temperature mean valence of nickel ions in pressurized La$_3$Ni$_2$O$_7$

Author: Cai, Shu, Zhou, Yazhou, Sun, Hualei, Zhang, Kai, Zhao, Jinyu, Huo, Mengwu, Nataf, Lucie, Wang, Yuxin, Li, Jie, Guo, Jing, Jiang, Kun, Wang, Meng, Ding, Yang, Yang, Wenge, Lu, Yi, Kong, Qingyu, Wu, Qi, Hu, Jiangping, Xiang, Tao, Mao, Ho-kwang, and Sun, Liling
Subjects: Condensed Matter - Superconductivity
Abstract: The discovery of high critical temperature (Tc) superconductivity in pressurized La$_3$Ni$_2$O$_7$ has ignited renewed excitement in the search of novel high-Tc superconducting compounds with 3d transition metals. Compared to other ambient-pressure superconductors, such as copper-oxide and iron-oxypnictides, unraveling the mechanisms of the pressure-induced superconductivity poses significant and unique challenges. A critical factor in this phenomenon seems to be related to the electronic configuration of 3d orbitals, which may play a fundamental role in driving high-Tc superconductivity. However, the pressure effects on the mixed-valence states of 3d-orbital cations and their influence on the emergence of high-Tc superconductivity remain poorly understood. Here, we use high-pressure (P) and low-temperature synchrotron X-ray absorption spectroscopy to investigate the influence of pressure on the mean valence change of Ni ions in La$_3$Ni$_2$O$_7$. Our results demonstrate that at a low-temperature of 20 K, the mean valence remains relatively stable across the pressures range from 1 atm to 40 GPa. Based on analyzing the absorption data, we find that, at a critical pressure, the ambient-pressure ordered phases disappear and both the structural and the superconducting phase transition occur. The pressure-induced structural phase transition revealed by our absorption results is consistent with that determined by X-ray diffraction, offering new information for a comprehensive understanding on the pressure-induced superconductivity in La$_3$Ni$_2$O$_7$., Comment: 10 pages and 4 figures
Published: 2024

9. Skew generalized von Neumann-Jordan constant for the Bana\'s-Fr\k{a}czek space

Author: Chen, Linhui, Liu, Qi, Tan, Xiewei, and Wang, Yuxin
Subjects: Mathematics - Functional Analysis, 46B20, F.2.2, I.2.7
Abstract: For any $\lambda>1, R_\lambda^2$ is Bana\'s-Fr\k{a}czek space, the exact value of the skew generalized von Neumann-Jordan constant $C_{\mathrm{NJ}}^p\left(\xi, \eta, R_\lambda^2\right)$ is calculated. By careful calculations, $C_{\mathrm{NJ}}^p\left(\xi, \eta, R_\lambda^2\right)=\frac{(\xi+\eta)^p+\left[(\eta+\xi)^2-\frac{4 \xi \eta}{\lambda^2}\right]^{p / 2}}{2^{p-1}\left(\xi^p+\eta^p\right)}$ is given., Comment: for associated mpeg file, see http://myhost.domain/file.mpg
Published: 2024
Full Text: View/download PDF

10. Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Author: Wang, Yuxin, Schröder, Maresa, Frauen, Dennis, Schweisthal, Jonas, Hess, Konstantin, and Feuerriegel, Stefan
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Constructing confidence intervals (CIs) for the average treatment effect (ATE) from patient records is crucial to assess the effectiveness and safety of drugs. However, patient records typically come from different hospitals, thus raising the question of how multiple observational datasets can be effectively combined for this purpose. In our paper, we propose a new method that estimates the ATE from multiple observational datasets and provides valid CIs. Our method makes little assumptions about the observational datasets and is thus widely applicable in medical practice. The key idea of our method is that we leverage prediction-powered inferences and thereby essentially `shrink' the CIs so that we offer more precise uncertainty quantification as compared to na\"ive approaches. We further prove the unbiasedness of our method and the validity of our CIs. We confirm our theoretical results through various numerical experiments. Finally, we provide an extension of our method for constructing CIs from combinations of experimental and observational datasets.
Published: 2024

11. A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

Author: Wang, Jiankang, Xu, Jianjun, Wang, Xiaorui, Wang, Yuxin, Xing, Mengting, Fang, Shancheng, Chen, Zhineng, Xie, Hongtao, and Zhang, Yongdong
Subjects: Computer Science - Computation and Language
Abstract: Synthesizing high-quality reasoning data for continual training has been proven to be effective in enhancing the performance of Large Language Models (LLMs). However, previous synthetic approaches struggle to easily scale up data and incur high costs in the pursuit of high quality. In this paper, we propose the Graph-based Synthetic Data Pipeline (GSDP), an economical and scalable framework for high-quality reasoning data synthesis. Inspired by knowledge graphs, we extracted knowledge points from seed data and constructed a knowledge point relationships graph to explore their interconnections. By exploring the implicit relationships among knowledge, our method achieves $\times$255 data expansion. Furthermore, GSDP led by open-source models, achieves synthesis quality comparable to GPT-4-0613 while maintaining $\times$100 lower costs. To tackle the most challenging mathematical reasoning task, we present the GSDP-MATH dataset comprising over 1.91 million pairs of math problems and answers. After fine-tuning on GSDP-MATH, GSDP-7B based on Mistral-7B achieves 37.7% accuracy on MATH and 78.4% on GSM8K, demonstrating the effectiveness of our method. The dataset and models trained in this paper will be available.
Published: 2024

12. Real-time volumetric free-hand ultrasound imaging for large-sized organs: A study of imaging the whole spine

Author: Li, Caozhe, Shen, Enxiang, Wang, Haoyang, Wang, Yuxin, Yuan, Jie, Gong, Li, Zhao, Di, Zhang, Weijing, and Jin, Zhibin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Three-dimensional (3D) ultrasound imaging can overcome the limitations of conventional two dimensional (2D) ultrasound imaging in structural observation and measurement. However, conducting volumetric ultrasound imaging for large-sized organs still faces difficulties including long acquisition time, inevitable patient movement, and 3D feature recognition. In this study, we proposed a real-time volumetric free-hand ultrasound imaging system optimized for the above issues and applied it to the clinical diagnosis of scoliosis. This study employed an incremental imaging method coupled with algorithmic acceleration to enable real-time processing and visualization of the large amounts of data generated when scanning large-sized organs. Furthermore, to deal with the difficulty of image feature recognition, we proposed two tissue segmentation algorithms to reconstruct and visualize the spinal anatomy in 3D space by approximating the depth at which the bone structures are located and segmenting the ultrasound images at different depths. We validated the adaptability of our system by deploying it to multiple models of ultra-sound equipment and conducting experiments using different types of ultrasound probes. We also conducted experiments on 6 scoliosis patients and 10 normal volunteers to evaluate the performance of our proposed method. Ultrasound imaging of a volunteer spine from shoulder to crotch (more than 500 mm) was performed in 2 minutes, and the 3D imaging results displayed in real-time were compared with the corresponding X-ray images with a correlation coefficient of 0.96 in spinal curvature. Our proposed volumetric ultrasound imaging system might hold the potential to be clinically applied to other large-sized organs.
Published: 2024

13. Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing

Author: Qu, Yadong, Wang, Yuxin, Zhou, Bangbang, Wang, Zixiao, Xie, Hongtao, and Zhang, Yongdong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing scene text recognition (STR) methods struggle to recognize challenging texts, especially for artistic and severely distorted characters. The limitation lies in the insufficient exploration of character morphologies, including the monotonousness of widely used synthetic training data and the sensitivity of the model to character morphologies. To address these issues, inspired by the human learning process of viewing and summarizing, we facilitate the contrastive learning-based STR framework in a self-motivated manner by leveraging synthetic and real unlabeled data without any human cost. In the viewing process, to compensate for the simplicity of synthetic data and enrich character morphology diversity, we propose an Online Generation Strategy to generate background-free samples with diverse character styles. By excluding background noise distractions, the model is encouraged to focus on character morphology and generalize the ability to recognize complex samples when trained with only simple synthetic data. To boost the summarizing process, we theoretically demonstrate the derivation error in the previous character contrastive loss, which mistakenly causes the sparsity in the intra-class distribution and exacerbates ambiguity on challenging samples. Therefore, a new Character Unidirectional Alignment Loss is proposed to correct this error and unify the representation of the same characters in all samples by aligning the character features in the student model with the reference features in the teacher model. Extensive experiment results show that our method achieves SOTA performance (94.7\% and 70.9\% average accuracy on common benchmarks and Union14M-Benchmark). Code will be available at https://github.com/qqqyd/ViSu.
Published: 2024

14. ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentence

Author: Wang, Yuxin, Zhu, Xiaomeng, Lyu, Weimin, Hassanpour, Saeed, and Vosoughi, Soroush
Subjects: Computer Science - Computation and Language
Abstract: Handling implicit language is essential for natural language processing systems to achieve precise text understanding and facilitate natural interactions with users. Despite its importance, the absence of a metric for accurately measuring the implicitness of language significantly constrains the depth of analysis possible in evaluating models' comprehension capabilities. This paper addresses this gap by developing a scalar metric that quantifies the implicitness level of language without relying on external references. Drawing on principles from traditional linguistics, we define "implicitness" as the divergence between semantic meaning and pragmatic interpretation. To operationalize this definition, we introduce ImpScore, a reference-free metric formulated through an interpretable regression model. This model is trained using pairwise contrastive learning on a specially curated dataset consisting of (implicit sentence, explicit sentence) pairs. We validate ImpScore through a user study that compares its assessments with human evaluations on out-of-distribution data, demonstrating its accuracy and strong correlation with human judgments. Additionally, we apply ImpScore to hate speech detection datasets, illustrating its utility and highlighting significant limitations in current large language models' ability to understand highly implicit content. Our metric is publicly available at https://github.com/audreycs/ImpScore., Comment: Accepted to ICLR 2025
Published: 2024

15. Electronic structure and superconducting properties of LaNiO$_2$

Author: Chen, Ziyan, Wang, Yuxin, Jiang, Kun, and Hu, Jiangping
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Superconductivity
Abstract: Motivated by recent photoemission measurements on the La$_{0.8}$Sr$_{0.2}$NiO$_2$, we carry out a systematic study of the infinite-layer nickelate using both dynamical mean-field theory and density matrix embedding theory. The renormalized electronic structure and Fermi surface of correlated La$_{0.8}$Sr$_{0.2}$NiO$_2$ are studied in an effective two-band model through the dynamical mean-field calculation. We find the correlation effects reflect mainly on the Ni $d$ band, which is consistent with the experimental findings. We further study the ground state including magnetism and superconductivity through the density matrix embedding theory. Within the experimental doping range and rigid-band approximation, we show that the $d$-wave superconductivity is the lowest energy state, while the static magnetism is absent except very close to zero doping. These findings provide a new understanding of infinite-layer nickelate superconductivity., Comment: 6 pages, 3 figures and Supplemental Material
Published: 2024

16. ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference

Author: He, Xin, Zhang, Shunkang, Wang, Yuxin, Yin, Haiyan, Zeng, Zihao, Shi, Shaohuai, Tang, Zhenheng, Chu, Xiaowen, Tsang, Ivor, and Soon, Ong Yew
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Sparse Mixture of Experts (MoE) models, while outperforming dense Large Language Models (LLMs) in terms of performance, face significant deployment challenges during inference due to their high memory demands. Existing offloading techniques, which involve swapping activated and idle experts between the GPU and CPU, often suffer from rigid expert caching mechanisms. These mechanisms fail to adapt to dynamic routing, leading to inefficient cache utilization, or incur prohibitive costs for prediction training. To tackle these inference-specific challenges, we introduce ExpertFlow, a comprehensive system specifically designed to enhance inference efficiency by accommodating flexible routing and enabling efficient expert scheduling between CPU and GPU. This reduces overhead and boosts system performance. Central to our approach is a predictive routing path-based offloading mechanism that utilizes a lightweight predictor to accurately forecast routing paths before computation begins. This proactive strategy allows for real-time error correction in expert caching, significantly increasing cache hit ratios and reducing the frequency of expert transfers, thereby minimizing I/O overhead. Additionally, we implement a dynamic token scheduling strategy that optimizes MoE inference by rearranging input tokens across different batches. This method not only reduces the number of activated experts per batch but also improves computational efficiency. Our extensive experiments demonstrate that ExpertFlow achieves up to 93.72\% GPU memory savings and enhances inference speed by 2 to 10 times compared to baseline methods, highlighting its effectiveness and utility as a robust solution for resource-constrained inference scenarios., Comment: Mixture-of-Experts, Inference, Offloading
Published: 2024

17. FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression

Author: Tang, Zhenheng, Kang, Xueze, Yin, Yiming, Pan, Xinglin, Wang, Yuxin, He, Xin, Wang, Qiang, Zeng, Rongfei, Zhao, Kaiyong, Shi, Shaohuai, Zhou, Amelie Chi, Li, Bo, He, Bingsheng, and Chu, Xiaowen
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: To alleviate hardware scarcity in training large deep neural networks (DNNs), particularly large language models (LLMs), we present FusionLLM, a decentralized training system designed and implemented for training DNNs using geo-distributed GPUs across different computing clusters or individual devices. Decentralized training faces significant challenges regarding system design and efficiency, including: 1) the need for remote automatic differentiation (RAD), 2) support for flexible model definitions and heterogeneous software, 3) heterogeneous hardware leading to low resource utilization or the straggler problem, and 4) slow network communication. To address these challenges, in the system design, we represent the model as a directed acyclic graph of operators (OP-DAG). Each node in the DAG represents the operator in the DNNs, while the edge represents the data dependency between operators. Based on this design, 1) users are allowed to customize any DNN without caring low-level operator implementation; 2) we enable the task scheduling with the more fine-grained sub-tasks, offering more optimization space; 3) a DAG runtime executor can implement RAD withour requiring the consistent low-level ML framework versions. To enhance system efficiency, we implement a workload estimator and design an OP-Fence scheduler to cluster devices with similar bandwidths together and partition the DAG to increase throughput. Additionally, we propose an AdaTopK compressor to adaptively compress intermediate activations and gradients at the slowest communication links. To evaluate the convergence and efficiency of our system and algorithms, we train ResNet-101 and GPT-2 on three real-world testbeds using 48 GPUs connected with 8 Mbps~10 Gbps networks. Experimental results demonstrate that our system and method can achieve 1.45 - 9.39x speedup compared to baseline methods while ensuring convergence.
Published: 2024

18. CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information

Author: Wang, Yuxin, Ma, Minghua, Wang, Zekun, Chen, Jingchang, Fan, Huiming, Shan, Liping, Yang, Qing, Xu, Dongliang, Liu, Ming, and Qin, Bing
Subjects: Computer Science - Computation and Language
Abstract: The colossal parameters and computational overhead of Large Language Models (LLMs) challenge their real-world applications. Network pruning, which targets unstructured or structured sparsity by removing redundant parameters, has recently been explored for LLM acceleration. Existing LLM pruning works focus on unstructured pruning, which typically requires special hardware support for a practical speed-up. In contrast, structured pruning can reduce latency on general devices. However, it remains a challenge to perform structured pruning efficiently and maintain performance, especially at high sparsity ratios. To this end, we introduce an efficient structured pruning framework named CFSP, which leverages both Coarse (interblock) and Fine-grained (intrablock) activation information as an importance criterion to guide pruning. The pruning is highly efficient, as it only requires one forward pass to compute feature activations. Specifically, we first allocate the sparsity budget across blocks based on their importance and then retain important weights within each block. In addition, we introduce a recovery fine-tuning strategy that adaptively allocates training overhead based on coarse-grained importance to further improve performance. Experimental results demonstrate that CFSP outperforms existing methods on diverse models across various sparsity budgets. Our code will be available at https://github.com/wyxscir/CFSP., Comment: Proc. The 31st International Conference on Computational Linguistics (COLING2025)
Published: 2024

19. Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling

Author: Wang, Zixiao, Xie, Hongtao, Wang, YuXin, Qu, Yadong, Guo, Fengjun, and Liu, Pengwei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing scene text removal (STR) task suffers from insufficient training data due to the expensive pixel-level labeling. In this paper, we aim to address this issue by introducing a Text-aware Masked Image Modeling algorithm (TMIM), which can pretrain STR models with low-cost text detection labels (e.g., text bounding box). Different from previous pretraining methods that use indirect auxiliary tasks only to enhance the implicit feature extraction ability, our TMIM first enables the STR task to be directly trained in a weakly supervised manner, which explores the STR knowledge explicitly and efficiently. In TMIM, first, a Background Modeling stream is built to learn background generation rules by recovering the masked non-text region. Meanwhile, it provides pseudo STR labels on the masked text region. Second, a Text Erasing stream is proposed to learn from the pseudo labels and equip the model with end-to-end STR ability. Benefiting from the two collaborative streams, our STR model can achieve impressive performance only with the public text detection datasets, which greatly alleviates the limitation of the high-cost STR labels. Experiments demonstrate that our method outperforms other pretrain methods and achieves state-of-the-art performance (37.35 PSNR on SCUT-EnsText). Code will be available at https://github.com/wzx99/TMIM., Comment: Accepted by ECCV 2024
Published: 2024

20. Bandwidth-Aware and Overlap-Weighted Compression for Communication-Efficient Federated Learning

Author: Tang, Zichen, Huang, Junlin, Yan, Rudan, Wang, Yuxin, Tang, Zhenheng, Shi, Shaohuai, Zhou, Amelie Chi, and Chu, Xiaowen
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning
Abstract: Current data compression methods, such as sparsification in Federated Averaging (FedAvg), effectively enhance the communication efficiency of Federated Learning (FL). However, these methods encounter challenges such as the straggler problem and diminished model performance due to heterogeneous bandwidth and non-IID (Independently and Identically Distributed) data. To address these issues, we introduce a bandwidth-aware compression framework for FL, aimed at improving communication efficiency while mitigating the problems associated with non-IID data. First, our strategy dynamically adjusts compression ratios according to bandwidth, enabling clients to upload their models at a close pace, thus exploiting the otherwise wasted time to transmit more data. Second, we identify the non-overlapped pattern of retained parameters after compression, which results in diminished client update signals due to uniformly averaged weights. Based on this finding, we propose a parameter mask to adjust the client-averaging coefficients at the parameter level, thereby more closely approximating the original updates, and improving the training convergence under heterogeneous environments. Our evaluations reveal that our method significantly boosts model accuracy, with a maximum improvement of 13% over the uncompressed FedAvg. Moreover, it achieves a $3.37\times$ speedup in reaching the target accuracy compared to FedAvg with a Top-K compressor, demonstrating its effectiveness in accelerating convergence with compression. The integration of common compression techniques into our framework further establishes its potential as a versatile foundation for future cross-device, communication-efficient FL research, addressing critical challenges in FL and advancing the field of distributed machine learning.
Published: 2024
Full Text: View/download PDF

21. HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection

Author: Wang, Yuxin, Feng, Duanyu, Dai, Yongfu, Chen, Zhengyu, Huang, Jimin, Ananiadou, Sophia, Xie, Qianqian, and Wang, Hao
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Cryptography and Security
Abstract: Data serves as the fundamental foundation for advancing deep learning, particularly tabular data presented in a structured format, which is highly conducive to modeling. However, even in the era of LLM, obtaining tabular data from sensitive domains remains a challenge due to privacy or copyright concerns. Hence, exploring how to effectively use models like LLMs to generate realistic and privacy-preserving synthetic tabular data is urgent. In this paper, we take a step forward to explore LLMs for tabular data synthesis and privacy protection, by introducing a new framework HARMONIC for tabular data generation and evaluation. In the tabular data generation of our framework, unlike previous small-scale LLM-based methods that rely on continued pre-training, we explore the larger-scale LLMs with fine-tuning to generate tabular data and enhance privacy. Based on idea of the k-nearest neighbors algorithm, an instruction fine-tuning dataset is constructed to inspire LLMs to discover inter-row relationships. Then, with fine-tuning, LLMs are trained to remember the format and connections of the data rather than the data itself, which reduces the risk of privacy leakage. In the evaluation part of our framework, we develop specific privacy risk metrics DLT for LLM synthetic data generation, as well as performance evaluation metrics LLE for downstream LLM tasks. Our experiments find that this tabular data generation framework achieves equivalent performance to existing methods with better privacy, which also demonstrates our evaluation framework for the effectiveness of synthetic data and privacy risks in LLM scenarios.
Published: 2024

22. Bulk high-temperature superconductivity in the high-pressure tetragonal phase of bilayer La2PrNi2O7

Author: Wang, Ningning, Wang, Gang, Shen, Xiaoling, Hou, Jun, Luo, Jun, Ma, Xiaoping, Yang, Huaixin, Shi, Lifen, Dou, Jie, Feng, Jie, Yang, Jie, Shi, Yunqing, Ren, Zhian, Ma, Hanming, Yang, Pengtao, Liu, Ziyi, Liu, Yue, Zhang, Hua, Dong, Xiaoli, Wang, Yuxin, Jiang, Kun, Hu, Jiangping, Calder, Stuart, Yan, Jiaqiang, Sun, Jianping, Wang, Bosen, Zhou, Rui, Uwatoko, Yoshiya, and Cheng, Jinguang
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Strongly Correlated Electrons
Abstract: The Ruddlesden-Popper (R-P) bilayer nickelate, La3Ni2O7, was recently found to show signatures of high-temperature superconductivity (HTSC) at pressures above 14 GPa. Subsequent investigations achieved zero resistance in single- and poly-crystalline samples under hydrostatic pressure conditions. Yet, obvious diamagnetic signals, the other hallmark of superconductors, are still lacking owing to the filamentary nature with low superconducting volume fraction. The presence of a novel "1313" polymorph and competing R-P phases obscured proper identification of the phase for HTSC. Thus, achieving bulk HTSC and identifying the phase at play are the most prominent tasks at present. Here, we address these issues in the praseodymium (Pr)-doped La2PrNi2O7 polycrystalline samples. We find that the substitutions of Pr for La effectively inhibits the intergrowth of different R-P phases, resulting in nearly pure bilayer structure. For La2PrNi2O7, pressure-induced orthorhombic-to-tetragonal structural transition takes place at Pc ~ 11 GPa, above which HTSC emerges gradually upon further compression. The superconducting transition temperatures at 18-20 GPa reach Tconset = 82.5 K and Tczero = 60 K, which are the highest values among known nickelate superconductors. More importantly, bulk HTSC was testified by detecting clear diamagnetic signals below ~75 K corresponding to an estimated superconducting volume fraction ~ 57(5)% at 20 GPa. Our results not only resolve the existing controversies but also illuminate directions for exploring bulk HTSC in the bilayer nickelates.
Published: 2024

23. Stock closing price prediction based on ICEEMDAN-FA-BiLSTM–GM combined model

Author: Xie, Lewei, Wan, Ruibo, Wang, Yuxin, and Li, Fangjian
Published: 2025
Full Text: View/download PDF

24. Self-supervised indoor scene point cloud completion from a single panorama

Author: Li, Tong, Zhang, Zhaoxuan, Wang, Yuxin, Cui, Yan, Li, Yuqi, Zhou, Dongsheng, Yin, Baocai, and Yang, Xin
Published: 2025
Full Text: View/download PDF

25. An underwater target recognition algorithm incorporating improved attention mechanism and downsampling: An underwater target recognition algorithm incorporating improved...

Author: Zhu, QiGuang, Cen, Qiang, Wang, YuXin, Chen, WeiDong, and Liu, Shuo
Published: 2025
Full Text: View/download PDF

26. Facile synthesis of porous M-doped Co3O4 as high-performance oxidase-mimicking nanozymes for colorimetric assay of L-cysteine

Author: Chen, Chao, Xia, Lei, Luo, Yuxi, Fan, Siyang, and Wang, Yuxin
Published: 2025
Full Text: View/download PDF

27. Anthropogenic land consolidation intensifies zoonotic host diversity loss and disease transmission in human habitats

Author: Pei, Shan, Yu, Pengbo, Raghwani, Jayna, Wang, Yuxin, Liu, Ziyan, Li, Yidan, Cheng, Yanchao, Lin, Qiushi, Song, Chuliang, Dharmarajan, Guha, Faust, Christina L., Tian, Yunyu, Xu, Yiting, Liang, Yilin, Qu, Jianhui, Wei, Jing, Li, Shen, Zhang, Tongjun, Ma, Chaofeng, Bharti, Nita, Cazelles, Bernard, Yang, Ruifu, Pybus, Oliver G., Dobson, Andrew P., Stenseth, Nils Chr., and Tian, Huaiyu
Published: 2025
Full Text: View/download PDF

28. An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records

Author: Diao, Xiaolin, Huo, Yanni, Yan, Zhanzheng, Wang, Haibin, Yuan, Jing, Wang, Yuxin, Cai, Jun, and Zhao, Wei
Subjects: Computer applications to medicine. Medical informatics, R858-859.7
Abstract: BackgroundSecondary hypertension is a kind of hypertension with a definite etiology and may be cured. Patients with suspected secondary hypertension can benefit from timely detection and treatment and, conversely, will have a higher risk of morbidity and mortality than those with primary hypertension. ObjectiveThe aim of this study was to develop and validate machine learning (ML) prediction models of common etiologies in patients with suspected secondary hypertension. MethodsThe analyzed data set was retrospectively extracted from electronic medical records of patients discharged from Fuwai Hospital between January 1, 2016, and June 30, 2019. A total of 7532 unique patients were included and divided into 2 data sets by time: 6302 patients in 2016-2018 as the training data set for model building and 1230 patients in 2019 as the validation data set for further evaluation. Extreme Gradient Boosting (XGBoost) was adopted to develop 5 models to predict 4 etiologies of secondary hypertension and occurrence of any of them (named as composite outcome), including renovascular hypertension (RVH), primary aldosteronism (PA), thyroid dysfunction, and aortic stenosis. Both univariate logistic analysis and Gini Impurity were used for feature selection. Grid search and 10-fold cross-validation were used to select the optimal hyperparameters for each model. ResultsValidation of the composite outcome prediction model showed good performance with an area under the receiver-operating characteristic curve (AUC) of 0.924 in the validation data set, while the 4 prediction models of RVH, PA, thyroid dysfunction, and aortic stenosis achieved AUC of 0.938, 0.965, 0.959, and 0.946, respectively, in the validation data set. A total of 79 clinical indicators were identified in all and finally used in our prediction models. The result of subgroup analysis on the composite outcome prediction model demonstrated high discrimination with AUCs all higher than 0.890 among all age groups of adults. ConclusionsThe ML prediction models in this study showed good performance in detecting 4 etiologies of patients with suspected secondary hypertension; thus, they may potentially facilitate clinical diagnosis decision making of secondary hypertension in an intelligent way.
Published: 2021
Full Text: View/download PDF

29. Leveraging Text Localization for Scene Text Removal via Text-Aware Masked Image Modeling

Author: Wang, Zixiao, Xie, Hongtao, Wang, YuXin, Qu, Yadong, Guo, Fengjun, Liu, Pengwei, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

30. Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

Author: Wang, Yuxin, Wu, Qianyi, Zhang, Guofeng, Xu, Dan, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

31. De Bruijn Polyominoes

Author: Condon, D., Wang, Yuxin, and Yang, E.
Subjects: Mathematics - Combinatorics
Abstract: We introduce the notions of de Bruijn polyominoes and prismatic polyominoes, which generalize the notions of de Bruijn sequences and arrays. Given a small fixed polyomino $p$ and a set of colors $[n]$, a de Bruijn polyomino for $(p,n)$ is a colored fixed polyomino $P$ with cells colored from $[n]$ such that every possible coloring of $p$ from $[n]$ exists as a subset of $P$. We call de Bruijn polyominoes for $(p,n)$ of minimum size $(p,n)$-prismatic. We discuss for some values of $p$ and $n$ the shape of a $(p,n)$-prismatic polyomino $P$, the construction of a coloring of $P$, and the enumeration of the colorings of $P$. We find evidence that the difficulty of these problems may depend on the parity of the size of $p$
Published: 2024

32. MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations

Author: Wang, Yuxin, Yang, Ivory, Hassanpour, Saeed, and Vosoughi, Soroush
Subjects: Computer Science - Computation and Language
Abstract: Mental manipulation, a significant form of abuse in interpersonal conversations, presents a challenge to identify due to its context-dependent and often subtle nature. The detection of manipulative language is essential for protecting potential victims, yet the field of Natural Language Processing (NLP) currently faces a scarcity of resources and research on this topic. Our study addresses this gap by introducing a new dataset, named ${\rm M{\small ental}M{\small anip}}$, which consists of $4,000$ annotated movie dialogues. This dataset enables a comprehensive analysis of mental manipulation, pinpointing both the techniques utilized for manipulation and the vulnerabilities targeted in victims. Our research further explores the effectiveness of leading-edge models in recognizing manipulative dialogue and its components through a series of experiments with various configurations. The results demonstrate that these models inadequately identify and categorize manipulative content. Attempts to improve their performance by fine-tuning with existing datasets on mental health and toxicity have not overcome these limitations. We anticipate that ${\rm M{\small ental}M{\small anip}}$ will stimulate further research, leading to progress in both understanding and mitigating the impact of mental manipulation in conversations., Comment: Accepted at ACL 2024
Published: 2024

33. Predictive Modeling of Flexible EHD Pumps using Kolmogorov-Arnold Networks

Author: Peng, Yanhong, Wang, Yuxin, Hu, Fangchao, He, Miao, Mao, Zebing, Huang, Xia, and Ding, Jun
Subjects: Computer Science - Machine Learning, Computer Science - Robotics, Computer Science - Symbolic Computation
Abstract: We present a novel approach to predicting the pressure and flow rate of flexible electrohydrodynamic pumps using the Kolmogorov-Arnold Network. Inspired by the Kolmogorov-Arnold representation theorem, KAN replaces fixed activation functions with learnable spline-based activation functions, enabling it to approximate complex nonlinear functions more effectively than traditional models like Multi-Layer Perceptron and Random Forest. We evaluated KAN on a dataset of flexible EHD pump parameters and compared its performance against RF, and MLP models. KAN achieved superior predictive accuracy, with Mean Squared Errors of 12.186 and 0.001 for pressure and flow rate predictions, respectively. The symbolic formulas extracted from KAN provided insights into the nonlinear relationships between input parameters and pump performance. These findings demonstrate that KAN offers exceptional accuracy and interpretability, making it a promising alternative for predictive modeling in electrohydrodynamic pumping.
Published: 2024
Full Text: View/download PDF

34. Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

Author: Gao, Zuan, Wang, Yuxin, Qu, Yadong, Zhang, Boqiang, Wang, Zixiao, Xu, Jianjun, and Xie, Hongtao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In text recognition, self-supervised pre-training emerges as a good solution to reduce dependence on expansive annotated real data. Previous studies primarily focus on local visual representation by leveraging mask image modeling or sequence contrastive learning. However, they omit modeling the linguistic information in text images, which is crucial for recognizing text. To simultaneously capture local character features and linguistic information in visual space, we propose Symmetric Superimposition Modeling (SSM). The objective of SSM is to reconstruct the direction-specific pixel and feature signals from the symmetrically superimposed input. Specifically, we add the original image with its inverted views to create the symmetrically superimposed inputs. At the pixel level, we reconstruct the original and inverted images to capture character shapes and texture-level linguistic context. At the feature level, we reconstruct the feature of the same original image and inverted image with different augmentations to model the semantic-level linguistic context and the local character discrimination. In our design, we disrupt the character shape and linguistic rules. Consequently, the dual-level reconstruction facilitates understanding character shapes and linguistic information from the perspective of visual texture and feature semantics. Experiments on various text recognition benchmarks demonstrate the effectiveness and generality of SSM, with 4.1% average performance gains and 86.6% new state-of-the-art average word accuracy on Union14M benchmarks. The code is available at https://github.com/FaltingsA/SSM., Comment: Accepted to IJCAI2024
Published: 2024

35. Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing

Author: Zhang, Boqiang, Xie, Hongtao, Gao, Zuan, and Wang, Yuxin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Scene text images contain not only style information (font, background) but also content information (character, texture). Different scene text tasks need different information, but previous representation learning methods use tightly coupled features for all tasks, resulting in sub-optimal performance. We propose a Disentangled Representation Learning framework (DARLING) aimed at disentangling these two types of features for improved adaptability in better addressing various downstream tasks (choose what you really need). Specifically, we synthesize a dataset of image pairs with identical style but different content. Based on the dataset, we decouple the two types of features by the supervision design. Clearly, we directly split the visual representation into style and content features, the content features are supervised by a text recognition loss, while an alignment loss aligns the style features in the image pairs. Then, style features are employed in reconstructing the counterpart image via an image decoder with a prompt that indicates the counterpart's content. Such an operation effectively decouples the features based on their distinctive properties. To the best of our knowledge, this is the first time in the field of scene text that disentangles the inherent properties of the text images. Our method achieves state-of-the-art performance in Scene Text Recognition, Removal, and Editing., Comment: Accepted to CVPR 2024
Published: 2024

36. In-situ Doppler-free spectroscopy with pulsed optical fields

Author: Wang, Yuxin, Zheng, Zhiyue, Zhang, Qiuxin, Lai, Yonglang, Ge, Zongqi, Wang, Tianyi, Ding, Liangyu, Vasilii, Smirnov, Semerikov, Ilya, Zhang, Shuaining, Zhang, Wei, and Zhang, Xiang
Subjects: Physics - Optics
Abstract: We propose a novel pulsed optical field method that alternately switches the pump beam in conventional saturation absorption to time-division multiplex the same probe beam into both probe and reference beams, followed by digital differential processing to achieve deterministic zero-background Doppler-free spectroscopy. This method effectively mitigates Doppler broadening and common-mode optical noise by addressing disturbances such as non-uniform background absorption and environmental noise, thereby offering enhanced accuracy and robustness. Using this technique, we measured the absolute frequency of Yb$^{+}$ isotopes in the $6s^2\ ^{1}S_0\to 6s6p ^{1}P_1$ transition. By employing an error signal derived from the first-derivative demodulated spectrum of $^{174}\mathrm{Yb}^{+}$, we achieved efficient stabilization of a 369.5 nm ultraviolet diode laser, demonstrating a frequency stability of $3 \times 10^{-11}$ over a 1500-second averaging period and a locking point uncertainty of 850 kHz sustained over 10 days. Furthermore, we report the first in-situ observation of Doppler-free Zeeman sub-level spectra, highlighting the precision of this method and its potential application in measuring magnetic field gradients., Comment: 6 pages, 6 figures
Published: 2024

37. GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

Author: Wang, Yuxin, Wu, Qianyi, Zhang, Guofeng, and Xu, Dan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper tackles the intricate challenge of object removal to update the radiance field using the 3D Gaussian Splatting. The main challenges of this task lie in the preservation of geometric consistency and the maintenance of texture coherence in the presence of the substantial discrete nature of Gaussian primitives. We introduce a robust framework specifically designed to overcome these obstacles. The key insight of our approach is the enhancement of information exchange among visible and invisible areas, facilitating content restoration in terms of both geometry and texture. Our methodology begins with optimizing the positioning of Gaussian primitives to improve geometric consistency across both removed and visible areas, guided by an online registration process informed by monocular depth estimation. Following this, we employ a novel feature propagation mechanism to bolster texture coherence, leveraging a cross-attention design that bridges sampling Gaussians from both uncertain and certain areas. This innovative approach significantly refines the texture coherence within the final radiance field. Extensive experiments validate that our method not only elevates the quality of novel view synthesis for scenes undergoing object removal but also showcases notable efficiency gains in training and rendering speeds., Comment: Project Page: https://w-ted.github.io/publications/gscream
Published: 2024

38. Orthogonality of skew type and characterization of inner product spaces

Author: Xia, Jinyu, Liu, Qi, Wang, Yuxin, Xu, Wenhui, Hu, Yongmo, and Li, Yongjin
Published: 2024
Full Text: View/download PDF

39. Polypyrrole-coated triple-layer yolk-shell Fe2O3 anode materials with their superior overall performance in lithium-ion batteries

Author: He, Zhen, Liu, Jiaming, Wei, Yuqian, Song, Yunfei, Yang, Wuxin, Yang, Aobo, Wang, Yuxin, and Li, Bo
Published: 2024
Full Text: View/download PDF

40. Structure and corrosion behavior of FeCoCrNiMo high-entropy alloy coatings prepared by mechanical alloying and plasma spraying

Author: Tian, Yun, Liu, Jianing, Xue, Mingming, Zhang, Dongyao, Wang, Yuxin, Geng, Keping, Dong, Yanchun, and Yang, Yong
Published: 2024
Full Text: View/download PDF

41. Nuclear-localized pyruvate kinases control phosphorylation of histone H3 on threonine 11

Author: Hu, Pengcheng, Xu, Yanmei, Su, Yanhua, Wang, Yuxin, Xiong, Yan, and Ding, Yong
Published: 2024
Full Text: View/download PDF

42. The indoor-outdoor relationship of Polycyclic Aromatic Hydrocarbons (PAHs) from PM2.5 in Beijing during APEC

Author: Wang, Yuxin and Xie, Yangyang
Published: 2024
Full Text: View/download PDF

43. Bulk high-temperature superconductivity in pressurized tetragonal La2PrNi2O7

Author: Wang, Ningning, Wang, Gang, Shen, Xiaoling, Hou, Jun, Luo, Jun, Ma, Xiaoping, Yang, Huaixin, Shi, Lifen, Dou, Jie, Feng, Jie, Yang, Jie, Shi, Yunqing, Ren, Zhian, Ma, Hanming, Yang, Pengtao, Liu, Ziyi, Liu, Yue, Zhang, Hua, Dong, Xiaoli, Wang, Yuxin, Jiang, Kun, Hu, Jiangping, Nagasaki, Shoko, Kitagawa, Kentaro, Calder, Stuart, Yan, Jiaqiang, Sun, Jianping, Wang, Bosen, Zhou, Rui, Uwatoko, Yoshiya, and Cheng, Jinguang
Published: 2024
Full Text: View/download PDF

44. A longitudinal Study on the Effect of Media Exposure on Travel Intention during Public Health Crises

Author: Zhou, Bin, Wang, Yuxin, Li, Ping, Xiong, Qihao, and Ryan, Chris
Published: 2024
Full Text: View/download PDF

45. YOLOv9-YX: lightweight algorithm for underwater target detection

Author: Cen, Qiang, Zhu, QiGuang, Wang, YuXin, Chen, WeiDong, and Liu, Shuo
Published: 2024
Full Text: View/download PDF

46. Polyoxometalate-based flexible conductive materials with superionic conductivity

Author: Wang, Yuxin, Xue, Shuping, Geng, Jun, Lu, Ying, Li, Teng, Duan, Xiaozheng, Bai, Xue, Yang, Yanli, Yang, Jingqi, and Liu, Shuxia
Published: 2024
Full Text: View/download PDF

47. 4-Octyl itaconate promotes alveolar ridge preservation following tooth extraction

Author: Du, Yanmei, He, Yanyan, Xie, Jianli, Wang, Yuxin, Sun, Xiaodong, and Yu, Xijiao
Published: 2024
Full Text: View/download PDF

48. Microstructure Evolution and Strengthening Mechanisms of Cast Al–Cu–Mn Alloy via Ti–Zr-Synergistic Micro-alloying

Author: Li, Jihao, Xiang, Zhilei, Shen, Gaoliang, Huang, Jingcun, Zhou, Zongyi, Sun, Wencao, Han, Yang, Wang, Yuxin, Wang, Andong, Chen, Yilan, and Chen, Ziyong
Published: 2024
Full Text: View/download PDF

49. A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges

Author: Xu, Xinrun, Wang, Yuxin, Xu, Chaoyi, Ding, Ziluo, Jiang, Jiechuan, Ding, Zhiming, and Karlsson, Börje F.
Subjects: Computer Science - Artificial Intelligence
Abstract: The swift evolution of Large-scale Models (LMs), either language-focused or multi-modal, has garnered extensive attention in both academy and industry. But despite the surge in interest in this rapidly evolving area, there are scarce systematic reviews on their capabilities and potential in distinct impactful scenarios. This paper endeavours to help bridge this gap, offering a thorough examination of the current landscape of LM usage in regards to complex game playing scenarios and the challenges still open. Here, we seek to systematically review the existing architectures of LM-based Agents (LMAs) for games and summarize their commonalities, challenges, and any other insights. Furthermore, we present our perspective on promising future research avenues for the advancement of LMs in games. We hope to assist researchers in gaining a clear understanding of the field and to generate more interest in this highly impactful research direction. A corresponding resource, continuously updated, can be found in our GitHub repository., Comment: 13 pages, 3 figures
Published: 2024

50. Mixer is more than just a model

Author: Ji, Qingfeng, Wang, Yuxin, and Sun, Letong
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recently, MLP structures have regained popularity, with MLP-Mixer standing out as a prominent example. In the field of computer vision, MLP-Mixer is noted for its ability to extract data information from both channel and token perspectives, effectively acting as a fusion of channel and token information. Indeed, Mixer represents a paradigm for information extraction that amalgamates channel and token information. The essence of Mixer lies in its ability to blend information from diverse perspectives, epitomizing the true concept of "mixing" in the realm of neural network architectures. Beyond channel and token considerations, it is possible to create more tailored mixers from various perspectives to better suit specific task requirements. This study focuses on the domain of audio recognition, introducing a novel model named Audio Spectrogram Mixer with Roll-Time and Hermit FFT (ASM-RH) that incorporates insights from both time and frequency domains. Experimental results demonstrate that ASM-RH is particularly well-suited for audio data and yields promising outcomes across multiple classification tasks. The models and optimal weights files will be published.
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

6,102 results on '"Wang, Yuxin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources