Author: "Yang, Yujie" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang, Yujie"' showing total 1,846 results

Start Over Author "Yang, Yujie"

1,846 results on '"Yang, Yujie"'

1. Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning

Author: Zhou, Hang, Tang, Yehui, Qin, Haochen, Yang, Yujie, Jin, Renren, Xiong, Deyi, Han, Kai, and Wang, Yunhe
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The efficacy of large language models (LLMs) on downstream tasks usually hinges on instruction tuning, which relies critically on the quality of training data. Unfortunately, collecting high-quality and diverse data is both expensive and time-consuming. To mitigate this issue, we propose a novel Star-Agents framework, which automates the enhancement of data quality across datasets through multi-agent collaboration and assessment. The framework adopts a three-pronged strategy. It initially generates diverse instruction data with multiple LLM agents through a bespoke sampling method. Subsequently, the generated data undergo a rigorous evaluation using a dual-model method that assesses both difficulty and quality. Finaly, the above process evolves in a dynamic refinement phase, where more effective LLMs are prioritized, enhancing the overall data quality. Our empirical studies, including instruction tuning experiments with models such as Pythia and LLaMA, demonstrate the effectiveness of the proposed framework. Optimized datasets have achieved substantial improvements, with an average increase of 12% and notable gains in specific metrics, such as a 40% improvement in Fermi, as evidenced by benchmarks like MT-bench, Vicuna bench, and WizardLM testset.
Published: 2024

2. Verification of Neural Control Barrier Functions with Symbolic Derivative Bounds Propagation

Author: Hu, Hanjiang, Yang, Yujie, Wei, Tianhao, and Liu, Changliu
Subjects: Computer Science - Robotics, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control, Mathematics - Optimization and Control
Abstract: Control barrier functions (CBFs) are important in safety-critical systems and robot control applications. Neural networks have been used to parameterize and synthesize CBFs with bounded control input for complex systems. However, it is still challenging to verify pre-trained neural networks CBFs (neural CBFs) in an efficient symbolic manner. To this end, we propose a new efficient verification framework for ReLU-based neural CBFs through symbolic derivative bound propagation by combining the linearly bounded nonlinear dynamic system and the gradient bounds of neural CBFs. Specifically, with Heaviside step function form for derivatives of activation functions, we show that the symbolic bounds can be propagated through the inner product of neural CBF Jacobian and nonlinear system dynamics. Through extensive experiments on different robot dynamics, our results outperform the interval arithmetic based baselines in verified rate and verification time along the CBF boundary, validating the effectiveness and efficiency of the proposed method with different model complexity. The code can be found at https://github.com/intelligent-control-lab/ verify-neural-CBF., Comment: Accepted to CoRL 2024, 18 pages, 6 figures, 4 tables
Published: 2024

3. Scalable Synthesis of Formally Verified Neural Value Function for Hamilton-Jacobi Reachability Analysis

Author: Yang, Yujie, Hu, Hanjiang, Wei, Tianhao, Li, Shengbo Eben, and Liu, Changliu
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Hamilton-Jacobi (HJ) reachability analysis provides a formal method for guaranteeing safety in constrained control problems. It synthesizes a value function to represent a long-term safe set called feasible region. Early synthesis methods based on state space discretization cannot scale to high-dimensional problems, while recent methods that use neural networks to approximate value functions result in unverifiable feasible regions. To achieve both scalability and verifiability, we propose a framework for synthesizing verified neural value functions for HJ reachability analysis. Our framework consists of three stages: pre-training, adversarial training, and verification-guided training. We design three techniques to address three challenges to improve scalability respectively: boundary-guided backtracking (BGB) to improve counterexample search efficiency, entering state regularization (ESR) to enlarge feasible region, and activation pattern alignment (APA) to accelerate neural network verification. We also provide a neural safety certificate synthesis and verification benchmark called Cersyve-9, which includes nine commonly used safe control tasks and supplements existing neural network verification benchmarks. Our framework successfully synthesizes verified neural value functions on all tasks, and our proposed three techniques exhibit superior scalability and efficiency compared with existing methods.
Published: 2024

4. Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning

Author: Jiang, Yuxuan, Yang, Yujie, Lan, Zhiqian, Zhan, Guojian, Li, Shengbo Eben, Sun, Qi, Ma, Jian, Yu, Tianwen, and Zhang, Changwu
Subjects: Computer Science - Machine Learning
Abstract: Rocket recycling is a crucial pursuit in aerospace technology, aimed at reducing costs and environmental impact in space exploration. The primary focus centers on rocket landing control, involving the guidance of a nonlinear underactuated rocket with limited fuel in real-time. This challenging task prompts the application of reinforcement learning (RL), yet goal-oriented nature of the problem poses difficulties for standard RL algorithms due to the absence of intermediate reward signals. This paper, for the first time, significantly elevates the success rate of rocket landing control from 8% with a baseline controller to 97% on a high-fidelity rocket model using RL. Our approach, called Random Annealing Jump Start (RAJS), is tailored for real-world goal-oriented problems by leveraging prior feedback controllers as guide policy to facilitate environmental exploration and policy learning in RL. In each episode, the guide policy navigates the environment for the guide horizon, followed by the exploration policy taking charge to complete remaining steps. This jump-start strategy prunes exploration space, rendering the problem more tractable to RL algorithms. The guide horizon is sampled from a uniform distribution, with its upper bound annealing to zero based on performance metrics, mitigating distribution shift and mismatch issues in existing methods. Additional enhancements, including cascading jump start, refined reward and terminal condition, and action smoothness regulation, further improve policy performance and practical applicability. The proposed method is validated through extensive evaluation and Hardware-in-the-Loop testing, affirming the effectiveness, real-time feasibility, and smoothness of the proposed controller., Comment: IROS 2024 Oral
Published: 2024

5. RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Author: Yang, Bing, Quan, Changsheng, Wang, Yabo, Wang, Pengyu, Yang, Yujie, Fang, Ying, Shao, Nian, Bu, Hui, Xu, Xin, and Li, Xiaofei
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The training of deep learning-based multichannel speech enhancement and source localization systems relies heavily on the simulation of room impulse response and multichannel diffuse noise, due to the lack of large-scale real-recorded datasets. However, the acoustic mismatch between simulated and real-world data could degrade the model performance when applying in real-world scenarios. To bridge this simulation-to-real gap, this paper presents a new relatively large-scale Real-recorded and annotated Microphone Array speech&Noise (RealMAN) dataset. The proposed dataset is valuable in two aspects: 1) benchmarking speech enhancement and localization algorithms in real scenarios; 2) offering a substantial amount of real-world training data for potentially improving the performance of real-world applications. Specifically, a 32-channel array with high-fidelity microphones is used for recording. A loudspeaker is used for playing source speech signals (about 35 hours of Mandarin speech). A total of 83.7 hours of speech signals (about 48.3 hours for static speaker and 35.4 hours for moving speaker) are recorded in 32 different scenes, and 144.5 hours of background noise are recorded in 31 different scenes. Both speech and noise recording scenes cover various common indoor, outdoor, semi-outdoor and transportation environments, which enables the training of general-purpose speech enhancement and source localization networks. To obtain the task-specific annotations, speaker location is annotated with an omni-directional fisheye camera by automatically detecting the loudspeaker. The direct-path signal is set as the target clean speech for speech enhancement, which is obtained by filtering the source speech signal with an estimated direct-path propagation filter., Comment: accepted by NIPS 2024
Published: 2024

6. Structure and expression analysis of TaGW5 in common wheat

Author: Ding, Puyang, Ma, Jian, Yang, Yujie, Luo, Wei, Zou, Yaya, Li, Ting, Mu, Yang, Tang, Huaping, and Lan, Xiujin
Published: 2019
Full Text: View/download PDF

7. EPIDetect: Video-based convulsive seizure detection in chronic epilepsy mouse model for anti-epilepsy drug screening

Author: Ren, Junming, Xiao, Zhoujian, Zhang, Yujia, Yang, Yujie, He, Ling, Yoon, Ezra, Bello, Stephen Temitayo, Chen, Xi, Wu, Dapeng, Tortorella, Micky, and He, Jufang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In the preclinical translational studies, drug candidates with remarkable anti-epileptic efficacy demonstrate long-term suppression of spontaneous recurrent seizures (SRSs), particularly convulsive seizures (CSs), in mouse models of chronic epilepsy. However, the current methods for monitoring CSs have limitations in terms of invasiveness, specific laboratory settings, high cost, and complex operation, which hinder drug screening efforts. In this study, a camera-based system for automated detection of CSs in chronically epileptic mice is first established to screen potential anti-epilepsy drugs.
Published: 2024

8. Controllability Test for Nonlinear Datatic Systems

Author: Yang, Yujie, Tao, Letian, Wang, Likun, and Li, Shengbo Eben
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Controllability is a fundamental property of control systems, serving as the prerequisite for controller design. While controllability test is well established in modelic (i.e., model-driven) control systems, extending it to datatic (i.e., data-driven) control systems is still a challenging task due to the absence of system models. In this study, we propose a general controllability test method for nonlinear systems with datatic description, where the system behaviors are merely described by data. In this situation, the state transition information of a dynamic system is available only at a limited number of data points, leaving the behaviors beyond these points unknown. Different from traditional exact controllability, we introduce a new concept called $\epsilon$-controllability, which extends the definition from point-to-point form to point-to-region form. Accordingly, our focus shifts to checking whether the system state can be steered to a closed state ball centered on the target state, rather than exactly at that target state. On its basis, we propose a tree search algorithm called maximum expansion of controllable subset (MECS) to identify controllable states in the dataset. Starting with a specific target state, our algorithm can iteratively propagate controllability from a known state ball to a new one. This iterative process gradually enlarges the $\epsilon$-controllable subset by incorporating new controllable balls until all $\epsilon$-controllable states are searched. Besides, a simplified version of MECS is proposed by solving a special shortest path problem, called Floyd expansion with radius fixed (FERF). FERF maintains a fixed radius of all controllable balls based on a mutual controllability assumption of neighboring states. The effectiveness of our method is validated in three datatic control systems whose dynamic behaviors are described by sampled data.
Published: 2024

9. The Feasibility of Constrained Reinforcement Learning Algorithms: A Tutorial Study

Author: Yang, Yujie, Zheng, Zhilong, Li, Shengbo Eben, Tomizuka, Masayoshi, and Liu, Changliu
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Satisfying safety constraints is a priority concern when solving optimal control problems (OCPs). Due to the existence of infeasibility phenomenon, where a constraint-satisfying solution cannot be found, it is necessary to identify a feasible region before implementing a policy. Existing feasibility theories built for model predictive control (MPC) only consider the feasibility of optimal policy. However, reinforcement learning (RL), as another important control method, solves the optimal policy in an iterative manner, which comes with a series of non-optimal intermediate policies. Feasibility analysis of these non-optimal policies is also necessary for iteratively improving constraint satisfaction; but that is not available under existing MPC feasibility theories. This paper proposes a feasibility theory that applies to both MPC and RL by filling in the missing part of feasibility analysis for an arbitrary policy. The basis of our theory is to decouple policy solving and implementation into two temporal domains: virtual-time domain and real-time domain. This allows us to separately define initial and endless, state and policy feasibility, and their corresponding feasible regions. Based on these definitions, we analyze the containment relationships between different feasible regions, which enables us to describe the feasible region of an arbitrary policy. We further provide virtual-time constraint design rules along with a practical design tool called feasibility function that helps to achieve the maximum feasible region. We review most of existing constraint formulations and point out that they are essentially applications of feasibility functions in different forms. We demonstrate our feasibility theory by visualizing different feasible regions under both MPC and RL policies in an emergency braking control task.
Published: 2024

10. Policy Bifurcation in Safe Reinforcement Learning

Author: Zou, Wenjun, Lyu, Yao, Li, Jie, Yang, Yujie, Li, Shengbo Eben, Duan, Jingliang, Zhan, Xianyuan, Liu, Jingjing, Zhang, Yaqin, and Li, Keqiang
Subjects: Computer Science - Machine Learning
Abstract: Safe reinforcement learning (RL) offers advanced solutions to constrained optimal control problems. Existing studies in safe RL implicitly assume continuity in policy functions, where policies map states to actions in a smooth, uninterrupted manner; however, our research finds that in some scenarios, the feasible policy should be discontinuous or multi-valued, interpolating between discontinuous local optima can inevitably lead to constraint violations. We are the first to identify the generating mechanism of such a phenomenon, and employ topological analysis to rigorously prove the existence of policy bifurcation in safe RL, which corresponds to the contractibility of the reachable tuple. Our theorem reveals that in scenarios where the obstacle-free state space is non-simply connected, a feasible policy is required to be bifurcated, meaning its output action needs to change abruptly in response to the varying state. To train such a bifurcated policy, we propose a safe RL algorithm called multimodal policy optimization (MUPO), which utilizes a Gaussian mixture distribution as the policy output. The bifurcated behavior can be achieved by selecting the Gaussian component with the highest mixing coefficient. Besides, MUPO also integrates spectral normalization and forward KL divergence to enhance the policy's capability of exploring different modes. Experiments with vehicle control tasks show that our algorithm successfully learns the bifurcated policy and ensures satisfying safety, while a continuous policy suffers from inevitable constraint violations.
Published: 2024

11. A robust audio deepfake detection system via multi-view feature

Author: Yang, Yujie, Qin, Haochen, Zhou, Hang, Wang, Chengcheng, Guo, Tianyu, Han, Kai, and Wang, Yunhe
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: With the advancement of generative modeling techniques, synthetic human speech becomes increasingly indistinguishable from real, and tricky challenges are elicited for the audio deepfake detection (ADD) system. In this paper, we exploit audio features to improve the generalizability of ADD systems. Investigation of the ADD task performance is conducted over a broad range of audio features, including various handcrafted features and learning-based features. Experiments show that learning-based audio features pretrained on a large amount of data generalize better than hand-crafted features on out-of-domain scenarios. Subsequently, we further improve the generalizability of the ADD system using proposed multi-feature approaches to incorporate complimentary information from features of different views. The model trained on ASV2019 data achieves an equal error rate of 24.27\% on the In-the-Wild dataset., Comment: 5 pages, 2 figures
Published: 2024

12. CBDC, cash, and financial intermediary in HANK

Author: Yang, Yujie, Zhang, Chenxing, and Hou, Wenwen
Published: 2024
Full Text: View/download PDF

13. Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation

Author: Du, Zhenzhen, Yang, Yujie, Zheng, Jing, Li, Qi, Lin, Denan, Li, Ye, Fan, Jianping, Cheng, Wen, Chen, Xie-Hui, and Cai, Yunpeng
Subjects: Computer applications to medicine. Medical informatics, R858-859.7
Abstract: BackgroundPredictions of cardiovascular disease risks based on health records have long attracted broad research interests. Despite extensive efforts, the prediction accuracy has remained unsatisfactory. This raises the question as to whether the data insufficiency, statistical and machine-learning methods, or intrinsic noise have hindered the performance of previous approaches, and how these issues can be alleviated. ObjectiveBased on a large population of patients with hypertension in Shenzhen, China, we aimed to establish a high-precision coronary heart disease (CHD) prediction model through big data and machine-learning MethodsData from a large cohort of 42,676 patients with hypertension, including 20,156 patients with CHD onset, were investigated from electronic health records (EHRs) 1-3 years prior to CHD onset (for CHD-positive cases) or during a disease-free follow-up period of more than 3 years (for CHD-negative cases). The population was divided evenly into independent training and test datasets. Various machine-learning methods were adopted on the training set to achieve high-accuracy prediction models and the results were compared with traditional statistical methods and well-known risk scales. Comparison analyses were performed to investigate the effects of training sample size, factor sets, and modeling approaches on the prediction performance. ResultsAn ensemble method, XGBoost, achieved high accuracy in predicting 3-year CHD onset for the independent test dataset with an area under the receiver operating characteristic curve (AUC) value of 0.943. Comparison analysis showed that nonlinear models (K-nearest neighbor AUC 0.908, random forest AUC 0.938) outperform linear models (logistic regression AUC 0.865) on the same datasets, and machine-learning methods significantly surpassed traditional risk scales or fixed models (eg, Framingham cardiovascular disease risk models). Further analyses revealed that using time-dependent features obtained from multiple records, including both statistical variables and changing-trend variables, helped to improve the performance compared to using only static features. Subpopulation analysis showed that the impact of feature design had a more significant effect on model accuracy than the population size. Marginal effect analysis showed that both traditional and EHR factors exhibited highly nonlinear characteristics with respect to the risk scores. ConclusionsWe demonstrated that accurate risk prediction of CHD from EHRs is possible given a sufficiently large population of training data. Sophisticated machine-learning methods played an important role in tackling the heterogeneity and nonlinear nature of disease prediction. Moreover, accumulated EHR data over multiple time points provided additional features that were valuable for risk prediction. Our study highlights the importance of accumulating big data from EHRs for accurate disease predictions.
Published: 2020
Full Text: View/download PDF

14. DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models

Author: He, Wei, Han, Kai, Tang, Yehui, Wang, Chengcheng, Yang, Yujie, Guo, Tianyu, and Wang, Yunhe
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large language models (LLMs) face a daunting challenge due to the excessive computational and memory requirements of the commonly used Transformer architecture. While state space model (SSM) is a new type of foundational network architecture offering lower computational complexity, their performance has yet to fully rival that of Transformers. This paper introduces DenseSSM, a novel approach to enhance the flow of hidden information between layers in SSMs. By selectively integrating shallowlayer hidden states into deeper layers, DenseSSM retains fine-grained information crucial for the final output. Dense connections enhanced DenseSSM still maintains the training parallelizability and inference efficiency. The proposed method can be widely applicable to various SSM types like RetNet and Mamba. With similar model size, DenseSSM achieves significant improvements, exemplified by DenseRetNet outperforming the original RetNet with up to 5% accuracy improvement on public benchmarks. code is avalaible at https://github.com/WailordHe/DenseSSM
Published: 2024

15. SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Author: Wang, Chengcheng, Hao, Zhiwei, Tang, Yehui, Guo, Jianyuan, Yang, Yujie, Han, Kai, and Wang, Yunhe
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Diffusion-based super-resolution (SR) models have recently garnered significant attention due to their potent restoration capabilities. But conventional diffusion models perform noise sampling from a single distribution, constraining their ability to handle real-world scenes and complex textures across semantic regions. With the success of segment anything model (SAM), generating sufficiently fine-grained region masks can enhance the detail recovery of diffusion-based SR model. However, directly integrating SAM into SR models will result in much higher computational cost. In this paper, we propose the SAM-DiffSR model, which can utilize the fine-grained structure information from SAM in the process of sampling noise to improve the image quality without additional computational cost during inference. In the process of training, we encode structural position information into the segmentation mask from SAM. Then the encoded mask is integrated into the forward diffusion process by modulating it to the sampled noise. This adjustment allows us to independently adapt the noise mean within each corresponding segmentation area. The diffusion model is trained to estimate this modulated noise. Crucially, our proposed framework does NOT change the reverse diffusion process and does NOT require SAM at inference. Experimental results demonstrate the effectiveness of our proposed method, showcasing superior performance in suppressing artifacts, and surpassing existing diffusion-based methods by 0.74 dB at the maximum in terms of PSNR on DIV2K dataset. The code and dataset are available at https://github.com/lose4578/SAM-DiffSR.
Published: 2024

16. On the Stability of Datatic Control Systems

Author: Yang, Yujie, Zheng, Zhilong, and Li, Shengbo Eben
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: The development of feedback controllers is undergoing a paradigm shift from $\textit{modelic}$ (model-driven) control to $\textit{datatic}$ (data-driven) control. Stability, as a fundamental property in control, is less well studied in datatic control paradigm. The difficulty is that traditional stability criteria rely on explicit system models, which are not available in those systems with datatic description. Some pioneering works explore stability criteria for datatic systems with special forms such as linear systems, homogeneous systems, and polynomial systems. However, these systems imply too strong assumptions on the inherent connection among data points, which do not hold in general nonlinear systems. This paper proposes a stability verification algorithm for general datatic control systems called $\eta$-testing. Our stability criterion only relies on a weak assumption of Lipschitz continuity so as to extend information from known data points to unmeasured regions. This information restricts the time derivative of any unknown state to the intersection of a set of closed balls. Inside the intersection, the worst-case time derivative of Lyapunov function is estimated by solving a quadratically constrained linear program (QCLP). By comparing the optimal values of QCLPs to zero in the whole state space, a sufficient condition of system stability can be checked. We test our algorithm on three datatic control systems, including both linear and nonlinear ones. Results show that our algorithm successfully verifies the stability, instability, and critical stability of tested systems.
Published: 2024

17. Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

Author: Zheng, Yinan, Li, Jianxiong, Yu, Dongjie, Yang, Yujie, Li, Shengbo Eben, Zhan, Xianyuan, and Liu, Jingjing
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Robotics
Abstract: Safe offline RL is a promising way to bypass risky online interactions towards safe policy learning. Most existing methods only enforce soft constraints, i.e., constraining safety violations in expectation below thresholds predetermined. This can lead to potentially unsafe outcomes, thus unacceptable in safety-critical scenarios. An alternative is to enforce the hard constraint of zero violation. However, this can be challenging in offline setting, as it needs to strike the right balance among three highly intricate and correlated aspects: safety constraint satisfaction, reward maximization, and behavior regularization imposed by offline datasets. Interestingly, we discover that via reachability analysis of safe-control theory, the hard safety constraint can be equivalently translated to identifying the largest feasible region given the offline dataset. This seamlessly converts the original trilogy problem to a feasibility-dependent objective, i.e., maximizing reward value within the feasible region while minimizing safety risks in the infeasible region. Inspired by these, we propose FISOR (FeasIbility-guided Safe Offline RL), which allows safety constraint adherence, reward maximization, and offline policy learning to be realized via three decoupled processes, while offering strong safety performance and stability. In FISOR, the optimal policy for the translated optimization problem can be derived in a special form of weighted behavior cloning. Thus, we propose a novel energy-guided diffusion model that does not require training a complicated time-dependent classifier to extract the policy, greatly simplifying the training. We compare FISOR against baselines on DSRL benchmark for safe offline RL. Evaluation results show that FISOR is the only method that can guarantee safety satisfaction in all tasks, while achieving top returns in most tasks., Comment: ICLR 2024, 30pages, 11 figures
Published: 2024

18. Congenital aniridia: bilateral phacoemulsification and IOL implantation

Author: Yang, Lidan, Yang, Yujie, and Yang, YongLi
Published: 2024
Full Text: View/download PDF

19. Suppressed ion migration for high-performance X-ray detectors based on atmosphere-controlled EFG-grown perovskite CsPbBr3 single crystals

Author: Hua, Yunqiu, Zhang, Guodong, Sun, Xue, Zhang, Peng, Hao, Yingying, Xu, Yadong, Yang, Yujie, Lin, Qianqian, Li, Xiang, Zhai, Zhongjun, Cui, Fucai, Liu, Hongjie, Liu, Jiaxin, and Tao, Xutang
Published: 2024
Full Text: View/download PDF

20. A robust multi-view knowledge transfer-based rough fuzzy C-means clustering algorithm

Author: Zhao, Feng, Yang, Yujie, Liu, Hanqiang, and Wang, Chaofei
Published: 2024
Full Text: View/download PDF

21. Functional connectivity changes of the hippocampal subregions in anti-N-methyl-D-aspartate receptor encephalitis

Author: Yang, Yujie, Fu, Shishun, Jiang, Guihua, Xu, Guang, Tian, Junzhang, and Ma, Xiaofen
Published: 2024
Full Text: View/download PDF

22. Safe Reinforcement Learning with Dual Robustness

Author: Li, Zeyang, Hu, Chuxiong, Wang, Yunan, Yang, Yujie, and Li, Shengbo Eben
Subjects: Computer Science - Machine Learning
Abstract: Reinforcement learning (RL) agents are vulnerable to adversarial disturbances, which can deteriorate task performance or compromise safety specifications. Existing methods either address safety requirements under the assumption of no adversary (e.g., safe RL) or only focus on robustness against performance adversaries (e.g., robust RL). Learning one policy that is both safe and robust remains a challenging open problem. The difficulty is how to tackle two intertwined aspects in the worst cases: feasibility and optimality. Optimality is only valid inside a feasible region, while identification of maximal feasible region must rely on learning the optimal policy. To address this issue, we propose a systematic framework to unify safe RL and robust RL, including problem formulation, iteration scheme, convergence analysis and practical algorithm design. This unification is built upon constrained two-player zero-sum Markov games. A dual policy iteration scheme is proposed, which simultaneously optimizes a task policy and a safety policy. The convergence of this iteration scheme is proved. Furthermore, we design a deep RL algorithm for practical implementation, called dually robust actor-critic (DRAC). The evaluations with safety-critical benchmarks demonstrate that DRAC achieves high performance and persistent safety under all scenarios (no adversary, safety adversary, performance adversary), outperforming all baselines significantly.
Published: 2023

23. Effect of Gd3+ doping on microstructure and magnetic properties of Ni–Zn–Co ferrite

Author: Zhang, Zhengyu, Yang, Yujie, Li, Hao, Geng, Zhihao, and Ding, Hongyu
Published: 2024
Full Text: View/download PDF

24. Numerical analysis of soil salt leaching influenced by subsurface seepage pipe with negative pressure

Author: LU Peirong, YANG Yujie, XIA Congxuan, LIU Yaxin, and WANG Ce
Subjects: soil water-salt transport, numerical simulation, negative-pressure seepage pipe, surface infiltration, salt leaching, Agriculture (General), S1-972, Irrigation engineering. Reclamation of wasteland. Drainage, TC801-978
Abstract: 【Objective】 Subsurface seepage pipe assisted with vacuum negative pressure is an emerging technology aimed at regulating soils water dynamics. In this paper, we examine the effects of negative-pressure seepage pipe (NPSP) on soil salt leaching. 【Method】 We used the HYDRUS model to simulate salt leaching in soils with five soil textures: silty clay, clay, clay loam, sandy clay loam, and sandy loam. Five infiltration water heads (2, 4, 6, 8 and 10 cm) and five negative pressure heads (200, 500, 1 000, 1 500 and 2 000 cm) in the subsurface pipes were considered in the simulations. Based on the simulated results, we calculated the changes in soil salt content before and after a leaching event, as well as the salt leaching efficiency (salt leached per unit of irrigation water). 【Result】 ① Water discharged by the NPSP was primarily originated from the wetting zone in soil profile formed during the infiltration process, and the maximum pipe discharge was obtained as the wetting zone was saturated. ② Soil texture was a key factor in influencing water and salt discharge, with both the desalination rate and salt leaching efficiency increasing as sand content increased. ③ For all the tested soil textures, the desalination rate was positively correlated with both the surface infiltration head and the inner-pipe negative pressure. However, for sandy clay loam and sandy loam soils, increasing the infiltration head lead to a lower salt leaching efficiency. ④ As compared with no-pipe control, the maximum increase of the desalination rate under the application of NPSP was 7.91%, while increased desalination rate were obtained in the soil profile above the NPSP, and the values decreasing with the lateral distance away from the NPSP; There was a salt accumulation zone directly below the NPSP, and its region expanded with the decrease of the applied negative pressure. 【Conclusion】 NPSP is feasible to facilitate soil desalination without increase the water consumption for leaching practice, and the desalination performance of NPSP is likely be improved in field with sandy soil.
Published: 2024
Full Text: View/download PDF

25. S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields

Author: Xie, Zeke, Yang, Xindi, Yang, Yujie, Sun, Qi, Jiang, Yixiang, Wang, Haoran, Cai, Yunfeng, and Sun, Mingming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Recently, Neural Radiance Field (NeRF) has shown great success in rendering novel-view images of a given scene by learning an implicit representation with only posed RGB images. NeRF and relevant neural field methods (e.g., neural surface representation) typically optimize a point-wise loss and make point-wise predictions, where one data point corresponds to one pixel. Unfortunately, this line of research failed to use the collective supervision of distant pixels, although it is known that pixels in an image or scene can provide rich structural information. To the best of our knowledge, we are the first to design a nonlocal multiplex training paradigm for NeRF and relevant neural field methods via a novel Stochastic Structural SIMilarity (S3IM) loss that processes multiple data points as a whole set instead of process multiple inputs independently. Our extensive experiments demonstrate the unreasonable effectiveness of S3IM in improving NeRF and neural surface representation for nearly free. The improvements of quality metrics can be particularly significant for those relatively difficult tasks: e.g., the test MSE loss unexpectedly drops by more than 90% for TensoRF and DVGO over eight novel view synthesis tasks; a 198% F-score gain and a 64% Chamfer $L_{1}$ distance reduction for NeuS over eight surface reconstruction tasks. Moreover, S3IM is consistently robust even with sparse inputs, corrupted images, and dynamic scenes., Comment: ICCV 2023 main conference. Code: https://github.com/Madaoer/S3IM. 14 pages, 5 figures, 17 tables
Published: 2023

26. Care to dare: cross-lagged effects of mentor secure-base support on newcomers' workplace courage

Author: Dong, Yuge, Yang, Yujie, Zheng, Lu, and Long, Lirong
Published: 2024
Full Text: View/download PDF

27. A humanized neutralizing antibody protects against human adenovirus type 7 infection in humanized desmoglein-2 and CD46 double-receptor transgenic mice

Author: Zhou, Chengxing, Liao, Xiaohong, Zhou, Zhichao, Mo, Chuncong, Yang, Yujie, Liao, Hui, Liu, Minglei, Zhang, Qiong, Li, Qiuru, Tian, Xingui, Zhou, Rong, and Cao, Hong
Published: 2024
Full Text: View/download PDF

28. Machine-learning-based prediction of cardiovascular events for hyperlipidemia population with lipid variability and remnant cholesterol as biomarkers

Author: Du, Zhenzhen, Wang, Shuang, Yang, Ouzhou, He, Juan, Yang, Yujie, Zheng, Jing, Zhao, Honglei, and Cai, Yunpeng
Published: 2024
Full Text: View/download PDF

29. A systematic review of spatial and temporal epidemiological approaches, focus on lung cancer risk associated with particulate matter

Author: Neupane, Basanta Kumar, Acharya, Bipin Kumar, Cao, Chunxiang, Xu, Min, Bhattarai, Hemraj, Yang, Yujie, and Wang, Shaohua
Published: 2024
Full Text: View/download PDF

30. Prevalence and severity of sarcopenia in patients on maintenance hemodialysis: a cross-sectional study

Author: Yang, Yujie, Zeng, Ying, Lv, Wenmei, Fu, Ping, and Yuan, Huaihong
Published: 2024
Full Text: View/download PDF

31. Bioelectrical impedance analysis–derived phase angle predicts possible Sarcopenia in patients on maintenance hemodialysis: a retrospective study

Author: Zeng, Ying, Chen, Yang, Yang, Yujie, Qiu, Ying, Fu, Ping, and Yuan, Huaihong
Published: 2024
Full Text: View/download PDF

32. Prevalence and potential influencing factors for social frailty among community-dwelling older adults: a systematic review and meta-analysis

Author: Li, Jie, Zhu, Linfang, Yang, Yujie, Li, Yajuan, Fu, Ping, and Yuan, Huaihong
Published: 2024
Full Text: View/download PDF

33. Physiologically-based pharmacokinetic/pharmacodynamic modeling of meropenem in critically ill patients

Author: Yang, Yujie, Wang, Yirong, Zeng, Wei, Zhou, Jinhua, Xu, Min, Lan, Ying, Liu, Lvye, Shen, Jian, Zhang, Chuan, and He, Qin
Published: 2024
Full Text: View/download PDF

34. Bioelectrical impedance phase angle combined with physical function predicts pre-frailty in maintenance hemodialysis patients: a prospective study

Author: Yang, Yujie, Lv, Wenmei, Zeng, Ying, Chen, Yang, and Yuan, Huaihong
Published: 2024
Full Text: View/download PDF

35. Graph Convolutional Network with Syntactic Dependency for Aspect-Based Sentiment Analysis

Author: Zhang, Fan, Zheng, Wenbin, and Yang, Yujie
Published: 2024
Full Text: View/download PDF

36. Management of triplet excitons transition: fine regulation of Förster and dexter energy transfer simultaneously

Author: Wang, Jiaqiang, Yang, Yujie, Sun, Xinnan, Li, Xiaoning, Zhang, Liyao, and Li, Zhen
Published: 2024
Full Text: View/download PDF

37. Developing a risk assessment tool for cancer-related venous thrombosis in China: a modified Delphi-analytic hierarchy process study

Author: Qin, Xiaoli, Gao, Xiurong, Yang, Yujie, Ou, Shunlong, Luo, Jing, Wei, Hua, and Jiang, Qian
Published: 2024
Full Text: View/download PDF

38. Bidirectional association between pneumonia and intestinal infection: an analysis of the MIMIC-IV database

Author: Hou, Weiqian, Zhu, Yi, Lai, Xigui, and Yang, Yujie
Published: 2024
Full Text: View/download PDF

39. Trade Friction in Two-Country HANK with Financial Friction

Author: Zhang, Chenxin, Yang, Yujie, and Hou, Wenwen
Published: 2024
Full Text: View/download PDF

40. Feasible Policy Iteration

Author: Yang, Yujie, Zheng, Zhilong, Li, Shengbo Eben, Duan, Jingliang, Liu, Jingjing, Zhan, Xianyuan, and Zhang, Ya-Qin
Subjects: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: Safe reinforcement learning (RL) aims to find the optimal policy and its feasible region in a constrained optimal control problem (OCP). Ensuring feasibility and optimality simultaneously has been a major challenge. Existing methods either attempt to solve OCPs directly with constrained optimization algorithms, leading to unstable training processes and unsatisfactory feasibility, or restrict policies in overly small feasible regions, resulting in excessive conservativeness with sacrificed optimality. To address this challenge, we propose an indirect safe RL framework called feasible policy iteration, which guarantees that the feasible region monotonically expands and converges to the maximum one, and the state-value function monotonically improves and converges to the optimal one. We achieve this by designing a policy update principle called region-wise policy improvement, which maximizes the state-value function under the constraint of the constraint decay function (CDF) inside the feasible region and minimizes the CDF outside the feasible region simultaneously. This update scheme ensures that the state-value function monotonically increases state-wise in the feasible region and the CDF monotonically decreases state-wise in the entire state space. We prove that the CDF converges to the solution of the risky Bellman equation while the state-value function converges to the solution of the feasible Bellman equation. The former represents the maximum feasible region and the latter manifests the optimal state-value function. Experiments show that our algorithm learns strictly safe and near-optimal policies with accurate feasible regions on classic control tasks. It also achieves fewer constraint violations with performance better than (or comparable to) baselines on Safety Gym.
Published: 2023

41. Influence of Ni2+ and Sn4+ ions content on the microstructure and magnetic properties of the NiCuZnSn ferrite materials

Author: Zhang, Yingming, Yang, Yujie, Chen, Congliang, Chen, Dongyang, and Meng, Yuting
Published: 2024
Full Text: View/download PDF

42. Coronin 2B deficiency induces nucleolar stress and neuronal apoptosis

Author: Wu, Hongjiao, Yang, Yujie, Yi, Wanying, Qiu, Yue, Ma, Shuangshuang, Xu, Jinying, Fan, Yingying, Chen, Yuewen, and Chen, Yu
Published: 2024
Full Text: View/download PDF

43. Preparation of low-loss amorphous soft magnetic powder cores by joint coating of resin and ZnO nanoparticles

Author: Li, Hao, Yang, Yujie, Chen, Congliang, Chen, Dongyang, Zhang, Yingming, Meng, Yuting, and Zhang, Zhengyu
Published: 2024
Full Text: View/download PDF

44. McNet: Fuse Multiple Cues for Multichannel Speech Enhancement

Author: Yang, Yujie, Quan, Changsheng, and Li, Xiaofei
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Sound, Electrical Engineering and Systems Science - Signal Processing
Abstract: In multichannel speech enhancement, both spectral and spatial information are vital for discriminating between speech and noise. How to fully exploit these two types of information and their temporal dynamics remains an interesting research problem. As a solution to this problem, this paper proposes a multi-cue fusion network named McNet, which cascades four modules to respectively exploit the full-band spatial, narrow-band spatial, sub-band spectral, and full-band spectral information. Experiments show that each module in the proposed network has its unique contribution and, as a whole, notably outperforms other state-of-the-art methods., Comment: submitted to icassp 2023
Published: 2022

45. Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

Author: Yu, Dongjie, Zou, Wenjun, Yang, Yujie, Ma, Haitong, Li, Shengbo Eben, Duan, Jingliang, and Chen, Jianyu
Subjects: Computer Science - Robotics, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an issue in safe model-based RL, especially in training time safety. In this paper, we propose a distributional reachability certificate (DRC) and its Bellman equation to address model uncertainties and characterize robust persistently safe states. Furthermore, we build a safe RL framework to resolve constraints required by the DRC and its corresponding shield policy. We also devise a line search method to maintain safety and reach higher returns simultaneously while leveraging the shield policy. Comprehensive experiments on classical benchmarks such as constrained tracking and navigation indicate that the proposed algorithm achieves comparable returns with much fewer constraint violations during training., Comment: 12 pages, 6 figures
Published: 2022

46. How does transition finance influence green innovation of high-polluting and high-energy-consuming enterprises? Evidence from China

Author: Liu, Chao, Yang, Yujie, and Chen, Shuai
Published: 2024
Full Text: View/download PDF

47. Correction: Protective Effect of Raspberry Ketone on Deep Vein Thrombosis and the Molecular Mechanism

Author: Zhang, Dalin, Lin, Shusen, Yang, Yujie, and Wang, Hecheng
Published: 2024
Full Text: View/download PDF

48. Influence of four kinds of organic binders on magnetic properties of FeSiCrC magnetic powder and magnetic powder core

Author: Meng, Yuting, Yang, Yujie, Chen, Dongyang, Zhang, Yingming, Chen, Congliang, Li, Hao, and Zhang, Zhenyu
Published: 2024
Full Text: View/download PDF

49. Effects of different light and heavy rare-earth compositions on structure and magnetic properties of high-entropy garnet ceramics

Author: Chen, Dongyang, Yang, Yujie, Zhang, Yingming, Chen, Congliang, Li, Hao, Meng, Yuting, and Zhang, Zhengyu
Published: 2024
Full Text: View/download PDF

50. Oral English Course Online Recommendation System Based on Support Vector Machine

Author: Yang, Yujie, primary and Yuan, Gang, additional
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,846 results on '"Yang, Yujie"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources