63,593 results on '"Yunfei An"'
Search Results
2. Corrigendum: Heterogeneous phenotype of a Chinese Familial WHIM syndrome with CXCR4V340fs gain-of-function mutation
- Author
-
Yu Huang, Lu Li, Ran Chen, Lang Yu, Shunkai Zhao, Yanjun Jia, Ying Dou, Zhiyong Zhang, Yunfei An, Xuemei Tang, Xiaodong Zhao, and Lina Zhou
- Subjects
CXCR4 variant ,gain-of-function ,inborn error of immunity ,WHIM syndrome ,heterogeneous phenotype ,Immunologic diseases. Allergy ,RC581-607 - Published
- 2025
- Full Text
- View/download PDF
3. Case report: A cyclic neutropenia patient with ELANE mutation accompanied by hemophagocytic lymphohistiocytosis
- Author
-
Lang Yu, Yulin Li, Wenhui Li, Yishi Zhang, Wenli He, Xuemei Tang, Yunfei An, and Xiaodong Zhao
- Subjects
cyclic congenital neutropenia ,CYN ,SCN ,hemophagocytic lymphohistiocytosis ,HLH ,Immunologic diseases. Allergy ,RC581-607 - Abstract
Many inborn errors of immunity may accompany secondary hemophagocytic lymphohistiocytosis (HLH), a condition typically characterized by impaired cytotoxic T and NK cell function. A considerable proportion of HLH cases also stem from chronic granulomatosis with phagocytic dysfunction. However, the development of secondary HLH in patients with severe congenital neutropenia (SCN) or cyclic neutropenia (CyN) with abnormal phagocytic cell counts has been less frequently reported. Herein, we present a case of a pediatric patient with ELANE mutation-associated CyN who developed HLH subsequent to severe bacterial, fungal, and viral infections. Notable observations included impaired NK cell degranulation function (CD107a). To the best of our knowledge, this represents the first documented instance of HLH in patients with CyN attributed to an ELANE mutation. Thus, our study establishes a link between ELANE-related CyN and HLH, underscoring the importance of considering HLH as a potential complication in these patients.
- Published
- 2024
- Full Text
- View/download PDF
4. VSV infection and LPS treatment alter serum bile acid profiles, bile acid biosynthesis, and bile acid receptors in mice
- Author
-
Yamei Li, Yan Luo, Chao Wang, Lei Xu, Xinhua Dai, Yunfei An, Lin He, Dongmei Zeng, Yangjuan Bai, and Hua Zhang
- Subjects
VSV infection ,LPS treatment ,bile acid profiles ,bile acid biosynthesis ,bile acid receptors ,Microbiology ,QR1-502 - Abstract
ABSTRACT Pathogen infections remain a significant public health problem worldwide. Accumulating evidence regarding the crosstalk between bile acid (BA) metabolism and immune response reveals that BA metabolism regulates host immunity and microbial pathogenesis, making it an attractive target for disease prevention and infection control. However, the effect of infection on circulating BA profiles, the biosynthesis-related enzymes, and their receptors remains to be depicted. Here, we investigated the effect of viral (vesicular stomatitis virus, VSV) and bacterial (lipopolysaccharide, LPS) infections on BA metabolism and signaling. Infection models were successfully established by intraperitoneally injecting VSV and LPS, respectively. VSV and LPS injection significantly changed the circulating BA profiles, with highly increased levels of taurine-conjugated BAs and significant decreases in unconjugated BAs. Consistent with the decreased levels of circulating cholic acid (CA) and chenodeoxycholic acid (CDCA), the expression of BA biosynthesis-related rate-limiting enzymes (Cyp7a1, Cyp27a1, Cyp8b1, and Hsd3b7) were significantly reduced. Furthermore, hepatic and pulmonary BA receptors (BARs) expression varied in different infection models. LPS treatment had an extensive impact on tested hepatic and pulmonary BARs, resulting in the upregulation of TGR5, S1PR2, and VDR, while VSV infection only promoted VDR expression. Our study provides insights into the involvement of BA metabolism in the pathophysiology of infection, which may provide potential clues for targeting BA metabolism and BAR signaling to boost innate immunity and control infection.IMPORTANCEThis study focuses on the crosstalk between bile acid (BA) metabolism and immune response in VSV infection and LPS treatment models and depicts the effect of infection on circulating BA profiles, the biosynthesis-related enzymes, and their receptors. These findings provide insights into the effect of infection on BA metabolism and signaling, adding a more comprehensive understanding to the relationship between infection, BA metabolism and immune responses.
- Published
- 2024
- Full Text
- View/download PDF
5. Heterogeneous phenotype of a Chinese Familial WHIM syndrome with CXCR4V340fs gain-of-function mutation
- Author
-
Yu Huang, Lu Li, Ran Chen, Lang Yu, Shunkai Zhao, Yanjun Jia, Ying Dou, Zhiyong Zhang, Yunfei An, Xuemei Tang, Xiaodong Zhao, and Lina Zhou
- Subjects
CXCR4 variant ,gain-of-function ,inborn error of immunity ,WHIM syndrome ,heterogeneous phenotype ,Immunologic diseases. Allergy ,RC581-607 - Abstract
BackgroundWHIM syndrome is a rare, autosomal dominant inborn error of immunity characterized by warts, hypogammaglobulinemia, infection, and myelokathexis. It is caused mainly by heterozygous mutations at the C-terminus of the C-X-C chemokine receptor type 4 (CXCR4) gene.MethodsWe described the detailed clinical, genetic, immunological and treatment characteristic of four WHIM patients from a single Chinese family.ResultsHere, we report four patients from a family carrying a variant of CXCR4 (c.1016_1017dupCT), which introduces a frameshift at codon V340, resulting in an extension of 14 amino acids (p.V340L fs*27). We provide an in-depth analysis of their clinical, genetic, immunological and treatment characteristic, noting that these patients exhibited an atypical clinical phenotype when compared to reported CXCR4R334X patients. Additionally, the frameshift variant CXCR4V340fs led to impaired receptor downregulation in patients’ PBMCs, and in HEK293T cells transfected with the variant plasmids.ConclusionsOur study provided detailed clinical features of four CXCR4V340fs WHIM patients from one Chinese family who presented atypical phenotype and enrich the spectrum of WHIM syndrome.
- Published
- 2024
- Full Text
- View/download PDF
6. Mechanism and target treatment of primary immunodeficiency diseases with systemic lupus erythematosus‐like phenotype
- Author
-
Shan Liu, Zhiyong Zhang, Xuemei Tang, Xiaodong Zhao, and Yunfei An
- Subjects
inborn errors of immunity (IEI) ,primary immunodeficiency diseases (PIDs) ,systemic lupus erythematosus (SLE) ,Pediatrics ,RJ1-570 - Abstract
Abstract Primary immunodeficiency diseases (PIDs) present a heterogeneous group of diseases with aberrant immune response caused by monogenic mutations. Due to the immune dysfunction and dysregulation, PIDs have a wide clinical spectrum such as infections, autoimmunity, autoinflammation, allergy, and malignancies. Systemic lupus erythematosus (SLE) is a systemic autoimmune disease characterized with multiple autoantibodies and multiple organ damage, which could be the predominant phenotype in patients with PIDs. In recent years, the increasing identification of monogenic causes of SLE and PIDs discloses the partially shared genetic background and common pathogenic process. The study of PIDs with SLE‐like phenotype paves the way for the exploration of lupus pathogenesis and new perspectives in targeted therapies concurrently.
- Published
- 2024
- Full Text
- View/download PDF
7. Individuals carrying the HLA-B*15 allele exhibit favorable responses to COVID-19 vaccines but are more susceptible to Omicron BA.5.2 and XBB.1.16 infection
- Author
-
Lingxin Meng, Yue Pan, Yueping Liu, Rui He, Yuting Sun, Chenhui Wang, Lei Fei, Airu Zhu, Zhongfang Wang, Yunfei An, Yuzhang Wu, Bo Diao, and Yongwen Chen
- Subjects
COVID-19 ,Omicron variants ,HLA-B*15 ,vaccination ,SARS-CoV-2 ,Immunologic diseases. Allergy ,RC581-607 - Abstract
BackgroundNatural infection or vaccination have provided robust immune defense against SARS-CoV-2 invasion, nevertheless, Omicron variants still successfully cause breakthrough infection, and the underlying mechanisms are poorly understood.MethodsSequential blood samples were continuously collected at different time points from 252 volunteers who were received the CanSino Ad5-nCoV (n= 183) vaccine or the Sinovac CoronaVac inactivated vaccine (n= 69). The anti-SARS-CoV-2 prototype and Omicron BA.5.2 as well as XBB.1.16 variant neutralizing antibodies (Nab) in sera were detected by ELISA. Sera were also used to measure pseudo and live virus neutralization assay. The associations between the anti-prototype Nab levels and different HLA-ABC alleles were analyzed using artificial intelligence (AI)-deep learning techniques. The frequency of B cells in PBMCs was investigated by flow cytometry assay (FACs).ResultsIndividuals carrying the HLA-B*15 allele manifested the highest concentrations of anti-SARS-CoV-2 prototype Nab after vax administration. Unfortunately, these volunteers are more susceptible to Omicron BA.5.2 breakthrough infection due to their sera have poorer anti-BA.5.2 Nab and lower levels of viral neutralization efficacy. FACs confirmed that a significant decrease in CD19+CD27+RBD+ memory B cells in these HLA-B*15 population compared to other cohorts. Importantly, generating lower concentrations of cross-reactive anti-XBB.1.16 Nab post-BA.5.2 infection caused HLA-B*15 individuals to be further infected by XBB.1.16 variant.ConclusionsIndividuals carrying the HLA-B*15 allele respond better to COVID-19 vax including the CanSino Ad5-nCoV and the Sinovac CoronaVac inactivated vaccines, but are more susceptible to Omicron variant infection, thus, a novel vaccine against this population is necessary for COVID-19 pandemic control in the future.
- Published
- 2024
- Full Text
- View/download PDF
8. An efficient and successful outcome after haematopoietic stem cell transplantation in a patient with an LPS-responsive beige-like anchor gene mutation
- Author
-
Cen Shen, Luying Zhang, Yan Meng, Lu Yang, Wenli He, Xiaoying Lei, Lina Zhou, Yunfei An, and Ying Dou
- Subjects
LRBA mutation ,haematopoietic stem cell transplantation ,neurological change ,infection ,graft-versus-host disease ,Pediatrics ,RJ1-570 - Abstract
Lipopolysaccharide (LPS)-responsive beige ankyrin (LRBA) gene mutations were first reported as the cause of immunodeficiency syndromes and autoimmunity in 2012. The majority of LRBA patients have multiple organ system involvement and a complex clinical phenotype. Herein we present a comprehensive account on the disease progression and transplantation procedure in a patient with LRBA deficiency who exhibited progressive autoimmune disease symptoms along with recurrent pulmonary infections since the age of 6 years old. Despite receiving abatacept therapy and immunoglobulin replacement treatments to manage the symptoms, but the symptoms still progressed. Therefore, nine years after disease onset, patients were treated with allogeneic haematopoietic stem cell transplantation (allo-HSCT). The patient experienced acute and chronic graft-versus-host disease (GVHD) and recurrent infections after transplantation. During one and a half years of follow-up, we found that allogeneic haematopoietic stem cell transplantation can relieve the symptoms of autoimmune disease in patients with LRBA deficiency, and marked clinical improvement and recovery of immune function were observed following stem cell transplantation.
- Published
- 2024
- Full Text
- View/download PDF
9. Deep Learning Radiomics Features of Mediastinal Fat and Pulmonary Nodules on Lung CT Images Distinguish Benignancy and Malignancy
- Author
-
Hongzhuo Qi, Qifan Xuan, Pingping Liu, Yunfei An, Wenjuan Huang, Shidi Miao, Qiujun Wang, Zengyao Liu, and Ruitao Wang
- Subjects
pulmonary nodules ,nomogram ,mediastinal fat ,benign ,malignant ,deep learning ,Biology (General) ,QH301-705.5 - Abstract
This study investigated the relationship between mediastinal fat and pulmonary nodule status, aiming to develop a deep learning-based radiomics model for diagnosing benign and malignant pulmonary nodules. We proposed a combined model using CT images of both pulmonary nodules and the fat around the chest (mediastinal fat). Patients from three centers were divided into training, validation, internal testing, and external testing sets. Quantitative radiomics and deep learning features from CT images served as predictive factors. A logistic regression model was used to combine data from both pulmonary nodules and mediastinal adipose regions, and personalized nomograms were created to evaluate the predictive performance. The model incorporating mediastinal fat outperformed the nodule-only model, with C-indexes of 0.917 (training), 0.903 (internal testing), 0.942 (external testing set 1), and 0.880 (external testing set 2). The inclusion of mediastinal fat significantly improved predictive performance (NRI = 0.243, p < 0.05). A decision curve analysis indicated that incorporating mediastinal fat features provided greater patient benefits. Mediastinal fat offered complementary information for distinguishing benign from malignant nodules, enhancing the diagnostic capability of this deep learning-based radiomics model. This model demonstrated strong diagnostic ability for benign and malignant pulmonary nodules, providing a more accurate and beneficial approach for patient care.
- Published
- 2024
- Full Text
- View/download PDF
10. Assessing the role of global food commodity prices in achieving the 2030 agenda for SDGs
- Author
-
Yifan Shen, Yanan Chen, Xunpeng Shi, Yunfei An, Muyi Yang, and Yunting Qi
- Subjects
Agricultural resource economics ,Food economics ,Science - Abstract
Summary: Food plays a vital role in human sustenance and well-being, and the fluctuations in its price exert a significant impact on the attainment of the Sustainable Development Goals (SDGs) from social, economic, and environmental perspectives. This paper conducts an analysis utilizing data from 163 countries, revealing that an upsurge in global food commodity prices entails trade-offs with 13 SDGs, while exhibiting synergies with a few others. By considering specific food products, various types of countries, and the supply and demand shocks, further analysis confirms predominantly negative associations between spikes in food prices and the SDGs. Our findings highlight the urgent imperative to mitigate abrupt increases in food prices, such as those witnessed during the 2022 food crisis, to ensure the comprehensive fulfillment of the 2030 agenda for SDGs.
- Published
- 2024
- Full Text
- View/download PDF
11. Successful use of extracorporeal membrane oxygenation for life‐threatening macrophage activation syndrome after treatment with tocilizumab in an systemic juvenile idiopathic arthritis patient
- Author
-
Xi Yang, Yingfu Chen, Rongxin Dai, Yunfei An, Xin Yan, Xiaodong Zhao, and Xuemei Tang
- Subjects
extracorporeal membrane oxygenation ,macrophage activation syndrome ,systemic juvenile idiopathic arthritis ,tocilizumab ,Pediatrics ,RJ1-570 - Abstract
Abstract Macrophage activation syndrome (MAS) is a rare, potentially life‐threatening condition in rheumatic diseases. The primary treatments consist of high‐dose corticosteroids and immunosuppressive drugs, although more recently, cytokine inhibitors like anakinra or tocilizumab (TCZ) have been reported. We present a case of a child with systemic juvenile idiopathic arthritis (sJIA). After receiving a single infusion of TCZ, the child developed progressive hypoxia and was subsequently transferred to the pediatric intensive care unit (PICU) after 4 days. An immediate postintubation chest X‐ray revealed a diffuse exudative lesion. Despite continuous efforts to provide mechanical ventilation and respiratory support, the patient's oxygen saturation continued to decline. Moreover, the patient developed hemodynamic compromise, necessitating the administration of norepinephrine. Eventually, vasopressin and dopamine were added to maintain stable hemodynamics. After an intensive but ineffective treatment, extracorporeal membrane oxygenation (ECMO) was initiated in the PICU after 16 h. The patient successfully recovered and was weaned off ECMO support after 60 h. Following discharge from the PICU, given the severe refractory clinical features, we made an attempt to readminister TCZ treatment. However, within half an hour of TCZ infusion, the patient experienced anaphylaxis characterized by palpitations and chest tightness, leading to the discontinuation of TCZ. TCZ, as a biological agent, is commonly used in the treatment of sJIA. Nonetheless, the occurrence of MAS and anaphylaxis following TCZ administration for sJIA may be more prevalent than previously recognized. Pediatric rheumatologists should exercise caution when initiating TCZ for active sJIA. Furthermore, we want to underscore the importance of life‐saving techniques such as ECMO for sJIA patients in emergency situations.
- Published
- 2023
- Full Text
- View/download PDF
12. Explainable Machine Learning Explores Association Between Sarcopenia and Breast Cancer Distant Metastasis
- Author
-
Hongzhuo Qi, Yunfei An, Xiaohui Hu, Shidi Miao, and Jing Li
- Subjects
Breast cancer ,cut-off value ,distant metastasis ,multi-objective optimization ,multimodal ,sarcopenia ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The impact of sarcopenia on the prognosis of breast cancer (BC) carries important clinical significance. However, there is no internationally standardized cut-off value for defining sarcopenia. This study proposed an explainable machine learning model for identifying risk factors of BC distant metastases and discussing the division of body composition cut-off values. Combining computed tomography (CT) image data of $11^{th}$ thoracic vertebrae (T11) and $4^{th}$ thoracic vertebrae (T4), a multi-objective optimized genetic algorithm was developed to select features and predict BC distant metastasis. Feature selection results were analyzed by Cox regression. The results showed that skeletal muscle index (SMI/T11) was a risk factor for predicting BC distant metastasis and an independent prognostic factor for distant metastasis-free survival (DMFS). A cut-off value of 21cm2/m2 for SMI/T11 was obtained by shapley additive explanations, in addition, The DMFS and overall survival (OS) of the low-risk group were significantly better than those of the high-risk group. The combination of multimodal data further confirms that sarcopenia is associated with poorer DMFS and OS in BC patients, and explores for the first time the issue of the cut-off values of sarcopenia and BC distant metastasis.
- Published
- 2023
- Full Text
- View/download PDF
13. Breastfeeding by a mother taking cyclosporine for nephrotic syndrome
- Author
-
Ruizhe Li, Chuan Zhang, Hongjing Wang, and Yunfei An
- Subjects
Cyclosporine ,Breastfeeding ,Nephrotic syndrome ,Pediatrics ,RJ1-570 ,Public aspects of medicine ,RA1-1270 - Abstract
Abstract Background Cyclosporine is widely used for immunosuppressive treatment of various systematic and local autoimmune diseases. Breastfeeding is conventionally contraindicated when treating with cyclosporine due to its excretion into breast milk, which may cause immune suppression of exposed infants and affect infants` growth. A few cases have tested cyclosporine levels in random breast milk samples and concluded the infants exposed to safe cyclosporine levels during breastfeeding. Since infants do not maintain a fixed feeding schedule, we monitored cyclosporine levels in breast milk at different times of the day to assess the safety of breast milk for infants throughout the day. Case presentation A 32-year-old dichorionic twin-pregnancy woman had nephrotic syndrome with renal biopsy confirmed type V lupus nephritis for over five years. She was treated only with prednisone 10 mg a day before pregnancy and during early pregnancy. Cyclosporine was added in her regimen from 22 weeks gestation and was adjusted to 225 mg a day from 28 weeks gestation. After parturition, she partially breastfed her twin infants while being treated with cyclosporine 3 mg/kg a day as well as prednisone and hydroxychloroquine sulfate. The cyclosporine level in maternal blood was determined, and several breast milk samples were collected for consecutive 48 h beginning on the ninth day after parturition. The concentration of cyclosporine in breast milk was measured and ranged from 0.443 to 5.307 mcg/L. Both infants grew and developed normally at the three-month follow-up, with no adverse effects observed. The study was conducted at West China Second University Hospital of Sichuan University, started in September 2021, with the consent of the participant and the approval of the ethics committee. Conclusion In this case, cyclosporine levels in breast milk were low at all times of the day. The growth and development of both infants were normal at three months postpartum. Thus, breastfeeding may still be an option for mothers with nephrotic syndrome who are treated with cyclosporine.
- Published
- 2022
- Full Text
- View/download PDF
14. Identity-Preserving Video Dubbing Using Motion Warping
- Author
-
Liu, Runzhen, Lin, Qinjie, Liu, Yunfei, Lin, Lijian, Zhu, Ye, Li, Yu, Xian, Chuhua, and Hong, Fa-Ting
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Video dubbing aims to synthesize realistic, lip-synced videos from a reference video and a driving audio signal. Although existing methods can accurately generate mouth shapes driven by audio, they often fail to preserve identity-specific features, largely because they do not effectively capture the nuanced interplay between audio cues and the visual attributes of reference identity . As a result, the generated outputs frequently lack fidelity in reproducing the unique textural and structural details of the reference identity. To address these limitations, we propose IPTalker, a novel and robust framework for video dubbing that achieves seamless alignment between driving audio and reference identity while ensuring both lip-sync accuracy and high-fidelity identity preservation. At the core of IPTalker is a transformer-based alignment mechanism designed to dynamically capture and model the correspondence between audio features and reference images, thereby enabling precise, identity-aware audio-visual integration. Building on this alignment, a motion warping strategy further refines the results by spatially deforming reference images to match the target audio-driven configuration. A dedicated refinement process then mitigates occlusion artifacts and enhances the preservation of fine-grained textures, such as mouth details and skin features. Extensive qualitative and quantitative evaluations demonstrate that IPTalker consistently outperforms existing approaches in terms of realism, lip synchronization, and identity retention, establishing a new state of the art for high-quality, identity-consistent video dubbing., Comment: v2, Under Review
- Published
- 2025
15. Exploring Optimal Latent Trajetory for Zero-shot Image Editing
- Author
-
Li, Maomao, Li, Yu, Liu, Yunfei, and Xu, Dong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Editability and fidelity are two essential demands for text-driven image editing, which expects that the editing area should align with the target prompt and the rest should remain unchanged separately. The current cutting-edge editing methods usually obey an "inversion-then-editing" pipeline, where the source image is first inverted to an approximate Gaussian noise ${z}_T$, based on which a sampling process is conducted using the target prompt. Nevertheless, we argue that it is not a good choice to use a near-Gaussian noise as a pivot for further editing since it almost lost all structure fidelity. We verify this by a pilot experiment, discovering that some intermediate-inverted latents can achieve a better trade-off between editability and fidelity than the fully-inverted ${z}_T$. Based on this, we propose a novel editing paradigm dubbed ZZEdit, which gentlely strengthens the target guidance on a sufficient-for-editing while structure-preserving latent. Specifically, we locate such an editing pivot by searching the first point on the inversion trajectory which has larger response levels toward the target prompt than the source one. Then, we propose a ZigZag process to perform mild target guiding on this pivot, which fulfills denoising and inversion iteratively, approaching the target while still holding fidelity. Afterwards, to achieve the same number of inversion and denoising steps, we perform a pure sampling process under the target prompt. Extensive experiments highlight the effectiveness of our ZZEdit in diverse image editing scenarios compared with the "inversion-then-editing" pipeline., Comment: 16 pages
- Published
- 2025
16. EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks
- Author
-
Liu, Shixuan, Cheng, Haoxiang, Wang, Yunfei, He, Yue, Fan, Changjun, and Liu, Zhong
- Subjects
Computer Science - Social and Information Networks - Abstract
Heterogeneous Information Networks (HINs) encapsulate diverse entity and relation types, with meta-paths providing essential meta-level semantics for knowledge reasoning, although their utility is constrained by discovery challenges. While Large Language Models (LLMs) offer new prospects for meta-path discovery due to their extensive knowledge encoding and efficiency, their adaptation faces challenges such as corpora bias, lexical discrepancies, and hallucination. This paper pioneers the mitigation of these challenges by presenting EvoPath, an innovative framework that leverages LLMs to efficiently identify high-quality meta-paths. EvoPath is carefully designed, with each component aimed at addressing issues that could lead to potential knowledge conflicts. With a minimal subset of HIN facts, EvoPath iteratively generates and evolves meta-paths by dynamically replaying meta-paths in the buffer with prioritization based on their scores. Comprehensive experiments on three large, complex HINs with hundreds of relations demonstrate that our framework, EvoPath, enables LLMs to generate high-quality meta-paths through effective prompting, confirming its superior performance in HIN reasoning tasks. Further ablation studies validate the effectiveness of each module within the framework.
- Published
- 2025
- Full Text
- View/download PDF
17. Universal Online Temporal Calibration for Optimization-based Visual-Inertial Navigation Systems
- Author
-
Fan, Yunfei, Zhao, Tianyu, Guo, Linan, Chen, Chen, Wang, Xin, and Zhou, Fengyi
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
6-Degree of Freedom (6DoF) motion estimation with a combination of visual and inertial sensors is a growing area with numerous real-world applications. However, precise calibration of the time offset between these two sensor types is a prerequisite for accurate and robust tracking. To address this, we propose a universal online temporal calibration strategy for optimization-based visual-inertial navigation systems. Technically, we incorporate the time offset td as a state parameter in the optimization residual model to align the IMU state to the corresponding image timestamp using td, angular velocity and translational velocity. This allows the temporal misalignment td to be optimized alongside other tracking states during the process. As our method only modifies the structure of the residual model, it can be applied to various optimization-based frameworks with different tracking frontends. We evaluate our calibration method with both EuRoC and simulation data and extensive experiments demonstrate that our approach provides more accurate time offset estimation and faster convergence, particularly in the presence of noisy sensor data., Comment: 7 pages
- Published
- 2025
18. Enhancing Neural Adaptive Wireless Video Streaming via Lower-Layer Information Exposure and Online Tuning
- Author
-
Zhao, Lingzhi, Cui, Ying, Jia, Yuhang, Zhang, Yunfei, and Nahrstedt, Klara
- Subjects
Computer Science - Multimedia - Abstract
Deep reinforcement learning (DRL) demonstrates its promising potential in the realm of adaptive video streaming and has recently received increasing attention. However, existing DRL-based methods for adaptive video streaming use only application (APP) layer information, adopt heuristic training methods, and train generalized neural networks with pre-collected data. This paper aims to boost the quality of experience (QoE) of adaptive wireless video streaming by using lower-layer information, deriving a rigorous training method, and adopting online tuning with real-time data. First, we formulate a more comprehensive and accurate adaptive wireless video streaming problem as an infinite stage discounted Markov decision process (MDP) problem by additionally incorporating past and lower-layer information, allowing a flexible tradeoff between QoE and costs for obtaining system information and solving the problem. In the offline scenario (only with pre-collected data), we propose an enhanced asynchronous advantage actor-critic (eA3C) method by jointly optimizing the parameters of parameterized policy and value function. Specifically, we build an eA3C network consisting of a policy network and a value network that can utilize cross-layer, past, and current information and jointly train the eA3C network using pre-collected samples. In the online scenario (with additional real-time data), we propose two continual learning-based online tuning methods for designing better policies for a specific user with different QoE and training time tradeoffs. Finally, experimental results show that the proposed offline policy can improve the QoE by 6.8~14.4% compared to the state-of-arts in the offline scenario, and the proposed online policies can further achieve 6~28% gains in QoE over the proposed offline policy in the online scenario., Comment: technical report for IEEE TMM, 17 pages, 10 figures
- Published
- 2025
19. Phenotypic characterization of patients with activated PI3Kδ syndrome 1 presenting with features of systemic lupus erythematosus
- Author
-
Yanping Wang, Qiuyun Yang, Xuemei Chen, Wenjing Tang, Lina Zhou, Zhi Chen, Yunfei An, Zhiyong Zhang, Xuemei Tang, and Xiaodong Zhao
- Subjects
Activated phosphoinositide 3-kinase δsyndrome 1 ,Autoimmune disease ,Immunosuppressive therapy ,PIK3CD ,Systemic lupus erythematosus ,Medicine (General) ,R5-920 ,Genetics ,QH426-470 - Abstract
Activated phosphoinositide 3-kinase δ syndrome 1 (APDS1) is a primary immunodeficiency disease caused by gain-of-function mutations in PIK3CD. Clinical features of autoimmune disease have been reported in patients with APDS1. In this study, we reported three patients with APDS1 presenting with systemic lupus erythematosus (SLE) phenotype. The clinical manifestations included recurrent respiratory tract infection, lymphoproliferation, Coombs-positive hemolytic anemia, decreased complement fractions, positive antinuclear antibodies, renal complications related to SLE associated diseases, which met the clinical spectrum of APDS1 and the classification criteria of SLE. The immunological phenotype included an inversion in the CD4:CD8 ratio, an increase in both non-circulating Tfh CD4+ memory T and circulating Tfh populations, a low level of recent thymic emigrant T cells, overexpression of CD57 on T cells, and a decrease in B cells with fewer antibody class switch recombination. These phenotypes detected in patients with APDS1 presenting with SLE were resemble that in patients with APDS1 presenting without SLE. Meanwhile, we described the effect of glucocorticoids and rapamycin therapy on patients with APDS1. The phosphorylation of S6 at Ser235/236 was inhibited in patients with APDS1 who underwent glucocorticoids therapy, including two who presented with SLE phenotype. The phosphorylation of AKT at Ser473 and phosphorylation of S6 at Ser235/236 were inhibited in other patients with APDS1 who underwent rapamycin therapy. Here, we showed the coexistence of immunodeficiency and SLE phenotype in APDS1, and the inhibition of rapamycin in activated Akt-mTOR signaling pathway.
- Published
- 2021
- Full Text
- View/download PDF
20. Ensuring Consistency for In-Image Translation
- Author
-
Fu, Chengpeng, Feng, Xiaocheng, Huang, Yichong, Huo, Wenshuai, Li, Baohang, Zhang, Zhirui, Lu, Yunfei, Tu, Dandan, Tang, Duyu, Wang, Hui, Qin, Bing, and Liu, Ting
- Subjects
Computer Science - Computation and Language - Abstract
The in-image machine translation task involves translating text embedded within images, with the translated results presented in image format. While this task has numerous applications in various scenarios such as film poster translation and everyday scene image translation, existing methods frequently neglect the aspect of consistency throughout this process. We propose the need to uphold two types of consistency in this task: translation consistency and image generation consistency. The former entails incorporating image information during translation, while the latter involves maintaining consistency between the style of the text-image and the original image, ensuring background integrity. To address these consistency requirements, we introduce a novel two-stage framework named HCIIT (High-Consistency In-Image Translation) which involves text-image translation using a multimodal multilingual large language model in the first stage and image backfilling with a diffusion model in the second stage. Chain of thought learning is utilized in the first stage to enhance the model's ability to leverage image information during translation. Subsequently, a diffusion model trained for style-consistent text-image generation ensures uniformity in text style within images and preserves background details. A dataset comprising 400,000 style-consistent pseudo text-image pairs is curated for model training. Results obtained on both curated test sets and authentic image test sets validate the effectiveness of our framework in ensuring consistency and producing high-quality translated images.
- Published
- 2024
21. SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
- Author
-
Yu, Jiaxing, Wu, Xinda, Xu, Yunfei, Zhang, Tieyao, Wu, Songruoyao, Ma, Le, and Zhang, Kejun
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Artificial Intelligence ,Computer Science - Sound - Abstract
Lyric-to-melody generation aims to automatically create melodies based on given lyrics, requiring the capture of complex and subtle correlations between them. However, previous works usually suffer from two main challenges: 1) lyric-melody alignment modeling, which is often simplified to one-syllable/word-to-one-note alignment, while others have the problem of low alignment accuracy; 2) lyric-melody harmony modeling, which usually relies heavily on intermediates or strict rules, limiting model's capabilities and generative diversity. In this paper, we propose SongGLM, a lyric-to-melody generation system that leverages 2D alignment encoding and multi-task pre-training based on the General Language Model (GLM) to guarantee the alignment and harmony between lyrics and melodies. Specifically, 1) we introduce a unified symbolic song representation for lyrics and melodies with word-level and phrase-level (2D) alignment encoding to capture the lyric-melody alignment; 2) we design a multi-task pre-training framework with hierarchical blank infilling objectives (n-gram, phrase, and long span), and incorporate lyric-melody relationships into the extraction of harmonized n-grams to ensure the lyric-melody harmony. We also construct a large-scale lyric-melody paired dataset comprising over 200,000 English song pieces for pre-training and fine-tuning. The objective and subjective results indicate that SongGLM can generate melodies from lyrics with significant improvements in both alignment and harmony, outperforming all the previous baseline methods., Comment: Extended version of paper accepted to AAAI 2025
- Published
- 2024
22. T$^3$-S2S: Training-free Triplet Tuning for Sketch to Scene Generation
- Author
-
Sun, Zhenhong, Wang, Yifu, Ng, Yonhon, Duan, Yunfei, Dong, Daoyi, Li, Hongdong, and Ji, Pan
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language ,Computer Science - Graphics - Abstract
Scene generation is crucial to many computer graphics applications. Recent advances in generative AI have streamlined sketch-to-image workflows, easing the workload for artists and designers in creating scene concept art. However, these methods often struggle for complex scenes with multiple detailed objects, sometimes missing small or uncommon instances. In this paper, we propose a Training-free Triplet Tuning for Sketch-to-Scene (T3-S2S) generation after reviewing the entire cross-attention mechanism. This scheme revitalizes the existing ControlNet model, enabling effective handling of multi-instance generations, involving prompt balance, characteristics prominence, and dense tuning. Specifically, this approach enhances keyword representation via the prompt balance module, reducing the risk of missing critical instances. It also includes a characteristics prominence module that highlights TopK indices in each channel, ensuring essential features are better represented based on token sketches. Additionally, it employs dense tuning to refine contour details in the attention map, compensating for instance-related regions. Experiments validate that our triplet tuning approach substantially improves the performance of existing sketch-to-image models. It consistently generates detailed, multi-instance 2D images, closely adhering to the input prompts and enhancing visual quality in complex multi-instance scenes. Code is available at https://github.com/chaos-sun/t3s2s.git.
- Published
- 2024
23. XTransplant: A Probe into the Upper Bound Performance of Multilingual Capability and Culture Adaptability in LLMs via Mutual Cross-lingual Feed-forward Transplantation
- Author
-
Ye, Yangfan, Feng, Xiaocheng, Feng, Xiachong, Qin, Libo, Huang, Yichong, Huang, Lei, Ma, Weitao, Zhang, Zhirui, Lu, Yunfei, Yan, Xiaohui, Tang, Duyu, Tu, Dandan, and Qin, Bing
- Subjects
Computer Science - Computation and Language - Abstract
Current large language models (LLMs) often exhibit imbalances in multilingual capabilities and cultural adaptability, largely due to their English-centric pretraining data. To address this imbalance, we propose a probing method named XTransplant that explores cross-lingual latent interactions via cross-lingual feed-forward transplantation during inference stage, with the hope of enabling the model to leverage the strengths of both English and non-English languages. Through extensive pilot experiments, we empirically prove that both the multilingual capabilities and cultural adaptability of LLMs hold the potential to be significantly improved by XTransplant, respectively from En -> non-En and non-En -> En, highlighting the underutilization of current LLMs' multilingual potential. And the patterns observed in these pilot experiments further motivate an offline scaling inference strategy, which demonstrates consistent performance improvements in multilingual and culture-aware tasks, sometimes even surpassing multilingual supervised fine-tuning. And we do hope our further analysis and discussion could help gain deeper insights into XTransplant mechanism.
- Published
- 2024
24. Domain-Pair Intertwined Topological Domain Structure in Elemental Bi Monolayer
- Author
-
Hong, Yunfei, Deng, Junkai, Yang, Yang, He, Ri, Zhong, Zhicheng, Ding, Xiangdong, Sun, Jun, and Liu, Jefferson Zhe
- Subjects
Condensed Matter - Materials Science - Abstract
Ferroelectric domain structures, separated by domain walls, often display unconventional physics and hold significant potential for applications in nano-devices. Most naturally growth domain walls are charge-neutral to avoid increased electrostatic energy, while the intrinsically stable charged 180{\deg} domain walls in Bi monolayer challenged this conventional knowledge and emerged an unexplored field. Here, using machine-learning potential and molecular dynamics (MD) simulations, we investigated the finite-temperature dynamics of domain walls and discovered a domain-pair intertwined topological domain structure in Bi monolayer. In 180{\deg} domain walls, a unique polarization switching mechanism is observed, characterized by the out-of-plane shuffle of Bi atoms without bond breaking. This shuffle mechanism reverses the charge properties of Bi atoms, transforming Bi anions into cations and vice versa, ultimately reversing the polarization. Then, we observed a topological multi-domain structure with two groups of domain pairs intertwined. The charged 180{\deg} domain walls form local domain pairs, with the 90{\deg} domain walls emerge between different domain pairs. This multi-domain maintains a stable topological structure within the strain range ({\epsilon}_x = 0 to 4.70%) and exhibits rich domain wall reactions under further applied strain. Our findings provide insights into the charged 180{\deg} domain walls and the related topological domain structures, enabling new opportunities for applications in electronic and nano-electronic devices., Comment: 25 pages, 4 main figures and 17 supplemental figures
- Published
- 2024
25. Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals
- Author
-
Luo, Yunfei, Chen, Yuliang, Salekin, Asif, and Rahman, Tauhidur
- Subjects
Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Signal Processing - Abstract
Time-series foundation models have the ability to run inference, mainly forecasting, on any type of time series data, thanks to the informative representations comprising waveform features. Wearable sensing data, on the other hand, contain more variability in both patterns and frequency bands of interest and generally emphasize more on the ability to infer healthcare-related outcomes. The main challenge of crafting a foundation model for wearable sensing physiological signals is to learn generalizable representations that support efficient adaptation across heterogeneous sensing configurations and applications. In this work, we propose NormWear, a step toward such a foundation model, aiming to extract generalized and informative wearable sensing representations. NormWear has been pretrained on a large set of physiological signals, including PPG, ECG, EEG, GSR, and IMU, from various public resources. For a holistic assessment, we perform downstream evaluation on 11 public wearable sensing datasets, spanning 18 applications in the areas of mental health, body state inference, biomarker estimations, and disease risk evaluations. We demonstrate that NormWear achieves a better performance improvement over competitive baselines in general time series foundation modeling. In addition, leveraging a novel representation-alignment-match-based method, we align physiological signals embeddings with text embeddings. This alignment enables our proposed foundation model to perform zero-shot inference, allowing it to generalize to previously unseen wearable signal-based health applications. Finally, we perform nonlinear dynamic analysis on the waveform features extracted by the model at each intermediate layer. This analysis quantifies the model's internal processes, offering clear insights into its behavior and fostering greater trust in its inferences among end users., Comment: The code is available at: http://github.com/Mobile-Sensing-and-UbiComp-Laboratory/NormWear
- Published
- 2024
26. GAMED: Knowledge Adaptive Multi-Experts Decoupling for Multimodal Fake News Detection
- Author
-
Shen, Lingzhi, Long, Yunfei, Cai, Xiaohao, Razzak, Imran, Chen, Guanming, Liu, Kang, and Jameel, Shoaib
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Multimodal fake news detection often involves modelling heterogeneous data sources, such as vision and language. Existing detection methods typically rely on fusion effectiveness and cross-modal consistency to model the content, complicating understanding how each modality affects prediction accuracy. Additionally, these methods are primarily based on static feature modelling, making it difficult to adapt to the dynamic changes and relationships between different data modalities. This paper develops a significantly novel approach, GAMED, for multimodal modelling, which focuses on generating distinctive and discriminative features through modal decoupling to enhance cross-modal synergies, thereby optimizing overall performance in the detection process. GAMED leverages multiple parallel expert networks to refine features and pre-embed semantic knowledge to improve the experts' ability in information selection and viewpoint sharing. Subsequently, the feature distribution of each modality is adaptively adjusted based on the respective experts' opinions. GAMED also introduces a novel classification technique to dynamically manage contributions from different modalities, while improving the explainability of decisions. Experimental results on the Fakeddit and Yang datasets demonstrate that GAMED performs better than recently developed state-of-the-art models. The source code can be accessed at https://github.com/slz0925/GAMED.
- Published
- 2024
27. Fair Primal Dual Splitting Method for Image Inverse Problems
- Author
-
Qu, Yunfei and Han, Deren
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Mathematics - Optimization and Control - Abstract
Image inverse problems have numerous applications, including image processing, super-resolution, and computer vision, which are important areas in image science. These application models can be seen as a three-function composite optimization problem solvable by a variety of primal dual-type methods. We propose a fair primal dual algorithmic framework that incorporates the smooth term not only into the primal subproblem but also into the dual subproblem. We unify the global convergence and establish the convergence rates of our proposed fair primal dual method. Experiments on image denoising and super-resolution reconstruction demonstrate the superiority of the proposed method over the current state-of-the-art.
- Published
- 2024
28. Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
- Author
-
Yu, Qifan, Shen, Zhebei, Yue, Zhongqi, Wu, Yang, Zhang, Wenqiao, Li, Yunfei, Li, Juncheng, Tang, Siliang, and Zhuang, Yueting
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Instruction tuning fine-tunes pre-trained Multi-modal Large Language Models (MLLMs) to handle real-world tasks. However, the rapid expansion of visual instruction datasets introduces data redundancy, leading to excessive computational costs. We propose a collaborative framework, DataTailor, which leverages three key principles--informativeness, uniqueness, and representativeness--for effective data selection. We argue that a valuable sample should be informative of the task, non-redundant, and represent the sample distribution (i.e., not an outlier). We further propose practical ways to score against each principle, which automatically adapts to a given dataset without tedious hyperparameter tuning. Comprehensive experiments on various benchmarks demonstrate that DataTailor achieves 100.8% of the performance of full-data fine-tuning with only 15% of the data, significantly reducing computational costs while maintaining superior results. This exemplifies the "Less is More" philosophy in MLLM development., Comment: 14 pages, 7 figures
- Published
- 2024
29. Gaze into the Pattern: Characterizing Spatial Patterns with Internal Temporal Correlations for Hardware Prefetching
- Author
-
Chen, Zixiao, Wu, Chentao, Gu, Yunfei, Jia, Ranhao, Li, Jie, and Guo, Minyi
- Subjects
Computer Science - Hardware Architecture - Abstract
Hardware prefetching is one of the most widely-used techniques for hiding long data access latency. To address the challenges faced by hardware prefetching, architects have proposed to detect and exploit the spatial locality at the granularity of spatial region. When a new region is activated, they try to find similar previously accessed regions for footprint prediction based on system-level environmental features such as the trigger instruction or data address. However, we find that such context-based prediction cannot capture the essential characteristics of access patterns, leading to limited flexibility, practicality and suboptimal prefetching performance. In this paper, inspired by the temporal property of memory accessing, we note that the temporal correlation exhibited within the spatial footprint is a key feature of spatial patterns. To this end, we propose Gaze, a simple and efficient hardware spatial prefetcher that skillfully utilizes footprint-internal temporal correlations to efficiently characterize spatial patterns. Meanwhile, we observe a unique unresolved challenge in utilizing spatial footprints generated by spatial streaming, which exhibit extremely high access density. Therefore, we further enhance Gaze with a dedicated two-stage approach that mitigates the over-prefetching problem commonly encountered in conventional schemes. Our comprehensive and diverse set of experiments show that Gaze can effectively enhance the performance across a wider range of scenarios. Specifically, Gaze improves performance by 5.7\% and 5.4\% at single-core, 11.4\% and 8.8\% at eight-core, compared to most recent low-cost solutions PMP and vBerti., Comment: Accepted by HPCA 2025
- Published
- 2024
30. Mixed-Precision Quantization: Make the Best Use of Bits Where They Matter Most
- Author
-
Fang, Yiming, Chen, Li, Chen, Yunfei, Wang, Weidong, and You, Changsheng
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Information Theory - Abstract
Mixed-precision quantization offers superior performance to fixed-precision quantization. It has been widely used in signal processing, communication systems, and machine learning. In mixed-precision quantization, bit allocation is essential. Hence, in this paper, we propose a new bit allocation framework for mixed-precision quantization from a search perspective. First, we formulate a general bit allocation problem for mixed-precision quantization. Then we introduce the penalized particle swarm optimization (PPSO) algorithm to address the integer consumption constraint. To improve efficiency and avoid iterations on infeasible solutions within the PPSO algorithm, a greedy criterion particle swarm optimization (GC-PSO) algorithm is proposed. The corresponding convergence analysis is derived based on dynamical system theory. Furthermore, we apply the above framework to some specific classic fields, i.e., finite impulse response (FIR) filters, receivers, and gradient descent. Numerical examples in each application underscore the superiority of the proposed framework to the existing algorithms., Comment: 15 pages, 10 figures
- Published
- 2024
31. WEM-GAN: Wavelet transform based facial expression manipulation
- Author
-
Sun, Dongya, Hu, Yunfei, Zhang, Xianzhe, and Hu, Yingsong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Facial expression manipulation aims to change human facial expressions without affecting face recognition. In order to transform the facial expressions to target expressions, previous methods relied on expression labels to guide the manipulation process. However, these methods failed to preserve the details of facial features, which causes the weakening or the loss of identity information in the output image. In our work, we propose WEM-GAN, in short for wavelet-based expression manipulation GAN, which puts more efforts on preserving the details of the original image in the editing process. Firstly, we take advantage of the wavelet transform technique and combine it with our generator with a U-net autoencoder backbone, in order to improve the generator's ability to preserve more details of facial features. Secondly, we also implement the high-frequency component discriminator, and use high-frequency domain adversarial loss to further constrain the optimization of our model, providing the generated face image with more abundant details. Additionally, in order to narrow the gap between generated facial expressions and target expressions, we use residual connections between encoder and decoder, while also using relative action units (AUs) several times. Extensive qualitative and quantitative experiments have demonstrated that our model performs better in preserving identity features, editing capability, and image generation quality on the AffectNet dataset. It also shows superior performance in metrics such as Average Content Distance (ACD) and Expression Distance (ED).
- Published
- 2024
32. MLD-EA: Check and Complete Narrative Coherence by Introducing Emotions and Actions
- Author
-
Zhang, Jinming and Long, Yunfei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Narrative understanding and story generation are critical challenges in natural language processing (NLP), with much of the existing research focused on summarization and question-answering tasks. While previous studies have explored predicting plot endings and generating extended narratives, they often neglect the logical coherence within stories, leaving a significant gap in the field. To address this, we introduce the Missing Logic Detector by Emotion and Action (MLD-EA) model, which leverages large language models (LLMs) to identify narrative gaps and generate coherent sentences that integrate seamlessly with the story's emotional and logical flow. The experimental results demonstrate that the MLD-EA model enhances narrative understanding and story generation, highlighting LLMs' potential as effective logic checkers in story writing with logical coherence and emotional consistency. This work fills a gap in NLP research and advances border goals of creating more sophisticated and reliable story-generation systems.
- Published
- 2024
33. Al0.68Sc0.32N/SiC based metal-ferroelectric-semiconductor capacitors operating up to 900 C
- Author
-
He, Yunfei, Moore, David C., Wang, Yubo, Ware, Spencer, Ma, Sizhe, Pradhan, Dhiren K., Hu, Zekun, Du, Xingyu, Kennedy, W. Joshua, Glavin, Nicholas R., Olsson III, Roy H., and Jariwala, Deep
- Subjects
Condensed Matter - Materials Science - Abstract
Ferroelectric (FE)-based devices show great promise for non-volatile memory applications, yet few demonstrate reliable operation at elevated temperatures. In this work, we fabricated and characterized metal ferroelectric semiconductor capacitors integrating Aluminum Scandium Nitride onto Silicon Carbide, a prospective high temperature semiconductor for logic operations in extreme environments. The resultant Ni/Al0.68Sc0.32N/4H-SiC structure was evaluated for non-volatile memory performance from room temperature to high-temperature conditions. The 30-nm thick Al0.68Sc0.32N/SiC-based ferroelectric capacitors demonstrated ferroelectric switching at 900 C. The coercive field of the FE layer decreased linearly from -6.4/+11.9 MV cm-1 at room temperature to -3.1/+7.8 MV cm-1 at 800 C. Using positive-up negative-down measurements, we characterized the temperature dependence of remanent polarization. At 600 C, the devices achieved remarkable reliability, demonstrating endurance of ~2000 cycles and retention exceeding 100 hours with negligible polarization loss. Further reliability measurements extended to 800 C with 10,000 secs retention and > 300 endurance cycles, establish these devices as promising candidates for high-temperature memory applications.
- Published
- 2024
34. The Game-Theoretic Symbiosis of Trust and AI in Networked Systems
- Author
-
Ge, Yunfei and Zhu, Quanyan
- Subjects
Computer Science - Artificial Intelligence - Abstract
This chapter explores the symbiotic relationship between Artificial Intelligence (AI) and trust in networked systems, focusing on how these two elements reinforce each other in strategic cybersecurity contexts. AI's capabilities in data processing, learning, and real-time response offer unprecedented support for managing trust in dynamic, complex networks. However, the successful integration of AI also hinges on the trustworthiness of AI systems themselves. Using a game-theoretic framework, this chapter presents approaches to trust evaluation, the strategic role of AI in cybersecurity, and governance frameworks that ensure responsible AI deployment. We investigate how trust, when dynamically managed through AI, can form a resilient security ecosystem. By examining trust as both an AI output and an AI requirement, this chapter sets the foundation for a positive feedback loop where AI enhances network security and the trust placed in AI systems fosters their adoption.
- Published
- 2024
35. Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms
- Author
-
Gao, Minghe, Bu, Wendong, Miao, Bingchen, Wu, Yang, Li, Yunfei, Li, Juncheng, Tang, Siliang, Wu, Qi, Zhuang, Yueting, and Wang, Meng
- Subjects
Computer Science - Multiagent Systems - Abstract
In this paper, we introduce the Generalist Virtual Agent (GVA), an autonomous entity engineered to function across diverse digital platforms and environments, assisting users by executing a variety of tasks. This survey delves into the evolution of GVAs, tracing their progress from early intelligent assistants to contemporary implementations that incorporate large-scale models. We explore both the philosophical underpinnings and practical foundations of GVAs, addressing their developmental challenges and the methodologies currently employed in their design and operation. By presenting a detailed taxonomy of GVA environments, tasks, and capabilities, this paper aims to bridge the theoretical and practical aspects of GVAs, concluding those that operate in environments closely mirroring the real world are more likely to demonstrate human-like intelligence. We discuss potential future directions for GVA research, highlighting the necessity for realistic evaluation metrics and the enhancement of long-sequence decision-making capabilities to advance the field toward more systematic or embodied applications. This work not only synthesizes the existing body of literature but also proposes frameworks for future investigations, contributing significantly to the ongoing development of intelligent systems.
- Published
- 2024
36. WavChat: A Survey of Spoken Dialogue Models
- Author
-
Ji, Shengpeng, Chen, Yifu, Fang, Minghui, Zuo, Jialong, Lu, Jingyu, Wang, Hanting, Jiang, Ziyue, Zhou, Long, Liu, Shujie, Cheng, Xize, Yang, Xiaoda, Wang, Zehan, Yang, Qian, Li, Jian, Jiang, Yidi, He, Jingzhen, Chu, Yunfei, Xu, Jin, and Zhao, Zhou
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Computation and Language ,Computer Science - Machine Learning ,Computer Science - Multimedia ,Computer Science - Sound - Abstract
Recent advancements in spoken dialogue models, exemplified by systems like GPT-4o, have captured significant attention in the speech domain. Compared to traditional three-tier cascaded spoken dialogue models that comprise speech recognition (ASR), large language models (LLMs), and text-to-speech (TTS), modern spoken dialogue models exhibit greater intelligence. These advanced spoken dialogue models not only comprehend audio, music, and other speech-related features, but also capture stylistic and timbral characteristics in speech. Moreover, they generate high-quality, multi-turn speech responses with low latency, enabling real-time interaction through simultaneous listening and speaking capability. Despite the progress in spoken dialogue systems, there is a lack of comprehensive surveys that systematically organize and analyze these systems and the underlying technologies. To address this, we have first compiled existing spoken dialogue systems in the chronological order and categorized them into the cascaded and end-to-end paradigms. We then provide an in-depth overview of the core technologies in spoken dialogue models, covering aspects such as speech representation, training paradigm, streaming, duplex, and interaction capabilities. Each section discusses the limitations of these technologies and outlines considerations for future research. Additionally, we present a thorough review of relevant datasets, evaluation metrics, and benchmarks from the perspectives of training and evaluating spoken dialogue systems. We hope this survey will contribute to advancing both academic research and industrial applications in the field of spoken dialogue systems. The related material is available at https://github.com/jishengpeng/WavChat., Comment: 60 papes, working in progress
- Published
- 2024
37. The surgical safety checklist: a quantitative study on attitudes and barriers among gynecological surgery teams
- Author
-
Junming Gong, Yushan Ma, Yunfei An, Qi Yuan, Yun Li, and Juan Hu
- Subjects
Surgical safety checklist ,Patient safety ,Operating room ,Gynecologists ,Anesthesiologists ,Operating room registered nurses ,Public aspects of medicine ,RA1-1270 - Abstract
Abstract Background Implementation of the surgical safety checklist (SSC) plays a significant role in improving surgical patient safety, but levels of compliance to a SSC implementation by surgical team members vary significantly. We aimed to investigate the factors affecting satisfaction levels of gynecologists, anesthesiologists, and operating room registered nurses (OR-RNs) with SSC implementation. Methods We conducted a survey based on 267 questionnaires completed by 85 gynecologists from 14 gynecological surgery teams, 86 anesthesiologists, and 96 OR-RNs at a hospital in China from March 3 to March 16, 2020. The self-reported questionnaire was used to collect respondent’s demographic information, levels of satisfaction with overall implementation of the SSC and its implementation in each of the three phases of a surgery, namely sign-in, time-out, and sign-out, and reasons for not giving a satisfaction score of 10 to its implementation in all phases. Results The subjective ratings regarding the overall implementation of the SSC between the surgical team members were different significantly. “Too many operations to check” was the primary factor causing gynecologists and anesthesiologists not to assign a score of 10 to sign-in implementation. The OR-RNs gave the lowest score to time-out implementation and 82 (85.42%) did not assign a score of 10 to it. “Surgeon is eager to start for surgery” was recognized as a major factor ranking first by OR-RNs and ranking second by anesthesiologists, and 57 (69.51%) OR-RNs chose “Too many operations to check” as the reason for not giving a score of 10 to time-out implementation. “No one initiates” and “Surgeon is not present for ‘sign out’” were commonly cited as the reasons for not assigning a score of 10 to sign-out implementation. Conclusion Factors affecting satisfaction with SSC implementation were various. These factors might be essentially related to heavy workloads and lack of ability about SSC implementation. It is advisable to reduce surgical team members’ excessive workloads and enhance their understanding of importance of SSC implementation, thereby improving surgical team members’ satisfaction with SSC implementation and facilitating compliance of SSC completion.
- Published
- 2021
- Full Text
- View/download PDF
38. Clinical and immunological characteristics of five patients with immune dysregulation, polyendocrinopathy, enteropathy, X-linked syndrome in China–expanding the atypical phenotypes
- Author
-
Yu Huang, Shuyu Fang, Ting Zeng, Junjie Chen, Lu Yang, Gan Sun, Rongxin Dai, Yunfei An, Xuemei Tang, Ying Dou, Xiaodong Zhao, and Lina Zhou
- Subjects
IPEX syndrome ,primary immunodeficiency disease ,FOXP3 mutations ,regulatory T cells ,atypical phenotypes ,Immunologic diseases. Allergy ,RC581-607 - Abstract
BackgroundImmune dysregulation, polyendocrinopathy, enteropathy, X-linked (IPEX) syndrome is a rare disorder of the immune regulatory system caused by forkhead box P3 (FOXP3) mutations. Abnormal numbers or functions of regulatory T (Treg) cells account for the various autoimmune symptoms. We aimed to explore the molecular genetics and phenotypic spectra of patients with atypical IPEX syndrome in China.MethodsWe analyzed the molecular, clinical and immune phenotype characteristics of five Chinese patients with FOXP3 mutations.ResultsWe summarized the molecular and phenotypic features of five patients with FOXP3 mutations, including two novel mutations. Four of the five patients displayed atypical phenotypes, and one developed immune-related peripheral neuropathy. Three of the five patients showed normal frequencies of Treg cells, but the proportions of subsets of Treg cells, CD4+ T cells and B cells were out of balance.ConclusionsOur report broadens the understanding of the clinical features of atypical IPEX syndrome. Our detailed analyses of the immunological characteristics of these patients enhance the understanding of the possible mechanisms underlying the clinical manifestations.
- Published
- 2022
- Full Text
- View/download PDF
39. Generation of human induced pluripotent stem cell line from peripheral blood mononuclear cells from an activated phosphoinositide 3-kinase δ syndrome patient
- Author
-
Shuyu Fang, Anle Zeng, Qiling Xu, Lina Zhou, Zhiyong Zhang, Yunfei An, and Xiaodong Zhao
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Activated phosphoinositide 3-kinase δ syndrome (APDS) is a rare autosomal dominant primary immunodeficiency disease (PID) which was caused by the acquired mutation of PIK3CD gene. In this study, we generated a human induced pluripotent stem cell (hiPSC) line CHCMUi001-A from the peripheral blood mononuclear cells (PBMCs) of a APDS patient, who has a heterozygous mutation (c.3061 G > A) in the PIK3CD gene. This iPSC line presented a normal karyotype and exhibited characteristics of pluripotent stem cells. This iPSC line can be very useful for not only studying disease mechanisms but also developing new potential clinical treatments for APDS patients.
- Published
- 2022
- Full Text
- View/download PDF
40. A novel FOXP3 mutation in a Chinese child with IPEX‐associated membranous nephropathy
- Author
-
Liwen Tan, Yunfei An, Qin Yang, Haiping Yang, Gaofu Zhang, Qiu Li, and Mo Wang
- Subjects
FOXP3 ,IPEX syndrome ,membranous nephropathy ,Genetics ,QH426-470 - Abstract
Abstract Background Immune dysregulation, polyendocrinopathy, enteropathy, X‐linked (IPEX) syndrome is a monogenic immunodeficiency disease caused by forkhead box protein3 (FOXP3) mutation. The kidney is commonly involved in IPEX syndrome, but there were few studies focusing on renal involvement. Methods Whole‐exome sequencing was used to identify the novel FOXP3 mutation. We collected clinical manifestations, kidney pathology, and gene function of the proband. All the previously published studies with IPEX‐associated renal involvement were reviewed. Results We report a late‐onset Chinese child with IPEX‐associated membranous nephropathy (MN). Type 1 diabetes mellitus and nephrotic‐range proteinuria are the main clinical manifestations. Whole‐exome sequencing shows a novel c.766A > G mutation in the FOXP3 gene. The literature review indicates that renal manifestations include proteinuria, microscopic hematuria, and renal insufficiency. MN is the most common pathological type in children with IPEX, followed by tubulointerstitial nephritis, interstitial nephritis, minimal change nephrotic syndrome, and membranoproliferative glomerulonephritis. Conclusion In summary, we report a novel FOXP3 mutation (c.766A > G) with MN stage II in IPEX. In a literature review, MN is the most common pathological type in children with IPEX and proteinuria is the most prevalent clinical feature. IPEX should be considered in the differential diagnosis of MN patients with related endocrine diseases and immune disorders.
- Published
- 2022
- Full Text
- View/download PDF
41. Direct ladderization of cyclooctatetraene-containing, processable conjugated ladder polymers from annulated bis-zirconacyclopentadienes
- Author
-
Rothenberger, August J, Bergman, Harrison M, Li, He, Qi, Miao, Wang, Yunfei, Liu, Yi, and Tilley, T Don
- Subjects
Macromolecular and Materials Chemistry ,Chemical Sciences ,Chemical sciences - Abstract
Conjugated ladder polymers (CLPs) are difficult yet captivating synthetic targets due to their fully unsaturated fused backbones. Inherent challenges associated with their synthesis often lead to low yields, structural defects, and insoluble products. Here a new method to form CLPs is demonstrated, utilizing a high-yielding dimerization of annulated zirconacyclopentadienes to form cyclooctatetraene (COT) monomer units. The resulting COT-containing polymers form rapidly in a single ladderization step from the bis-zirconacyclopentadiene precursors and display M n up to 29.7 kg mol-1. The polymers represent rare examples of CLPs with negatively curved rings, resulting in the observation of unusual properties. The rigid tub-shaped COT units embedded in the backbone imbue the polymers with microporosity, exhibiting BET surface areas up to 555 m2 g-1. Additionally, the remarkable solubility of these CLPs in organic solvents enables the fabrication of thin films showcasing high dielectric performance with a discharged energy density as high as 6.54 J cm-3 at 650 MV m-1.
- Published
- 2024
42. ADAPT: A Game-Theoretic and Neuro-Symbolic Framework for Automated Distributed Adaptive Penetration Testing
- Author
-
Lei, Haozhe, Ge, Yunfei, and Zhu, Quanyan
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Artificial Intelligence ,Computer Science - Computer Science and Game Theory - Abstract
The integration of AI into modern critical infrastructure systems, such as healthcare, has introduced new vulnerabilities that can significantly impact workflow, efficiency, and safety. Additionally, the increased connectivity has made traditional human-driven penetration testing insufficient for assessing risks and developing remediation strategies. Consequently, there is a pressing need for a distributed, adaptive, and efficient automated penetration testing framework that not only identifies vulnerabilities but also provides countermeasures to enhance security posture. This work presents ADAPT, a game-theoretic and neuro-symbolic framework for automated distributed adaptive penetration testing, specifically designed to address the unique cybersecurity challenges of AI-enabled healthcare infrastructure networks. We use a healthcare system case study to illustrate the methodologies within ADAPT. The proposed solution enables a learning-based risk assessment. Numerical experiments are used to demonstrate effective countermeasures against various tactical techniques employed by adversarial AI.
- Published
- 2024
43. EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations
- Author
-
Li, Jia, Li, Ge, Zhang, Xuanming, Zhao, Yunfei, Dong, Yihong, Jin, Zhi, Li, Binhua, Huang, Fei, and Li, Yongbin
- Subjects
Computer Science - Computation and Language ,Computer Science - Software Engineering - Abstract
How to evaluate Large Language Models (LLMs) in code generation remains an open question. Existing benchmarks have two limitations - data leakage and lack of domain-specific evaluation. The former hurts the fairness of benchmarks, and the latter hinders practitioners from selecting superior LLMs for specific programming domains. To address these two limitations, we propose a new benchmark - EvoCodeBench, which has the following advances: (1) Evolving data. EvoCodeBench will be dynamically updated every period (e.g., 6 months) to avoid data leakage. This paper releases the first version - EvoCodeBench-2403, containing 275 samples from 25 repositories. (2) A domain taxonomy and domain labels. Based on the statistics of open-source communities, we design a programming domain taxonomy consisting of 10 popular domains. Based on the taxonomy, we annotate each sample in EvoCodeBench with a domain label. (3) Domain-specific evaluations. Besides the Pass@k, we compute the Domain-Specific Improvement (DSI) and define LLMs' comfort and strange domains. These evaluations help practitioners select superior LLMs in specific domains and discover the shortcomings of existing LLMs. We evaluate 8 popular LLMs (e.g., gpt-4, DeepSeek Coder) on EvoCodeBench and summarize some insights. EvoCodeBench reveals the actual abilities of these LLMs in real-world repositories. For example, the highest Pass@1 of gpt-4 on EvoCodeBench-2403 is only 20.74%. Besides, we evaluate LLMs in different domains and discover their comfort and strange domains. For example, gpt-4 performs best in most domains but falls behind others in the Internet domain. StarCoder 2-15B unexpectedly performs well in the Database domain and even outperforms 33B LLMs. EvoCodeBench has been released., Comment: Accepted by the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)
- Published
- 2024
44. SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging With Neural Networks Based on Ballistocardiograms
- Author
-
Li, Shuzhen, Chen, Yuxin, Chen, Xuesong, Gao, Ruiyang, Zhang, Yupeng, Yu, Chao, Li, Yunfei, Ye, Ziyi, Huang, Weijun, Yi, Hongliang, Leng, Yue, and Wu, Yi
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Machine Learning - Abstract
Sleep monitoring plays a crucial role in maintaining good health, with sleep staging serving as an essential metric in the monitoring process. Traditional methods, utilizing medical sensors like EEG and ECG, can be effective but often present challenges such as unnatural user experience, complex deployment, and high costs. Ballistocardiography~(BCG), a type of piezoelectric sensor signal, offers a non-invasive, user-friendly, and easily deployable alternative for long-term home monitoring. However, reliable BCG-based sleep staging is challenging due to the limited sleep monitoring data available for BCG. A restricted training dataset prevents the model from generalization across populations. Additionally, transferring to BCG faces difficulty ensuring model robustness when migrating from other data sources. To address these issues, we introduce SleepNetZero, a zero-shot learning based approach for sleep staging. To tackle the generalization challenge, we propose a series of BCG feature extraction methods that align BCG components with corresponding respiratory, cardiac, and movement channels in PSG. This allows models to be trained on large-scale PSG datasets that are diverse in population. For the migration challenge, we employ data augmentation techniques, significantly enhancing generalizability. We conducted extensive training and testing on large datasets~(12393 records from 9637 different subjects), achieving an accuracy of 0.803 and a Cohen's Kappa of 0.718. ZeroSleepNet was also deployed in real prototype~(monitoring pads) and tested in actual hospital settings~(265 users), demonstrating an accuracy of 0.697 and a Cohen's Kappa of 0.589. To the best of our knowledge, this work represents the first known reliable BCG-based sleep staging effort and marks a significant step towards in-home health monitoring., Comment: 25 pages
- Published
- 2024
45. Einstein Probe discovery of EP240408a: a peculiar X-ray transient with an intermediate timescale
- Author
-
Zhang, Wenda, Yuan, Weimin, Ling, Zhixing, Chen, Yong, Rea, Nanda, Rau, Arne, Cai, Zhiming, Cheng, Huaqing, Zelati, Francesco Coti, Dai, Lixin, Hu, Jingwei, Jia, Shumei, Jin, Chichuan, Li, Dongyue, O'Brien, Paul, Shen, Rongfeng, Shu, Xinwen, Sun, Shengli, Sun, Xiaojin, Wang, Xiaofeng, Yang, Lei, Zhang, Bing, Zhang, Chen, Zhang, Shuang-Nan, Zhang, Yonghe, An, Jie, Buckley, David, Coleiro, Alexis, Cordier, Bertrand, Dou, Liming, Eyles-Ferris, Rob, Fan, Zhou, Feng, Hua, Fu, Shaoyu, Fynbo, Johan P. U., Galbany, Lluis, Jha, Saurabh W., Jiang, Shuaiqing, Kong, Albert, Kuulkers, Erik, Lei, Weihua, Li, Wenxiong, Liu, Bifang, Liu, Mingjun, Liu, Xing, Liu, Yuan, Liu, Zhu, Maitra, Chandreyee, Marino, Alessio, Monageng, Itumeleng, Nandra, Kirpal, Sanders, Jeremy, Soria, Roberto, Tao, Lian, Wang, Junfeng, Wang, Song, Wang, Tinggui, Wang, Zhongxiang, Wu, Qingwen, Wu, Xuefeng, Xu, Dong, Xu, Yanjun, Xue, Suijian, Xue, Yongquan, Zhang, Zijian, Zhu, Zipei, Zou, Hu, Bao, Congying, Chen, Fansheng, Chen, Houlei, Chen, Tianxiang, Chen, Wei, Chen, Yehai, Chen, Yifan, Cui, Chenzhou, Cui, Weiwei, Dai, Yanfeng, Fan, Dongwei, Guan, Ju, Han, Dawei, Hou, Dongjie, Hu, Haibo, Huang, Maohai, Huo, Jia, Jia, Zhenqing, Jiang, Bowen, Jin, Ge, Li, Chengkui, Li, Junfei, Li, Longhui, Li, Maoshun, Li, Wei, Li, Zhengda, Lian, Tianying, Liu, Congzhan, Liu, Heyang, Liu, Huaqiu, Lu, Fangjun, Luo, Laidan, Ma, Jia, Mao, Xuan, Pan, Haiwu, Pan, Xin, Song, Liming, Sun, Hui, Tan, Yunyin, Tang, Qingjun, Tao, Yihan, Wang, Hao, Wang, Juan, Wang, Lei, Wang, Wenxin, Wang, Yilong, Wang, Yusa, Wu, Qinyu, Xu, Haitao, Xu, Jingjing, Xu, Xinpeng, Xu, Yunfei, Xu, Zhao, Xue, Changbin, Xue, Yulong, Yan, Ailiang, Yang, Haonan, Yang, Xiongtao, Yang, Yanji, Zhang, Juan, Zhang, Mo, Zhang, Wenjie, Zhang, Zhen, Zhang, Ziliang, Zhao, Donghua, Zhao, Haisheng, Zhao, Xiaofan, Zhao, Zijian, Zhou, Hongyan, Zhou, Yilin, Zhu, Yuxuan, and Zhu, Zhencai
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We report the discovery of a peculiar X-ray transient, EP240408a, by Einstein Probe (EP) and follow-up studies made with EP, Swift, NICER, GROND, ATCA and other ground-based multi-wavelength telescopes. The new transient was first detected with Wide-field X-ray Telescope (WXT) on board EP on April 8th, 2024, manifested in an intense yet brief X-ray flare lasting for 12 seconds. The flare reached a peak flux of 3.9x10^(-9) erg/cm2/s in 0.5-4 keV, about 300 times brighter than the underlying X-ray emission detected throughout the observation. Rapid and more precise follow-up observations by EP/FXT, Swift and NICER confirmed the finding of this new transient. Its X-ray spectrum is non-thermal in 0.5-10 keV, with a power-law photon index varying within 1.8-2.5. The X-ray light curve shows a plateau lasting for about 4 days, followed by a steep decay till becoming undetectable about 10 days after the initial detection. Based on its temporal property and constraints from previous EP observations, an unusual timescale in the range of 7-23 days is found for EP240408a, which is intermediate between the commonly found fast and long-term transients. No counterparts have been found in optical and near-infrared, with the earliest observation at 17 hours after the initial X-ray detection, suggestive of intrinsically weak emission in these bands. We demonstrate that the remarkable properties of EP240408a are inconsistent with any of the transient types known so far, by comparison with, in particular, jetted tidal disruption events, gamma-ray bursts, X-ray binaries and fast blue optical transients. The nature of EP240408a thus remains an enigma. We suggest that EP240408a may represent a new type of transients with intermediate timescales of the order of about 10 days. The detection and follow-ups of more of such objects are essential for revealing their origin., Comment: 25 pages, 11 figures
- Published
- 2024
- Full Text
- View/download PDF
46. Sensing-Communication-Computing-Control Closed-Loop Optimization for 6G Unmanned Robotic Systems
- Author
-
Fang, Xinran, Lei, Chengleyang, Feng, Wei, Chen, Yunfei, Xiao, Ming, Ge, Ning, and Wang, Chengxiang
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Electrical Engineering and Systems Science - Signal Processing - Abstract
Rapid advancements in field robots have brought a new kind of cyber physical system (CPS)--unmanned robotic system--under the spotlight. In the upcoming sixth-generation (6G) era, these systems hold great potential to replace humans in hazardous tasks. This paper investigates an unmanned robotic system comprising a multi-functional unmanned aerial vehicle (UAV), sensors, and actuators. The UAV carries communication and computing modules, acting as an edge information hub (EIH) that transfers and processes information. During the task execution, the EIH gathers sensing data, calculates control commands, and transmits commands to actuators--leading to reflex-arc-like sensing-communication-computing-control ($\mathbf{SC}^3$) loops. Unlike existing studies that design $\mathbf{SC}^3$ loop components separately, we take each $\mathbf{SC}^3$ loop as an integrated structure and propose a goal-oriented closed-loop optimization scheme. This scheme jointly optimizes uplink and downlink (UL&DL) communication and computing within and across the $\mathbf{SC}^3$ loops to minimize the total linear quadratic regulator (LQR) cost. We derive optimal closed-form solutions for intra-loop allocation and propose an efficient iterative algorithm for inter-loop optimization. Under the condition of adequate CPU frequency availability, we derive an approximate closed-form solution for inter-loop bandwidth allocation. Simulation results demonstrate that the proposed scheme achieves a two-tier task-level balance within and across $\mathbf{SC}^3$ loops.
- Published
- 2024
47. Structured Connectivity for 6G Reflex Arc: Task-Oriented Virtual User and New Uplink-Downlink Tradeoff
- Author
-
Fang, Xinran, Lei, Chengleyang, Feng, Wei, Chen, Yunfei, Ge, Ning, and Jin, Shi
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
To accommodate the evolving demands of unmanned operations, the future sixth-generation (6G) network will support not only communication links but also sensing-communication-computing-control ($\mathbf{SC}^3$) loops. In each $\mathbf{SC}^3$ cycle, the sensor uploads sensing data to the computing center, and the computing center calculates the control command and sends it to the actuator to take action. To maintain the task-level connections between the sensor-computing center link and the computing center-actuator link, we propose to treat the sensor and actuator as a virtual user. In this way, the two communication links of the $\mathbf{SC}^3$ loop become the uplink and downlink (UL&DL) of the virtual user. Based on the virtual user, we propose a task-oriented UL&DL optimization scheme. This scheme jointly optimizes UL&DL transmit power, time, bandwidth, and CPU frequency to minimize the control linear quadratic regulator (LQR) cost. We decouple the complex problem into a convex UL&DL bandwidth allocation problem with the closed-form solution for the optimal time allocation. Simulation results demonstrate that the proposed scheme achieves a task-level balance between the UL&DL, surpassing conventional communication schemes that optimize each link separately.
- Published
- 2024
48. Similarity-Dissimilarity Loss with Supervised Contrastive Learning for Multi-label Classification
- Author
-
Huang, Guangming, Long, Yunfei, Luo, Cunjin, and Liu, Sheng
- Subjects
Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Supervised contrastive learning has been explored in making use of label information for multi-label classification, but determining positive samples in multi-label scenario remains challenging. Previous studies have examined strategies for identifying positive samples, considering label overlap proportion between anchors and samples. However, they ignore various relations between given anchors and samples, as well as how to dynamically adjust the weights in contrastive loss functions based on different relations, leading to great ambiguity. In this paper, we introduce five distinct relations between multi-label samples and propose a Similarity-Dissimilarity Loss with contrastive learning for multi-label classification. Our loss function re-weights the loss by computing the similarity and dissimilarity between positive samples and a given anchor based on the introduced relations. We mainly conduct experiments for multi-label text classification on MIMIC datasets, then further extend the evaluation on MS-COCO. The Experimental results show that our proposed loss effectively improves the performance on all encoders under supervised contrastive learning paradigm, demonstrating its effectiveness and robustness.
- Published
- 2024
49. Bulk electricity storage in 1-nm water channels
- Author
-
Artemov, Vasily, Babiy, Svetlana, Teng, Yunfei, Ma, Jiaming, Ryzhov, Alexander, Chen, Tzu-Heng, Navratilova, Lucie, Boureau, Victor, Schouwink, Pascal, Liseanskaia, Mariia, Huber, Patrick, Brushett, Fikile, Laloui, Lyesse, Tagliabue, Giulia, and Radenovic, Aleksandra
- Subjects
Condensed Matter - Soft Condensed Matter - Abstract
When water is confined within walls only a few molecular diameters apart, it displays unique behaviors that differ significantly from bulk water. This confinement reveals fascinating mechanical, thermodynamic, and dielectric anomalies. Nature has effectively used the confinement "trick" to achieve superior functionalities with abundant elements and water, avoiding scarce materials. The challenge, however, is to replicate this principle in scalable artificial device engineering. Here, we introduce the "blue battery", a scalable supercapacitive device utilizing pure water confined in 1-nm clay channels as its sole electrolyte. Made entirely from Earth-abundant materials via scalable nano-engineering, it preserves nearly 100% coulombic efficiency over 60,000 charge-discharge cycles, operates at voltages up to 1.65 V, and delivers competitive power and energy densities. Thus, achieving a high degree of sustainability via just the confinement effect, our concept establishes a versatile blueprint for environmentally neutral technologies, enabling the design of other "blue devices" for micro- to bulk-scale energy storage applications, even in extreme environments like Mars. Our research opens possibilities for environmentally neutral energy solutions inspired by nature., Comment: Main text: 17 pages, 4 figures, 55 references. Supplementary: 28 pages, 26 figures, 3 tables, 3 references
- Published
- 2024
50. DPL: Cross-quality DeepFake Detection via Dual Progressive Learning
- Author
-
Zhang, Dongliang, Li, Yunfei, Zhou, Jiaran, and Li, Yuezun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Real-world DeepFake videos often undergo various compression operations, resulting in a range of video qualities. These varying qualities diversify the pattern of forgery traces, significantly increasing the difficulty of DeepFake detection. To address this challenge, we introduce a new Dual Progressive Learning (DPL) framework for cross-quality DeepFake detection. We liken this task to progressively drilling for underground water, where low-quality videos require more effort than high-quality ones. To achieve this, we develop two sequential-based branches to "drill waters" with different efforts. The first branch progressively excavates the forgery traces according to the levels of video quality, i.e., time steps, determined by a dedicated CLIP-based indicator. In this branch, a Feature Selection Module is designed to adaptively assign appropriate features to the corresponding time steps. Considering that different techniques may introduce varying forgery traces within the same video quality, we design a second branch targeting forgery identifiability as complementary. This branch operates similarly and shares the feature selection module with the first branch. Our design takes advantage of the sequential model where computational units share weights across different time steps and can memorize previous progress, elegantly achieving progressive learning while maintaining reasonable memory costs. Extensive experiments demonstrate the superiority of our method for cross-quality DeepFake detection., Comment: ACCV 2024
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.