Author: "Zhu, Lei" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhu, Lei"' showing total 18,275 results

Start Over Author "Zhu, Lei"

18,275 results on '"Zhu, Lei"'

51. Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

Author: Zhu, Lei, Wei, Fangyun, and Lu, Yanye
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this work, we investigate the potential of a large language model (LLM) to directly comprehend visual signals without the necessity of fine-tuning on multi-modal datasets. The foundational concept of our method views an image as a linguistic entity, and translates it to a set of discrete words derived from the LLM's vocabulary. To achieve this, we present the Vision-to-Language Tokenizer, abbreviated as V2T Tokenizer, which transforms an image into a ``foreign language'' with the combined aid of an encoder-decoder, the LLM vocabulary, and a CLIP model. With this innovative image encoding, the LLM gains the ability not only for visual comprehension but also for image denoising and restoration in an auto-regressive fashion-crucially, without any fine-tuning. We undertake rigorous experiments to validate our method, encompassing understanding tasks like image recognition, image captioning, and visual question answering, as well as image denoising tasks like inpainting, outpainting, deblurring, and shift restoration. Code and models are available at https://github.com/zh460045050/V2L-Tokenizer., Comment: Accepted by CVPR 2024
Published: 2024

52. Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal

Author: Yang, Yijun, Wu, Hongtao, Aviles-Rivero, Angelica I., Zhang, Yulun, Qin, Jing, and Zhu, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Real-world vision tasks frequently suffer from the appearance of unexpected adverse weather conditions, including rain, haze, snow, and raindrops. In the last decade, convolutional neural networks and vision transformers have yielded outstanding results in single-weather video removal. However, due to the absence of appropriate adaptation, most of them fail to generalize to other weather conditions. Although ViWS-Net is proposed to remove adverse weather conditions in videos with a single set of pre-trained weights, it is seriously blinded by seen weather at train-time and degenerates when coming to unseen weather during test-time. In this work, we introduce test-time adaptation into adverse weather removal in videos, and propose the first framework that integrates test-time adaptation into the iterative diffusion reverse process. Specifically, we devise a diffusion-based network with a novel temporal noise model to efficiently explore frame-correlated information in degraded video clips at training stage. During inference stage, we introduce a proxy task named Diffusion Tubelet Self-Calibration to learn the primer distribution of test video stream and optimize the model by approximating the temporal noise model for online adaptation. Experimental results, on benchmark datasets, demonstrate that our Test-Time Adaptation method with Diffusion-based network(Diff-TTA) outperforms state-of-the-art methods in terms of restoring videos degraded by seen weather conditions. Its generalizable capability is also validated with unseen weather conditions in both synthesized and real-world videos.
Published: 2024

53. Agile Multi-Source-Free Domain Adaptation

Author: Li, Xinyao, Li, Jingjing, Li, Fengling, Zhu, Lei, and Lu, Ke
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Efficiently utilizing rich knowledge in pretrained models has become a critical topic in the era of large models. This work focuses on adaptively utilizing knowledge from multiple source-pretrained models to an unlabeled target domain without accessing the source data. Despite being a practically useful setting, existing methods require extensive parameter tuning over each source model, which is computationally expensive when facing abundant source domains or larger source models. To address this challenge, we propose a novel approach which is free of the parameter tuning over source backbones. Our technical contribution lies in the Bi-level ATtention ENsemble (Bi-ATEN) module, which learns both intra-domain weights and inter-domain ensemble weights to achieve a fine balance between instance specificity and domain consistency. By slightly tuning source bottlenecks, we achieve comparable or even superior performance on a challenging benchmark DomainNet with less than 3% trained parameters and 8 times of throughput compared with SOTA method. Furthermore, with minor modifications, the proposed module can be easily equipped to existing methods and gain more than 4% performance boost. Code is available at https://github.com/TL-UESTC/Bi-ATEN., Comment: Accepted to AAAI2024
Published: 2024

54. Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation

Author: Du, Zhekai, Li, Xinyao, Li, Fengling, Lu, Ke, Zhu, Lei, and Li, Jingjing
Subjects: Computer Science - Artificial Intelligence
Abstract: Conventional Unsupervised Domain Adaptation (UDA) strives to minimize distribution discrepancy between domains, which neglects to harness rich semantics from data and struggles to handle complex domain shifts. A promising technique is to leverage the knowledge of large-scale pre-trained vision-language models for more guided adaptation. Despite some endeavors, current methods often learn textual prompts to embed domain semantics for source and target domains separately and perform classification within each domain, limiting cross-domain knowledge transfer. Moreover, prompting only the language branch lacks flexibility to adapt both modalities dynamically. To bridge this gap, we propose Domain-Agnostic Mutual Prompting (DAMP) to exploit domain-invariant semantics by mutually aligning visual and textual embeddings. Specifically, the image contextual information is utilized to prompt the language branch in a domain-agnostic and instance-conditioned way. Meanwhile, visual prompts are imposed based on the domain-agnostic textual prompt to elicit domain-invariant visual embeddings. These two branches of prompts are learned mutually with a cross-attention module and regularized with a semantic-consistency loss and an instance-discrimination contrastive loss. Experiments on three UDA benchmarks demonstrate the superiority of DAMP over state-of-the-art approaches.
Published: 2024

55. Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

Author: Chen, Haoyu, Li, Wenbo, Gu, Jinjin, Ren, Jingjing, Sun, Haoze, Zou, Xueyi, Zhang, Zhensong, Yan, Youliang, and Zhu, Lei
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: For image super-resolution (SR), bridging the gap between the performance on synthetic datasets and real-world degradation scenarios remains a challenge. This work introduces a novel "Low-Res Leads the Way" (LWay) training framework, merging Supervised Pre-training with Self-supervised Learning to enhance the adaptability of SR models to real-world images. Our approach utilizes a low-resolution (LR) reconstruction network to extract degradation embeddings from LR images, merging them with super-resolved outputs for LR reconstruction. Leveraging unseen LR images for self-supervised learning guides the model to adapt its modeling space to the target domain, facilitating fine-tuning of SR models without requiring paired high-resolution (HR) images. The integration of Discrete Wavelet Transform (DWT) further refines the focus on high-frequency details. Extensive evaluations show that our method significantly improves the generalization and detail restoration capabilities of SR models on unseen real-world datasets, outperforming existing methods. Our training regime is universally compatible, requiring no network architecture modifications, making it a practical solution for real-world SR applications., Comment: Accepted to CVPR 2024
Published: 2024

56. Carbon dioxide emissions from global overseas coal-fired power plants

Author: Guo, Peng, Shen, Huizhong, Chen, Yilin, Dai, Hancheng, Mai, Zelin, Xu, Ruibin, Zhang, Ruixin, Wang, Zhanxiang, He, Jinling, Zheng, Lianming, Sun, Haitong Zhe, Ke, Kainan, Meng, Jing, Liu, Maodian, Li, Jin, Adalibieke, Wulahati, Wang, Chen, Ye, Jianhuai, Zhu, Lei, Shen, Guofeng, Fu, Tzung-May, Tsang, Albert, Yang, Xin, Russell, Armistead G., Driscoll, Charles T., and Tao, Shu
Published: 2024
Full Text: View/download PDF

57. Adaptive and flexible ℓ1-norm graph embedding for unsupervised feature selection

Author: Jiang, Kun, Cao, Ting, Zhu, Lei, and Sun, Qindong
Published: 2024
Full Text: View/download PDF

58. Construction of a Coal Chemical Industry Park with Zero Carbon Emission by Integrating Renewable Energy Based on Life Cycle Analysis

Author: Zhu, Lei, Wang, Shuai, Wu, Le, Kang, Lixia, and Liu, Yongzhong
Published: 2024
Full Text: View/download PDF

59. Masked frequency-color fusion network for video instance-level hazy lane detection

Author: Liu, Ye, Zhu, Lei, Wan, Liang, and Wang, Xing
Published: 2024
Full Text: View/download PDF

60. Lumbar localized fat distribution parameters are independent predictors of osteoporotic vertebral compression re-fractures (OVCRFs) following Percutaneous Kyphoplasty (PKP): a retrospective matched case–control study

Author: Zhang, Fu-Yu, Zhu, Lei, Shi, Hang, Wang, Feng, Chen, Lu, Zhang, Zi-Jian, Jiang, Zan-Li, Yao, Jie, and Wu, Xiao-Tao
Published: 2024
Full Text: View/download PDF

61. Structural basis for the dynamic chaperoning of disordered clients by Hsp90

Author: Qu, Xiaozhan, Zhao, Shuo, Wan, Chanjuan, Zhu, Lei, Ji, Tuo, Rossi, Paolo, Wang, Junfeng, Kalodimos, Charalampos G., Wang, Chao, Xu, Weiya, and Huang, Chengdong
Published: 2024
Full Text: View/download PDF

62. Impact of EnKF assimilating Himawari-8 all-sky infrared radiance on the forecasting of a warm-sector rainstorm event

Author: Lou, Shanshan, Zhu, Lei, Qiu, Xuexing, Chen, Guangzhou, Yuan, Song, and Zhou, Shengnan
Published: 2024
Full Text: View/download PDF

63. Multiple-source distribution deep adaptive feature norm network for EEG emotion recognition

Author: Zhu, Lei, Yu, Fei, Ding, Wangpan, Huang, Aiai, Ying, Nanjiao, and Zhang, Jianhai
Published: 2024
Full Text: View/download PDF

64. Ran in Procambarus clarkii: molecular characterization and immune function

Author: Gu, Yanlong, Zhao, Tong, Wang, Xinru, Hou, Libo, Li, Hao, Zhu, Lei, and Kong, Xianghui
Published: 2024
Full Text: View/download PDF

65. Progress of organic photovoltaics towards 20% efficiency

Author: Zhu, Lei, Zhang, Ming, Zhou, Zichun, Zhong, Wenkai, Hao, Tianyu, Xu, Shengjie, Zeng, Rui, Zhuang, Jiaxing, Xue, Xiaonan, Jing, Hao, Zhang, Yongming, and Liu, Feng
Published: 2024
Full Text: View/download PDF

66. Impact of the COVID-19 Pandemic on the Incidence of Notifiable Infectious Diseases in China Based on SARIMA Models Between 2013 and 2021

Author: Liu, Jingwen, Zeng, Wu, Zhuo, Chao, Liu, Yu, Zhu, Lei, and Zou, Guanyang
Published: 2024
Full Text: View/download PDF

67. Covalent triazine frameworks materials for photo- and electro-catalysis

Author: Liang, Aoji, Li, Wenbin, Li, Anbai, Peng, Hui, Ma, Guofu, Zhu, Lei, Lei, Ziqiang, and Xu, Yuxi
Published: 2024
Full Text: View/download PDF

68. Road feature enhancement network for remote sensing images based on DeepLabV3Plus

Author: Dong, Liang, Zhu, Enci, Zhu, Lei, Wang, Quanxing, and Du, Wenchen
Published: 2024
Full Text: View/download PDF

69. Rab proteins in fish and crustaceans: an overview

Author: Zhu, Lei, Gu, Yanlong, Kong, Yiming, Wang, Xinru, Li, Hao, Hou, Libo, and Kong, Xianghui
Published: 2024
Full Text: View/download PDF

70. Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label

Author: Zhang, Xinliang, Zhu, Lei, He, Hangzhou, Jin, Lujia, and Lu, Yanye
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Scribble-based weakly-supervised semantic segmentation using sparse scribble supervision is gaining traction as it reduces annotation costs when compared to fully annotated alternatives. Existing methods primarily generate pseudo-labels by diffusing labeled pixels to unlabeled ones with local cues for supervision. However, this diffusion process fails to exploit global semantics and class-specific cues, which are important for semantic segmentation. In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision. Directly adopting pseudo-labels might misguide the segmentation model, thus we design a localization rectification module to correct foreground representations in the feature space. To further combine the advantages of both supervisions, we also introduce a distance entropy loss for uncertainty reduction, which adapts per-pixel confidence weights according to the reliable region determined by the scribble and pseudo-label's boundary. Experiments on the ScribbleSup dataset with different qualities of scribble annotations outperform all the previous methods, demonstrating the superiority and robustness of our method.The code is available at https://github.com/Zxl19990529/Class-driven-Scribble-Promotion-Network.
Published: 2024

71. OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

Author: Engelmann, Francis, Takmaz, Ayca, Schult, Jonas, Fedele, Elisabetta, Wald, Johanna, Peng, Songyou, Wang, Xi, Litany, Or, Tang, Siyu, Tombari, Federico, Pollefeys, Marc, Guibas, Leonidas, Tian, Hongbo, Wang, Chunjie, Yan, Xiaosheng, Wang, Bingwen, Zhang, Xuanyang, Liu, Xiao, Nguyen, Phuc, Nguyen, Khoi, Tran, Anh, Pham, Cuong, Huang, Zhening, Wu, Xiaoyang, Chen, Xi, Zhao, Hengshuang, Zhu, Lei, and Lasenby, Joan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023. The goal of this workshop series is to provide a platform for exploration and discussion of open-vocabulary 3D scene understanding tasks, including but not limited to segmentation, detection and mapping. We provide an overview of the challenge hosted at the workshop, present the challenge dataset, the evaluation methodology, and brief descriptions of the winning methods. For additional details, please see https://opensun3d.github.io/index_iccv23.html., Comment: Our OpenSUN3D workshop website for ICCV 2023: https://opensun3d.github.io/index_iccv23.html
Published: 2024

72. RelayAttention for Efficient Large Language Model Serving with Long System Prompts

Author: Zhu, Lei, Wang, Xinjiang, Zhang, Wayne, and Lau, Rynson W. H.
Subjects: Computer Science - Computation and Language
Abstract: A practical large language model (LLM) service may involve a long system prompt, which specifies the instructions, examples, and knowledge documents of the task and is reused across requests. However, the long system prompt causes throughput/latency bottlenecks as the cost of generating the next token grows w.r.t. the sequence length. This paper aims to improve the efficiency of LLM services that involve long system prompts. Our key observation is that handling these system prompts requires heavily redundant memory accesses in existing causal attention computation algorithms. Specifically, for batched requests, the cached hidden states (\ie, key-value pairs) of system prompts are transferred from off-chip DRAM to on-chip SRAM multiple times, each corresponding to an individual request. To eliminate such a redundancy, we propose RelayAttention, an attention algorithm that allows reading these hidden states from DRAM exactly once for a batch of input tokens. RelayAttention is a free lunch: it maintains the generation quality while requiring no model retraining, as it is based on a mathematical reformulation of causal attention. We have observed significant performance improvements to a production-level system, vLLM, through integration with RelayAttention. The improvements are even more profound with longer system prompts., Comment: accepted by the ACL 2024 main conference
Published: 2024

73. Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

Author: Huang, Jiahao, Wu, Yinzhe, Wang, Fanwen, Fang, Yingying, Nan, Yang, Alkan, Cagan, Abraham, Daniel, Liao, Congyu, Xu, Lei, Gao, Zhifan, Wu, Weiwen, Zhu, Lei, Chen, Zhaolin, Lally, Peter, Bangerter, Neal, Setsompop, Kawin, Guo, Yike, Rueckert, Daniel, Wang, Ge, and Yang, Guang
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans. This review elucidates recent advances in MRI acceleration via data and physics-driven models, leveraging techniques from algorithm unrolling models, enhancement-based methods, and plug-and-play models to the emerging full spectrum of generative model-based methods. We also explore the synergistic integration of data models with physics-based insights, encompassing the advancements in multi-coil hardware accelerations like parallel imaging and simultaneous multi-slice imaging, and the optimization of sampling patterns. We then focus on domain-specific challenges and opportunities, including image redundancy exploitation, image integrity, evaluation metrics, data heterogeneity, and model generalization. This work also discusses potential solutions and future research directions, with an emphasis on the role of data harmonization and federated learning for further improving the general applicability and performance of these methods in MRI reconstruction., Comment: Accepted by IEEE Reviews in Biomedical Engineering (RBME)
Published: 2024
Full Text: View/download PDF

74. An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion

Author: Ali, Sharib, Espinel, Yamid, Jin, Yueming, Liu, Peng, Güttner, Bianca, Zhang, Xukun, Zhang, Lihua, Dowrick, Tom, Clarkson, Matthew J., Xiao, Shiting, Wu, Yifan, Yang, Yijun, Zhu, Lei, Sun, Dai, Li, Lan, Pfeiffer, Micha, Farid, Shahid, Maier-Hein, Lena, Buc, Emmanuel, and Bartoli, Adrien
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: Augmented reality for laparoscopic liver resection is a visualisation mode that allows a surgeon to localise tumours and vessels embedded within the liver by projecting them on top of a laparoscopic image. Preoperative 3D models extracted from CT or MRI data are registered to the intraoperative laparoscopic images during this process. In terms of 3D-2D fusion, most of the algorithms make use of anatomical landmarks to guide registration. These landmarks include the liver's inferior ridge, the falciform ligament, and the occluding contours. They are usually marked by hand in both the laparoscopic image and the 3D model, which is time-consuming and may contain errors if done by a non-experienced user. Therefore, there is a need to automate this process so that augmented reality can be used effectively in the operating room. We present the Preoperative-to-Intraoperative Laparoscopic Fusion Challenge (P2ILF), held during the Medical Imaging and Computer Assisted Interventions (MICCAI 2022) conference, which investigates the possibilities of detecting these landmarks automatically and using them in registration. The challenge was divided into two tasks: 1) A 2D and 3D landmark detection task and 2) a 3D-2D registration task. The teams were provided with training data consisting of 167 laparoscopic images and 9 preoperative 3D models from 9 patients, with the corresponding 2D and 3D landmark annotations. A total of 6 teams from 4 countries participated, whose proposed methods were evaluated on 16 images and two preoperative 3D models from two patients. All the teams proposed deep learning-based methods for the 2D and 3D landmark segmentation tasks and differentiable rendering-based methods for the registration task. Based on the experimental outcomes, we propose three key hypotheses that determine current limitations and future directions for research in this domain., Comment: 24 pages
Published: 2024

75. On the Scarcity of Dense Cores ($n>10^{5}$ cm$^{-3}$) in High Latitude Planck Galactic Cold Clumps

Author: Xu, Fengwei, Wang, Ke, Liu, Tie, Eden, David, Liu, Xunchuan, Juvela, Mika, He, Jinhua, Johnstone, Doug, Goldsmith, Paul, Garay, Guido, Wu, Yuefang, Soam, Archana, Traficante, Alessio, Ristorcelli, Isabelle, Falgarone, Edith, Chen, Huei-Ru Vivien, Hirano, Naomi, Doi, Yasuo, Kwon, Woojin, White, Glenn J., Whitworth, Anthony, Sanhueza, Patricio, Rawlings, Mark G., Alina, Dana, Ren, Zhiyuan, Lee, Chang Won, Tatematsu, Ken'ichi, Zhang, Chuan-Peng, Zhou, Jianjun, Lai, Shih-Ping, Ward-Thompson, Derek, Liu, Sheng-Yuan, Gu, Qilao, Chakali, Eswaraiah, Zhu, Lei, Mardones, Diego, and Tóth, L. Viktor
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: High-latitude ($|b|>30^{\circ}$) molecular clouds have virial parameters that exceed 1, but whether these clouds can form stars has not been studied systematically. Using JCMT SCUBA-2 archival data, we surveyed 70 fields that target high-latitude Planck galactic cold clumps (HLPCs) to find dense cores with density of $10^{5}$-$10^{6}$ cm$^{-3}$ and size of $<0.1$ pc. The sample benefits from both the representativeness of the parent sample and covering densest clumps at the high column density end ($>1\times10^{21}$ cm$^{-2}$). At an average noise rms of 15 mJy/beam, we detected Galactic dense cores in only one field, G6.04+36.77 (L183), while also identifying 12 extragalactic objects and two young stellar objects. Compared to the low-latitude clumps, dense cores are scarce in HLPCs. With synthetic observations, the densities of cores are constrained to be $n_c\lesssim10^5$ cm$^{-3}$, should they exist in HLPCs. Low-latitude clumps, Taurus clumps, and HLPCs form a sequence where a higher virial parameter corresponds to a lower dense core detection rate. If HLPCs were affected by the Local Bubble, the scarcity should favor turbulence-inhibited rather than supernova-driven star formation. Studies of the formation mechanism of the L183 molecular cloud are warranted., Comment: 9 pages for the main text. 4 figures, 1 table. Published in Astrophysical Journal Letter
Published: 2024
Full Text: View/download PDF

76. Vivim: a Video Vision Mamba for Medical Video Segmentation

Author: Yang, Yijun, Xing, Zhaohu, Yu, Lequan, Huang, Chunwang, Fu, Huazhu, and Zhu, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Medical video segmentation gains increasing attention in clinical practice due to the redundant dynamic references in video frames. However, traditional convolutional neural networks have a limited receptive field and transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity. This bottleneck poses a significant challenge when processing longer sequences in medical video analysis tasks using available devices with limited memory. Recently, state space models (SSMs), famous by Mamba, have exhibited impressive achievements in efficient long sequence modeling, which develops deep neural networks by expanding the receptive field on many vision tasks significantly. Unfortunately, vanilla SSMs failed to simultaneously capture causal temporal cues and preserve non-casual spatial information. To this end, this paper presents a Video Vision Mamba-based framework, dubbed as Vivim, for medical video segmentation tasks. Our Vivim can effectively compress the long-term spatiotemporal representation into sequences at varying scales with our designed Temporal Mamba Block. We also introduce an improved boundary-aware affine constraint across frames to enhance the discriminative ability of Vivim on ambiguous lesions. Extensive experiments on thyroid segmentation, breast lesion segmentation in ultrasound videos, and polyp segmentation in colonoscopy videos demonstrate the effectiveness and efficiency of our Vivim, superior to existing methods. The code is available at: https://github.com/scott-yjyang/Vivim. The dataset will be released once accepted.
Published: 2024

77. SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Author: Xing, Zhaohu, Ye, Tian, Yang, Yijun, Liu, Guang, and Zhu, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The Transformer architecture has shown a remarkable ability in modeling global relationships. However, it poses a significant computational challenge when processing high-dimensional medical images. This hinders its development and widespread adoption in this task. Mamba, as a State Space Model (SSM), recently emerged as a notable manner for long-range dependencies in sequential modeling, excelling in natural language processing filed with its remarkable memory efficiency and computational speed. Inspired by its success, we introduce SegMamba, a novel 3D medical image \textbf{Seg}mentation \textbf{Mamba} model, designed to effectively capture long-range dependencies within whole volume features at every scale. Our SegMamba, in contrast to Transformer-based methods, excels in whole volume feature modeling from a state space model standpoint, maintaining superior processing speed, even with volume features at a resolution of {$64\times 64\times 64$}. Comprehensive experiments on the BraTS2023 dataset demonstrate the effectiveness and efficiency of our SegMamba. The code for SegMamba is available at: https://github.com/ge-xing/SegMamba, Comment: Code has released
Published: 2024

78. Filamentary Network and Magnetic Field Structures Revealed with BISTRO in the High-Mass Star-Forming Region NGC2264 : Global Properties and Local Magnetogravitational Configurations

Author: Wang, Jia-Wei, Koch, Patrick M., Clarke, Seamus D., Fuller, Gary, Peretto, Nicolas, Tang, Ya-Wen, Yen, Hsi-Wei, Lai, Shih-Ping, Ohashi, Nagayoshi, Arzoumanian, Doris, Johnstone, Doug, Furuya, Ray, Inutsuka, Shu-ichiro, Lee, Chang Won, Ward-Thompson, Derek, Gouellec, Valentin J. M. Le, Liu, Hong-Li, Fanciullo, Lapo, Hwang, Jihye, Pattle, Kate, Poidevin, Frédérick, Tahani, Mehrnoosh, Onaka, Takashi, Rawlings, Mark G., Chung, Eun Jung, Liu, Junhao, Lyo, A-Ran, Priestley, Felix, Hoang, Thiem, Tamura, Motohide, Berry, David, Bastien, Pierre, Ching, Tao-Chung, Coudé, Simon, Kwon, Woojin, Chen, Mike, Eswaraiah, Chakali, Soam, Archana, Hasegawa, Tetsuo, Qiu, Keping, Bourke, Tyler L., Byun, Do-Young, Chen, Zhiwei, Chen, Huei-Ru Vivien, Chen, Wen Ping, Cho, Jungyeon, Choi, Minho, Choi, Yunhee, Choi, Youngwoo, Chrysostomou, Antonio, Dai, Sophia, Di Francesco, James, Diep, Pham Ngoc, Doi, Yasuo, Duan, Yan, Duan, Hao-Yuan, Eden, David, Fiege, Jason, Fissel, Laura M., Franzmann, Erica, Friberg, Per, Friesen, Rachel, Gledhill, Tim, Graves, Sarah, Greaves, Jane, Griffin, Matt, Gu, Qilao, Han, Ilseung, Hayashi, Saeko, Houde, Martin, Inoue, Tsuyoshi, Iwasaki, Kazunari, Jeong, Il-Gyo, Könyves, Vera, Kang, Ji-hyun, Kang, Miju, Karoly, Janik, Kataoka, Akimasa, Kawabata, Koji, Khan, Zacariyya, Kim, Mi-Ryang, Kim, Kee-Tae, Kim, Kyoung Hee, Kim, Shinyoung, Kim, Jongsoo, Kim, Hyosung, Kim, Gwanjeong, Kirchschlager, Florian, Kirk, Jason, Kobayashi, Masato I. N., Kusune, Takayoshi, Kwon, Jungmi, Lacaille, Kevin, Law, Chi-Yan, Lee, Sang-Sung, Lee, Hyeseung, Lee, Jeong-Eun, Lee, Chin-Fei, Li, Dalei, Li, Hua-bai, Li, Guangxing, Li, Di, Lin, Sheng-Jun, Liu, Tie, Liu, Sheng-Yuan, Lu, Xing, Mairs, Steve, Matsumura, Masafumi, Matthews, Brenda, Moriarty-Schieven, Gerald, Nagata, Tetsuya, Nakamura, Fumitaka, Nakanishi, Hiroyuki, Ngoc, Nguyen Bich, Park, Geumsook, Parsons, Harriet, Pyo, Tae-Soo, Qian, Lei, Rao, Ramprasad, Rawlings, Jonathan, Retter, Brendan, Richer, John, Rigby, Andrew, Sadavoy, Sarah, Saito, Hiro, Savini, Giorgio, Seta, Masumichi, Sharma, Ekta, Shimajiri, Yoshito, Shinnaga, Hiroko, Tang, Xindi, Thuong, Hoang Duc, Tomisaka, Kohji, Tram, Le Ngoc, Tsukamoto, Yusuke, Viti, Serena, Wang, Hongchi, Whitworth, Anthony, Wu, Jintai, Xie, Jinjin, Yang, Meng-Zhe, Yoo, Hyunju, Yuan, Jinghua, Yun, Hyeong-Sik, Zenko, Tetsuya, Zhang, Chuan-Peng, Zhang, Yapeng, Zhang, Guoyin, Zhou, Jianjun, Zhu, Lei, de Looze, Ilse, André, Philippe, Dowell, C. Darren, Eyres, Stewart, Falle, Sam, Robitaille, Jean-François, and van Loo, Sven
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: We report 850 $\mu$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from north to east. Field strengths estimates and a virial analysis for the major clumps indicate that NGC 2264C is globally dominated by gravity while in 2264D magnetic, gravitational, and kinetic energies are roughly balanced. We present an analysis scheme that utilizes the locally resolved magnetic field structures, together with the locally measured gravitational vector field and the extracted filamentary network. From this, we infer statistical trends showing that this network consists of two main groups of filaments oriented approximately perpendicular to one another. Additionally, gravity shows one dominating converging direction that is roughly perpendicular to one of the filament orientations, which is suggestive of mass accretion along this direction. Beyond these statistical trends, we identify two types of filaments. The type-I filament is perpendicular to the magnetic field with local gravity transitioning from parallel to perpendicular to the magnetic field from the outside to the filament ridge. The type-II filament is parallel to the magnetic field and local gravity. We interpret these two types of filaments as originating from the competition between radial collapsing, driven by filament self-gravity, and the longitudinal collapsing, driven by the region's global gravity., Comment: Accepted for publication in the Astrophysical Journal. 43 pages, 32 figures, and 4 tables (including Appendix)
Published: 2024

79. SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese

Author: Xu, Liang, Xue, Hang, Zhu, Lei, and Zhao, Kangkang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We introduce SuperCLUE-Math6(SC-Math6), a new benchmark dataset to evaluate the mathematical reasoning abilities of Chinese language models. SC-Math6 is designed as an upgraded Chinese version of the GSM8K dataset with enhanced difficulty, diversity, and application scope. It consists of over 2000 mathematical word problems requiring multi-step reasoning and providing natural language solutions. We propose an innovative scheme to quantify the reasoning capability of large models based on performance over problems with different reasoning steps. Experiments on 13 representative Chinese models demonstrate a clear stratification of reasoning levels, with top models like GPT-4 showing superior performance. SC-Math6 fills the gap in Chinese mathematical reasoning benchmarks and provides a comprehensive testbed to advance the intelligence of Chinese language models., Comment: Dataset revised and finalized, results updated with new model; 8 pages, 7 figures, 4 tables
Published: 2024

80. The ALMA-QUARKS survey: Detection of two extremely dense substructures in a massive prestellar core

Author: Mai, Xiaofeng, Liu, Tie, Liu, Xunchuan, Zhu, Lei, Garay, Guido, Goldsmith, Paul F., Juvela, Mika, Liu, Hongli, Mannfors, Emma, Tej, Anandmayee, Sanhueza, Patricio, Li, Shanghuo, Xu, Fengwei, Semadeni, Enrique Vazquez, Jiao, Wenyu, Peng, Yaping, Baug, T., Yang, Aiyuan, Dewangan, Lokesh, Bronfman, Leonardo, Gómez, Gilberto C., Palau, Aina, Lee, Chang Won, Qin, Sheng-Li, Tatematsu, Ken'ichi, Chibueze, James O., Yang, Dongting, Lu, Xing, Luo, Qiuyi, Gu, Qilao, Issac, Namitha, Zhang, Suinan, Li, Pak-Shing, Zhang, Bo, and Tóth, L. Viktor
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: Only a handful of massive starless core candidates have been discovered so far, but none of them have been fully confirmed. Within the MM1 clump in the filamentary infrared dark cloud G34.43+0.24 that was covered by the ALMA-ATOMS survey at Band 3 ($\sim2\arcsec$, 6000\,au) and the ALMA-QUARKS survey at Band 6 ($\sim 0.3\arcsec$, 900\,au), two prestellar core candidates MM1-C and E1 with masses of 71 and 20 \solarmass~and radii of 2100--4400\,au were discovered. The two cores show no obvious sign of star-formation activities. In particular, MM1-C is a very promising massive prestellar core candidate with a total gas mass of 71\,\solarmass. Within MM1-C, we detected two extremely dense substructures, C1 and C2, as characterized by their high densities of $\rm n_{H_2}\sim 10^{8-9} cm^{-3}$. Moreover, evidence of further fragmentation in C2 was also revealed. We have detected the primordial fragmentation in the earliest stage of massive star formation, and we speculate that MM1-C would be the birthplace of a massive multiple system. However, we cannot fully rule out the possibility that the massive prestellar core MM1-C will just form a cluster of low-mass stars if it undergoes further fragmentation., Comment: 12 pages, 6 figures
Published: 2024

81. A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model

Author: Zhao, Dongdi, Ma, Jianbo, Lu, Lu, Li, Jinke, Ji, Xuan, Zhu, Lei, Fang, Fuming, Liu, Ming, and Jiang, Feijun
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Sound
Abstract: Far-field speech recognition is a challenging task that conventionally uses signal processing beamforming to attack noise and interference problem. But the performance has been found usually limited due to heavy reliance on environmental assumption. In this paper, we propose a unified multichannel far-field speech recognition system that combines the neural beamforming and transformer-based Listen, Spell, Attend (LAS) speech recognition system, which extends the end-to-end speech recognition system further to include speech enhancement. Such framework is then jointly trained to optimize the final objective of interest. Specifically, factored complex linear projection (fCLP) has been adopted to form the neural beamforming. Several pooling strategies to combine look directions are then compared in order to find the optimal approach. Moreover, information of the source direction is also integrated in the beamforming to explore the usefulness of source direction as a prior, which is usually available especially in multi-modality scenario. Experiments on different microphone array geometry are conducted to evaluate the robustness against spacing variance of microphone array. Large in-house databases are used to evaluate the effectiveness of the proposed framework and the proposed method achieve 19.26\% improvement when compared with a strong baseline.
Published: 2024

82. EPA: Neural Collapse Inspired Robust Out-of-Distribution Detector

Author: Zhang, Jiawei, Chen, Yufan, Jin, Cheng, Zhu, Lei, and Gu, Yuantao
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security
Abstract: Out-of-distribution (OOD) detection plays a crucial role in ensuring the security of neural networks. Existing works have leveraged the fact that In-distribution (ID) samples form a subspace in the feature space, achieving state-of-the-art (SOTA) performance. However, the comprehensive characteristics of the ID subspace still leave under-explored. Recently, the discovery of Neural Collapse ($\mathcal{NC}$) sheds light on novel properties of the ID subspace. Leveraging insight from $\mathcal{NC}$, we observe that the Principal Angle between the features and the ID feature subspace forms a superior representation for measuring the likelihood of OOD. Building upon this observation, we propose a novel $\mathcal{NC}$-inspired OOD scoring function, named Entropy-enhanced Principal Angle (EPA), which integrates both the global characteristic of the ID subspace and its inner property. We experimentally compare EPA with various SOTA approaches, validating its superior performance and robustness across different network architectures and OOD datasets., Comment: Accepted by ICASSP 2024
Published: 2024

83. Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis

Author: Ren, Jingjing, Xu, Cheng, Chen, Haoyu, Qin, Xinran, and Zhu, Lei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent progress in multi-modal conditioned face synthesis has enabled the creation of visually striking and accurately aligned facial images. Yet, current methods still face issues with scalability, limited flexibility, and a one-size-fits-all approach to control strength, not accounting for the differing levels of conditional entropy, a measure of unpredictability in data given some condition, across modalities. To address these challenges, we introduce a novel uni-modal training approach with modal surrogates, coupled with an entropy-aware modal-adaptive modulation, to support flexible, scalable, and scalable multi-modal conditioned face synthesis network. Our uni-modal training with modal surrogate that only leverage uni-modal data, use modal surrogate to decorate condition with modal-specific characteristic and serve as linker for inter-modal collaboration , fully learns each modality control in face synthesis process as well as inter-modal collaboration. The entropy-aware modal-adaptive modulation finely adjust diffusion noise according to modal-specific characteristics and given conditions, enabling well-informed step along denoising trajectory and ultimately leading to synthesis results of high fidelity and quality. Our framework improves multi-modal face synthesis under various conditions, surpassing current methods in image quality and fidelity, as demonstrated by our thorough experimental results.
Published: 2023

84. Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

Author: Nan, Yang, Xing, Xiaodan, Wang, Shiyi, Tang, Zeyu, Felder, Federico N, Zhang, Sheng, Ledda, Roberta Eufrasia, Ding, Xiaoliu, Yu, Ruiqi, Liu, Weiping, Shi, Feng, Sun, Tianyang, Cao, Zehong, Zhang, Minghui, Gu, Yun, Zhang, Hanxiao, Gao, Jian, Wang, Pingyu, Tang, Wen, Yu, Pengxin, Kang, Han, Chen, Junqiang, Lu, Xing, Zhang, Boyu, Mamalakis, Michail, Prinzi, Francesco, Carlini, Gianluca, Cuneo, Lisa, Banerjee, Abhirup, Xing, Zhaohu, Zhu, Lei, Mesbah, Zacharia, Jain, Dhruv, Mayet, Tsiry, Yuan, Hongyu, Lyu, Qing, Qayyum, Abdul, Mazher, Moona, Wells, Athol, Walsh, Simon LF, and Yang, Guang
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intricate honeycombing patterns present in the lung tissues of fibrotic lung disease patients exacerbate the challenges, often leading to various prediction errors. To address this issue, the 'Airway-Informed Quantitative CT Imaging Biomarker for Fibrotic Lung Disease 2023' (AIIB23) competition was organized in conjunction with the official 2023 International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI). The airway structures were meticulously annotated by three experienced radiologists. Competitors were encouraged to develop automatic airway segmentation models with high robustness and generalization abilities, followed by exploring the most correlated QIB of mortality prediction. A training set of 120 high-resolution computerised tomography (HRCT) scans were publicly released with expert annotations and mortality status. The online validation set incorporated 52 HRCT scans from patients with fibrotic lung disease and the offline test set included 140 cases from fibrosis and COVID-19 patients. The results have shown that the capacity of extracting airway trees from patients with fibrotic lung disease could be enhanced by introducing voxel-wise weighted general union loss and continuity loss. In addition to the competitive image biomarkers for prognosis, a strong airway-derived biomarker (Hazard ratio>1.5, p<0.0001) was revealed for survival prognostication compared with existing clinical measurements, clinician assessment and AI-based biomarkers., Comment: 19 pages
Published: 2023
Full Text: View/download PDF

85. A newly developed 10kA-level HTS conductor: innovative tenon-mortise-based modularized conductor (TMMC) based on China ancient architecture

Author: Zheng, Jinxing, Cheng, Yuan, Wang, Lei, Liu, Fei, liu, Haiyang, Li, Ming, and Zhu, Lei
Subjects: Condensed Matter - Superconductivity
Abstract: We propose a new type of high temperature superconducting (HTS) conductor concept: modularized conductors (MC) connected by Chinese traditional tenon mortise (TM) connection structure, reffered as TMMC. The conductor consists of multiple concentric round sub conductors with slots for stacking REBCO tapes. Innovatively, the REBCO stacks in the adjacent sub conductors are arranged with the fully misaligned configuration to enhance the critical current' s isotropy with respect to magnetic field and reduce ac loss. For example, the angle between the adjacent stacks in the two adjacent sub conductors is 45 degree if each subconductor contains 4 REBCO stacks. In order to construct the fully misaligned configuration, the sub conductors are designed with two open half circular formers and connected by tenonmortise structure which makes the conductor modulrized and simply to assembly and disassembly. Based on the design concept, a prototype conductor containing 160 REBCO tapes distributed in the four concentric sub conductors is fabricated. The conductor measured critical current is 13.69 kA at 77 K and sefl field, which is consistent to the simulaiton result. In order to further improve the TMMC' s engineering critical current density (Jce) and bending performance, we propose two enhancement approaches which are reducing the former' s thickness and rearrange stacks in the outer sub conductors. With the enhancements, both TMMC' s radius and Jce are comparable to the existing slotted core conductor. The study shows the TMMC' s advantages of nontwisted structures, easy assembly, high current carrying and low ac losses, which makes it promising for constructing large scale scientific devices.
Published: 2023

86. SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

Author: Luo, Xiangde, Fu, Jia, Zhong, Yunxin, Liu, Shuolin, Han, Bing, Astaraki, Mehdi, Bendazzoli, Simone, Toma-Dasu, Iuliana, Ye, Yiwen, Chen, Ziyang, Xia, Yong, Su, Yanzhou, Ye, Jin, He, Junjun, Xing, Zhaohu, Wang, Hongqiu, Zhu, Lei, Yang, Kaixiang, Fang, Xin, Wang, Zhiwei, Lee, Chan Woong, Park, Sang Joon, Chun, Jaehee, Ulrich, Constantin, Maier-Hein, Klaus H., Ndipenoch, Nchongmaje, Miron, Alina, Li, Yongmin, Zhang, Yimeng, Chen, Yu, Bai, Lu, Huang, Jinlong, An, Chengyang, Wang, Lisheng, Huang, Kaiwen, Gu, Yunqi, Zhou, Tao, Zhou, Mu, Zhang, Shichuan, Liao, Wenjun, Wang, Guotai, and Zhang, Shaoting
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results in many medical image segmentation tasks. However, for NPC OARs and GTVs segmentation, few public datasets are available for model development and evaluation. To alleviate this problem, the SegRap2023 challenge was organized in conjunction with MICCAI2023 and presented a large-scale benchmark for OAR and GTV segmentation with 400 Computed Tomography (CT) scans from 200 NPC patients, each with a pair of pre-aligned non-contrast and contrast-enhanced CT scans. The challenge's goal was to segment 45 OARs and 2 GTVs from the paired CT scans. In this paper, we detail the challenge and analyze the solutions of all participants. The average Dice similarity coefficient scores for all submissions ranged from 76.68\% to 86.70\%, and 70.42\% to 73.44\% for OARs and GTVs, respectively. We conclude that the segmentation of large-size OARs is well-addressed, and more efforts are needed for GTVs and small-size or thin-structure OARs. The benchmark will remain publicly available here: https://segrap2023.grand-challenge.org, Comment: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)
Published: 2023

87. Lite-Mind: Towards Efficient and Robust Brain Representation Network

Author: Gong, Zixuan, Zhang, Qi, Bao, Guangyin, Zhu, Lei, Liu, Ke, Hu, Liang, Miao, Duoqian, and Zhang, Yu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The limited data availability and the low signal-to-noise ratio of fMRI signals lead to the challenging task of fMRI-to-image retrieval. State-of-the-art MindEye remarkably improves fMRI-to-image retrieval performance by leveraging a large model, i.e., a 996M MLP Backbone per subject, to align fMRI embeddings to the final hidden layer of CLIP's Vision Transformer (ViT). However, significant individual variations exist among subjects, even under identical experimental setups, mandating the training of large subject-specific models. The substantial parameters pose significant challenges in deploying fMRI decoding on practical devices. To this end, we propose Lite-Mind, a lightweight, efficient, and robust brain representation learning paradigm based on Discrete Fourier Transform (DFT), which efficiently aligns fMRI voxels to fine-grained information of CLIP. We elaborately design a DFT backbone with Spectrum Compression and Frequency Projector modules to learn informative and robust voxel embeddings. Our experiments demonstrate that Lite-Mind achieves an impressive 94.6% fMRI-to-image retrieval accuracy on the NSD dataset for Subject 1, with 98.7% fewer parameters than MindEye. Lite-Mind is also proven to be able to be migrated to smaller fMRI datasets and establishes a new state-of-the-art for zero-shot classification on the GOD dataset., Comment: 17 pages, ACM MM 2024 Oral
Published: 2023

88. How does common ownership affect corporate innovation after succession in Chinese family firms? A perspective on value cocreation

Author: Wu, Jiong, Zhu, Lei, and Hu, Yuheng
Published: 2024
Full Text: View/download PDF

89. Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

Author: Liang, Zhihao, Zhang, Qi, Hu, Wenbo, Zhu, Lei, Feng, Ying, Jia, Kui, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

90. Partially Supervised Unpaired Multi-modal Learning for Label-Efficient Medical Image Segmentation

Author: Zhu, Lei, Xu, Yanyu, Fu, Huazhu, Xu, Xinxing, Goh, Rick Siow Mong, Liu, Yong, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Xu, Xuanang, editor, Cui, Zhiming, editor, Rekik, Islem, editor, Ouyang, Xi, editor, and Sun, Kaicong, editor
Published: 2025
Full Text: View/download PDF

91. Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint

Author: Chen, Sixiang, Ye, Tian, Zhang, Kai, Xing, Zhaohu, Lin, Yunlong, Zhu, Lei, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

92. Anchored Supervised Contrastive Learning for Long-Tailed Medical Image Regression

Author: Li, Zhaoying, Xing, Zhaohu, Liu, Hongying, Zhu, Lei, Wan, Liang, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lin, Zhouchen, editor, Cheng, Ming-Ming, editor, He, Ran, editor, Ubul, Kurban, editor, Silamu, Wushouer, editor, Zha, Hongbin, editor, Zhou, Jie, editor, and Liu, Cheng-Lin, editor
Published: 2025
Full Text: View/download PDF