Author: "Wang, Boyang" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Boyang"' showing total 782 results

Start Over Author "Wang, Boyang"

782 results on '"Wang, Boyang"'

1. This&That: Language-Gesture Controlled Video Generation for Robot Planning

Author: Wang, Boyang, Sridhar, Nikhil, Feng, Chao, Van der Merwe, Mark, Fishman, Adam, Fazeli, Nima, and Park, Jeong Joon
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a robot learning method for communicating, planning, and executing a wide range of tasks, dubbed This&That. We achieve robot planning for general tasks by leveraging the power of video generative models trained on internet-scale data containing rich physical and semantic context. In this work, we tackle three fundamental challenges in video-based planning: 1) unambiguous task communication with simple human instructions, 2) controllable video generation that respects user intents, and 3) translating visual planning into robot actions. We propose language-gesture conditioning to generate videos, which is both simpler and clearer than existing language-only methods, especially in complex and uncertain environments. We then suggest a behavioral cloning design that seamlessly incorporates the video plans. This&That demonstrates state-of-the-art effectiveness in addressing the above three challenges, and justifies the use of video generation as an intermediate representation for generalizable task planning and execution. Project website: https://cfeng16.github.io/this-and-that/.
Published: 2024

2. Seeking Certainty In Uncertainty: Dual-Stage Unified Framework Solving Uncertainty in Dynamic Facial Expression Recognition

Author: Wang, Haoran, Mai, Xinji, Tao, Zeng, Tong, Xuan, Lin, Junxiong, Wang, Yan, Yu, Jiawen, Wang, Boyang, Yan, Shaoqi, Zhao, Qing, Zhou, Ziheng, Gao, Shuyong, and Zhang, Wenqiang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The contemporary state-of-the-art of Dynamic Facial Expression Recognition (DFER) technology facilitates remarkable progress by deriving emotional mappings of facial expressions from video content, underpinned by training on voluminous datasets. Yet, the DFER datasets encompass a substantial volume of noise data. Noise arises from low-quality captures that defy logical labeling, and instances that suffer from mislabeling due to annotation bias, engendering two principal types of uncertainty: the uncertainty regarding data usability and the uncertainty concerning label reliability. Addressing the two types of uncertainty, we have meticulously crafted a two-stage framework aiming at \textbf{S}eeking \textbf{C}ertain data \textbf{I}n extensive \textbf{U}ncertain data (SCIU). This initiative aims to purge the DFER datasets of these uncertainties, thereby ensuring that only clean, verified data is employed in training processes. To mitigate the issue of low-quality samples, we introduce the Coarse-Grained Pruning (CGP) stage, which assesses sample weights and prunes those deemed unusable due to their low weight. For samples with incorrect annotations, the Fine-Grained Correction (FGC) stage evaluates prediction stability to rectify mislabeled data. Moreover, SCIU is conceived as a universally compatible, plug-and-play framework, tailored to integrate seamlessly with prevailing DFER methodologies. Rigorous experiments across prevalent DFER datasets and against numerous benchmark methods substantiates SCIU's capacity to markedly elevate performance metrics.
Published: 2024

3. Suppressing Uncertainties in Degradation Estimation for Blind Super-Resolution

Author: Lin, Junxiong, Tao, Zeng, Tong, Xuan, Mai, Xinji, Wang, Haoran, Wang, Boyang, Wang, Yan, Zhao, Qing, Yu, Jiawen, Lin, Yuxuan, Yan, Shaoqi, Gao, Shuyong, and Zhang, Wenqiang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The problem of blind image super-resolution aims to recover high-resolution (HR) images from low-resolution (LR) images with unknown degradation modes. Most existing methods model the image degradation process using blur kernels. However, this explicit modeling approach struggles to cover the complex and varied degradation processes encountered in the real world, such as high-order combinations of JPEG compression, blur, and noise. Implicit modeling for the degradation process can effectively overcome this issue, but a key challenge of implicit modeling is the lack of accurate ground truth labels for the degradation process to conduct supervised training. To overcome this limitations inherent in implicit modeling, we propose an \textbf{U}ncertainty-based degradation representation for blind \textbf{S}uper-\textbf{R}esolution framework (\textbf{USR}). By suppressing the uncertainty of local degradation representations in images, USR facilitated self-supervised learning of degradation representations. The USR consists of two components: Adaptive Uncertainty-Aware Degradation Extraction (AUDE) and a feature extraction network composed of Variable Depth Dynamic Convolution (VDDC) blocks. To extract Uncertainty-based Degradation Representation from LR images, the AUDE utilizes the Self-supervised Uncertainty Contrast module with Uncertainty Suppression Loss to suppress the inherent model uncertainty of the Degradation Extractor. Furthermore, VDDC block integrates degradation information through dynamic convolution. Rhe VDDC also employs an Adaptive Intensity Scaling operation that adaptively adjusts the degradation representation according to the network hierarchy, thereby facilitating the effective integration of degradation information. Quantitative and qualitative experiments affirm the superiority of our approach.
Published: 2024

4. McEval: Massively Multilingual Code Evaluation

Author: Chai, Linzheng, Liu, Shukai, Yang, Jian, Yin, Yuwei, Jin, Ke, Liu, Jiaheng, Sun, Tao, Zhang, Ge, Ren, Changyu, Guo, Hongcheng, Wang, Zekun, Wang, Boyang, Wu, Xianjie, Wang, Bing, Li, Tongliang, Yang, Liqun, Duan, Sufeng, and Li, Zhoujun
Subjects: Computer Science - Programming Languages
Abstract: Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited number of languages, where other languages are translated from the Python samples (e.g. MultiPL-E) degrading the data diversity. To further facilitate the research of code LLMs, we propose a massively multilingual code benchmark covering 40 programming languages (McEval) with 16K test samples, which substantially pushes the limits of code LLMs in multilingual scenarios. The benchmark contains challenging code completion, understanding, and generation evaluation tasks with finely curated massively multilingual instruction corpora McEval-Instruct. In addition, we introduce an effective multilingual coder mCoder trained on McEval-Instruct to support multilingual programming language generation. Extensive experimental results on McEval show that there is still a difficult journey between open-source models and closed-source LLMs (e.g. GPT-series models) in numerous languages. The instruction corpora, evaluation benchmark, and leaderboard are available at \url{https://mceval.github.io/}., Comment: 22 pages
Published: 2024

5. Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission

Author: Yang, Mingyu, Liu, Bowen, Wang, Boyang, and Kim, Hun-Seok
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Information Theory, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Deep learning-based joint source-channel coding (deep JSCC) has been demonstrated as an effective approach for wireless image transmission. Nevertheless, current research has concentrated on minimizing a standard distortion metric such as Mean Squared Error (MSE), which does not necessarily improve the perceptual quality. To address this issue, we propose DiffJSCC, a novel framework that leverages pre-trained text-to-image diffusion models to enhance the realism of images transmitted over the channel. The proposed DiffJSCC utilizes prior deep JSCC frameworks to deliver an initial reconstructed image at the receiver. Then, the spatial and textual features are extracted from the initial reconstruction, which, together with the channel state information (e.g., signal-to-noise ratio, SNR), are passed to a control module to fine-tune the pre-trained Stable Diffusion model. Extensive experiments on the Kodak dataset reveal that our method significantly surpasses both conventional methods and prior deep JSCC approaches on perceptual metrics such as LPIPS and FID scores, especially with poor channel conditions and limited bandwidth. Notably, DiffJSCC can achieve highly realistic reconstructions for 768x512 pixel Kodak images with only 3072 symbols (<0.008 symbols per pixel) under 1dB SNR. Our code will be released in https://github.com/mingyuyng/DiffJSCC.
Published: 2024

6. Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution

Author: Lin, Junxiong, Wang, Yan, Tao, Zeng, Wang, Boyang, Zhao, Qing, Wang, Haorang, Tong, Xuan, Mai, Xinji, Lin, Yuxuan, Song, Wei, Yu, Jiawen, Yan, Shaoqi, and Zhang, Wenqiang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Pre-trained diffusion models utilized for image generation encapsulate a substantial reservoir of a priori knowledge pertaining to intricate textures. Harnessing the potential of leveraging this a priori knowledge in the context of image super-resolution presents a compelling avenue. Nonetheless, prevailing diffusion-based methodologies presently overlook the constraints imposed by degradation information on the diffusion process. Furthermore, these methods fail to consider the spatial variability inherent in the estimated blur kernel, stemming from factors such as motion jitter and out-of-focus elements in open-environment scenarios. This oversight results in a notable deviation of the image super-resolution effect from fundamental realities. To address these concerns, we introduce a framework known as Adaptive Multi-modal Fusion of \textbf{S}patially Variant Kernel Refinement with Diffusion Model for Blind Image \textbf{S}uper-\textbf{R}esolution (SSR). Within the SSR framework, we propose a Spatially Variant Kernel Refinement (SVKR) module. SVKR estimates a Depth-Informed Kernel, which takes the depth information into account and is spatially variant. Additionally, SVKR enhance the accuracy of depth information acquired from LR images, allowing for mutual enhancement between the depth map and blur kernel estimates. Finally, we introduce the Adaptive Multi-Modal Fusion (AMF) module to align the information from three modalities: low-resolution images, depth maps, and blur kernels. This alignment can constrain the diffusion model to generate more authentic SR results.
Published: 2024

7. APISR: Anime Production Inspired Real-World Anime Super-Resolution

Author: Wang, Boyang, Yang, Fengyu, Yu, Xihang, Zhang, Chao, and Zhao, Hanbin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: While real-world anime super-resolution (SR) has gained increasing attention in the SR community, existing methods still adopt techniques from the photorealistic domain. In this paper, we analyze the anime production workflow and rethink how to use characteristics of it for the sake of the real-world anime SR. First, we argue that video networks and datasets are not necessary for anime SR due to the repetition use of hand-drawing frames. Instead, we propose an anime image collection pipeline by choosing the least compressed and the most informative frames from the video sources. Based on this pipeline, we introduce the Anime Production-oriented Image (API) dataset. In addition, we identify two anime-specific challenges of distorted and faint hand-drawn lines and unwanted color artifacts. We address the first issue by introducing a prediction-oriented compression module in the image degradation model and a pseudo-ground truth preparation with enhanced hand-drawn lines. In addition, we introduce the balanced twin perceptual loss combining both anime and photorealistic high-level features to mitigate unwanted color artifacts and increase visual clarity. We evaluate our method through extensive experiments on the public benchmark, showing our method outperforms state-of-the-art anime dataset-trained approaches.
Published: 2024

8. LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection

Author: Guo, Hongcheng, Yang, Jian, Liu, Jiaheng, Bai, Jiaqi, Wang, Boyang, Li, Zhoujun, Zheng, Tieqiao, Zhang, Bo, peng, Junran, and Tian, Qi
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Software Engineering
Abstract: Log anomaly detection is a key component in the field of artificial intelligence for IT operations (AIOps). Considering log data of variant domains, retraining the whole network for unknown domains is inefficient in real industrial scenarios. However, previous deep models merely focused on extracting the semantics of log sequences in the same domain, leading to poor generalization on multi-domain logs. To alleviate this issue, we propose a unified Transformer-based framework for Log anomaly detection (LogFormer) to improve the generalization ability across different domains, where we establish a two-stage process including the pre-training and adapter-based tuning stage. Specifically, our model is first pre-trained on the source domain to obtain shared semantic knowledge of log data. Then, we transfer such knowledge to the target domain via shared parameters. Besides, the Log-Attention module is proposed to supplement the information ignored by the log-paring. The proposed method is evaluated on three public and one real-world datasets. Experimental results on multiple benchmarks demonstrate the effectiveness of our LogFormer with fewer trainable parameters and lower training costs., Comment: arXiv admin note: text overlap with arXiv:2201.00016
Published: 2024

9. VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data

Author: Wang, Boyang, Liu, Bowen, Liu, Shiyu, and Yang, Fengyu
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: In the blind single image super-resolution (SISR) task, existing works have been successful in restoring image-level unknown degradations. However, when a single video frame becomes the input, these works usually fail to address degradations caused by video compression, such as mosquito noise, ringing, blockiness, and staircase noise. In this work, we for the first time, present a video compression-based degradation model to synthesize low-resolution image data in the blind SISR task. Our proposed image synthesizing method is widely applicable to existing image datasets, so that a single degraded image can contain distortions caused by the lossy video compression algorithms. This overcomes the leak of feature diversity in video data and thus retains the training efficiency. By introducing video coding artifacts to SISR degradation models, neural networks can super-resolve images with the ability to restore video compression degradations, and achieve better results on restoring generic distortions caused by image compression as well. Our proposed approach achieves superior performance in SOTA no-reference Image Quality Assessment, and shows better visual quality on various datasets. In addition, we evaluate the SISR neural network trained with our degradation model on video super-resolution (VSR) datasets. Compared to architectures specifically designed for the VSR purpose, our method exhibits similar or better performance, evidencing that the presented strategy on infusing video-based degradation is generalizable to address more complicated compression artifacts even without temporal cues.
Published: 2023

10. M2C: Towards Automatic Multimodal Manga Complement

Author: Guo, Hongcheng, Wang, Boyang, Bai, Jiaqi, Liu, Jiaheng, Yang, Jian, and Li, Zhoujun
Subjects: Computer Science - Computation and Language
Abstract: Multimodal manga analysis focuses on enhancing manga understanding with visual and textual features, which has attracted considerable attention from both natural language processing and computer vision communities. Currently, most comics are hand-drawn and prone to problems such as missing pages, text contamination, and aging, resulting in missing comic text content and seriously hindering human comprehension. In other words, the Multimodal Manga Complement (M2C) task has not been investigated, which aims to handle the aforementioned issues by providing a shared semantic space for vision and language understanding. To this end, we first propose the Multimodal Manga Complement task by establishing a new M2C benchmark dataset covering two languages. First, we design a manga argumentation method called MCoT to mine event knowledge in comics with large language models. Then, an effective baseline FVP-M$^{2}$ using fine-grained visual prompts is proposed to support manga complement. Extensive experimental results show the effectiveness of FVP-M$^{2}$ method for Multimodal Mange Complement., Comment: EMNLP2023. arXiv admin note: text overlap with arXiv:2210.15461
Published: 2023

11. Multibody dynamic modeling and motion analysis of flexible robot considering contact

Author: Wu, Tingke, Liu, Zhuyong, Ma, Ziqi, and Wang, Boyang
Published: 2024
Full Text: View/download PDF

12. Network pharmacology on the mechanism of Yi Qi Tong Qiao Pill inhibiting allergic rhinitis

Author: Wang, Boyang, Zhang, DingFan, Zhang, Tingyu, Sutcharitchan, Chayanis, Hua, Jianlin, Hua, Dongfang, Zhang, Bo, and Li, Shao
Subjects: Quantitative Biology - Quantitative Methods, None
Abstract: Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calculated through network-based algorithms and holistic targets of TQTQP was constructed. Network target analysis was performed to explore the potential mechanisms of YQTQP in the treatment of AR and the mechanisms were classified into different modules according to their biological functions. Besides, animal and clinical experiments were conducted to validate our findings inferred from Network target analysis. Results: Network target analysis showed that YQTQP targeted 12 main pathways or biological processes related to AR, represented by those related to IL-4, IFN-{\gamma}, TNF-{\alpha} and IL-13. These results could be classified into 3 biological modules, including regulation of immune and inflammation, epithelial barrier disorder and cell adhesion. Finally, a series of experiments composed of animal and clinical experiments, proved our findings and confirmed that YQTQP could improve related symptoms of AR, like permeability of nasal mucosa epithelium. Conclusion: A combination of Network target analysis and the experimental validation indicated that YQTQP was effective in the treatment of AR and might provide a new insight on revealing the mechanism of TCM against diseases., Comment: 25 pages, 6 figures
Published: 2023

13. A Hierarchical Multi-Vehicle Coordinated Motion Planning Method based on Interactive Spatio-Temporal Corridors

Author: Zhang, Xiang, Wang, Boyang, Lu, Yaomin, Liu, Haiou, Gong, Jianwei, and Chen, Huiyan
Subjects: Computer Science - Robotics
Abstract: Multi-vehicle coordinated motion planning has always been challenged to safely and efficiently resolve conflicts under non-holonomic dynamic constraints. Constructing spatial-temporal corridors for multi-vehicle can decouple the high-dimensional conflicts and further reduce the difficulty of obtaining feasible trajectories. Therefore, this paper proposes a novel hierarchical method based on interactive spatio-temporal corridors (ISTCs). In the first layer, based on the initial guidance trajectories, Mixed Integer Quadratic Programming is designed to construct ISTCs capable of resolving conflicts in generic multi-vehicle scenarios. And then in the second layer, Non-Linear Programming is settled to generate in-corridor trajectories that satisfy the vehicle dynamics. By introducing ISTCs, the multi-vehicle coordinated motion planning problem is able to be decoupled into single-vehicle trajectory optimization problems, which greatly decentralizes the computational pressure and has great potential for real-world applications. Besides, the proposed method searches for feasible solutions in the 3-D $(x,y,t)$ configuration space, preserving more possibilities than the traditional velocity-path decoupling method. Simulated experiments in unsignalized intersection and challenging dense scenarios have been conduced to verify the feasibility and adaptability of the proposed framework.
Published: 2023

14. A network-based biomarkers discovery of Cold/Hot ZHENG chronic gastritis and Cold/Hot herbs of formulae

Author: Wang, Boyang, Chen, Pan, Zhang, Peng, and Li, Shao
Subjects: Quantitative Biology - Other Quantitative Biology
Abstract: Objective: To discover biomarkers and uncover the mechanism of Cold/Hot ZHENG (syndrome in traditional Chinese medicine) chronic gastritis (CG) and Cold/Hot herbs in traditional Chinese medicine (TCM) formulae on systematic biology. Background: CG is a common inflammatory disease and the diagnosis of CG in TCM can be classified into Cold ZHENG (Asthenic Cold) and Hot ZHENG (Excess Hot). However, the molecular features of Cold/Hot ZHENG in CG and the mechanism of Cold/Hot herbs in formulae for CG remained unclear. Methods: Based on data of 35 patients of Cold/Hot ZHENG CG and 3 scRNA-seq CG samples, we conduct analysis with transcriptomics datasets and algorithms, to discover biomarkers for Cold/Hot ZHENG CG. And we collected 25 formulae (with traditional effects related to Cold/Hot ZHENG) for CG and corresponding 89 Cold/Hot herbs (including Warm/Cool herbs) to discover features and construct target networks of Cold/Hot herbs on the basis of network target and enrichment analysis. Results: Biomarkers of Cold/Hot ZHENG CG represented by CCL2 and LEP suggested that Hot ZHENG CG might be characterized by over-inflammation and exuberant metabolism, and Cold ZHENG CG showed a trend of suppression in immune regulation and energy metabolism. And biomarkers of Cold/Hot ZHENG showed also significant changes in the progression of gastric cancer. And biomarkers and pathways of Hot herbs intend to regulate immune responses and energy metabolism, while those of Cold herbs were likely to participate in anti-inflammation effect. Conclusion: In this study, we found that the biomarkers and mechanism of Cold/Hot ZHENG CG and those of Cold/Hot herbs were closely related to the regulation of immune and metabolisms. These findings may reflect the mechanism, build bridges between multiple views of Cold/Hot ZHENG and Cold/Hot herbs, and provide a research paradigm for further achieving precision TCM., Comment: 17 pages (references not included), 7 figures
Published: 2023

15. Bionic soft robotic gripper with feedback control for adaptive grasping and capturing applications

Author: Wu, Tingke, Liu, Zhuyong, Ma, Ziqi, Wang, Boyang, Ma, Daolin, and Yu, Hexi
Published: 2024
Full Text: View/download PDF

16. Cytological and transcriptomic analyses provide insights into the pollen fertility of synthetic allodiploid Brassica juncea hybrids

Author: Wang, Boyang, Liang, Niannian, Shen, Xiaohan, Xie, Zhengqing, Zhang, Luyue, Tian, Baoming, Yuan, Yuxiang, Guo, Jialin, Zhang, Xiaowei, Wei, Fang, and Wei, Xiaochun
Published: 2024
Full Text: View/download PDF

17. Efficacy of bone defect therapy involving various surface treatments of titanium alloy implants: an in vivo and in vitro study

Author: Wang, Boyang, Guo, Yu, Xu, Jiuhui, Zeng, Fanwei, Ren, Tingting, and Guo, Wei
Published: 2023
Full Text: View/download PDF

18. Effectiveness of flipped classroom in pharmacy education – a meta-analysis

Author: Cui, He, Xie, Xinyu, Wang, Boyang, and Zhao, Yuan
Published: 2023
Full Text: View/download PDF

19. Uncovering the mechanisms of Yi Qi Tong Qiao Pill in the treatment of allergic rhinitis based on Network target analysis

Author: Wang, Boyang, Zhang, Dingfan, Zhang, Tingyu, Sutcharitchan, Chayanis, Hua, Jianlin, Hua, Dongfang, Zhang, Bo, and Li, Shao
Published: 2023
Full Text: View/download PDF

20. Clinical significance of distal femur morphology in a healthy Mongolian youth population

Author: Wang, Boyang, Zhang, Guoliang, Pu, Ribusurong, Li, Qiang, and Wang, Yuewen
Published: 2023
Full Text: View/download PDF

21. Prediction of Pedestrian Spatiotemporal Risk Levels for Intelligent Vehicles: A Data-driven Approach

Author: Zhang, Zheyu, Wang, Boyang, Lu, Chao, Li, Jinghang, Gong, Cheng, and Gong, Jianwei
Subjects: Computer Science - Robotics, Computer Science - Human-Computer Interaction
Abstract: In recent years, road safety has attracted significant attention from researchers and practitioners in the intelligent transport systems domain. As one of the most common and vulnerable groups of road users, pedestrians cause great concerns due to their unpredictable behavior and movement, as subtle misunderstandings in vehicle-pedestrian interaction can easily lead to risky situations or collisions. Existing methods use either predefined collision-based models or human-labeling approaches to estimate the pedestrians' risks. These approaches are usually limited by their poor generalization ability and lack of consideration of interactions between the ego vehicle and a pedestrian. This work tackles the listed problems by proposing a Pedestrian Risk Level Prediction system. The system consists of three modules. Firstly, vehicle-perspective pedestrian data are collected. Since the data contains information regarding the movement of both the ego vehicle and pedestrian, it can simplify the prediction of spatiotemporal features in an interaction-aware fashion. Using the long short-term memory model, the pedestrian trajectory prediction module predicts their spatiotemporal features in the subsequent five frames. As the predicted trajectory follows certain interaction and risk patterns, a hybrid clustering and classification method is adopted to explore the risk patterns in the spatiotemporal features and train a risk level classifier using the learned patterns. Upon predicting the spatiotemporal features of pedestrians and identifying the corresponding risk level, the risk patterns between the ego vehicle and pedestrians are determined. Experimental results verified the capability of the PRLP system to predict the risk level of pedestrians, thus supporting the collision risk assessment of intelligent vehicles and providing safety warnings to both vehicles and pedestrians.
Published: 2021

22. Direct contact between tumor cells and platelets initiates a FAK-dependent F3/TGF-β positive feedback loop that promotes tumor progression and EMT in osteosarcoma

Author: Shi, Qianyu, Xu, Jiuhui, Chen, Chenglong, Hu, Xueyu, Wang, Boyang, Zeng, Fanwei, Ren, Tingting, Huang, Yi, Guo, Wei, Tang, Xiaodong, and Ji, Tao
Published: 2024
Full Text: View/download PDF

23. Quantitative characterization of thin-interbedded coal measure reservoir configurations in the Wujiu depression, inner Mongolia: Sedimentary controlling mechanisms

Author: Li, Geng, Qin, Yong, Zhang, Hewei, Song, Xuejuan, Wang, Boyang, Wang, Ziwei, and Mi, Wentian
Published: 2024
Full Text: View/download PDF

24. Noncovalent hybridization of Fe single-atom with biochar for highly efficient peroxymonosulfate activation: Built-in electric field-driven radical and non-radical processes

Author: Fang, Shu, He, Yiyang, Cao, Xiao, Li, Yaru, Gu, Lin, Mao, Wei, Wang, Boyang, and Zhang, Hanlin
Published: 2024
Full Text: View/download PDF

25. A foreground-context dual-guided network for light-field salient object detection

Author: Zheng, Xin, Wang, Boyang, Liu, Deyang, Lv, Chengtao, Yan, Jiebin, and An, Ping
Published: 2024
Full Text: View/download PDF

26. A cotton endoreduplication gene, GaTOP6B, regulates trichome branching development

Author: Song, Jiaqi, Wang, Ao, Zhu, Wei, Yang, Lanlan, Xie, Zhengqing, Han, Xingzhou, Wang, Boyang, Tian, Baoming, Zhang, Luyue, Chen, Weiwei, Wei, Fang, and Shi, Gongyao
Published: 2024
Full Text: View/download PDF

27. Integrated design of multifunctional lightweight magnetic cellulose-based aerogel with 1D/2D/3D hierarchical network for efficient microwave absorption

Author: Wang, Boyang, Nan, Kai, Rao, Huichao, Chen, Yikun, Pei, Ruifeng, and Wang, Yan
Published: 2024
Full Text: View/download PDF

28. Deformation measurement by single spherical near-field intensity measurement for large reflector antenna

Author: Ye, Qian, Wang, Boyang, Yao, Qiang, Wang, Jinqing, Liu, Qinghui, and Shen, Zhiqiang
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics
Abstract: This paper presents a new method to obtain the deformation distribution on the main reflector of an antenna only by measuring the electric intensity on a spherical surface with the focal point as the center of the sphere, regardless of phase. Combining the differential geometry theory with geometric optics method, this paper has derived a deformation-intensity equation to relate the surface deformation to the intensity distribution of a spherical near-field directly. Based on the Finite difference method (FDM) and Gauss-Seidel iteration, deformation has been calculated from intensity simulated by GO and PO method, respectively, with relatively small errors, which prove the effectiveness of the equation proposed in this paper. By means of this method , it is possible to measure the deformation only by scanning the electric intensity of a single hemispherical near-field whose area is only about $1/15$ of the aperture. And the measurement only needs a plane wave at any frequency as the incident wave, which means that both the signals from the outer space satellite and the far-field artificial beacon could be used as the sources. The scanning can be realized no matter what attitude and elevation angle the antenna is in because the size and angle of the hemisphere are changeable.
Published: 2021
Full Text: View/download PDF

29. Three-dimensional refractive index microscopy based on the multi-layer propagation model with obliquity factor correction

Author: Tong, Zhan, Ren, Xuesong, Zhang, Zihan, Wang, Boyang, Miao, Yubin, and Meng, Guoxiang
Published: 2024
Full Text: View/download PDF

30. Modified magnesium oxide/silver nanoparticles reinforced poly (butylene succinate-co-terephthalate) composite biofilms for food packaging application

Author: Zhang, Jianing, Zhang, Jie, Wang, Boyang, Li, Wei, Wang, Huifang, Guo, Ruijie, Yu, Wenwen, Xie, Lan, and Zheng, Qiang
Published: 2024
Full Text: View/download PDF

31. Mixed noise-guided mutual constraint framework for unsupervised anomaly detection in smart industries

Author: Zhao, Qing, Wang, Yan, Lin, Yuxuan, Yan, Shaoqi, Song, Wei, Wang, Boyang, Huang, Jun, Chang, Yang, Qi, Lizhe, and Zhang, Wenqiang
Published: 2024
Full Text: View/download PDF

32. Granular fuzzy rule-based model construction under the collaboration of multiple organizations

Author: Liu, Bingsheng, Wang, Boyang, Shen, Yinghua, Pedrycz, Witold, and Chen, Yuan
Published: 2024
Full Text: View/download PDF

33. Origin and geological control of desorbed gas in multi-thin coal seam in the Wujiu depression, Hailar Basin, China

Author: Li, Geng, Qin, Yong, Song, Xuejuan, Wang, Boyang, Yao, Haipeng, and Lin, Yabing
Published: 2023
Full Text: View/download PDF

34. Fluid seepage mechanism and permeability prediction model of multi-seam interbed coal measures

Author: Li, Geng, Qin, Yong, Wang, Boyang, Zhang, Miao, Lin, Yabing, Song, Xuejuan, and Mi, Wentian
Published: 2024
Full Text: View/download PDF

35. State-of-the-art of biomass-derived carbon dots: Preparation, properties, and applications

Author: Fang, Mengyuan, Wang, Boyang, Qu, Xiaoli, Li, Senrui, Huang, Jinsheng, Li, Jiangnan, Lu, Siyu, and Zhou, Nan
Published: 2024
Full Text: View/download PDF

36. Identification, evolution and geological indications of solid bitumen in shales: A case study of the first member of Cretaceous Qingshankou Formation in Songliao Basin, NE China

Author: LIU, Bo, WANG, Liu, FU, Xiaofei, HUO, Qiuli, BAI, Longhui, LYU, Jiancai, and WANG, Boyang
Published: 2023
Full Text: View/download PDF

37. FastReach: A system for privacy-preserving reachability queries over location data

Author: Quan, Hanyu, Wang, Boyang, Li, Ming, and Leontiadis, Iraklis
Published: 2023
Full Text: View/download PDF

38. The evolution of structure, properties and polar domains in rare earth and PbTiO3 co-substituted BiFeO3 ferroelectric ceramics

Author: Hu, Hao, Zhuang, Jian, Weng, Yunxiang, Zhang, Nan, Wang, Boyang, Wang, Dawei, Feng, Guobao, and Ren, Wei
Published: 2023
Full Text: View/download PDF

39. A Benchmark for Studying Diabetic Retinopathy: Segmentation, Grading, and Transferability

Author: Zhou, Yi, Wang, Boyang, Huang, Lei, Cui, Shanshan, and Shao, Ling
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: People with diabetes are at risk of developing an eye disease called diabetic retinopathy (DR). This disease occurs when high blood glucose levels cause damage to blood vessels in the retina. Computer-aided DR diagnosis is a promising tool for early detection of DR and severity grading, due to the great success of deep learning. However, most current DR diagnosis systems do not achieve satisfactory performance or interpretability for ophthalmologists, due to the lack of training data with consistent and fine-grained annotations. To address this problem, we construct a large fine-grained annotated DR dataset containing 2,842 images (FGADR). This dataset has 1,842 images with pixel-level DR-related lesion annotations, and 1,000 images with image-level labels graded by six board-certified ophthalmologists with intra-rater consistency. The proposed dataset will enable extensive studies on DR diagnosis. We set up three benchmark tasks for evaluation: 1. DR lesion segmentation; 2. DR grading by joint classification and segmentation; 3. Transfer learning for ocular multi-disease identification. Moreover, a novel inductive transfer learning method is introduced for the third task. Extensive experiments using different state-of-the-art methods are conducted on our FGADR dataset, which can serve as baselines for future research.
Published: 2020
Full Text: View/download PDF

40. Sensing of Acetaminophen Drug Using Silicon-Doped Graphdiyne: a DFT Inspection

Author: Zhu, He, Xing, Yanxia, An, Xiaowen, Wang, Boyang, Chang, Guifang, and Yang, Tao
Published: 2023
Full Text: View/download PDF

41. Fingerprinting Encrypted Voice Traffic on Smart Speakers with Deep Learning

Author: Wang, Chenggang, Kennedy, Sean, Li, Haipeng, Hudson, King, Atluri, Gowtham, Wei, Xuetao, Sun, Wenhai, and Wang, Boyang
Subjects: Computer Science - Cryptography and Security
Abstract: This paper investigates the privacy leakage of smart speakers under an encrypted traffic analysis attack, referred to as voice command fingerprinting. In this attack, an adversary can eavesdrop both outgoing and incoming encrypted voice traffic of a smart speaker, and infers which voice command a user says over encrypted traffic. We first built an automatic voice traffic collection tool and collected two large-scale datasets on two smart speakers, Amazon Echo and Google Home. Then, we implemented proof-of-concept attacks by leveraging deep learning. Our experimental results over the two datasets indicate disturbing privacy concerns. Specifically, compared to 1% accuracy with random guess, our attacks can correctly infer voice commands over encrypted traffic with 92.89\% accuracy on Amazon Echo. Despite variances that human voices may cause on outgoing traffic, our proof-of-concept attacks remain effective even only leveraging incoming traffic (i.e., the traffic from the server). This is because the AI-based voice services running on the server side response commands in the same voice and with a deterministic or predictable manner in text, which leaves distinguishable pattern over encrypted traffic. We also built a proof-of-concept defense to obfuscate encrypted traffic. Our results show that the defense can effectively mitigate attack accuracy on Amazon Echo to 32.18%.
Published: 2020

42. Data Inference from Encrypted Databases: A Multi-dimensional Order-Preserving Matching Approach

Author: Pan, Yanjun, Efrat, Alon, Li, Ming, Wang, Boyang, Quan, Hanyu, Mitchell, Joseph, Gao, Jie, and Arkin, Esther
Subjects: Computer Science - Cryptography and Security, Computer Science - Computational Geometry, Computer Science - Databases
Abstract: Due to increasing concerns of data privacy, databases are being encrypted before they are stored on an untrusted server. To enable search operations on the encrypted data, searchable encryption techniques have been proposed. Representative schemes use order-preserving encryption (OPE) for supporting efficient Boolean queries on encrypted databases. Yet, recent works showed the possibility of inferring plaintext data from OPE-encrypted databases, merely using the order-preserving constraints, or combined with an auxiliary plaintext dataset with similar frequency distribution. So far, the effectiveness of such attacks is limited to single-dimensional dense data (most values from the domain are encrypted), but it remains challenging to achieve it on high-dimensional datasets (e.g., spatial data) which are often sparse in nature. In this paper, for the first time, we study data inference attacks on multi-dimensional encrypted databases (with 2-D as a special case). We formulate it as a 2-D order-preserving matching problem and explore both unweighted and weighted cases, where the former maximizes the number of points matched using only order information and the latter further considers points with similar frequencies. We prove that the problem is NP-hard, and then propose a greedy algorithm, along with a polynomial-time algorithm with approximation guarantees. Experimental results on synthetic and real-world datasets show that the data recovery rate is significantly enhanced compared with the previous 1-D matching algorithm., Comment: 11 pages, 4 figures
Published: 2020

43. DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images

Author: Zhou, Yi, Wang, Boyang, He, Xiaodong, Cui, Shanshan, and Shao, Ling
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Diabetic retinopathy (DR) is a complication of diabetes that severely affects eyes. It can be graded into five levels of severity according to international protocol. However, optimizing a grading model to have strong generalizability requires a large amount of balanced training data, which is difficult to collect particularly for the high severity levels. Typical data augmentation methods, including random flipping and rotation, cannot generate data with high diversity. In this paper, we propose a diabetic retinopathy generative adversarial network (DR-GAN) to synthesize high-resolution fundus images which can be manipulated with arbitrary grading and lesion information. Thus, large-scale generated data can be used for more meaningful augmentation to train a DR grading and lesion segmentation model. The proposed retina generator is conditioned on the structural and lesion masks, as well as adaptive grading vectors sampled from the latent grading space, which can be adopted to control the synthesized grading severity. Moreover, a multi-scale spatial and channel attention module is devised to improve the generation ability to synthesize details. Multi-scale discriminators are designed to operate from large to small receptive fields, and joint adversarial losses are adopted to optimize the whole network in an end-to-end manner. With extensive experiments evaluated on the EyePACS dataset connected to Kaggle, as well as the FGADR dataset, we validate the effectiveness of our method, which can both synthesize highly realistic (1280 x 1280) controllable fundus images and contribute to the DR grading task., Comment: Extension work of our MICCAI paper
Published: 2019
Full Text: View/download PDF

44. Fabrication and chemical modification of carbon nanodots/monolayer hexagonal boron nitride/substrate heterostructures and their terahertz optoelectronic properties

Author: Wen, Hua, Wang, Boyang, Cheng, Xingjia, Song, Dan, Xiao, Huan, Xu, Wen, and Lu, Siyu
Published: 2023
Full Text: View/download PDF

45. Nanomechanical and chemical variations of inertinite and vitrinite within lacustrine shale during oil generation

Author: Gao, Yifei, Liu, Bo, Fu, Xiaofei, Tian, Shansi, Wang, Boyang, Wang, Liu, Gentzis, Thomas, and Ostadhassan, Mehdi
Published: 2023
Full Text: View/download PDF

46. Resilient or resistant to non-neutral environments? A comparative study on occupant thermal needs in buildings under natural ventilation, fee-free heating, and fee-charged heating modes

Author: Hu, Jinhua, He, Yingdong, Wang, Qiquan, Wang, Boyang, Hao, Xiaoli, Li, Nianping, Yin, Wei, and Liu, Lifang
Published: 2023
Full Text: View/download PDF

47. Depicting the regulatory role of JZOL on TRP channels in the treatment of Acute Bronchitis based on the combination of clinical trials, computational analysis and in vivo experiments

Author: Fan, Qinhua, primary, Du, Yawei, additional, Wu, Chongming, additional, Wang, Boyang, additional, Xie, Yanming, additional, Zhang, Zeling, additional, Su, Wenquan, additional, Wang, Zizhuo, additional, Xu, Changchang, additional, Li, Xueke, additional, Ding, Ying, additional, Xiao, Xinjiang, additional, Yu, Rong, additional, Li, Nan, additional, Wang, Juan, additional, Teng, Yiqun, additional, Lv, Hongfen, additional, Yang, Nian, additional, Wen, Yuling, additional, Huang, Xiaoli, additional, Pan, Wei, additional, Liu, Yufeng, additional, Xi, Xueqin, additional, Zhao, Qianye, additional, Liu, Changshan, additional, Xu, Jian, additional, Zhang, Haitao, additional, Zhuo, Lie, additional, Rong, Qiangquan, additional, Xia, Yu, additional, Shen, Qin, additional, Li, Shao, additional, Wang, Junhong, additional, and Wu, Shengxian, additional
Published: 2024
Full Text: View/download PDF

48. TinyPower: Side-Channel Attacks with Tiny Neural Networks

Author: Li, Haipeng, primary, Ninan, Mabon, additional, Wang, Boyang, additional, and Emmert, John M., additional
Published: 2024
Full Text: View/download PDF

49. Evaluation of Retinal Image Quality Assessment Networks in Different Color-spaces

Author: Fu, Huazhu, Wang, Boyang, Shen, Jianbing, Cui, Shanshan, Xu, Yanwu, Liu, Jiang, and Shao, Ling
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Retinal image quality assessment (RIQA) is essential for controlling the quality of retinal imaging and guaranteeing the reliability of diagnoses by ophthalmologists or automated analysis systems. Existing RIQA methods focus on the RGB color-space and are developed based on small datasets with binary quality labels (i.e., `Accept' and `Reject'). In this paper, we first re-annotate an Eye-Quality (EyeQ) dataset with 28,792 retinal images from the EyePACS dataset, based on a three-level quality grading system (i.e., `Good', `Usable' and `Reject') for evaluating RIQA methods. Our RIQA dataset is characterized by its large-scale size, multi-level grading, and multi-modality. Then, we analyze the influences on RIQA of different color-spaces, and propose a simple yet efficient deep network, named Multiple Color-space Fusion Network (MCF-Net), which integrates the different color-space representations at both a feature-level and prediction-level to predict image quality grades. Experiments on our EyeQ dataset show that our MCF-Net obtains a state-of-the-art performance, outperforming the other deep learning methods. Furthermore, we also evaluate diabetic retinopathy (DR) detection methods on images of different quality, and demonstrate that the performances of automated diagnostic systems are highly dependent on image quality., Comment: Accepted by MICCAI 2019. Corrected two typos in Table 1 as: (1) in training set, the number of "Usable + All" should be '1,876'; (2) In testing set, the number of "Total + DR-0" should be '11,362'. Project page: https://github.com/hzfu/EyeQ
Published: 2019
Full Text: View/download PDF

50. Regeneration and Joining of the Learned Motion Primitives for Automated Vehicle Motion Planning Applications

Author: Wang, Boyang, Gong, Jianwei, Liang, Wenli, and Chen, Huiyan
Subjects: Computer Science - Robotics
Abstract: How to integrate human factors into the motion planning system is of great significance for improving the acceptance of intelligent vehicles. Decomposing motion into primitives and then accurately and smoothly joining the motion primitives (MPs) is an essential issue in the motion planning system. Therefore, the purpose of this paper is to regenerate and join the learned MPs in the library. By applying a representation algorithm based on the modified dynamic movement primitives (DMPs) and singular value decomposition (SVD), our method separates the basic shape parameters and fine-tuning shape parameters from the same type of demonstration trajectories in the MP library. Moreover, we convert the MP joining problem into a re-representation problem and use the characteristics of the proposed representation algorithm to achieve an accurate and smooth transition. This paper demonstrates that the proposed method can effectively reduce the number of shape adjustment parameters when the MPs are regenerated without affecting the accuracy of the representation. Besides, we also present the ability of the proposed method to smooth the velocity jump when the MPs are connected and evaluate its effect on the accuracy of tracking the set target points. The results show that the proposed method can not only improve the adjustment ability of a single MP in response to different motion planning requirements but also meet the basic requirements of MP joining in the generation of MP sequences.
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

782 results on '"Wang, Boyang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources