7,680 results on '"Yalong An"'
Search Results
2. A 192 bp ERV fragment insertion in the first intron of porcine TLR6 may act as an enhancer associated with the increased expressions of TLR6 and TLR1
- Author
-
XiaoYan Wang, Zixuan Chen, Eduard Murani, Enrico D’Alessandro, Yalong An, Cai Chen, Kui Li, Grazia Galeano, Klaus Wimmers, and Chengyi Song
- Subjects
Pig ,Retrotransposon ,RIP ,ERV ,TLRs ,TLR6 ,Genetics ,QH426-470 - Abstract
Abstract Background Toll-like receptors (TLRs) play important roles in building innate immune and inducing adaptive immune responses. Associations of the TLR genes polymorphisms with disease susceptibility, which are the basis of molecular breeding for disease resistant animals, have been reported extensively. Retrotransposon insertion polymorphisms (RIPs), as a new type of molecular markers developed recently, have great potential in population genetics and quantitative trait locus mapping. In this study, bioinformatic prediction combined with PCR-based amplification was employed to screen for RIPs in porcine TLR genes. Their population distribution was examined, and for one RIP the impact on gene activity and phenotype was further evaluated. Results Five RIPs, located at the 3' flank of TLR3, 5' flank of TLR5, intron 1 of TLR6, intron 1 of TLR7, and 3' flank of TLR8 respectively, were identified. These RIPs were detected in different breeds with an uneven distribution among them. By using the dual luciferase activity assay a 192 bp endogenous retrovirus (ERV) in the intron 1 of TLR6 was shown to act as an enhancer increasing the activities of TLR6 putative promoter and two mini-promoters. Furthermore, real-time quantitative polymerase chain reaction (qPCR) analysis revealed significant association (p
- Published
- 2021
- Full Text
- View/download PDF
3. Stable room-temperature multiferroic skyrmions in lithium niobate with enhanced Pockels effect
- Author
-
Yu, Yalong, Xiong, Bo, Wu, Siqi, Ren, Yekai, Chen, Nuo, Mi, Qingjiao, Lou, Kangping, Wang, Rui, and Chu, Tao
- Subjects
Condensed Matter - Materials Science - Abstract
Lithium Niobate (LN) is a ferroelectric material with exceptional electrical characteristics, including high piezoelectricity, high Pockels effect, etc. These properties make it a promising platform for numerous fields such as high-speed communication, optical computation, and quantum information processing. Besides these, the introduction of magnetic structures to LN holds significant potential to achieve magnetoelectric coupling, which can be applied in magnetic memory and data-processing devices with high efficiency. Here, for the first time, we observe a special topological magnetic structure called magnetic skyrmion in LN (SK-LN) by the combination of magnetic field annealing and rapid annealing processes . Compared to the magnetic skyrmions reported in magnetic systems, SK-LN exhibit exceptionally high stability. Additionally, the center of the magnetic vortex exhibits spontaneous ferroelectric polarization, indicating its multiferroic characteristic. With the excitation of these multiferroic skyrmions, the modulation efficiency of the electro-optical (EO) modulator fabricated on thin film lithium niobate on insulator (LNOI) wafer was found to be enhanced from 1.98 V*cm to 0.63 V*cm. It is considered that the multiferroic skyrmions significantly enhance the Pockels coefficient of LN to 101 pm/V, nearly three times the result (32pm/V) reported previously.
- Published
- 2024
4. STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
- Author
-
Ma, Xiaoxiao, Zhou, Mohan, Liang, Tao, Bai, Yalong, Zhao, Tiejun, Chen, Huaian, and Jin, Yi
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We present STAR, a text-to-image model that employs scale-wise auto-regressive paradigm. Unlike VAR, which is limited to class-conditioned synthesis within a fixed set of predetermined categories, our STAR enables text-driven open-set generation through three key designs: To boost diversity and generalizability with unseen combinations of objects and concepts, we introduce a pre-trained text encoder to extract representations for textual constraints, which we then use as guidance. To improve the interactions between generated images and fine-grained textual guidance, making results more controllable, additional cross-attention layers are incorporated at each scale. Given the natural structure correlation across different scales, we leverage 2D Rotary Positional Encoding (RoPE) and tweak it into a normalized version. This ensures consistent interpretation of relative positions across token maps at different scales and stabilizes the training process. Extensive experiments demonstrate that STAR surpasses existing benchmarks in terms of fidelity,image text consistency, and aesthetic quality. Our findings emphasize the potential of auto-regressive methods in the field of high-quality image synthesis, offering promising new directions for the T2I field currently dominated by diffusion methods., Comment: 12 pages, 6 figures
- Published
- 2024
5. Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset
- Author
-
Zhu, Jie, Li, Junhui, Wen, Yalong, and Guo, Lifan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
In light of recent breakthroughs in large language models (LLMs) that have revolutionized natural language processing (NLP), there is an urgent need for new benchmarks to keep pace with the fast development of LLMs. In this paper, we propose CFLUE, the Chinese Financial Language Understanding Evaluation benchmark, designed to assess the capability of LLMs across various dimensions. Specifically, CFLUE provides datasets tailored for both knowledge assessment and application assessment. In knowledge assessment, it consists of 38K+ multiple-choice questions with associated solution explanations. These questions serve dual purposes: answer prediction and question reasoning. In application assessment, CFLUE features 16K+ test instances across distinct groups of NLP tasks such as text classification, machine translation, relation extraction, reading comprehension, and text generation. Upon CFLUE, we conduct a thorough evaluation of representative LLMs. The results reveal that only GPT-4 and GPT-4-turbo achieve an accuracy exceeding 60\% in answer prediction for knowledge assessment, suggesting that there is still substantial room for improvement in current LLMs. In application assessment, although GPT-4 and GPT-4-turbo are the top two performers, their considerable advantage over lightweight LLMs is noticeably diminished. The datasets and scripts associated with CFLUE are openly accessible at https://github.com/aliyun/cflue., Comment: Accepted by ACL 2024
- Published
- 2024
6. Domain adaptive pose estimation via multi-level alignment
- Author
-
Chen, Yugan, Zhao, Lin, Xu, Yalong, Zu, Honglei, An, Xiaoqi, and Li, Guangyu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Domain adaptive pose estimation aims to enable deep models trained on source domain (synthesized) datasets produce similar results on the target domain (real-world) datasets. The existing methods have made significant progress by conducting image-level or feature-level alignment. However, only aligning at a single level is not sufficient to fully bridge the domain gap and achieve excellent domain adaptive results. In this paper, we propose a multi-level domain adaptation aproach, which aligns different domains at the image, feature, and pose levels. Specifically, we first utilize image style transer to ensure that images from the source and target domains have a similar distribution. Subsequently, at the feature level, we employ adversarial training to make the features from the source and target domains preserve domain-invariant characeristics as much as possible. Finally, at the pose level, a self-supervised approach is utilized to enable the model to learn diverse knowledge, implicitly addressing the domain gap. Experimental results demonstrate that significant imrovement can be achieved by the proposed multi-level alignment method in pose estimation, which outperforms previous state-of-the-art in human pose by up to 2.4% and animal pose estimation by up to 3.1% for dogs and 1.4% for sheep., Comment: accepted to icme2024
- Published
- 2024
7. VizGroup: An AI-Assisted Event-Driven System for Real-Time Collaborative Programming Learning Analytics
- Author
-
Tang, Xiaohang, Wong, Sam, Pu, Kevin, Chen, Xi, Yang, Yalong, and Chen, Yan
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Programming instructors often conduct collaborative learning activities, like Peer Instruction, to foster a deeper understanding in students and enhance their engagement with learning. These activities, however, may not always yield productive outcomes due to the diversity of student mental models and their ineffective collaboration. In this work, we introduce VizGroup, an AI-assisted system that enables programming instructors to easily oversee students' real-time collaborative learning behaviors during large programming courses. VizGroup leverages Large Language Models (LLMs) to recommend event specifications for instructors so that they can simultaneously track and receive alerts about key correlation patterns between various collaboration metrics and ongoing coding tasks. We evaluated VizGroup with 12 instructors using a dataset collected from a Peer Instruction activity that was conducted in a large programming lecture. The results showed that compared to a version of VizGroup without the suggested units, VizGroup with suggested units helped instructors create additional monitoring units on previously undetected patterns on their own, covered a more diverse range of metrics, and influenced the participants' following notification creation strategies.
- Published
- 2024
8. Evaluating Navigation and Comparison Performance of Computational Notebooks on Desktop and in Virtual Reality
- Author
-
In, Sungwon, Krokos, Erick, Whitley, Kirsten, North, Chris, and Yang, Yalong
- Subjects
Computer Science - Human-Computer Interaction - Abstract
The computational notebook serves as a versatile tool for data analysis. However, its conventional user interface falls short of keeping pace with the ever-growing data-related tasks, signaling the need for novel approaches. With the rapid development of interaction techniques and computing environments, there is a growing interest in integrating emerging technologies in data-driven workflows. Virtual reality, in particular, has demonstrated its potential in interactive data visualizations. In this work, we aimed to experiment with adapting computational notebooks into VR and verify the potential benefits VR can bring. We focus on the navigation and comparison aspects as they are primitive components in analysts' workflow. To further improve comparison, we have designed and implemented a Branching&Merging functionality. We tested computational notebooks on the desktop and in VR, both with and without the added Branching&Merging capability. We found VR significantly facilitated navigation compared to desktop, and the ability to create branches enhanced comparison., Comment: 15 pages, 10 figures, ACM CHI 2024
- Published
- 2024
9. Dynamic Prompt Optimizing for Text-to-Image Generation
- Author
-
Mo, Wenyi, Zhang, Tianyu, Bai, Yalong, Su, Bing, Wen, Ji-Rong, and Yang, Qing
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Text-to-image generative models, specifically those based on diffusion models like Imagen and Stable Diffusion, have made substantial advancements. Recently, there has been a surge of interest in the delicate refinement of text prompts. Users assign weights or alter the injection time steps of certain words in the text prompts to improve the quality of generated images. However, the success of fine-control prompts depends on the accuracy of the text prompts and the careful selection of weights and time steps, which requires significant manual intervention. To address this, we introduce the \textbf{P}rompt \textbf{A}uto-\textbf{E}diting (PAE) method. Besides refining the original prompts for image generation, we further employ an online reinforcement learning strategy to explore the weights and injection time steps of each word, leading to the dynamic fine-control prompts. The reward function during training encourages the model to consider aesthetic score, semantic consistency, and user preferences. Experimental results demonstrate that our proposed method effectively improves the original prompts, generating visually more appealing images while maintaining semantic alignment. Code is available at https://github.com/Mowenyii/PAE., Comment: Accepted to CVPR 2024
- Published
- 2024
10. A degeneration formula of Donaldson-Thomas theory on Calabi-Yau 4-folds
- Author
-
Cao, Yalong, Zhao, Gufang, and Zhou, Zijun
- Subjects
Mathematics - Algebraic Geometry ,High Energy Physics - Theory - Abstract
We prove a degeneration formula for Donaldson-Thomas theory on Calabi-Yau 4-folds, and apply it to compute zero dimensional invariants on $\mathbb{C}^4$ and on any local curve., Comment: 65 pages
- Published
- 2024
11. Non-autoregressive Generative Models for Reranking Recommendation
- Author
-
Ren, Yuxin, Yang, Qiya, Wu, Yichun, Xu, Wei, Wang, Yalong, and Zhang, Zhiqiang
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence - Abstract
Contemporary recommendation systems are designed to meet users' needs by delivering tailored lists of items that align with their specific demands or interests. In a multi-stage recommendation system, reranking plays a crucial role by modeling the intra-list correlations among items. The key challenge of reranking lies in the exploration of optimal sequences within the combinatorial space of permutations. Recent research proposes a generator-evaluator learning paradigm, where the generator generates multiple feasible sequences and the evaluator picks out the best sequence based on the estimated listwise score. The generator is of vital importance, and generative models are well-suited for the generator function. Current generative models employ an autoregressive strategy for sequence generation. However, deploying autoregressive models in real-time industrial systems is challenging. To address these issues, we propose a Non-AutoRegressive generative model for reranking Recommendation (NAR4Rec) designed to enhance efficiency and effectiveness. To tackle challenges such as sparse training samples and dynamic candidates, we introduce a matching model. Considering the diverse nature of user feedback, we employ a sequence-level unlikelihood training objective to differentiate feasible sequences from unfeasible ones. Additionally, to overcome the lack of dependency modeling in non-autoregressive models regarding target items, we introduce contrastive decoding to capture correlations among these items. Extensive offline experiments validate the superior performance of NAR4Rec over state-of-the-art reranking methods. Online A/B tests reveal that NAR4Rec significantly enhances the user experience. Furthermore, NAR4Rec has been fully deployed in a popular video app Kuaishou with over 300 million daily active users., Comment: Accepted by KDD 2024
- Published
- 2024
12. StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models
- Author
-
Zhou, Mohan, Bai, Yalong, Yang, Qing, and Zhao, Tiejun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The ability to fine-tune generative models for text-to-image generation tasks is crucial, particularly facing the complexity involved in accurately interpreting and visualizing textual inputs. While LoRA is efficient for language model adaptation, it often falls short in text-to-image tasks due to the intricate demands of image generation, such as accommodating a broad spectrum of styles and nuances. To bridge this gap, we introduce StyleInject, a specialized fine-tuning approach tailored for text-to-image models. StyleInject comprises multiple parallel low-rank parameter matrices, maintaining the diversity of visual features. It dynamically adapts to varying styles by adjusting the variance of visual features based on the characteristics of the input signal. This approach significantly minimizes the impact on the original model's text-image alignment capabilities while adeptly adapting to various styles in transfer learning. StyleInject proves particularly effective in learning from and enhancing a range of advanced, community-fine-tuned generative models. Our comprehensive experiments, including both small-sample and large-scale data fine-tuning as well as base model distillation, show that StyleInject surpasses traditional LoRA in both text-image semantic consistency and human preference evaluation, all while ensuring greater parameter efficiency., Comment: 11 pages, 11 figures
- Published
- 2024
13. Interplay of Landau quantization and interminivalley scatterings in a weakly coupled moir\'e superlattice
- Author
-
Yuan, Yalong, Liu, Le, Zhu, Jundong, Dong, Jingwei, Chu, Yanbang, Wu, Fanfan, Du, Luojun, Watanabe, Kenji, Taniguchi, Takashi, Shi, Dongxia, Zhang, Guangyu, and Yang, Wei
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Double layer quantum systems are promising platforms for realizing novel quantum phases. Here, we report a study of quantum oscillations (QOs) in a weakly coupled double layer system, composed of a large angle twisted double bilayer graphene (TDBG). We observe two different QOs at low temperature, one with a periodicity in carrier density (n), i.e. Shubnikov de Haas oscillation (SdHO) due to Landau quantization, and the other one in displacement field (D), resulting a grid pattern. We quantify the interlayer coupling strength by measuring the interlayer capacitance from the grid pattern with a capacitance model, revealing an electron hole asymmetry. At high temperature when SdHO are thermal smeared, we observe resistance peaks when LLs from two minivalleys in the moir\'e Brillion zone are aligned, regardless of carrier density; eventually, it results in a two fold increase of oscillating frequency in D, serving as a smoking gun evidence of the magneto intersubband oscillations (MISO) in a double layer system. The temperature dependence of MISO suggests electron-electron interaction between two minivalleys play a crucial rule in the scattering, and the scattering times obtained from MISO thermal damping are found to be correlated with the interlayer coupling strength. Our study reveals an intriguing interplay among Landau quantization, moir\'e band structure, and scatterings.
- Published
- 2024
14. Multiple measurements on the cosmic curvature using Gaussian process regression without calibration and a cosmological model
- Author
-
Gong, Xiaolong, Xu, Yifei, Liu, Tonghua, Cao, Shuo, Jiang, Jianyong, Nan, Yalong, and Ding, Ruobin
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
In this letter, we propose an improved cosmological model-independent method to measure cosmic curvature, combining the recent measurements of transverse and line-of-sight directions in the baryon acoustic oscillations (BAO) with cosmic chronometers (CC) datasets. Considering that the CC dataset is discrete and includes only 32 $H(z)$ measurements, we apply Gaussian process (GP) regression to fit the CC dataset and reconstruct them. Our methodology, which does not need the calibration or selection of any cosmological model, provide multiple measurements of spatial curvature ($\Omega_K$) at different redshifts (depending on the redshift coverage of BAO dataset). For combination of all BAO data, we find that the constraint result on cosmic curvature is $\Omega_K=-0.096^{+0.190}_{-0.195}$ with $1\sigma$ observational uncertainty. Although the measured $\Omega_K$ is in good agreement with zero cosmic curvature within 1$\sigma$ confidence level, our result revels the fact of a closed universe. More importantly, our results show that the obtained $\Omega_K$ measurements are almost unaffected by different priors of the Hubble constant. This could help solve the issue of the Hubble tension that may be caused by inconsistencies in the spatial curvature between the early and late universes., Comment: 12 pages, 2 figures, welcome to comment, submitted to Physics Letters B
- Published
- 2024
- Full Text
- View/download PDF
15. CscK metrics near the canonical class
- Author
-
Guo, Bin, Jian, Wangjian, Shi, Yalong, and Song, Jian
- Subjects
Mathematics - Differential Geometry ,Mathematics - Analysis of PDEs ,53C55, 35J60 - Abstract
Let $X$ be a K\"ahler manifold with semi-ample canonical bundle $K_X$. It is proved by Jian-Shi-Song that for any K\"ahler class $\gamma$, there exists $\delta>0$ such that for all $t\in (0, \delta)$ there exists a unique cscK metric $g_t$ in $K_X+ t \gamma $. In this paper, we prove that $\{ (X, g_t) \}_{ t\in (0, \delta)} $ have uniformly bounded K\"ahler potentials, volume forms and diameters. As a consequence, these metric spaces are pre-compact in the Gromov-Hausdorff sense., Comment: 13 pages, no figures. All comments are welcome!
- Published
- 2024
16. Green functions for GJMS operators on spheres, Gegenbauer polynomials and rigidity theorems
- Author
-
Chen, Xuezhang and Shi, Yalong
- Subjects
Mathematics - Differential Geometry ,Mathematics - Analysis of PDEs ,35J08, 53C24 - Abstract
We derive explicit representation formulae of Green functions for GJMS operators on $n$-spheres, including the fractional ones. These formulae not only have natural geometric interpretations concerning the extrinsic geometry of the round sphere, but also reflect the spherical rigidity among closed embedded hypersurfaces in $\mathbb{R}^{n+1}$., Comment: 36 pages, no figures. Related works are mentioned, typos corrected. Theorem 1(3) now includes more cases. We also provide an alternative proof of Theorem 3(2) when n is at least 5 in the appendix, by using the asymptotic expansion formula of Green functions. Comments are welcome!
- Published
- 2024
17. An Approach for Multi-Object Tracking with Two-Stage Min-Cost Flow
- Author
-
Li, Huining, Jiang, Yalong, Zeng, Xianlin, Li, Feng, and Wang, Zhipeng
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The minimum network flow algorithm is widely used in multi-target tracking. However, the majority of the present methods concentrate exclusively on minimizing cost functions whose values may not indicate accurate solutions under occlusions. In this paper, by exploiting the properties of tracklets intersections and low-confidence detections, we develop a two-stage tracking pipeline with an intersection mask that can accurately locate inaccurate tracklets which are corrected in the second stage. Specifically, we employ the minimum network flow algorithm with high-confidence detections as input in the first stage to obtain the candidate tracklets that need correction. Then we leverage the intersection mask to accurately locate the inaccurate parts of candidate tracklets. The second stage utilizes low-confidence detections that may be attributed to occlusions for correcting inaccurate tracklets. This process constructs a graph of nodes in inaccurate tracklets and low-confidence nodes and uses it for the second round of minimum network flow calculation. We perform sufficient experiments on popular MOT benchmark datasets and achieve 78.4 MOTA on the test set of MOT16, 79.2 on MOT17, and 76.4 on MOT20, which shows that the proposed method is effective.
- Published
- 2023
18. Discrete element analysis of deformation and failure characteristics in a slope affected by corrosion deterioration and underground mining: a case study of the Jiweishan landslide, China
- Author
-
Yang, Zhongping, Zhao, Qian, Li, Shiqi, Zhao, Yalong, Liu, Xinrong, and Zhong, Zuliang
- Published
- 2024
- Full Text
- View/download PDF
19. Modulating the thermophysical properties of diamond/SiC composites via controlling the diamond graphitization
- Author
-
Wang, Xulei, Li, Yikang, Huang, Yabo, Zhang, Yalong, Wang, Pei, Guan, Li, He, Xinbo, Liu, Rongjun, Qu, Xuanhui, and Wu, Xiaoge
- Published
- 2024
- Full Text
- View/download PDF
20. Crystal Plasticity-Based Assessment of Constitutive Laws for Microstructure and Rolling Texture Capture in Ferritic Stainless Steel During Cold Rolling
- Author
-
Song, Kangjie, Luo, Yalong, Zhang, Chi, Zhang, Liwen, Deng, Guanyu, and Zheng, Huaibei
- Published
- 2024
- Full Text
- View/download PDF
21. Plenodomus tracheiphilus, but not Dothiorella ulmi, causes wilt disease on elm trees in Alberta, Canada
- Author
-
Yang, Yalong, Fu, Heting, Zahr, Kher, Xue, Shiming, Calpas, James, Demilliano, Krista, Harding, Michael W., Feindel, David, and Feng, Jie
- Published
- 2024
- Full Text
- View/download PDF
22. Pathological response and tumor stroma immunogenic features predict long-term survival in non-small cell lung cancer after neoadjuvant chemotherapy
- Author
-
Wang, Shuaibo, Sun, Xujie, Dong, Jiyan, Liu, Li, Zhao, Hao, Li, Renda, Yang, Zhenlin, Cheng, Na, Wang, Yalong, Fu, Li, Yi, Hang, Lv, Zhuoheng, Huo, Huandong, Jin, Donghui, Mao, Yousheng, and Yang, Lin
- Published
- 2024
- Full Text
- View/download PDF
23. Crystal Plasticity Analysis of the Orientation-Dependent Grain Rotation and Fragmentation Behaviors in Ferritic Stainless Steel During Cold Rolling
- Author
-
Song, Kangjie, Miao, Luyang, Luo, Yalong, Zhang, Chi, Zhang, Liwen, and Deng, Guanyu
- Published
- 2024
- Full Text
- View/download PDF
24. Catalyzing Urban Dynamics: Fostering Information Exchange, Encouraging Innovation, and Facilitating Knowledge Creation in the Macau Peninsula
- Author
-
Pan, Chen, Li, Haibo, Wang, Lu, Wu, Jiawei, Lee, Mengshun, Xing, Yalong, and Liu, Xiaodong
- Published
- 2024
- Full Text
- View/download PDF
25. Inhibition of LSD1 via SP2509 attenuated the progression of rheumatoid arthritis
- Author
-
Yu, Ziliang, Li, Peipei, Gao, Dagong, Hu, Yalong, Xia, Fei, Liu, Lei, Liu, Jian, Liu, Wei, and Zhang, Haiping
- Published
- 2024
- Full Text
- View/download PDF
26. Exploring the Relationships between Perceived Transformational Leadership and Transactional Leadership and Teachers' Intellectual Style
- Author
-
Liu, Peng, Liu, Lili, Bo, Yalong, and Yang, Hui
- Abstract
The purpose of this study was to understand the relationship between perceived transformational leadership and transactional leadership style and teachers' intellectual style. Based on 967 middle school teachers' survey answers, this study identified that there are positive relationships between perceived transformational leadership and Type I intellectual style, as well as a positive relationship between perceived instructional leadership and Type II intellectual style. This study paves the way for a theoretical understanding of the relationship between leadership style and teachers' intellectual style and provides practical suggestions for education practitioners.
- Published
- 2023
27. This is the Table I Want! Interactive Data Transformation on Desktop and in Virtual Reality
- Author
-
In, Sungwon, Lin, Tica, North, Chris, Pfister, Hanspeter, and Yang, Yalong
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Data transformation is an essential step in data science. While experts primarily use programming to transform their data, there is an increasing need to support non-programmers with user interface-based tools. With the rapid development in interaction techniques and computing environments, we report our empirical findings about the effects of interaction techniques and environments on performing data transformation tasks. Specifically, we studied the potential benefits of direct interaction and virtual reality (VR) for data transformation. We compared gesture interaction versus a standard WIMP user interface, each on the desktop and in VR. With the tested data and tasks, we found time performance was similar between desktop and VR. Meanwhile, VR demonstrates preliminary evidence to better support provenance and sense-making throughout the data transformation process. Our exploration of performing data transformation in VR also provides initial affirmation for enabling an iterative and fully immersive data science workflow., Comment: IEEE Transactions on Visualization and Computer Graphics (TVCG), to appear
- Published
- 2023
- Full Text
- View/download PDF
28. Reasonable Anomaly Detection in Long Sequences
- Author
-
Jiang, Yalong and Li, Changkang
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Video anomaly detection is a challenging task due to the lack in approaches for representing samples. The visual representations of most existing approaches are limited by short-term sequences of observations which cannot provide enough clues for achieving reasonable detections. In this paper, we propose to completely represent the motion patterns of objects by learning from long-term sequences. Firstly, a Stacked State Machine (SSM) model is proposed to represent the temporal dependencies which are consistent across long-range observations. Then SSM model functions in predicting future states based on past ones, the divergence between the predictions with inherent normal patterns and observed ones determines anomalies which violate normal motion patterns. Extensive experiments are carried out to evaluate the proposed approach on the dataset and existing ones. Improvements over state-of-the-art methods can be observed. Our code is available at https://github.com/AllenYLJiang/Anomaly-Detection-in-Sequences., Comment: 8 pages, 1 figure
- Published
- 2023
29. Multi-Focus Querying of the Human Genome Information on Desktop and in Virtual Reality: an Evaluation
- Author
-
Reiske, Gunnar, In, Sungwon, and Yang, Yalong
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Graphics - Abstract
The human genome is incredibly information-rich, consisting of approximately 25,000 protein-coding genes spread out over 3.2 billion nucleotide base pairs contained within 24 unique chromosomes. The genome is important in maintaining spatial context, which assists in understanding gene interactions and relationships. However, existing methods of genome visualization that utilize spatial awareness are inefficient and prone to limitations in presenting gene information and spatial context. This study proposed an innovative approach to genome visualization and exploration utilizing virtual reality. To determine the optimal placement of gene information and evaluate its essentiality in a VR environment, we implemented and conducted a user study with three different interaction methods. Two interaction methods were developed in virtual reality to determine if gene information is better suited to be embedded within the chromosome ideogram or separate from the ideogram. The final ideogram interaction method was performed on a desktop and served as a benchmark to evaluate the potential benefits associated with the use of VR. Our study findings reveal a preference for VR, despite longer task completion times. In addition, the placement of gene information within the visualization had a notable impact on the ability of a user to complete tasks. Specifically, gene information embedded within the chromosome ideogram was better suited for single target identification and summarization tasks, while separating gene information from the ideogram better supported region comparison tasks., Comment: To be published at ISMAR 2023 conference
- Published
- 2023
30. Data-driven Storytelling in Hybrid Immersive Display Environments
- Author
-
Zhou, Xiaoyan, Yang, Yalong, Ortega, Francisco, Batmaz, Anil Ufuk, and Lee, Benjamin
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Data-driven stories seek to inform and persuade audiences through the use of data visualisations and engaging narratives. These stories have now been highly optimised to be viewed on desktop and mobile computers. In contrast, while immersive virtual and augmented reality (VR/AR) technologies have been shown to be more persuasive, no clear standard has yet emerged for such immersive stories. With this in mind, we propose that a hybrid data-driven storytelling approach can leverage the familiarity of 2D display devices with the immersiveness and presence afforded by VR/AR headsets. In this position paper, we characterise hybrid data-driven stories by describing its design opportunities, considerations, and challenges. In particular, we describe how both 2D and 3D display environments can play either complementary or symbiotic roles with each other for the purposes of storytelling. We hope that this work inspires researchers to investigate how hybrid user interfaces may be used for storytelling.
- Published
- 2023
31. RL-LABEL: A Deep Reinforcement Learning Approach Intended for AR Label Placement in Dynamic Scenarios
- Author
-
Zhu-Tian, Chen, Chiappalupi, Daniele, Lin, Tica, Yang, Yalong, Beyer, Johanna, and Pfister, Hanspeter
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Graphics - Abstract
Labels are widely used in augmented reality (AR) to display digital information. Ensuring the readability of AR labels requires placing them occlusion-free while keeping visual linkings legible, especially when multiple labels exist in the scene. Although existing optimization-based methods, such as force-based methods, are effective in managing AR labels in static scenarios, they often struggle in dynamic scenarios with constantly moving objects. This is due to their focus on generating layouts optimal for the current moment, neglecting future moments and leading to sub-optimal or unstable layouts over time. In this work, we present RL-LABEL, a deep reinforcement learning-based method for managing the placement of AR labels in scenarios involving moving objects. RL-LABEL considers the current and predicted future states of objects and labels, such as positions and velocities, as well as the user's viewpoint, to make informed decisions about label placement. It balances the trade-offs between immediate and long-term objectives. Our experiments on two real-world datasets show that RL-LABEL effectively learns the decision-making process for long-term optimization, outperforming two baselines (i.e., no view management and a force-based method) by minimizing label occlusions, line intersections, and label movement distance. Additionally, a user study involving 18 participants indicates that RL-LABEL excels over the baselines in aiding users to identify, compare, and summarize data on AR labels within dynamic scenes.
- Published
- 2023
- Full Text
- View/download PDF
32. Genome Mining from Agriculturally Relevant Fungi Led to a d-Glucose Esterified Polyketide with a Terpene-like Core Structure.
- Author
-
Yan, Chunsheng, Han, Wenyu, Zhou, Qingyang, Niwa, Kanji, Tang, Melody, Burch, Jessica, Zhang, Yalong, Delgadillo, David, Sun, Zuodong, Wu, Zhongshou, Jacobsen, Steven, Nelson, Hosea, Houk, K, and Tang, Yi
- Subjects
Terpenes ,Polyketides ,Multigene Family ,Fungi - Abstract
Comparison of biosynthetic gene clusters (BGCs) found in devastating plant pathogens and biocontrol fungi revealed an uncharacterized and conserved polyketide BGC. Genome mining identified the associated metabolite to be treconorin, which has a terpene-like, trans-fused 5,7-bicyclic core that is proposed to derive from a (4 + 3) cycloaddition. The core is esterified with d-glucose, which derives from the glycosidic cleavage of a trehalose ester precursor. This glycomodification strategy is different from the commonly observed glycosylation of natural products.
- Published
- 2023
33. Exploring passengers’ choice of transfer city in air-to-rail intermodal travel using an interpretable ensemble machine learning approach
- Author
-
Ren, Yifeng, Yang, Min, Chen, Enhui, Cheng, Long, and Yuan, Yalong
- Published
- 2024
- Full Text
- View/download PDF
34. Fabrication of poly(vinylidene fluoride)/graphite heterogeneous porous carbon nanofiber composite mat by electrospraying method for efficient oil–water separation
- Author
-
Liu, Yalong, Kong, Fangyuan, Xin, Binjie, Chen, Zhuoming, Xu, Yingqi, Liu, Yan, Li, Lifeng, and Newton, Md All Amin
- Published
- 2024
- Full Text
- View/download PDF
35. Cyclin-dependent Kinase 5 and Neurodegenerative Diseases
- Author
-
Song, Mingxue, Qiang, Yalong, Zhao, Xiulan, and Song, Fuyong
- Published
- 2024
- Full Text
- View/download PDF
36. Clarifying the correlations between hydraulic indicators evaluating the hydraulic performance of free water surface constructed wetlands
- Author
-
Wan, Di, Li, Yalong, Zhu, Qing, Cui, Yuanlai, Shu, Yonghong, and Guo, Changqiang
- Published
- 2024
- Full Text
- View/download PDF
37. Learning and Evaluating Human Preferences for Conversational Head Generation
- Author
-
Zhou, Mohan, Bai, Yalong, Zhang, Wei, Yao, Ting, Zhao, Tiejun, and Mei, Tao
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
A reliable and comprehensive evaluation metric that aligns with manual preference assessments is crucial for conversational head video synthesis methods development. Existing quantitative evaluations often fail to capture the full complexity of human preference, as they only consider limited evaluation dimensions. Qualitative evaluations and user studies offer a solution but are time-consuming and labor-intensive. This limitation hinders the advancement of conversational head generation algorithms and systems. In this paper, we propose a novel learning-based evaluation metric named Preference Score (PS) for fitting human preference according to the quantitative evaluations across different dimensions. PS can serve as a quantitative evaluation without the need for human annotation. Experimental results validate the superiority of Preference Score in aligning with human perception, and also demonstrate robustness and generalizability to unseen data, making it a valuable tool for advancing conversation head generation. We expect this metric could facilitate new advances in conversational head generation. Project Page: https://https://github.com/dc3ea9f/PreferenceScore., Comment: Accepted by ACM Multimedia 2023
- Published
- 2023
- Full Text
- View/download PDF
38. Interactive Conversational Head Generation
- Author
-
Zhou, Mohan, Bai, Yalong, Zhang, Wei, Yao, Ting, and Zhao, Tiejun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We introduce a new conversation head generation benchmark for synthesizing behaviors of a single interlocutor in a face-to-face conversation. The capability to automatically synthesize interlocutors which can participate in long and multi-turn conversations is vital and offer benefits for various applications, including digital humans, virtual agents, and social robots. While existing research primarily focuses on talking head generation (one-way interaction), hindering the ability to create a digital human for conversation (two-way) interaction due to the absence of listening and interaction parts. In this work, we construct two datasets to address this issue, ``ViCo'' for independent talking and listening head generation tasks at the sentence level, and ``ViCo-X'', for synthesizing interlocutors in multi-turn conversational scenarios. Based on ViCo and ViCo-X, we define three novel tasks targeting the interaction modeling during the face-to-face conversation: 1) responsive listening head generation making listeners respond actively to the speaker with non-verbal signals, 2) expressive talking head generation guiding speakers to be aware of listeners' behaviors, and 3) conversational head generation to integrate the talking/listening ability in one interlocutor. Along with the datasets, we also propose corresponding baseline solutions to the three aforementioned tasks. Experimental results show that our baseline method could generate responsive and vivid agents that can collaborate with real person to fulfil the whole conversation. Project page: https://vico.solutions/., Comment: arXiv admin note: text overlap with arXiv:2112.13548
- Published
- 2023
39. Deep Equilibrium Multimodal Fusion
- Author
-
Ni, Jinhong, Bai, Yalong, Zhang, Wei, Yao, Ting, and Mei, Tao
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Multimodal fusion integrates the complementary information present in multiple modalities and has gained much attention recently. Most existing fusion approaches either learn a fixed fusion strategy during training and inference, or are only capable of fusing the information to a certain extent. Such solutions may fail to fully capture the dynamics of interactions across modalities especially when there are complex intra- and inter-modality correlations to be considered for informative multimodal fusion. In this paper, we propose a novel deep equilibrium (DEQ) method towards multimodal fusion via seeking a fixed point of the dynamic multimodal fusion process and modeling the feature correlations in an adaptive and recursive manner. This new way encodes the rich information within and across modalities thoroughly from low level to high level for efficacious downstream multimodal learning and is readily pluggable to various multimodal frameworks. Extensive experiments on BRCA, MM-IMDB, CMU-MOSI, SUN RGB-D, and VQA-v2 demonstrate the superiority of our DEQ fusion. More remarkably, DEQ fusion consistently achieves state-of-the-art performance on multiple multimodal benchmarks. The code will be released.
- Published
- 2023
40. Observation of first-order quantum phase transitions and ferromagnetism in twisted double bilayer graphene
- Author
-
Liu, Le, Lu, Xin, Chu, Yanbang, Yang, Guang, Yuan, Yalong, Wu, Fanfan, Ji, Yiru, Tian, Jinpeng, Watanabe, Kenji, Taniguchi, Takashi, Du, Luojun, Shi, Dongxia, Liu, Jianpeng, Shen, Jie, Lu, Li, Yang, Wei, and Zhang, Guangyu
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Strongly Correlated Electrons - Abstract
Twisted graphene multilayers are highly tunable flatband systems for developing new phases of matter. Thus far, while orbital ferromagnetism has been observed in valley polarized phases, the long-range orders of other correlated phases as well as the quantum phase transitions between different orders mostly remain unknown. Here, we report an observation of Coulomb interaction driven first-order quantum phase transitions and ferromagnetism in twisted double bilayer graphene (TDBG). At zero magnetic field, the transitions are revealed in a series of step-like abrupt resistance jumps with prominent hysteresis loop when either the displacement field (D) or the carrier density (n) is tuned across symmetry-breaking boundary near half filling, indicating a formation of ordered domains. It is worth noting that the good turnability and switching of these states gives a rise to a memory performance with a large on/off ratio. Moreover, when both spin and valley play the roles at finite magnetic field, we observe abundant first-order quantum phase transitions among normal metallic states from charge neutral point, orbital ferromagnetic states from quarter filling, and spin-polarized states from half filling. We interpret these first-order phase transitions in the picture of phase separations and spin domain percolations driven by multi-field tunable Coulomb interactions, in agreement with Lifshitz transition from Hartree-Fock calculations. The observed multi-filed tunable domain structure and its hysteresis resembles the characteristics of multiferroics, revealing intriguing magnetoelectric properties. Our result enriches the correlated phase diagram in TDBG for discovering novel exotic phases and quantum phase transitions, and it would benefit other twisted moir\'e systems as well.
- Published
- 2023
- Full Text
- View/download PDF
41. Visual-Aware Text-to-Speech
- Author
-
Zhou, Mohan, Bai, Yalong, Zhang, Wei, Yao, Ting, Zhao, Tiejun, and Mei, Tao
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Computation and Language ,Computer Science - Sound - Abstract
Dynamically synthesizing talking speech that actively responds to a listening head is critical during the face-to-face interaction. For example, the speaker could take advantage of the listener's facial expression to adjust the tones, stressed syllables, or pauses. In this work, we present a new visual-aware text-to-speech (VA-TTS) task to synthesize speech conditioned on both textual inputs and sequential visual feedback (e.g., nod, smile) of the listener in face-to-face communication. Different from traditional text-to-speech, VA-TTS highlights the impact of visual modality. On this newly-minted task, we devise a baseline model to fuse phoneme linguistic information and listener visual signals for speech synthesis. Extensive experiments on multimodal conversation dataset ViCo-X verify our proposal for generating more natural audio with scenario-appropriate rhythm and prosody., Comment: accepted as oral and top 3% paper by ICASSP 2023
- Published
- 2023
- Full Text
- View/download PDF
42. Quasimaps to quivers with potentials
- Author
-
Cao, Yalong and Zhao, Gufang
- Subjects
Mathematics - Algebraic Geometry ,High Energy Physics - Theory ,Mathematics - Representation Theory ,14N35, 14D23, 20G42 - Abstract
This paper is concerned with a non-compact GIT quotient of a vector space, in the presence of an abelian group action and an equivariant regular function (potential) on the quotient. We define virtual counts of quasimaps from prestable curves to the critical locus of the potential, and prove a gluing formula in the formalism of cohomological field theories. The main examples studied in this paper is when the above setting arises from quivers with potentials, where the above construction gives quantum correction to the equivariant Chow homology of the critical locus. Following similar ideas as in quasimaps to Nakajima quiver varieties studied by the Okounkov school, we analyse vertex functions in several examples, including Hilbert schemes of points on $\mathbb{C}^3$, moduli spaces of perverse coherent systems on the resolved conifold, and a quiver which defines higher $\mathfrak{sl}_2$-spin chains. Bethe equations are calculated in these cases. The construction in the present paper is based on the theory of gauged linear sigma models as well as shifted symplectic geometry of Pantev, To\"en, Vaquie and Vezzosi, and uses the virtual pullback formalism of symmetric obstruction theory of Park, which arises from the recent development of Donaldson-Thomas theory of Calabi-Yau 4-folds., Comment: 87 pages
- Published
- 2023
43. $L^\infty$ estimates for K\'ahler-Ricci flow on K\'ahler-Einstein Fano manifolds: a new derivation
- Author
-
Jian, Wangjian and Shi, Yalong
- Subjects
Mathematics - Differential Geometry - Abstract
Assuming Perelman's estimates, we give a new proof of uniform $L^\infty$ estimate along normalized K\"ahler-Ricci flow on Fano manifolds with K\"ahler-Einstein metrics, using Chen-Cheng's auxiliary Monge-Amp\`ere equation and the Alexandrov-Bakelman-Pucci maximum principle. This proof does not use pluripotential theory., Comment: comments are welcome
- Published
- 2023
44. Distributed Leadership and Teacher Work Engagement: The Mediating Role of Teacher Efficacy and the Moderating Role of Interpersonal Trust
- Author
-
Cai, Yonghong, Liu, Peng, Tang, Runjia, and Bo, Yalong
- Abstract
Teacher work engagement is essential for schools to achieve their educational objectives and student learning outcomes. Among all the influencing factors, supportive workplace resources from diverse sources such as distributed leadership, trust, and self-efficacy have been cited as important factors affecting teachers' engagement in their work. Conservation of resources theory proposes a theoretical mechanism of how supportive resources can facilitate employee engagement; this study aims to further examine and investigate the ways in which teacher work engagement is enhanced by distributed leadership and the role of teacher efficacy and trust in this relationship. The findings of a survey involving 577 Chinese primary school teachers reveal that teacher efficacy completely mediates the positive relationship between distributed leadership and teacher work engagement. Distributed leadership also leads to increased teacher efficacy owing to higher levels of trust. These results suggest that distributed leadership is a highly effective approach that school administrators can adopt and that a workplace climate with a high level of interpersonal trust should be considered to further improve the effectiveness of school leadership and management.
- Published
- 2023
- Full Text
- View/download PDF
45. Research on Surface Tension Tank Technology of Aerospace Cryogenic Propellant
- Author
-
Ma, Tianju, primary, Yu, Bin, additional, Huang, Cheng, additional, Wang, Yalong, additional, Yang, Xuehu, additional, Li, Ganggang, additional, Gu, Sendong, additional, Hou, Yanhui, additional, and Chang, Xin, additional
- Published
- 2024
- Full Text
- View/download PDF
46. Electron-infrared phonon coupling in ABC trilayer graphene
- Author
-
Zan, Xiaozhou, Guo, Xiangdong, Deng, Aolin, Huang, Zhiheng, Liu, Le, Wu, Fanfan, Yuan, Yalong, Zhao, Jiaojiao, Peng, Yalin, Li, Lu, Zhang, Yangkun, Li, Xiuzhen, Zhu, Jundong, Dong, Jingwei, Shi, Dongxia, Yang, Wei, Yang, Xiaoxia, Shi, Zhiwen, Du, Luojun, Dai, Qing, and Zhang, Guangyu
- Subjects
Physics - Optics ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Stacking order plays a crucial role in determining the crystal symmetry and has significant impacts on electronic, optical, magnetic, and topological properties. Electron-phonon coupling, which is central to a wide range of intriguing quantum phenomena, is expected to be intricately connected with stacking order. Understanding the stacking order-dependent electron-phonon coupling is essential for understanding peculiar physical phenomena associated with electron-phonon coupling, such as superconductivity and charge density waves. In this study, we investigate the effect of stacking order on electron-infrared phonon coupling in graphene trilayers. By using gate-tunable Raman spectroscopy and excitation frequency-dependent near-field infrared nanoscopy, we show that rhombohedral ABC-stacked trilayer graphene has a significantly stronger electron-infrared phonon coupling strength than the Bernal ABA-stacked trilayer graphene. Our findings provide novel insights into the superconductivity and other fundamental physical properties of rhombohedral ABC-stacked trilayer graphene, and can enable nondestructive and high-throughput imaging of trilayer graphene stacking order using Raman scattering.
- Published
- 2023
47. High-efficiency electro-optic modulator on thin-film lithium niobate with high-permittivity cladding
- Author
-
Chen, Nuo, Lou, Kangping, Yu, Yalong, He, Xuanjian, and Chu, Tao
- Subjects
Physics - Optics ,Physics - Applied Physics - Abstract
Thin-film lithium niobate is a promising platform owing to its large electro-optic coefficients and low propagation loss. However, the large footprints of devices limit their application in large-scale integrated optical systems. A crucial challenge is how to maintain the performance advantage given the design space restrictions in this situation. This article proposes and demonstrates a high-efficiency lithium niobate electro-optic (EO) modulator with high-permittivity cladding to improve the electric field strength in waveguides and its overlap with optical fields while maintaining low optical loss and broad bandwidth. The proposed modulator exhibits considerable improvement, featuring a low half-wave voltage-length product of 1.41 Vcm, a low excess loss of 0.5 dB, and a broad 3 dB EO bandwidth of more than 40 GHz. This modulation efficiency is the highest reported for a broadband lithium niobate modulator so far. The design scheme of using high-permittivity cladding may provide a promising solution for improving the integration of photonic devices on the thin-film lithium niobate platform and these devices may serve as fundamental components in large-scale photonic integrated circuits in the future., Comment: 10 pages, 6 figures
- Published
- 2023
48. Texture-Based Input Feature Selection for Action Recognition
- Author
-
Jiang, Yalong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The performance of video action recognition has been significantly boosted by using motion representations within a two-stream Convolutional Neural Network (CNN) architecture. However, there are a few challenging problems in action recognition in real scenarios, e.g., the variations in viewpoints and poses, and the changes in backgrounds. The domain discrepancy between the training data and the test data causes the performance drop. To improve the model robustness, we propose a novel method to determine the task-irrelevant content in inputs which increases the domain discrepancy. The method is based on a human parsing model (HP model) which jointly conducts dense correspondence labelling and semantic part segmentation. The predictions from the HP model also function as re-rendering the human regions in each video using the same set of textures to make humans appearances in all classes be the same. A revised dataset is generated for training and testing and makes the action recognition model exhibit invariance to the irrelevant content in the inputs. Moreover, the predictions from the HP model are used to enrich the inputs to the AR model during both training and testing. Experimental results show that our proposed model is superior to existing models for action recognition on the HMDB-51 dataset and the Penn Action dataset.
- Published
- 2023
49. Learning cross space mapping via DNN using large scale click-through logs
- Author
-
Yu, Wei, Yang, Kuiyuan, Bai, Yalong, Yao, Hongxun, and Rui, Yong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The gap between low-level visual signals and high-level semantics has been progressively bridged by continuous development of deep neural network (DNN). With recent progress of DNN, almost all image classification tasks have achieved new records of accuracy. To extend the ability of DNN to image retrieval tasks, we proposed a unified DNN model for image-query similarity calculation by simultaneously modeling image and query in one network. The unified DNN is named the cross space mapping (CSM) model, which contains two parts, a convolutional part and a query-embedding part. The image and query are mapped to a common vector space via these two parts respectively, and image-query similarity is naturally defined as an inner product of their mappings in the space. To ensure good generalization ability of the DNN, we learn weights of the DNN from a large number of click-through logs which consists of 23 million clicked image-query pairs between 1 million images and 11.7 million queries. Both the qualitative results and quantitative results on an image retrieval evaluation task with 1000 queries demonstrate the superiority of the proposed method., Comment: Accepted by IEEE Transactions on Multimedia 2015
- Published
- 2023
50. Towards an Understanding of Distributed Asymmetric Collaborative Visualization on Problem-solving
- Author
-
Tong, Wai, Xia, Meng, Wong, Kam Kwai, Bowman, Doug A., Pong, Ting-Chuen, Qu, Huamin, and Yang, Yalong
- Subjects
Computer Science - Human-Computer Interaction - Abstract
This paper provided empirical knowledge of the user experience for using collaborative visualization in a distributed asymmetrical setting through controlled user studies. With the ability to access various computing devices, such as Virtual Reality (VR) head-mounted displays, scenarios emerge when collaborators have to or prefer to use different computing environments in different places. However, we still lack an understanding of using VR in an asymmetric setting for collaborative visualization. To get an initial understanding and better inform the designs for asymmetric systems, we first conducted a formative study with 12 pairs of participants. All participants collaborated in asymmetric (PC-VR) and symmetric settings (PC-PC and VR-VR). We then improved our asymmetric design based on the key findings and observations from the first study. Another ten pairs of participants collaborated with enhanced PC-VR and PC-PC conditions in a follow-up study. We found that a well-designed asymmetric collaboration system could be as effective as a symmetric system. Surprisingly, participants using PC perceived less mental demand and effort in the asymmetric setting (PC-VR) compared to the symmetric setting (PC-PC). We provided fine-grained discussions about the trade-offs between different collaboration settings., Comment: 11 pages, 12 figures, accepted at IEEE VR 2023
- Published
- 2023
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.