Author: "Wang, Weiran" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Weiran"' showing total 523 results

Start Over Author "Wang, Weiran"

523 results on '"Wang, Weiran"'

1. Text Injection for Neural Contextual Biasing

Author: Meng, Zhong, Wu, Zelin, Prabhavalkar, Rohit, Peyser, Cal, Wang, Weiran, Chen, Nanxin, Sainath, Tara N., and Ramabhadran, Bhuvana
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Neural contextual biasing effectively improves automatic speech recognition (ASR) for crucial phrases within a speaker's context, particularly those that are infrequent in the training data. This work proposes contextual text injection (CTI) to enhance contextual ASR. CTI leverages not only the paired speech-text data, but also a much larger corpus of unpaired text to optimize the ASR model and its biasing component. Unpaired text is converted into speech-like representations and used to guide the model's attention towards relevant bias phrases. Moreover, we introduce a contextual text-injected (CTI) minimum word error rate (MWER) training, which minimizes the expected WER caused by contextual biasing when unpaired text is injected into the model. Experiments show that CTI with 100 billion text sentences can achieve up to 43.3% relative WER reduction from a strong neural biasing model. CTI-MWER provides a further relative improvement of 23.5%., Comment: 5 pages, 1 figure
Published: 2024

2. Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

Author: Wu, Zelin, Song, Gan, Li, Christopher, Rondon, Pat, Meng, Zhong, Velez, Xavier, Wang, Weiran, Caseiro, Diamantino, Pundak, Golan, Munkhdalai, Tsendsuren, Chandorkar, Angad, and Prabhavalkar, Rohit
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Contextual biasing enables speech recognizers to transcribe important phrases in the speaker's context, such as contact names, even if they are rare in, or absent from, the training data. Attention-based biasing is a leading approach which allows for full end-to-end cotraining of the recognizer and biasing system and requires no separate inference-time components. Such biasers typically consist of a context encoder; followed by a context filter which narrows down the context to apply, improving per-step inference time; and, finally, context application via cross attention. Though much work has gone into optimizing per-frame performance, the context encoder is at least as important: recognition cannot begin before context encoding ends. Here, we show the lightweight phrase selection pass can be moved before context encoding, resulting in a speedup of up to 16.1 times and enabling biasing to scale to 20K phrases with a maximum pre-decoding delay under 33ms. With the addition of phrase- and wordpiece-level cross-entropy losses, our technique also achieves up to a 37.5% relative WER reduction over the baseline without the losses and lightweight phrase selection pass., Comment: 9 pages, 3 figures, accepted by NAACL 2024 - Industry Track
Published: 2024

3. TransformerFAM: Feedback attention is working memory

Author: Hwang, Dongseong, Wang, Weiran, Huo, Zhuoyuan, Sim, Khe Chai, and Mengibar, Pedro Moreno
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: While Transformers have revolutionized deep learning, their quadratic attention complexity hinders their ability to process infinitely long inputs. We propose Feedback Attention Memory (FAM), a novel Transformer architecture that leverages a feedback loop to enable the network to attend to its own latent representations. This design fosters the emergence of working memory within the Transformer, allowing it to process indefinitely long sequences. TransformerFAM requires no additional weights, enabling seamless integration with pre-trained models. Our experiments show that TransformerFAM significantly improves Transformer performance on long-context tasks across various model sizes (1B, 8B, and 24B). These results showcase the potential to empower Large Language Models (LLMs) to process sequences of unlimited length., Comment: 26 pages, 12 figures, 14 tables
Published: 2024

4. Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

Author: Prabhavalkar, Rohit, Meng, Zhong, Wang, Weiran, Stooke, Adam, Cai, Xingyu, He, Yanzhang, Narayanan, Arun, Hwang, Dongseong, Sainath, Tara N., and Moreno, Pedro J.
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The accuracy of end-to-end (E2E) automatic speech recognition (ASR) models continues to improve as they are scaled to larger sizes, with some now reaching billions of parameters. Widespread deployment and adoption of these models, however, requires computationally efficient strategies for decoding. In the present work, we study one such strategy: applying multiple frame reduction layers in the encoder to compress encoder outputs into a small number of output frames. While similar techniques have been investigated in previous work, we achieve dramatically more reduction than has previously been demonstrated through the use of multiple funnel reduction layers. Through ablations, we study the impact of various architectural choices in the encoder to identify the most effective strategies. We demonstrate that we can generate one encoder output frame for every 2.56 sec of input speech, without significantly affecting word error rate on a large-scale voice search task, while improving encoder and decoder latencies by 48% and 92% respectively, relative to a strong but computationally expensive baseline., Comment: Accepted to 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
Published: 2024

5. USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

Author: Ding, Shaojin, Qiu, David, Rim, David, He, Yanzhang, Rybakov, Oleg, Li, Bo, Prabhavalkar, Rohit, Wang, Weiran, Sainath, Tara N., Han, Zhonglin, Li, Jian, Yazdanbakhsh, Amir, and Agrawal, Shivani
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios. In this study, we propose a USM fine-tuning approach for ASR, with a low-bit quantization and N:M structured sparsity aware paradigm on the model weights, reducing the model complexity from parameter precision and matrix topology perspectives. We conducted extensive experiments with a 2-billion parameter USM on a large-scale voice search dataset to evaluate our proposed method. A series of ablation studies validate the effectiveness of up to int4 quantization and 2:4 sparsity. However, a single compression technique fails to recover the performance well under extreme setups including int2 quantization and 1:4 sparsity. By contrast, our proposed method can compress the model to have 9.4% of the size, at the cost of only 7.3% relative word error rate (WER) regressions. We also provided in-depth analyses on the results and discussions on the limitations and potential solutions, which would be valuable for future studies., Comment: Accepted by ICASSP 2024. Preprint
Published: 2023

6. Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm

Author: Wang, Weiran, Wu, Zelin, Caseiro, Diamantino, Munkhdalai, Tsendsuren, Sim, Khe Chai, Rondon, Pat, Pundak, Golan, Song, Gan, Prabhavalkar, Rohit, Meng, Zhong, Zhao, Ding, Sainath, Tara, and Mengibar, Pedro Moreno
Subjects: Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Contextual biasing refers to the problem of biasing the automatic speech recognition (ASR) systems towards rare entities that are relevant to the specific user or application scenarios. We propose algorithms for contextual biasing based on the Knuth-Morris-Pratt algorithm for pattern matching. During beam search, we boost the score of a token extension if it extends matching into a set of biasing phrases. Our method simulates the classical approaches often implemented in the weighted finite state transducer (WFST) framework, but avoids the FST language altogether, with careful considerations on memory footprint and efficiency on tensor processing units (TPUs) by vectorization. Without introducing additional model parameters, our method achieves significant word error rate (WER) reductions on biasing test sets by itself, and yields further performance gain when combined with a model-based biasing method.
Published: 2023

7. Massive End-to-end Models for Short Search Queries

Author: Wang, Weiran, Prabhavalkar, Rohit, Hwang, Dongseong, Li, Qiujia, Sim, Khe Chai, Li, Bo, Qin, James, Cai, Xingyu, Stooke, Adam, Meng, Zhong, Zheng, CJ, He, Yanzhang, Sainath, Tara, and Mengibar, Pedro Moreno
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters. The encoders of our models use the neural architecture of Google's universal speech model (USM), with additional funnel pooling layers to significantly reduce the frame rate and speed up training and inference. We perform extensive studies on vocabulary size, time reduction strategy, and its generalization performance on long-form test sets. Despite the speculation that, as the model size increases, CTC can be as good as RNN-T which builds label dependency into the prediction, we observe that a 900M RNN-T clearly outperforms a 1.8B CTC and is more tolerant to severe time reduction, although the WER gap can be largely removed by LM shallow fusion.
Published: 2023

8. Augmenting conformers with structured state-space sequence models for online speech recognition

Author: Shan, Haozhe, Gu, Albert, Meng, Zhong, Wang, Weiran, Choromanski, Krzysztof, and Sainath, Tara
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Online speech recognition, where the model only accesses context to the left, is an important and challenging use case for ASR systems. In this work, we investigate augmenting neural encoders for online ASR by incorporating structured state-space sequence models (S4), a family of models that provide a parameter-efficient way of accessing arbitrarily long left context. We performed systematic ablation studies to compare variants of S4 models and propose two novel approaches that combine them with convolutions. We found that the most effective design is to stack a small S4 using real-valued recurrent weights with a local convolution, allowing them to work complementarily. Our best model achieves WERs of 4.01%/8.53% on test sets from Librispeech, outperforming Conformers with extensively tuned convolution., Comment: ICASSP 2024
Published: 2023

9. Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

Author: Huang, Yiling, Wang, Weiran, Zhao, Guanlong, Liao, Hank, Xia, Wei, and Wang, Quan
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound, Statistics - Machine Learning
Abstract: While standard speaker diarization attempts to answer the question "who spoken when", most of relevant applications in reality are more interested in determining "who spoken what". Whether it is the conventional modularized approach or the more recent end-to-end neural diarization (EEND), an additional automatic speech recognition (ASR) model and an orchestration algorithm are required to associate the speaker labels with recognized words. In this paper, we propose Word-level End-to-End Neural Diarization (WEEND) with auxiliary network, a multi-task learning algorithm that performs end-to-end ASR and speaker diarization in the same neural architecture. That is, while speech is being recognized, speaker labels are predicted simultaneously for each recognized word. Experimental results demonstrate that WEEND outperforms the turn-based diarization baseline system on all 2-speaker short-form scenarios and has the capability to generalize to audio lengths of 5 minutes. Although 3+speaker conversations are harder, we find that with enough in-domain training data, WEEND has the potential to deliver high quality diarized text.
Published: 2023

10. The Rise and Potential of Large Language Model Based Agents: A Survey

Author: Xi, Zhiheng, Chen, Wenxiang, Guo, Xin, He, Wei, Ding, Yiwen, Hong, Boyang, Zhang, Ming, Wang, Junzhe, Jin, Senjie, Zhou, Enyu, Zheng, Rui, Fan, Xiaoran, Wang, Xiao, Xiong, Limao, Zhou, Yuhao, Wang, Weiran, Jiang, Changhao, Zou, Yicheng, Liu, Xiangyang, Yin, Zhangyue, Dou, Shihan, Weng, Rongxiang, Cheng, Wensen, Zhang, Qi, Qin, Wenjuan, Zheng, Yongyan, Qiu, Xipeng, Huang, Xuanjing, and Gui, Tao
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are artificial entities that sense their environment, make decisions, and take actions. Many efforts have been made to develop intelligent agents, but they mainly focus on advancement in algorithms or training strategies to enhance specific capabilities or performance on particular tasks. Actually, what the community lacks is a general and powerful model to serve as a starting point for designing AI agents that can adapt to diverse scenarios. Due to the versatile capabilities they demonstrate, large language models (LLMs) are regarded as potential sparks for Artificial General Intelligence (AGI), offering hope for building general AI agents. Many researchers have leveraged LLMs as the foundation to build AI agents and have achieved significant progress. In this paper, we perform a comprehensive survey on LLM-based agents. We start by tracing the concept of agents from its philosophical origins to its development in AI, and explain why LLMs are suitable foundations for agents. Building upon this, we present a general framework for LLM-based agents, comprising three main components: brain, perception, and action, and the framework can be tailored for different applications. Subsequently, we explore the extensive applications of LLM-based agents in three aspects: single-agent scenarios, multi-agent scenarios, and human-agent cooperation. Following this, we delve into agent societies, exploring the behavior and personality of LLM-based agents, the social phenomena that emerge from an agent society, and the insights they offer for human society. Finally, we discuss several key topics and open problems within the field. A repository for the related papers at https://github.com/WooooDyy/LLM-Agent-Paper-List., Comment: 86 pages, 12 figures
Published: 2023

11. Text Injection for Capitalization and Turn-Taking Prediction in Speech Models

Author: Bijwadia, Shaan, Chang, Shuo-yiin, Wang, Weiran, Meng, Zhong, Zhang, Hao, and Sainath, Tara N.
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Text injection for automatic speech recognition (ASR), wherein unpaired text-only data is used to supplement paired audio-text data, has shown promising improvements for word error rate. This study examines the use of text injection for auxiliary tasks, which are the non-ASR tasks often performed by an E2E model. In this work, we use joint end-to-end and internal language model training (JEIT) as our text injection algorithm to train an ASR model which performs two auxiliary tasks. The first is capitalization, which is a de-normalization task. The second is turn-taking prediction, which attempts to identify whether a user has completed their conversation turn in a digital assistant interaction. We show results demonstrating that our text injection method boosts capitalization performance for long-tail data, and improves turn-taking detection recall.
Published: 2023

12. Reversal of gentamicin sulfate resistance in avian pathogenic Escherichia coli by matrine combined with berberine hydrochloride

Author: Meng, Jinwu, Ding, Jinxue, Wang, Weiran, Gu, Bolin, Zhou, Fanting, Wu, Desheng, Fu, Xiang, and Liu, Jiaguo
Published: 2024
Full Text: View/download PDF

13. Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

Author: Botros, Rami, Gulati, Anmol, Sainath, Tara N., Choromanski, Krzysztof, Pang, Ruoming, Strohman, Trevor, Wang, Weiran, and Yu, Jiahui
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Conformer models maintain a large number of internal states, the vast majority of which are associated with self-attention layers. With limited memory bandwidth, reading these from memory at each inference step can slow down inference. In this paper, we design an optimized conformer that is small enough to meet on-device restrictions and has fast inference on TPUs. We explore various ideas to improve the execution speed, including replacing lower conformer blocks with convolution-only blocks, strategically downsizing the architecture, and utilizing an RNNAttention-Performer. Our optimized conformer can be readily incorporated into a cascaded-encoder setting, allowing a second-pass decoder to operate on its output and improve the accuracy whenever more resources are available. Altogether, we find that these optimizations can reduce latency by a factor of 6.8x, and come at a reasonable trade-off in quality. With the cascaded second-pass, we show that the recognition accuracy is completely recoverable. Thus, our proposed encoder can double as a strong standalone encoder in on device, and as the first part of a high-performance ASR pipeline.
Published: 2023

14. Magnetic NiFe2O4@SiO2@CS-PBTCA nanoparticles for uranium adsorption

Author: Wang, Weiran and Wang, Zhifeng
Published: 2023
Full Text: View/download PDF

15. JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition

Author: Meng, Zhong, Wang, Weiran, Prabhavalkar, Rohit, Sainath, Tara N., Chen, Tongzhou, Variani, Ehsan, Zhang, Yu, Li, Bo, Rosenberg, Andrew, and Ramabhadran, Bhuvana
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound
Abstract: We propose JEIT, a joint end-to-end (E2E) model and internal language model (ILM) training method to inject large-scale unpaired text into ILM during E2E training which improves rare-word speech recognition. With JEIT, the E2E model computes an E2E loss on audio-transcript pairs while its ILM estimates a cross-entropy loss on unpaired text. The E2E model is trained to minimize a weighted sum of E2E and ILM losses. During JEIT, ILM absorbs knowledge from unpaired text while the E2E training serves as regularization. Unlike ILM adaptation methods, JEIT does not require a separate adaptation step and avoids the need for Kullback-Leibler divergence regularization of ILM. We also show that modular hybrid autoregressive transducer (MHAT) performs better than HAT in the JEIT framework, and is much more robust than HAT during ILM adaptation. To push the limit of unpaired text injection, we further propose a combined JEIT and JOIST training (CJJT) that benefits from modality matching, encoder text injection and ILM training. Both JEIT and CJJT can foster a more effective LM fusion. With 100B unpaired sentences, JEIT/CJJT improves rare-word recognition accuracy by up to 16.4% over a model trained without unpaired text., Comment: 5 pages, 3 figures, in ICASSP 2023
Published: 2023

16. JOIST: A Joint Speech and Text Streaming Model For ASR

Author: Sainath, Tara N., Prabhavalkar, Rohit, Bapna, Ankur, Zhang, Yu, Huo, Zhouyuan, Chen, Zhehuai, Li, Bo, Wang, Weiran, and Strohman, Trevor
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We present JOIST, an algorithm to train a streaming, cascaded, encoder end-to-end (E2E) model with both speech-text paired inputs, and text-only unpaired inputs. Unlike previous works, we explore joint training with both modalities, rather than pre-training and fine-tuning. In addition, we explore JOIST using a streaming E2E model with an order of magnitude more data, which are also novelties compared to previous works. Through a series of ablation studies, we explore different types of text modeling, including how to model the length of the text sequence and the appropriate text sub-word unit representation. We find that best text representation for JOIST improves WER across a variety of search and rare-word test sets by 4-14% relative, compared to a model not trained with text. In addition, we quantitatively show that JOIST maintains streaming capabilities, which is important for good user-level experience.
Published: 2022

17. Improving Deliberation by Text-Only and Semi-Supervised Training

Author: Hu, Ke, Sainath, Tara N., He, Yanzhang, Prabhavalkar, Rohit, Strohman, Trevor, Mavandadi, Sepand, and Wang, Weiran
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Text-only and semi-supervised training based on audio-only data has gained popularity recently due to the wide availability of unlabeled text and speech data. In this work, we propose incorporating text-only and semi-supervised training into an attention-based deliberation model. By incorporating text-only data in training a bidirectional encoder representation from transformer (BERT) for the deliberation text encoder, and large-scale text-to-speech and audio-only utterances using joint acoustic and text decoder (JATD) and semi-supervised training, we achieved 4%-12% WER reduction for various tasks compared to the baseline deliberation. Compared to a state-of-the-art language model (LM) rescoring method, the deliberation model reduces the Google Voice Search WER by 11% relative. We show that the deliberation model also achieves a positive human side-by-side evaluation compared to the state-of-the-art LM rescorer with reasonable endpointer latencies., Comment: Accepted by Interspeech 2022
Published: 2022

18. NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Author: Li, Yawei, Zhang, Kai, Timofte, Radu, Van Gool, Luc, Kong, Fangyuan, Li, Mingxi, Liu, Songwei, Du, Zongcai, Liu, Ding, Zhou, Chenhui, Chen, Jingyi, Han, Qingrui, Li, Zheyuan, Liu, Yingqi, Chen, Xiangyu, Cai, Haoming, Qiao, Yu, Dong, Chao, Sun, Long, Pan, Jinshan, Zhu, Yi, Zong, Zhikai, Liu, Xiaoxiao, Hui, Zheng, Yang, Tao, Ren, Peiran, Xie, Xuansong, Hua, Xian-Sheng, Wang, Yanbo, Ji, Xiaozhong, Lin, Chuming, Luo, Donghao, Tai, Ying, Wang, Chengjie, Zhang, Zhizhong, Xie, Yuan, Cheng, Shen, Luo, Ziwei, Yu, Lei, Wen, Zhihong, Wu1, Qi, Li, Youwei, Fan, Haoqiang, Sun, Jian, Liu, Shuaicheng, Huang, Yuanfei, Jin, Meiguang, Huang, Hua, Liu, Jing, Zhang, Xinjian, Wang, Yan, Long, Lingshun, Li, Gen, Zhang, Yuanfan, Cao, Zuowei, Sun, Lei, Alexander, Panaetov, Wang, Yucong, Cai, Minjie, Wang, Li, Tian, Lu, Wang, Zheyuan, Ma, Hongbing, Liu, Jie, Chen, Chao, Cai, Yidong, Tang, Jie, Wu, Gangshan, Wang, Weiran, Huang, Shirui, Lu, Honglei, Liu, Huan, Wang, Keyan, Chen, Jun, Chen, Shi, Miao, Yuchun, Huang, Zimo, Zhang, Lefei, Ayazoğlu, Mustafa, Xiong, Wei, Xiong, Chengyi, Wang, Fei, Li, Hao, Wen, Ruimian, Yang, Zhijing, Zou, Wenbin, Zheng, Weixin, Ye, Tian, Zhang, Yuncheng, Kong, Xiangzhen, Arora, Aditya, Zamir, Syed Waqas, Khan, Salman, Hayat, Munawar, Khan, Fahad Shahbaz, Ning, Dandan Gaoand Dengwen Zhouand Qian, Tang, Jingzhu, Huang, Han, Wang, Yufei, Peng, Zhangheng, Li, Haobo, Guan, Wenxue, Gong, Shenghua, Li, Xin, Liu, Jun, Wang, Wanjun, Zhou, Dengwen, Zeng, Kun, Lin, Hanjiang, Chen, Xinyu, and Fang, Jinsheng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution., Comment: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR
Published: 2022

19. Streaming Align-Refine for Non-autoregressive Deliberation

Author: Wang, Weiran, Hu, Ke, and Sainath, Tara N.
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We propose a streaming non-autoregressive (non-AR) decoding algorithm to deliberate the hypothesis alignment of a streaming RNN-T model. Our algorithm facilitates a simple greedy decoding procedure, and at the same time is capable of producing the decoding result at each frame with limited right context, thus enjoying both high efficiency and low latency. These advantages are achieved by converting the offline Align-Refine algorithm to be streaming-compatible, with a novel transformer decoder architecture that performs local self-attentions for both text and audio, and a time-aligned cross-attention at each layer. Furthermore, we perform discriminative training of our model with the minimum word error rate (MWER) criterion, which has not been done in the non-AR decoding literature. Experiments on voice search datasets and Librispeech show that with reasonable right context, our streaming model performs as well as the offline counterpart, and discriminative training leads to further WER gain when the first-pass model has small capacity., Comment: In submission to INTERSPEECH 2022
Published: 2022

20. Improving Rare Word Recognition with LM-aware MWER Training

Author: Wang, Weiran, Chen, Tongzhou, Sainath, Tara N., Variani, Ehsan, Prabhavalkar, Rohit, Huang, Ronny, Ramabhadran, Bhuvana, Gaur, Neeraj, Mavandadi, Sepand, Peyser, Cal, Strohman, Trevor, He, Yanzhang, and Rybach, David
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Language models (LMs) significantly improve the recognition accuracy of end-to-end (E2E) models on words rarely seen during training, when used in either the shallow fusion or the rescoring setups. In this work, we introduce LMs in the learning of hybrid autoregressive transducer (HAT) models in the discriminative training framework, to mitigate the training versus inference gap regarding the use of LMs. For the shallow fusion setup, we use LMs during both hypotheses generation and loss computation, and the LM-aware MWER-trained model achieves 10\% relative improvement over the model trained with standard MWER on voice search test sets containing rare words. For the rescoring setup, we learn a small neural module to generate per-token fusion weights in a data-dependent manner. This model achieves the same rescoring WER as regular MWER-trained model, but without the need for sweeping fusion weights., Comment: To appear in INTERSPEECH 2022
Published: 2022

21. A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Author: Ding, Shaojin, Wang, Weiran, Zhao, Ding, Sainath, Tara N., He, Yanzhang, David, Robert, Botros, Rami, Wang, Xin, Panigrahy, Rina, Liang, Qiao, Hwang, Dongseong, McGraw, Ian, Prabhavalkar, Rohit, and Strohman, Trevor
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound
Abstract: In this paper, we propose a dynamic cascaded encoder Automatic Speech Recognition (ASR) model, which unifies models for different deployment scenarios. Moreover, the model can significantly reduce model size and power consumption without loss of quality. Namely, with the dynamic cascaded encoder model, we explore three techniques to maximally boost the performance of each model size: 1) Use separate decoders for each sub-model while sharing the encoders; 2) Use funnel-pooling to improve the encoder efficiency; 3) Balance the size of causal and non-causal encoders to improve quality and fit deployment constraints. Overall, the proposed large-medium model has 30% smaller size and reduces power consumption by 33%, compared to the baseline cascaded encoder model. The triple-size model that unifies the large, medium, and small models achieves 37% total size reduction with minimal quality loss, while substantially reducing the engineering efforts of having separate models., Comment: Accepted by INTERSPEECH 2022
Published: 2022

22. The cascade influence of grain trade shocks on countries in the context of the Russia-Ukraine conflict

Author: Liu, Linqing, Wang, Weiran, Yan, Xiaofei, Shen, Mengyun, and Chen, Haizhi
Published: 2023
Full Text: View/download PDF

23. Polynucleotide phosphorylase protects against renal tubular injury via blocking mt-dsRNA-PKR-eIF2α axis

Author: Zhu, Yujie, Zhang, Mingchao, Wang, Weiran, Qu, Shuang, Liu, Minghui, Rong, Weiwei, Yang, Wenwen, Liang, Hongwei, Zeng, Caihong, Zhu, Xiaodong, Li, Limin, Liu, Zhihong, and Zen, Ke
Published: 2023
Full Text: View/download PDF

24. mRNA and miRNA expression profiles reveal the potential roles of RLRs signaling pathway and mitophagy in duck hepatitis A virus type 1 infection

Author: Wang, Weiran, Meng, Jinwu, Wu, Desheng, Ding, Jinxue, and Liu, Jiaguo
Published: 2024
Full Text: View/download PDF

25. Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding

Author: Wang, Weiran, Hu, Ke, and Sainath, Tara
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We propose to deliberate the hypothesis alignment of a streaming RNN-T model with the previously proposed Align-Refine non-autoregressive decoding method and its improved versions. The method performs a few refinement steps, where each step shares a transformer decoder that attends to both text features (extracted from alignments) and audio features, and outputs complete updated alignments. The transformer decoder is trained with the CTC loss which facilitates parallel greedy decoding, and performs full-context attention to capture label dependencies. We improve Align-Refine by introducing cascaded encoder that captures more audio context before refinement, and alignment augmentation which enforces learning label dependency. We show that, conditioned on hypothesis alignments of a streaming RNN-T model, our method obtains significantly more accurate recognition results than the first-pass RNN-T, with only small amount of model parameters.
Published: 2021

26. Contrastively Disentangled Sequential Variational Autoencoder

Author: Bai, Junwen, Wang, Weiran, and Gomes, Carla
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Self-supervised disentangled representation learning is a critical task in sequence modeling. The learnt representations contribute to better model interpretability as well as the data generation, and improve the sample efficiency for downstream tasks. We propose a novel sequence representation learning method, named Contrastively Disentangled Sequential Variational Autoencoder (C-DSVAE), to extract and separate the static (time-invariant) and dynamic (time-variant) factors in the latent space. Different from previous sequential variational autoencoder methods, we use a novel evidence lower bound which maximizes the mutual information between the input and the latent factors, while penalizes the mutual information between the static and dynamic factors. We leverage contrastive estimations of the mutual information terms in training, together with simple yet effective augmentation techniques, to introduce additional inductive biases. Our experiments show that C-DSVAE significantly outperforms the previous state-of-the-art methods on multiple metrics., Comment: Accepted by NeurIPS 2021
Published: 2021

27. The dominant role of extracellular polymeric substances produced by Achromobacter xylosoxidans BP1 in Cr(VI) microbial reduction

Author: Jia, Jianli, Xiao, Bing, Yao, Linying, Zhang, Ben, Ma, Yichi, Wang, Weiran, Han, Yuxin, Lei, Qiushuang, Zhao, Ruofan, Dong, Jingqi, Wei, Nan, and Zhang, Hongzhen
Published: 2024
Full Text: View/download PDF

28. The synergy effect of matrine and berberine hydrochloride on treating colibacillosis caused by an avian highly pathogenic multidrug-resistant Escherichia coli

Author: Meng, Jinwu, Wang, Weiran, Ding, Jinxue, Gu, Bolin, Zhou, Fanting, Wu, Desheng, Fu, Xiang, Qiao, Mingyu, and Liu, Jiaguo
Published: 2024
Full Text: View/download PDF

29. Non-uniform pinching of short-gap intermediate frequency vacuum arc without controlled magnetic field

Author: Jiang, Yuan, Ma, Suliang, Li, Qing, Wang, Weiran, Zhang, Kaiyuan, Cao, Rui, and Wu, Jianwen
Published: 2024
Full Text: View/download PDF

30. Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective

Author: Lyu, Qi, Fu, Xiao, Wang, Weiran, and Lu, Songtao
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Multiple views of data, both naturally acquired (e.g., image and audio) and artificially produced (e.g., via adding different noise to data samples), have proven useful in enhancing representation learning. Natural views are often handled by multiview analysis tools, e.g., (deep) canonical correlation analysis [(D)CCA], while the artificial ones are frequently used in self-supervised learning (SSL) paradigms, e.g., BYOL and Barlow Twins. Both types of approaches often involve learning neural feature extractors such that the embeddings of data exhibit high cross-view correlations. Although intuitive, the effectiveness of correlation-based neural embedding is mostly empirically validated. This work aims to understand latent correlation maximization-based deep multiview learning from a latent component identification viewpoint. An intuitive generative model of multiview data is adopted, where the views are different nonlinear mixtures of shared and private components. Since the shared components are view/distortion-invariant, representing the data using such components is believed to reveal the identity of the samples effectively and robustly. Under this model, latent correlation maximization is shown to guarantee the extraction of the shared components across views (up to certain ambiguities). In addition, it is further shown that the private information in each view can be provably disentangled from the shared using proper regularization design. A finite sample analysis, which has been rare in nonlinear mixture identifiability study, is also presented. The theoretical results and newly designed regularization are tested on a series of tasks., Comment: Accepted to ICLR 2022 Spotlight, 37 pages, 11 figures
Published: 2021

31. Molecular level toxicity effects of As(V) on Folsomia candida: Integrated transcriptomics and metabolomics analyses

Author: Lin, Xianglong, Wang, Weiran, He, Fei, Hou, Hong, and Guo, Fei
Published: 2024
Full Text: View/download PDF

32. In situ reduction growth Sn-MoS2 on CNFs as advanced separator coating for improved-performance lithium sulfur batteries

Author: Liu, Xiaohong, Chen, Peng, Wang, Weiran, Li, Wenxu, Rao, Yutong, Wang, Yanping, Zhao, Jianxun, Sun, Lianshan, Liu, Wanqiang, and Cheng, Yong
Published: 2024
Full Text: View/download PDF

33. TCM formula for trauma treatment screening and its role of promoting infectious wound coalescence investigating

Author: Li, Siya, Gu, Bolin, Meng, Jinwu, Zhu, Jinyue, Wang, Jinli, Wang, Weiran, Ding, Jinxue, Qiu, Tianxin, Wang, Wenjia, Liu, Jiaguo, Wu, Yi, and Li, Kun
Published: 2024
Full Text: View/download PDF

34. A review of Sustained release materials for remediation of organically contaminated groundwater：Material preparation, applications and prospects for practical application

Author: Wang, Weiran, Jia, Jianli, Zhang, Ben, Xiao, Bing, Yang, Haojun, Zhang, Shuyue, Gao, Xiaolong, Han, Yuxin, Zhang, Shuo, Liu, Zejun, Jin, Shaoyan, and Wu, Yu
Published: 2024
Full Text: View/download PDF

35. Effects of artificial sweetener acesulfame on soil-dwelling earthworms (Eisenia fetida) and its gut microbiota

Author: Lin, Xianglong, Liu, Zhelun, Wang, Weiran, Duan, Guilan, and Zhu, Yongguan
Published: 2024
Full Text: View/download PDF

36. Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

Author: Bai, Junwen, Wang, Weiran, Zhou, Yingbo, and Xiong, Caiming
Subjects: Computer Science - Machine Learning
Abstract: We propose Deep Autoencoding Predictive Components (DAPC) -- a self-supervised representation learning method for sequence data, based on the intuition that useful representations of sequence data should exhibit a simple structure in the latent space. We encourage this latent structure by maximizing an estimate of predictive information of latent feature sequences, which is the mutual information between past and future windows at each time step. In contrast to the mutual information lower bound commonly used by contrastive learning, the estimate of predictive information we adopt is exact under a Gaussian assumption. Additionally, it can be computed without negative sampling. To reduce the degeneracy of the latent space extracted by powerful encoders and keep useful information from the inputs, we regularize predictive information learning with a challenging masked reconstruction loss. We demonstrate that our method recovers the latent space of noisy dynamical systems, extracts predictive features for forecasting tasks, and improves automatic speech recognition when used to pretrain the encoder on large amounts of unlabeled data.
Published: 2020

37. An investigation of phone-based subword units for end-to-end speech recognition

Author: Wang, Weiran, Wang, Guangsen, Bhatnagar, Aadyot, Zhou, Yingbo, Xiong, Caiming, and Socher, Richard
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Phones and their context-dependent variants have been the standard modeling units for conventional speech recognition systems, while characters and subwords have demonstrated their effectiveness for end-to-end recognition systems. We investigate the use of phone-based subwords, in particular, byte pair encoder (BPE), as modeling units for end-to-end speech recognition. In addition, we also developed multi-level language model-based decoding algorithms based on a pronunciation dictionary. Besides the use of the lexicon, which is easily available, our system avoids the need of additional expert knowledge or processing steps from conventional systems. Experimental results show that phone-based BPEs tend to yield more accurate recognition systems than the character-based counterpart. In addition, further improvement can be obtained with a novel one-pass joint beam search decoder, which efficiently combines phone- and character-based BPE systems. For Switchboard, our phone-based BPE system achieves 6.8\%/14.4\% word error rate (WER) on the Switchboard/CallHome portion of the test set while joint decoding achieves 6.3\%/13.3\% WER. On Fisher + Switchboard, joint decoding leads to 4.9\%/9.5\% WER, setting new milestones for telephony speech recognition., Comment: Interspeech 2020 final version. Implementation for reproducing the results can be found at: https://github.com/salesforce/transformerasr
Published: 2020

38. A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification

Author: Kao, Chieh-Chi, Sun, Ming, Wang, Weiran, and Wang, Chao
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: Acoustic event classification (AEC) and acoustic event detection (AED) refer to the task of detecting whether specific target events occur in audios. As long short-term memory (LSTM) leads to state-of-the-art results in various speech related tasks, it is employed as a popular solution for AEC as well. This paper focuses on investigating the dynamics of LSTM model on AEC tasks. It includes a detailed analysis on LSTM memory retaining, and a benchmarking of nine different pooling methods on LSTM models using 1.7M generated mixture clips of multiple events with different signal-to-noise ratios. This paper focuses on understanding: 1) utterance-level classification accuracy; 2) sensitivity to event position within an utterance. The analysis is done on the dataset for the detection of rare sound events from DCASE 2017 Challenge. We find max pooling on the prediction level to perform the best among the nine pooling approaches in terms of classification accuracy and insensitivity to event position within an utterance. To authors' best knowledge, this is the first kind of such work focused on LSTM dynamics for AEC tasks., Comment: Accepted to ICASSP 2020
Published: 2020

39. Unsupervised Pre-training of Bidirectional Speech Encoders via Masked Reconstruction

Author: Wang, Weiran, Tang, Qingming, and Livescu, Karen
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: We propose an approach for pre-training speech representations via a masked reconstruction loss. Our pre-trained encoder networks are bidirectional and can therefore be used directly in typical bidirectional speech recognition models. The pre-trained networks can then be fine-tuned on a smaller amount of supervised data for speech recognition. Experiments with this approach on the LibriSpeech and Wall Street Journal corpora show promising results. We find that the main factors that lead to speech recognition improvements are: masking segments of sufficient width in both time and frequency, pre-training on a much larger amount of unlabeled data than the labeled data, and domain adaptation when the unlabeled and labeled data come from different domains. The gain from pre-training is additive to that of supervised data augmentation., Comment: Final version for ICASSP 2020
Published: 2020

40. Data Techniques For Online End-to-end Speech Recognition

Author: Chen, Yang, Wang, Weiran, Chen, I-Fan, and Wang, Chao
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Practitioners often need to build ASR systems for new use cases in a short amount of time, given limited in-domain data. While recently developed end-to-end methods largely simplify the modeling pipelines, they still suffer from the data sparsity issue. In this work, we explore a few simple-to-implement techniques for building online ASR systems in an end-to-end fashion, with a small amount of transcribed data in the target domain. These techniques include data augmentation in the target domain, domain adaptation using models previously trained on a large source domain, and knowledge distillation on non-transcribed target domain data, using an adapted bi-directional model as the teacher; they are applicable in real scenarios with different types of resources. Our experiments demonstrate that each technique is independently useful in the improvement of the online ASR performance in the target domain., Comment: 5 pages, 1 figure
Published: 2020

41. Semi-supervised ASR by End-to-end Self-training

Author: Chen, Yang, Wang, Weiran, and Wang, Chao
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound
Abstract: While deep learning based end-to-end automatic speech recognition (ASR) systems have greatly simplified modeling pipelines, they suffer from the data sparsity issue. In this work, we propose a self-training method with an end-to-end system for semi-supervised ASR. Starting from a Connectionist Temporal Classification (CTC) system trained on the supervised data, we iteratively generate pseudo-labels on a mini-batch of unsupervised utterances with the current model, and use the pseudo-labels to augment the supervised data for immediate model update. Our method retains the simplicity of end-to-end ASR systems, and can be seen as performing alternating optimization over a well-defined learning objective. We also perform empirical investigations of our method, regarding the effect of data augmentation, decoding beamsize for pseudo-label generation, and freshness of pseudo-labels. On a commonly used semi-supervised ASR setting with the WSJ corpus, our method gives 14.4% relative WER improvement over a carefully-trained base system with data augmentation, reducing the performance gap between the base system and the oracle system by 50%., Comment: Accepted by Interspeech 2020
Published: 2020

42. RNA-seq and microRNA association analysis to explore the pathogenic mechanism of DHAV-1 infection with DEHs

Author: Wang, Weiran, Li, Kun, Zhang, Tao, Dong, Hong, and Liu, Jiaguo
Published: 2023
Full Text: View/download PDF

43. Acoustic scene analysis with multi-head attention networks

Author: Wang, Weimin, Wang, Weiran, Sun, Ming, and Wang, Chao
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound, Statistics - Machine Learning
Abstract: Acoustic Scene Classification (ASC) is a challenging task, as a single scene may involve multiple events that contain complex sound patterns. For example, a cooking scene may contain several sound sources including silverware clinking, chopping, frying, etc. What complicates ASC more is that classes of different activities could have overlapping sounds patterns (e.g. both cooking and dishwashing could have silverware clinking sound). In this paper, we propose a multi-head attention network to model the complex temporal input structures for ASC. The proposed network takes the audio's time-frequency representation as input, and it leverages standard VGG plus LSTM layers to extract high-level feature representation. Further more, it applies multiple attention heads to summarize various patterns of sound events into fixed dimensional representation, for the purpose of final scene classification. The whole network is trained in an end-to-end fashion with back-propagation. Experimental results confirm that our model discovers meaningful sound patterns through the attention mechanism, without using explicit supervision in the alignment. We evaluated our proposed model using DCASE 2018 Task 5 dataset, and achieved competitive performance on par with previous winner's results., Comment: 8 pages, 6 figures
Published: 2019

44. Elicitation with hydrogen peroxide promotes growth, phenolic-enrichment, antioxidant activity and nutritional values of two hydroponic lettuce genotypes

Author: Wang, Weixuan, Lin, Zikun, Wang, Weiran, Shang, Meixin, Lv, Haofeng, Zong, Quanli, Li, Junliang, Liang, Bin, and Zhou, Weiwei
Published: 2023
Full Text: View/download PDF

45. Phosphorylated bush sophora root polysaccharides protect the liver in duck viral hepatitis by preserving mitochondrial function

Author: Qiu, Tianxin, Shi, Yu, He, Miao, Wang, Wenjia, Meng, Jinwu, Ding, Jinxue, Wang, Weiran, Li, Siya, Li, Kun, and Liu, Jiaguo
Published: 2023
Full Text: View/download PDF

46. Tumor mutation burden-assisted risk stratification for papillary thyroid cancer

Author: Chen, Zhijiang, Wang, Weiran, Xu, Jiajie, Song, Yuntao, Zhu, Honglin, Ma, Tonghui, Ge, Minghua, and Guan, Haixia
Published: 2022
Full Text: View/download PDF

47. Multimodal and Multi-view Models for Emotion Recognition

Author: Aguilar, Gustavo, Rozgić, Viktor, Wang, Weiran, and Wang, Chao
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Studies on emotion recognition (ER) show that combining lexical and acoustic information results in more robust and accurate models. The majority of the studies focus on settings where both modalities are available in training and evaluation. However, in practice, this is not always the case; getting ASR output may represent a bottleneck in a deployment pipeline due to computational complexity or privacy-related constraints. To address this challenge, we study the problem of efficiently combining acoustic and lexical modalities during training while still providing a deployable acoustic model that does not require lexical inputs. We first experiment with multimodal models and two attention mechanisms to assess the extent of the benefits that lexical information can provide. Then, we frame the task as a multi-view learning problem to induce semantic information from a multimodal model into our acoustic-only network using a contrastive loss function. Our multimodal model outperforms the previous state of the art on the USC-IEMOCAP dataset reported on lexical and acoustic information. Additionally, our multi-view-trained acoustic network significantly surpasses models that have been exclusively trained with acoustic features., Comment: ACL 2019
Published: 2019

48. Everything old is new again: A multi-view learning approach to learning using privileged information and distillation

Author: Wang, Weiran
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We adopt a multi-view approach for analyzing two knowledge transfer settings---learning using privileged information (LUPI) and distillation---in a common framework. Under reasonable assumptions about the complexities of hypothesis spaces, and being optimistic about the expected loss achievable by the student (in distillation) and a transformed teacher predictor (in LUPI), we show that encouraging agreement between the teacher and the student leads to reduced search space. As a result, improved convergence rate can be obtained with regularized empirical risk minimization.
Published: 2019

49. A review on magnetic biochar for the removal of heavy metals from contaminated soils: Preparation, application, and microbial response

Author: Xiao, Bing, Jia, Jianli, Wang, Weiran, Zhang, Ben, Ming, Huyang, Ma, Shuo, Kang, Yike, and Zhao, Mengjie
Published: 2023
Full Text: View/download PDF

50. Reconstructing 3D Contour Models of General Scenes from RGB-D Sequences

Author: Wang, Weiran, Di, Huijun, Song, Lingxiao, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Þór Jónsson, Björn, editor, Gurrin, Cathal, editor, Tran, Minh-Triet, editor, Dang-Nguyen, Duc-Tien, editor, Hu, Anita Min-Chun, editor, Huynh Thi Thanh, Binh, editor, and Huet, Benoit, editor
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

523 results on '"Wang, Weiran"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources