Search

Your search keyword '"Zhang, Xiao Lei"' showing total 36 results

Search Constraints

Start Over You searched for: Author "Zhang, Xiao Lei" Remove constraint Author: "Zhang, Xiao Lei" Topic computer science - sound Remove constraint Topic: computer science - sound
36 results on '"Zhang, Xiao Lei"'

Search Results

1. DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model

2. AudioSpa: Spatializing Sound Events with Text

3. UniForm: A Unified Diffusion Transformer for Audio-Video Generation

4. Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR

5. Speaker Contrastive Learning for Source Speaker Tracing

6. Diffusion-Based Adversarial Purification for Speaker Verification

7. Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays

8. Interpretable Spectrum Transformation Attacks to Speaker Recognition

9. Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

10. LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification

11. Symmetric Saliency-based Adversarial Attack To Speaker Identification

12. WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

13. Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays

14. End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone Arrays

15. Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays

16. Conformer-based End-to-end Speech Recognition With Rotary Position Embedding

17. AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data

18. Attention-based multi-channel speaker verification with ad-hoc microphone arrays

19. Efficient conformer-based speech recognition with linear attention

20. Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

21. Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arrays

22. Minimum-volume Multichannel Nonnegative matrix factorization for blind source separation

23. Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

24. A comparison of handcrafted, parameterized, and learnable features for speech separation

25. Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

26. Speech enhancement aided end-to-end multi-task learning for voice activity detection

27. Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting

28. Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification

29. Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks

30. Deep Ad-hoc Beamforming

31. Linear Regression for Speaker Verification

32. An Investigation of Universal Background Sparse Coding Based Speaker Verification on TIMIT

33. Multilayer bootstrap network for unsupervised speaker recognition

34. Denoising Deep Neural Networks Based Voice Activity Detection

35. Deep Learning Based Two-dimensional Speaker Localization With Large Ad-hoc Microphone Arrays

36. A comparison of handcrafted, parameterized, and learnable features for speech separation

Catalog

Books, media, physical & digital resources