236 results on '"Jiangyan Yi"'
Search Results
2. EmoFake: An Initial Dataset for Emotion Fake Audio Detection.
3. Utilizing Speaker Profiles for Impersonation Audio Detection.
4. MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction.
5. MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
6. Fewer-Token Neural Speech Codec with Time-Invariant Codes.
7. Multi-Scale Permutation Entropy for Audio Deepfake Detection.
8. NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption.
9. What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection.
10. Dynamic Ensemble Teacher-Student Distillation Framework for Light-Weight Fake Audio Detection.
11. Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection.
12. Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features.
13. TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection.
14. MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
15. Learning From Yourself: A Self-Distillation Method For Fake Speech Detection.
16. GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios.
17. Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection.
18. Adaptive Fake Audio Detection with Low-Rank Model Squeezing.
19. ADD 2023: the Second Audio Deepfake Detection Challenge.
20. Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection.
21. TST: Time-Sparse Transducer for Automatic Speech Recognition.
22. WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification.
23. Open-vocabulary Multimodal Emotion Recognition: Dataset, Metric, and Benchmark.
24. ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild.
25. VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing.
26. An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio.
27. Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism.
28. AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition.
29. TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking.
30. Frequency-mix Knowledge Distillation for Fake Speech Detection.
31. RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection.
32. EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark.
33. Transfer knowledge for punctuation prediction via adversarial training.
34. Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings.
35. reducing multilingual context confusion for end-to-end code-switching automatic speech recognition.
36. Fully Automated End-to-End Fake Audio Detection.
37. Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis.
38. An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio.
39. Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features.
40. A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature.
41. Context-Aware Mask Prediction Network for End-to-End Text-Based Speech Editing.
42. ADD 2022: the first Audio Deep Synthesis Detection Challenge.
43. The VIBVG Speech Synthesis System for Blizzard Challenge 2023.
44. CFAD: A Chinese dataset for fake audio detection.
45. DGSD: Dynamical graph self-distillation for EEG-based auditory spatial attention detection.
46. Emotion selectable end-to-end text-based speech editing.
47. SceneFake: An initial dataset and benchmarks for scene fake audio detection.
48. Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection.
49. Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition.
50. NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.