Search

Your search keyword '"Su, Dan"' showing total 4,286 results

Search Constraints

Start Over You searched for: Author "Su, Dan" Remove constraint Author: "Su, Dan"
4,286 results on '"Su, Dan"'

Search Results

1. Nemotron-4 340B Technical Report

2. Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer

3. Prompt-guided Precise Audio Editing with Diffusion Models

4. Fuse after Align: Improving Face-Voice Association Learning via Multimodal Encoder

5. Nemotron-4 15B Technical Report

6. MM-LLMs: Recent Advances in MultiModal Large Language Models

10. Non-volatile memory based on PZT/FeGa thin film memtranstor

11. A High Fidelity and Low Complexity Neural Audio Coding

12. DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis

13. Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

17. Model Debiasing via Gradient-based Explanation on Representation

18. Learn What NOT to Learn: Towards Generative Safety in Chatbots

23. A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

24. NusaCrowd: Open Source Initiative for Indonesian NLP Resources

25. TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR

26. UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis

27. Nonlinear‐disturbance‐observer‐based predictive control for trajectory tracking of planar motors

32. Generative Long-form Question Answering: Relevance, Faithfulness and Succinctness

33. Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training

34. Context Generation Improves Open Domain Question Answering

35. The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022

36. Test and analysis of energy characteristics of large vertical submersible pumps

37. Multi-state data storage in a two-dimensional stripy antiferromagnet implemented by magnetoelectric effect

38. Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings

39. Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion

40. Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers

41. End-to-End Voice Conversion with Information Perturbation

45. Time to sputum culture conversion and its associated factors among drug-resistant tuberculosis patients: a systematic review and meta-analysis

46. AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation

47. Towards Answering Open-ended Ethical Quandary Questions

48. FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis

49. 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

50. Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis

Catalog

Books, media, physical & digital resources