Search

Your search keyword '"Jansen, Aren"' showing total 146 results

Search Constraints

Start Over You searched for: Author "Jansen, Aren" Remove constraint Author: "Jansen, Aren"
146 results on '"Jansen, Aren"'

Search Results

1. A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

2. Dataset balancing can hurt model performance

3. V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

4. MusicLM: Generating Music From Text

5. MAQA: A Multimodal QA Benchmark for Negation

6. MuLan: A Joint Embedding of Music Audio and Natural Language

7. Text-Driven Separation of Arbitrary Sounds

8. Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

9. BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

10. Attention Bottlenecks for Multimodal Fusion

11. Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation

12. The Benefit Of Temporally-Strong Labels In Audio Event Classification

13. Self-Supervised Learning from Automatically Separated Sound Scenes

14. Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds

15. Addressing Missing Labels in Large-Scale Sound Event Recognition Using a Teacher-Student Framework With Loss Masking

16. Towards Learning a Universal Non-Semantic Representation of Speech

17. Improving Universal Sound Separation Using Sound Classification

18. Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision

19. Unsupervised Learning of Semantic Audio Representations

20. Shared computational principles for language processing in humans and deep language models

22. CNN Architectures for Large-Scale Audio Classification

23. A segmental framework for fully-unsupervised large-vocabulary speech recognition

24. Unsupervised word segmentation and lexicon discovery using acoustic word embeddings

25. Scalable Out-of-Sample Extension of Graph Embeddings Using Deep Neural Networks

26. A Framework for Evaluating Speech Representations

29. V2Meow: Meowing to the Visual Beat via Music Generation

30. BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

33. MuLan: A Joint Embedding of Music Audio and Natural Language

35. Self-Supervised Learning from Automatically Separated Sound Scenes

40. Thinking ahead: spontaneous prediction in context as a keystone of language in humans and machines

46. SEMANTICALLY MEANINGFUL ATTRIBUTES FROM CO-LISTEN EMBEDDINGS FOR PLAYLIST EXPLORATION AND EXPANSION.

47. Temporal Dynamics of Meaning

Catalog

Books, media, physical & digital resources