Search

Your search keyword '"John R. Hershey"' showing total 83 results

Search Constraints

Start Over You searched for: Author "John R. Hershey" Remove constraint Author: "John R. Hershey" Topic computer science Remove constraint Topic: computer science
83 results on '"John R. Hershey"'

Search Results

1. Phasebook and Friends: Leveraging Discrete Representations for Source Separation

2. Adversarial training and decoding strategies for end-to-end neural conversation models

3. Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis

4. End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings

5. Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes

6. What's All the FUSS About Free Universal Sound Separation Data?

7. Hybrid CTC/Attention Architecture for End-to-End Speech Recognition

8. Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming

9. Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend

10. Prior-based Binary Masking and Discriminative Methods for Reverberant and Noisy Speech Recognition Using Distant Stereo Microphones

11. Improving Universal Sound Separation Using Sound Classification

12. Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement

13. Universal Sound Separation

15. The Phasebook: Building Complex Masks via Discrete Representations for Source Separation

16. SDR - half-baked or well done?

17. VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

18. An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech

19. Speaker Adaptation for Multichannel End-to-End Speech Recognition

20. End-to-End Multi-Speaker Speech Recognition

21. Alternative Objective Functions for Deep Clustering

22. Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation

23. Exploring Tradeoffs in Models for Low-latency Speech Enhancement

24. Differentiable Consistency Constraints for Improved Deep Speech Enhancement

25. End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

26. A Purely End-to-End System for Multi-speaker Speech Recognition

27. Language independent end-to-end architecture for joint language identification and speech recognition

28. Multi-level language modeling and decoding for open vocabulary end-to-end speech recognition

29. Early and late integration of audio features for automatic video description

30. Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition

31. Attention-Based Multimodal Fusion for Video Description

32. Student-teacher network learning with enhanced features

33. Toolkits for Robust Speech Processing

35. Joint CTC/attention decoding for end-to-end speech recognition

36. Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition

37. Novel Deep Architectures in Speech Processing

38. Deep Recurrent Networks for Separation and Recognition of Single-Channel Speech in Nonstationary Background Audio

39. Dialog state tracking with attention-based sequence-to-sequence learning

40. Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs

41. Single-Channel Multi-Speaker Separation Using Deep Clustering

42. Minimum word error training of long short-term memory recurrent neural network language models for speech recognition

43. Deep beamforming networks for multi-channel speech recognition

44. Deep clustering: Discriminative embeddings for segmentation and separation

45. Deep unfolding for multichannel source separation

46. Improved Mvdr Beamforming Using Single-Channel Mask Prediction Networks

47. Deep Clustering and Conventional Networks for Music Separation: Stronger Together

48. Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages

49. Tracking Motion, Deformation, and Texture Using Conditionally Gaussian Processes

50. Monaural speech separation and recognition challenge

Catalog

Books, media, physical & digital resources