Search

Your search keyword '"Electrical Engineering and Systems Science - Audio and Speech Processing"' showing total 41,389 results

Search Constraints

Start Over You searched for: Descriptor "Electrical Engineering and Systems Science - Audio and Speech Processing" Remove constraint Descriptor: "Electrical Engineering and Systems Science - Audio and Speech Processing"
41,389 results on '"Electrical Engineering and Systems Science - Audio and Speech Processing"'

Search Results

101. Perceptual implications of simplifying geometrical acoustics models for Ambisonics-based binaural reverberation

102. EveGuard: Defeating Vibration-based Side-Channel Eavesdropping with Audio Adversarial Perturbations

103. XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection

104. WavChat: A Survey of Spoken Dialogue Models

105. Local deployment of large-scale music AI models on commodity hardware

106. An End-To-End Stuttering Detection Method Based On Conformer And BILSTM

107. ParaLBench: A Large-Scale Benchmark for Computational Paralinguistics over Acoustic Foundation Models

108. Zero-shot Voice Conversion with Diffusion Transformers

109. Re-Parameterization of Lightweight Transformer for On-Device Speech Emotion Recognition

110. EEG-Based Speech Decoding: A Novel Approach Using Multi-Kernel Ensemble Diffusion Models

111. Towards Unified Neural Decoding of Perceived, Spoken and Imagined Speech from EEG Signals

112. Transferable Adversarial Attacks against ASR

113. Improvement and Implementation of a Speech Emotion Recognition Model Based on Dual-Layer LSTM

114. Direct Speech-to-Speech Neural Machine Translation: A Survey

115. A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models

116. State-Space Estimation of Spatially Dynamic Room Impulse Responses using a Room Acoustic Model-based Prior

117. Developing an Effective Training Dataset to Enhance the Performance of AI-based Speaker Separation Systems

118. Robust AI-Synthesized Speech Detection Using Feature Decomposition Learning and Synthesizer Feature Augmentation

119. Language Models for Music Medicine Generation

120. Automatic Album Sequencing

121. Study on Inter and Intra Speaker Variability in Speaker Recognition

122. SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model

123. CJST: CTC Compressor based Joint Speech and Text Training for Decoder-Only ASR

124. Evaluating Synthetic Command Attacks on Smart Voice Assistants

125. PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation

126. On the Role of Speech Data in Reducing Toxicity Detection Bias

127. Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech

128. Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation

129. AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics

130. SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

131. Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models

132. Just Label the Repeats for In-The-Wild Audio-to-Score Alignment

133. Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages

134. AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models

135. NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics

136. Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

137. Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition

138. Electroencephalogram-based Multi-class Decoding of Attended Speakers' Direction with Audio Spatial Spectrum

139. DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions

140. Diff-MSTC: A Mixing Style Transfer Prototype for Cubase

141. Debatts: Zero-Shot Debating Text-to-Speech Synthesis

142. CTC-Assisted LLM-Based Contextual ASR

143. PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

144. Acoustic Volume Rendering for Neural Impulse Response Fields

145. Intelligent Fault Diagnosis of Type and Severity in Low-Frequency, Low Bit-Depth Signals

146. Selective State Space Model for Monaural Speech Enhancement

147. Speech-Based Estimation of Schizophrenia Severity Using Feature Fusion

148. A Kalman Filter model for synchronization in musical ensembles

149. Toward Transdisciplinary Approaches to Audio Deepfake Discernment

150. Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

Catalog

Books, media, physical & digital resources