24 results on '"Murphy, Damian T."'
Search Results
2. Speech Intelligibility versus Congruency: User Preferences of the Acoustics of Virtual Reality Game Spaces.
- Author
-
Popp, Constantin and Murphy, Damian T.
- Subjects
INTELLIGIBILITY of speech ,VIRTUAL reality ,SOUND designers ,ARCHITECTURAL acoustics ,ACOUSTICS ,SINGLE-degree-of-freedom systems - Abstract
3D audio spatializers for Virtual Reality (VR) can use the acoustic properties of the surfaces of a visualised game space to calculate a matching reverb. However, this approach could lead to reverbs that impair the tasks performed in such a space, such as listening to speech-based audio. Sound designers would then have to alter the room's acoustic properties independently of its visualisation to improve speech intelligibility, causing audio-visual incongruency. As user expectation of simulated room acoustics regarding speech intelligibility in VR has not been studied, this study asked participants to rate the congruency of reverbs and their visualisations in 6-DoF VR while listening to speech-based audio. The participants compared unaltered, matching reverbs with sound-designed, mismatching reverbs. The latter feature improved D50s and reduced RT60s at the cost of lower audio-visual congruency. Results suggest participants preferred improved reverbs only when the unaltered reverbs had comparatively low D50s or excessive ringing. Otherwise, too dry or too reverberant reverbs were disliked. The range of expected RT60s depended on the surface visualisation. Differences in timbre between the reverbs may not affect preferences as strongly as shorter RT60s. Therefore, sound designers can intervene and prioritise speech intelligibility over audio-visual congruency in acoustically challenging game spaces. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Source excitation strategies for obtaining impulse responses in finite difference time domain room acoustics simulation
- Author
-
Murphy, Damian T., Southern, Alex, and Savioja, Lauri
- Published
- 2014
- Full Text
- View/download PDF
4. The Perception of Formant Tuning in Soprano Voices
- Author
-
Vos, Rebecca R., Murphy, Damian T., Howard, David M., and Daffern, Helena
- Published
- 2018
- Full Text
- View/download PDF
5. Bio-Inspired Evolutionary Oral Tract Shape Modeling for Physical Modeling Vocal Synthesis
- Author
-
Howard, David M., Tyrrell, Andy M., Murphy, Damian T., Cooper, Crispin, and Mullen, Jack
- Published
- 2009
- Full Text
- View/download PDF
6. The relationship between environmental context and attentional engagement in podcast listening experiences.
- Author
-
Harrison, Jay, Archer-Boyd, Alan W., Francombe, Jon, Pike, Chris, and Murphy, Damian T.
- Subjects
EXPLORATORY factor analysis ,ECO-labeling ,LISTENING ,PODCASTING - Abstract
Introduction: Previous research has shown that podcasts are most frequently consumed using mobile listening devices across a wide variety of environmental, situational, and social contexts. To date, no studies have investigated how an individual’s environmental context might influence their attentional engagement in podcast listening experiences. Improving understanding of the contexts in which episodes of listening take place, and how they might aect listener engagement, could be highly valuable to researchers and producers working in the fields of object-based and personalized media. Methods: An online questionnaire on listening habits and behaviors was distributed to a sample of 264 podcast listeners. An exploratory factor analysis was run to identify factors of environmental context that influence attentional engagement in podcast listening experiences. Five aspects of podcast listening engagement were also defined and measured across the sample. Results: The exploratory factor analysis revealed five factors of environmental context labeled as: outdoors, indoors & at home, evenings, soundscape & at work, and exercise. The aspects of podcast listening engagement provided a comprehensive quantitative account of contemporary podcast listening experiences. Discussion: The results presented support the hypothesis that elements of a listener’s environmental context can influence their attentional engagement in podcast listening experiences. The soundscape & at work factor suggests that some listeners actively choose to consume podcasts to mask disturbing stimuli in their surrounding soundscape. Further analysis suggested that the proposed factors of environmental context were positively correlated with the measured aspects of podcast listening engagement. The results are highly pertinent to the fields of podcast studies, mobile listening experiences, and personalized media, and provide a basis for researchers seeking to explore how other forms of listening context might influence attentional engagement. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. Creating Audio Object-Focused Acoustic Environments for Room-Scale Virtual Reality.
- Author
-
Popp, Constantin and Murphy, Damian T.
- Subjects
VIRTUAL reality ,SINGLE-degree-of-freedom systems ,ARCHITECTURAL acoustics ,MULTIMODAL user interfaces ,HEADPHONES ,LOUDSPEAKERS ,HEAD-mounted displays ,SOUNDS ,SPATIAL behavior - Abstract
Room-scale virtual reality (VR) affordance in movement and interactivity causes new challenges in creating virtual acoustic environments for VR experiences. Such environments are typically constructed from virtual interactive objects that are accompanied by an Ambisonic bed and an off-screen ("invisible") music soundtrack, with the Ambisonic bed, music, and virtual acoustics describing the aural features of an area. This methodology can become problematic in room-scale VR as the player cannot approach or interact with such background sounds, contradicting the player's motion aurally and limiting interactivity. Written from a sound designer's perspective, the paper addresses these issues by proposing a musically inclusive novel methodology that reimagines an acoustic environment predominately using objects that are governed by multimodal rule-based systems and spatialized in six degrees of freedom using 3D binaural audio exclusively while minimizing the use of Ambisonic beds and non-diegetic music. This methodology is implemented using off-the-shelf, creator-oriented tools and methods and is evaluated through the development of a standalone, narrative, prototype room-scale VR experience. The experience's target platform is a mobile, untethered VR system based on head-mounted displays, inside-out tracking, head-mounted loudspeakers or headphones, and hand-held controllers. The authors apply their methodology to the generation of ambiences based on sound-based music, sound effects, and virtual acoustics. The proposed methodology benefits the interactivity and spatial behavior of virtual acoustic environments but may be constrained by platform and project limitations. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Predicting the Colouration between Binaural Signals.
- Author
-
McKenzie, Thomas, Armstrong, Cal, Ward, Lauren, Murphy, Damian T., and Kearney, Gavin
- Subjects
FAST Fourier transforms ,LOUDNESS - Abstract
Although the difference between the fast Fourier transforms of two audio signals is often used as a basic measure of predicting perceived colouration, these signal measures do not provide information on how relevant the results are from a perceptual point of view. This paper presents a perceptually motivated loudness calculation for predicting the colouration between binaural signals which incorporates equal loudness frequency contouring, relative subjective loudness weighting, cochlea frequency modelling, and an iterative normalisation of input signals. The validation compares the presented model to three other colouration calculations in two ways: using test signals designed to evaluate specific elements of the model, and against the results of a listening test on degraded binaural audio signals. Results demonstrate the presented model is appropriate for predicting the colouration between binaural signals. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
9. Interaural Level Difference Optimization of Binaural Ambisonic Rendering
- Author
-
McKenzie, Thomas, Murphy, Damian T., and Kearney, Gavin
- Subjects
interaural level difference ,lcsh:T ,binaural synthesis ,lcsh:Technology ,lcsh:QC1-999 ,virtual loudspeaker ,Ambisonics ,lcsh:Chemistry ,binaural Ambisonic rendering ,lcsh:Biology (General) ,lcsh:QD1-999 ,lcsh:TA1-2040 ,lcsh:Engineering (General). Civil engineering (General) ,lcsh:QH301-705.5 ,lcsh:Physics - Abstract
Ambisonics is a spatial audio technique appropriate for dynamic binaural rendering due to its sound field rotation and transformation capabilities, which has made it popular for virtual reality applications. An issue with low-order Ambisonics is that interaural level differences (ILDs) are often reproduced with lower values when compared to head-related impulse responses (HRIRs), which reduces lateralization and spaciousness. This paper introduces a method of Ambisonic ILD Optimization (AIO), a pre-processing technique to bring the ILDs produced by virtual loudspeaker binaural Ambisonic rendering closer to those of HRIRs. AIO is evaluated objectively for Ambisonic orders up to fifth order versus a reference dataset of HRIRs for all locations on the sphere via estimated ILD and spectral difference, and perceptually through listening tests using both simple and complex scenes. Results conclude AIO produces an overall improvement for all tested orders of Ambisonics, though the benefits are greatest at first and second order.
- Published
- 2019
10. Auditory Localization in Low-Bitrate Compressed Ambisonic Scenes.
- Author
-
Rudzki, Tomasz, Gomez-Lanzaco, Ignacio, Stubbs, Jessica, Skoglund, Jan, Murphy, Damian T., and Kearney, Gavin
- Subjects
ACOUSTIC localization ,STREAMING audio ,TRANSFER functions ,DIRECTIONAL hearing ,AUDIO codec - Abstract
The increasing popularity of Ambisonics as a spatial audio format for streaming services poses new challenges to existing audio coding techniques. Immersive audio delivered to mobile devices requires an efficient bitrate compression that does not affect the spatial quality of the content. Good localizability of virtual sound sources is one of the key elements that must be preserved. This study was conducted to investigate the localization precision of virtual sound source presentations within Ambisonic scenes encoded with Opus low-bitrate compression at different bitrates and Ambisonic orders (1st, 3rd, and 5th). The test stimuli were reproduced over a 50-channel spherical loudspeaker configuration and binaurally using individually measured and generic Head-Related Transfer Functions (HRTFs). Participants were asked to adjust the position of a virtual acoustic pointer to match the position of virtual sound source within the bitrate-compressed Ambisonic scene. Results show that auditory localization in low-bitrate compressed Ambisonic scenes is not significantly affected by codec parameters. The key factors influencing localization are the rendering method and Ambisonic order truncation. This suggests that efficient perceptual coding might be successfully used for mobile spatial audio delivery. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
11. Boundary absorption approximation in the spatial high-frequency extrapolation method for parametric room impulse response synthesis.
- Author
-
Southern, Alex, Murphy, Damian T., and Savioja, Lauri
- Subjects
- *
IMPULSE response , *FINITE difference time domain method , *EXTRAPOLATION , *VIRTUAL room acoustics , *THEORY of wave motion - Abstract
The spatial high-frequency extrapolation method extrapolates low-frequency band-limited spatial room impulse responses (SRIRs) to higher frequencies based on a frame-by-frame time/frequency analysis that determines directional reflected components within the SRIR. Such extrapolation can be used to extend finite-difference time domain (FDTD) wave propagation simulations, limited to only relatively low frequencies, to the full audio band. For this bandwidth extrapolation, a boundary absorption weighting function is proposed based on a parametric approximation of the energy decay relief of the SRIR used as the input to the algorithm. Results using examples of both measured and FDTD simulated impulse responses demonstrate that this approach can be applied successfully to a range of acoustic spaces. Objective measures show a close approximation to reverberation time and acceptable early decay time values. Results are verified through accompanying auralizations that demonstrate the plausibility of this approach when compared to the original reference case. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
12. Articulatory vocal tract synthesis in Supercollider
- Author
-
Murphy, Damian T., Jani, Mátyás, and Ternström, Sten
- Subjects
Media and Communication Technology ,Medieteknik - Abstract
The APEX system enables vocal tract articulation using a reduced set of user controllable parameters by means of Principal Component Analysis of X-ray tract data. From these articulatory profiles it is then possible to calculate cross-sectional area function data that can be used as input to a number of articulatory based speech synthesis algorithms. In this paper the Kelly-Lochbaum 1-D digital waveguide vocal tract is used, and both APEX control and synthesis engine have been implemented and tested in SuperCollider. Accurate formant synthesis and real-time control are demonstrated, although for multi-parameter speech-like articulation a more direct mapping from tract-to-synthesizer tube sections is needed. SuperCollider provides an excellent framework for the further exploration of this work. QC 20171218
- Published
- 2015
13. Diffuse-Field Equalisation of Binaural Ambisonic Rendering.
- Author
-
McKenzie, Thomas, Murphy, Damian T., and Kearney, Gavin
- Subjects
RENDERING (Computer graphics) ,VIRTUAL reality ,LOGICAL prediction - Abstract
Ambisonics has enjoyed a recent resurgence in popularity due to virtual reality applications. Low order Ambisonic reproduction is inherently inaccurate at high frequencies, which causes poor timbre and height localisation. Diffuse-Field Equalisation (DFE), the theory of removing direction-independent frequency response, is applied to binaural (over headphones) Ambisonic rendering to address high-frequency reproduction. DFE of Ambisonics is evaluated by comparing binaural Ambisonic rendering to direct convolution via head-related impulse responses (HRIRs) in three ways: spectral difference, predicted sagittal plane localisation and perceptual listening tests on timbre. Results show DFE successfully improves frequency reproduction of binaural Ambisonic rendering for the majority of sound source locations, as well as the limitations of the technique, and set the basis for further research in the field. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
14. The modeling of diffuse boundaries in the 2-D digital waveguide mesh
- Author
-
Shelley, Simon and Murphy, Damian T.
- Abstract
The digital waveguide mesh can be used to simulate the propagation of sound waves in an acoustic system. The accurate simulation of the acoustic characteristics of boundaries within such a system is an important part of the model. One significant property of an acoustic boundary is its diffusivity. Previous approaches to simulating diffuse boundaries in a digital waveguide mesh are effective but exhibit limitations and have not been analyzed in detail. An improved technique is presented here that simulates diffusion at boundaries and offers a high degree of control and consistency. This technique works by rotating wavefronts as they pass through a special diffusing layer adjacent to the boundary. The waves are rotated randomly according to a chosen probability function and the model is lossless. This diffusion model is analyzed in detail, and its diffusivity is quantified in the form of frequency dependent diffusion coefficients. The approach used to measuring boundary diffusion is described here in detail for the 2-D digital waveguide mesh and can readily be extended for the 3-D case.
- Published
- 2008
15. The KW-boundary hybrid digital waveguide mesh for room acoustics applications
- Author
-
Murphy, Damian T. and Beeson, Mark
- Abstract
The digital waveguide mesh is a discrete-time simulation used to model acoustic wave propagation through a bounded medium. It can be applied to the simulation of the acoustics of rooms through the generation of impulse responses suitable for auralization purposes. However, large-scale three-dimensional mesh structures are required for high quality results. These structures must therefore be efficient and also capable of flexible boundary implementation in terms of both geometrical layout and the possibility for improved mesh termination algorithms. The general one-dimensional N-port boundary termination is investigated, where N depends on the geometry of the modeled domain and the mesh topology used. The equivalence between physical variable Kirchoff-model, and scattering-based wave-model boundary formulations is proved. This leads to the KW-hybrid one-dimensional N-port boundary-node termination, which is shown to be equivalent to the Kirchoff- and wave-model cases. The KW-hybrid boundary-node is implemented as part of a new hybrid two-dimensional triangular digital waveguide mesh. This is shown to offer the possibility for large-scale, computationally efficient mesh structures for more complex shapes. It proves more accurate than a similar rectilinear mesh in terms of geometrical fit, and offers significant savings in processing time and memory use over a standard wave-based model. The new hybrid mesh also has the potential for improved real-world room boundary simulations through the inclusion of additional mixed modeling algorithms.
- Published
- 2007
16. Single-Frame Discrimination of Non-Stationary Sinusoids.
- Author
-
Wells, Jeremy J. and Murphy, Damian T.
- Published
- 2007
- Full Text
- View/download PDF
17. Diffusion modelling at the boundary of a digital waveguide mesh.
- Author
-
Shelley, Simon and Murphy, Damian T.
- Published
- 2005
18. Room Impulse Response Synthesis and Validation Using a Hybrid Acoustic Model.
- Author
-
Southern, Alex, Siltanen, Samuel, Murphy, Damian T., and Savioja, Lauri
- Subjects
IMPULSE response ,ACOUSTIC models ,THEORY of wave motion ,SOUND design ,WAVELENGTHS - Abstract
Synthesizing the room impulse response (RIR) of an arbitrary enclosure may be performed using a number of alternative acoustic modeling methods, each with their own particular advantages and limitations. This article is concerned with obtaining a hybrid RIR derived from both wave and geometric-acoustics based methods, optimized for use across different regions of time or frequency. Consideration is given to how such RIRs can be matched across modeling domains in terms of both amplitude and boundary behavior and the approach is verified using a number of standardised case studies. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
19. Three-Dimensional Digital Waveguide Mesh Simulation of Cylindrical Vocal Tract Analogs.
- Author
-
Speed, Matt, Murphy, Damian T., and Howard, David M.
- Subjects
WAVEGUIDES ,MESH networks ,SIMULATION methods & models ,VOCAL tract ,ACOUSTIC models ,X-rays - Abstract
3D time-domain acoustic modeling techniques have the potential to produce more accurate simulation of the vocal tract than previously implemented 1D or 2D solutions, although the variability of human voice renders it a problematic benchmark for validation of its resynthesis. This study uses acoustic measurement of acrylic cylindrical vocal tract models derived from X-Ray data to assess the validity of comparable 3D digital waveguide mesh simulations. It is found that for more simple structures the 3D digital waveguide mesh is able to reproduce the acoustic behavior up to 10 kHz with only slight errors in resonant frequencies. As the simulated structures become more geometrically complex, this shifting becomes more severe. [ABSTRACT FROM PUBLISHER]
- Published
- 2013
- Full Text
- View/download PDF
20. Spatial Encoding of Finite Difference Time Domain Acoustic Models for Auralization.
- Author
-
Southern, Alex, Murphy, Damian T., and Savioja, Lauri
- Subjects
FINITE difference time domain method ,ENCODING ,IMPULSE response ,ACOUSTIC models ,COMPUTER simulation ,MICROPHONES ,MATHEMATICAL models - Abstract
A single room impulse response can reveal information about the acoustics of a given space in both objective, and, when used for auralization, subjective terms. However, for additional spatial information, or more accurate and perceptually convincing auralization, multiple impulse responses are needed. Higher order Ambisonics is a robust means of capturing the spatial qualities of an acoustic space over multiple channels for decoding and rendering over many possible speaker layouts. A method for obtaining Nth-order Ambisonic impulse responses from a room acoustic model, based on lower orders using differential microphone techniques is presented. This is tested using a third-order encoding of a 2-D finite difference time domain room acoustic simulation based on multiple circular arrays of receivers. Accurate channel directional profiles are obtained and results are verified in a series of listening tests comparing the localization of a sound source placed within the given simulation to the same source encoded directly. This generic encoding scheme can be applied to any room acoustic simulation technique where it is possible to obtain impulse responses across multiple receiver positions. Although the proposed method encompasses horizontal encoding only, it can also be applied directly in 3-D simulations where height information is not required in the final auralization. [ABSTRACT FROM PUBLISHER]
- Published
- 2012
- Full Text
- View/download PDF
21. A Comparative Evaluation of Techniques for Single-Frame Discrimination of Nonstationary Sinusoids.
- Author
-
Wells, Jeremy J. and Murphy, Damian T.
- Subjects
SPECTRUM analysis ,FOURIER analysis ,MUSIC ,PARAMETER estimation ,CODING theory - Abstract
Many spectral analysis and modification techniques require the separation of sinusoidal from nonsinusoidal signal components of a Fourier spectrum. Techniques exist for the estimation of the parameters of nonstationary sinusoids, and for discriminating these from other components, within a single Fourier frame. We present a comparative study of five methods for sinusoidal discrimination, considering their effectiveness and their computational cost. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
22. Real-Time Dynamic Articulations in the 2-D Waveguide Mesh Vocal Tract Model.
- Author
-
Mullen, Jack, Howard, David M., and Murphy, Damian T.
- Subjects
ARTICULATION (Speech) ,WAVEGUIDES ,VOCAL tract ,SPEECH ,ACOUSTIC impedance ,RESONATORS ,SPEECH synthesis - Abstract
Time domain articulatory vocal tract modeling in one-dimensional (1-D) is well established. Previous studies into two-dimensional (2-D) simulation of wave propagation in the vocal tract have shown it to present accurate static vowel synthesis. However, little has been done to demonstrate how such a model might accommodate the dynamic tract shape changes necessary in modeling speech. Two methods of applying the area function to the 2-D digital waveguide mesh vocal tract model are presented here. First, a method based on mapping the cross-sectional area onto the number of waveguides across the mesh, termed a widthwise mapping approach is detailed. Discontinuity problems associated with the dynamic manipulation of the model are highlighted. Second, a new method is examined that uses a static-shaped rectangular mesh with the area function translated into an impedance map which is then applied to each waveguide. Two approaches for constructing such a map are demonstrated; one using a linear impedance increase to model a constriction to the tract and another using a raised cosine function. Recommendations are made towards the use of the cosine method as it allows for a wider central propagational channel. It is also shown that this impedance mapping approach allows for stable dynamic shape changes and also permits a reduction in sampling frequency leading to real-time interaction with the model. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
23. Waveguide Physical Modeling of Vocal Tract Acoustics: Flexible Formant Bandwidth Control From Increased Model Dimensionality.
- Author
-
Mullen, Jack, Howard, David M., and Murphy, Damian T.
- Subjects
SPEECH synthesis ,VOCAL tract ,SPEECH processing systems ,SPEECH ,WAVEGUIDES ,BANDWIDTHS - Abstract
Digital waveguide physical modeling is often used as an efficient representation of acoustical resonators such as the human vocal tract. Building on the basic one-dimensional (1-D) Kelly--Lochbaum tract model, various speech synthesis techniques demonstrate improvements to the wave scattering mechanisms in order to better approximate wave propagation in the complex vocal system. Some of these techniques are discussed in this paper, with particular reference to an alternative approach in the form of a two-dimensional waveguide mesh model. Emphasis is placed on its ability to produce vowel spectra similar to that which would be present in natural speech, and how it improves upon the 1-D model. Tract area function is accommodated as model width, rather than translated into acoustic impedance, and as such offers extra control as an additional bounding limit to the model. Results show that the two-dimensional (2-D) model introduces approximately linear control over formant bandwidths leading to attainable realistic values across a range of vowels. Similarly, the 2-D model allows for application of theoretical reflection values within the tract, which when applied to the 1-D model result in small formant bandwidths, and, hence, unnatural sounding synthesized vowels. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
24. Diphthong synthesis using the three-dimensional dynamic digital waveguide mesh
- Author
-
Gully, Amelia J. and Murphy, Damian T.
- Subjects
621.38 - Abstract
The human voice is a complex and nuanced instrument, and despite many years of research, no system is yet capable of producing natural-sounding synthetic speech. This affects intelligibility for some groups of listeners, in applications such as automated announcements and screen readers. Furthermore, those who require a computer to speak - due to surgery or a degenerative disease - are limited to unnatural-sounding voices that lack expressive control and may not match the user's gender, age or accent. It is evident that natural, personalised and controllable synthetic speech systems are required. A three-dimensional digital waveguide model of the vocal tract, based on magnetic resonance imaging data, is proposed here in order to address these issues. The model uses a heterogeneous digital waveguide mesh method to represent the vocal tract airway and surrounding tissues, facilitating dynamic movement and hence speech output. The accuracy of the method is validated by comparison with audio recordings of natural speech, and perceptual tests are performed which confirm that the proposed model sounds significantly more natural than simpler digital waveguide mesh vocal tract models. Control of such a model is also considered, and a proof-of-concept study is presented using a deep neural network to control the parameters of a two-dimensional vocal tract model, resulting in intelligible speech output and paving the way for extension of the control system to the proposed three-dimensional vocal tract model. Future improvements to the system are also discussed in detail. This project considers both the naturalness and control issues associated with synthetic speech and therefore represents a significant step towards improved synthetic speech for use across society.
- Published
- 2017
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.