Author: "Anil Palaparthi" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Anil Palaparthi"' showing total 23 results

Start Over Author "Anil Palaparthi"

23 results on '"Anil Palaparthi"'

1. Deep Learning for Neuromuscular Control of Vocal Source for Voice Production

Author: Anil Palaparthi, Rishi K. Alluri, and Ingo R. Titze
Subjects: nonlinear control systems, artificial neural networks, voice production, speech acoustics, TensorFlow, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: A computational neuromuscular control system that generates lung pressure and three intrinsic laryngeal muscle activations (cricothyroid, thyroarytenoid, and lateral cricoarytenoid) to control the vocal source was developed. In the current study, LeTalker, a biophysical computational model of the vocal system was used as the physical plant. In the LeTalker, a three-mass vocal fold model was used to simulate self-sustained vocal fold oscillation. A constant /ə/ vowel was used for the vocal tract shape. The trachea was modeled after MRI measurements. The neuromuscular control system generates control parameters to achieve four acoustic targets (fundamental frequency, sound pressure level, normalized spectral centroid, and signal-to-noise ratio) and four somatosensory targets (vocal fold length, and longitudinal fiber stress in the three vocal fold layers). The deep-learning-based control system comprises one acoustic feedforward controller and two feedback (acoustic and somatosensory) controllers. Fifty thousand steady speech signals were generated using the LeTalker for training the control system. The results demonstrated that the control system was able to generate the lung pressure and the three muscle activations such that the four acoustic and four somatosensory targets were reached with high accuracy. After training, the motor command corrections from the feedback controllers were minimal compared to the feedforward controller except for thyroarytenoid muscle activation.
Published: 2024
Full Text: View/download PDF

2. Examining the Quasi-Steady Airflow Assumption in Irregular Vocal Fold Vibration

Author: Xiaojian Wang, Xudong Zheng, Ingo R. Titze, Anil Palaparthi, and Qian Xue
Subjects: quasi-steady assumption, flow unsteadiness, vocal fold vibration, voice generation, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: The quasi-steady flow assumption (QSFA) is commonly used in the field of biomechanics of phonation. It approximates time-varying glottal flow with steady flow solutions based on frozen glottal shapes, ignoring unsteady flow behaviors and vocal fold motion. This study examined the limitations of QSFA in human phonation using numerical methods by considering factors of phonation frequency, air inertance in the vocal tract, and irregular glottal shapes. Two sets of irregular glottal shapes were examined through dynamic, pseudo-static, and quasi-steady simulations. The differences between dynamic and quasi-steady/pseudo-static simulations were measured for glottal flow rate, glottal wall pressure, and sound spectrum to evaluate the validity of QSFA. The results show that errors in glottal flow rate and wall pressure predicted by QSFA were small at 100 Hz but significant at 500 Hz due to growing flow unsteadiness. Air inertia in the vocal tract worsened predictions when interacting with unsteady glottal flow. Flow unsteadiness also influenced the harmonic energy ratio, which is perceptually important. The effects of glottal shape and glottal wall motion on the validity of QSFA were found to be insignificant.
Published: 2023
Full Text: View/download PDF

3. Vocalization with semi-occluded airways is favorable for optimizing sound production.

Author: Ingo R Titze, Anil Palaparthi, Karin Cox, Amanda Stark, Lynn Maxfield, and Brian Manternach
Subjects: Biology (General), QH301-705.5
Abstract: Vocalization in mammals, birds, reptiles, and amphibians occurs with airways that have wide openings to free-space for efficient sound radiation, but sound is also produced with occluded or semi-occluded airways that have small openings to free-space. It is hypothesized that pressures produced inside the airway with semi-occluded vocalizations have an overall widening effect on the airway. This overall widening then provides more opportunity to produce wide-narrow contrasts along the airway for variation in sound quality and loudness. For human vocalization described here, special emphasis is placed on the epilaryngeal airway, which can be adjusted for optimal aerodynamic power transfer and for optimal acoustic source-airway interaction. The methodology is three-fold, (1) geometric measurement of airway dimensions from CT scans, (2) aerodynamic and acoustic impedance calculation of the airways, and (3) simulation of acoustic signals with a self-oscillating computational model of the sound source and wave propagation.
Published: 2021
Full Text: View/download PDF

4. Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds

Author: Anil Palaparthi, Simeon Smith, and Ingo R. Titze
Subjects: intrinsic laryngeal muscle activations, voice acoustics, vocal fold posturing, voice production, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Any specific vowel sound that humans produce can be represented in terms of four perceptual features in addition to the vowel category. They are pitch, loudness, brightness, and roughness. Corresponding acoustic features chosen here are fundamental frequency (fo), sound pressure level (SPL), normalized spectral centroid (NSC), and approximate entropy (ApEn). In this study, thyroarytenoid (TA) and cricothyroid (CT) activations were varied computationally to study their relationship with these four specific acoustic features. Additionally, postural and material property variables such as vocal fold length (L) and fiber stress ( σ ) in the three vocal fold tissue layers were also calculated. A fiber-gel finite element model developed at National Center for Voice and Speech was used for this purpose. Muscle activation plots were generated to obtain the dependency of postural and acoustic features on TA and CT muscle activations. These relationships were compared against data obtained from previous in vivo human larynx studies and from canine laryngeal studies. General trends are that fo and SPL increase with CT activation, while NSC decreases when CT activation is raised above 20%. With TA activation, acoustic features have no uniform trends, except SPL increases uniformly with TA if there is a co-variation with CT activation. Trends for postural variables and material properties are also discussed in terms of activation levels.
Published: 2019
Full Text: View/download PDF

5. Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds

Author: Anil Palaparthi, Simeon L. Smith, and Ingo R. Titze
Subjects: intrinsic laryngeal muscle activations, Spectral centroid, lcsh:Technology, 01 natural sciences, Approximate entropy, Loudness, lcsh:Chemistry, 03 medical and health sciences, 0302 clinical medicine, Vowel, 0103 physical sciences, Stress (linguistics), medicine, General Materials Science, 030223 otorhinolaryngology, Sound pressure, lcsh:QH301-705.5, 010301 acoustics, Instrumentation, Fluid Flow and Transfer Processes, Physics, lcsh:T, Process Chemistry and Technology, voice acoustics, General Engineering, Fundamental frequency, lcsh:QC1-999, Computer Science Applications, voice production, medicine.anatomical_structure, lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, vocal fold posturing, Vocal folds, lcsh:Engineering (General). Civil engineering (General), lcsh:Physics, Biomedical engineering
Abstract: Any specific vowel sound that humans produce can be represented in terms of four perceptual features in addition to the vowel category. They are pitch, loudness, brightness, and roughness. Corresponding acoustic features chosen here are fundamental frequency (fo), sound pressure level (SPL), normalized spectral centroid (NSC), and approximate entropy (ApEn). In this study, thyroarytenoid (TA) and cricothyroid (CT) activations were varied computationally to study their relationship with these four specific acoustic features. Additionally, postural and material property variables such as vocal fold length (L) and fiber stress (&sigma, ) in the three vocal fold tissue layers were also calculated. A fiber-gel finite element model developed at National Center for Voice and Speech was used for this purpose. Muscle activation plots were generated to obtain the dependency of postural and acoustic features on TA and CT muscle activations. These relationships were compared against data obtained from previous in vivo human larynx studies and from canine laryngeal studies. General trends are that fo and SPL increase with CT activation, while NSC decreases when CT activation is raised above 20%. With TA activation, acoustic features have no uniform trends, except SPL increases uniformly with TA if there is a co-variation with CT activation. Trends for postural variables and material properties are also discussed in terms of activation levels.
Published: 2022

6. Analysis of glottal inverse filtering in the presence of source-filter interaction

Author: Anil Palaparthi and Ingo R. Titze
Subjects: Linguistics and Language, Mean squared error, Communication, Acoustics, Ripple, Inverse filter, 020206 networking & telecommunications, 02 engineering and technology, Filter (signal processing), 01 natural sciences, Article, Language and Linguistics, Computer Science Applications, Formant, Amplitude, Computer Science::Sound, Modeling and Simulation, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Computer Vision and Pattern Recognition, 010301 acoustics, Software, Impulse response, Vocal tract, Mathematics
Abstract: The validity of glottal inverse filtering (GIF) to obtain a glottal flow waveform from radiated pressure signal in the presence and absence of source-filter interaction was studied systematically. A driven vocal fold surface model of vocal fold vibration was used to generate source signals. A one-dimensional wave reflection algorithm was used to solve for acoustic pressures in the vocal tract. Several test signals were generated with and without source-filter interaction at various fundamental frequencies and vowels. Linear Predictive Coding (LPC), Quasi Closed Phase (QCP), and Quadratic Programming (QPR) based algorithms, along with supraglottal impulse response, were used to inverse filter the radiated pressure signals to obtain the glottal flow pulses. The accuracy of each algorithm was tested for its recovery of maximum flow declination rate (MFDR), peak glottal flow, open phase ripple factor, closed phase ripple factor, and mean squared error. The algorithms were also tested for their absolute relative errors of the Normalized Amplitude Quotient, the Quasi-Open Quotient, and the Harmonic Richness Factor. The results indicated that the mean squared error decreased with increase in source-filter interaction level suggesting that the inverse filtering algorithms perform better in the presence of source-filter interaction. All glottal inverse filtering algorithms predicted the open phase ripple factor better than the closed phase ripple factor of a glottal flow waveform, irrespective of the source-filter interaction level. Major prediction errors occurred in the estimation of the closed phase ripple factor, MFDR, peak glottal flow, normalized amplitude quotient, and Quasi-Open Quotient. Feedback-related nonlinearity (source-filter interaction) affected the recovered signal primarily when fo was well below the first formant frequency of a vowel. The prediction error increased when fo was close to the first formant frequency due to the difficulty of estimating the precise value of resonance frequencies, which was exacerbated by nonlinear kinetic losses in the vocal tract.
Published: 2020

7. Vocal Tradeoffs in Anterior Glottoplasty for Voice Feminization

Author: Anil Palaparthi, Ted Mau, and Ingo R. Titze
Subjects: Male, Glottis, Voice Quality, medicine.medical_treatment, Transgender Persons, Article, 030507 speech-language pathology & audiology, 03 medical and health sciences, Sex Factors, 0302 clinical medicine, otorhinolaryngologic diseases, Humans, Medicine, Computer Simulation, Thyroarytenoid muscle, 030223 otorhinolaryngology, Sound pressure, Reduction (orthopedic surgery), Fixation (histology), Laryngoscopy, business.industry, Gender Identity, Anatomy, respiratory system, Voice production, Glottal flow, Treatment Outcome, medicine.anatomical_structure, Otorhinolaryngology, Ligament, Female, 0305 other medical science, business
Abstract: OBJECTIVES/HYPOTHESIS Anterior (Wendler) glottoplasty has become a popular surgery for voice feminization. However, there has been some discrepancy between its theoretical pitch-raising potential and what is actually achievable, and downsides to shortening the glottis have not been fully explored. In addition, descriptions of the surgery are inconsistent in their treatment of the vocal ligament. This study aimed to determine 1) how fundamental frequency (fo ) is expected to vary with length of anterior glottic fixation, 2) the impact of glottic shortening on sound pressure level (SPL), and 3) the effect of including the ligament in fixation. STUDY DESIGN Computational simulation. METHODS Voice production was simulated in a fiber-gel finite element computational model using canonical male vocal fold geometry incorporating a three-layer vocal fold composition (superficial lamina propria, vocal ligament, and thyroarytenoid muscle). Progressive anterior glottic fixation (0, 1/8, 2/8, 3/8, etc. up to 7/8 of membranous vocal fold length) was simulated. Outcome measures were fo , SPL, and glottal flow waveforms. RESULTS fo increased from 110 Hz to 164 Hz when the anterior one-half vocal fold was fixed and continued to progressively rise with further fixation. SPL progressively decreased beyond 1/8 to 1/4 fixation. Inclusion of the vocal ligament in fixation did not further increase fo . Any fixation increased aperiodicity in the acoustic signal. CONCLUSIONS The optimal length of fixation is a compromise between pitch elevation and reduction in output acoustic power. The simulation also provided a potential explanation for vocal roughness that is sometimes noted after anterior glottoplasty. LEVEL OF EVIDENCE NA Laryngoscope, 131:1081-1087, 2021.
Published: 2020

8. A computational study of depth of vibration into vocal fold tissues

Author: Simeon L. Smith, Ted Mau, Ingo R. Titze, and Anil Palaparthi
Subjects: Speech Communication, Acoustics and Ultrasonics, Finite Element Analysis, Geometry, Vocal Cords, Models, Biological, Vibration, Layered structure, 030507 speech-language pathology & audiology, 03 medical and health sciences, 0302 clinical medicine, Arts and Humanities (miscellaneous), otorhinolaryngologic diseases, medicine, Humans, 030223 otorhinolaryngology, Fold (geology), Dysphonia, Effective depth, Finite element method, medicine.anatomical_structure, Cardinal point, Vocal folds, Coronal plane, Larynx, 0305 other medical science, Geology
Abstract: The effective depth of vocal fold vibration is self-regulated and generally not known a priori in vocalization. In this study, the effective depth was quantified systematically under various phonatory conditions using a fiber-gel finite element vocal fold model. The horizontal and vertical excursions of each finite element nodal point trajectory were recorded to compute trajectory areas. The extent of vibration was then studied based on the variation of trajectory radii as a function of depth in several coronal sections along the anterior-posterior direction. The results suggested that the vocal fold nodal trajectory excursions decrease systematically as a function of depth but are affected by the layered structure of the vocal folds. The effective depth of vibration was found to range between 15 and 55% of the total anatomical depth across all phonatory conditions. The nodal trajectories from the current study were compared qualitatively with the results from excised human hemi-larynx experiments published in Döllinger and Berry [(2006). J. Voice. 20(3), 401-413]. An estimate of the effective mass of a one-mass vocal fold model was also computed based on the effective depth of vibration observed in this study under various phonatory conditions.
Published: 2019

9. Individualized Patient Vocal Priorities for Tailored Therapy

Author: Anil Palaparthi, Tobias Riede, Linda S. Hynan, Laura E. Toles, Ingo R. Titze, Amy Hamilton, and Ted Mau
Subjects: Adult, Male, Linguistics and Language, medicine.medical_specialty, Tailored therapy, Voice Quality, Computer science, Loudness Perception, Test validity, Speech Therapy, Speech Acoustics, Language and Linguistics, 030507 speech-language pathology & audiology, 03 medical and health sciences, Speech and Hearing, 0302 clinical medicine, Speech Production Measurement, Rating scale, Surveys and Questionnaires, otorhinolaryngologic diseases, medicine, Humans, Speech, Medical physics, Precision Medicine, 030223 otorhinolaryngology, Aged, Voice Disorders, Reproducibility of Results, respiratory system, Middle Aged, Needs assessment, Female, Factor Analysis, Statistical, 0305 other medical science, psychological phenomena and processes
Abstract: Purpose The purposes of this study are to introduce the concept of vocal priorities based on acoustic correlates, to develop an instrument to determine these vocal priorities, and to analyze the pattern of vocal priorities in patients with voice disorders. Method Questions probing the importance of 5 vocal attributes (vocal clarity, loudness, mean speaking pitch, pitch range, vocal endurance) were generated from consensus conference involving speech-language pathologists, laryngologists, and voice scientists, as well as patient feedback. The responses to the preliminary items from 213 subjects were subjected to exploratory factor analysis, which confirmed 4 of the predefined domains. The final instrument consisted of a 16-item Vocal Priority Questionnaire probing the relative importance of clarity, loudness, mean speaking pitch, and pitch range. Results The Vocal Priority Questionnaire had high reliability (Cronbach's α = .824) and good construct validity. A majority of the cohort (61%) ranked vocal clarity as their highest vocal priority, and 20%, 12%, and 7% ranked loudness, mean speaking pitch, and pitch range, respectively, as their highest priority. The frequencies of the highest ranked priorities did not differ by voice diagnosis or by sex. Considerable individual variation in vocal priorities existed within these large trends. Conclusions A patient's vocal priorities can be identified and taken into consideration in planning behavioral or surgical intervention for a voice disorder. Inclusion of vocal priorities in treatment planning empowers the patient in shared decision making, helps the clinician tailor treatment, and may also improve therapy compliance.
Published: 2018

10. How auditory selectivity for sound timing arises: the diverse roles of GABAergic inhibition in shaping the excitation to interval-selective midbrain neurons

Author: Kyphuong Luong, Christopher J. Leary, Anil Palaparthi, Rishi K. Alluri, Jessica L. Hanson, Gustavo A. Vasquez-Opazo, Gary J. Rose, and Jalina A. Graham
Subjects: 0301 basic medicine, Neurons, Patch-Clamp Techniques, Chemistry, General Neuroscience, media_common.quotation_subject, Illusion, Sensory system, Article, Midbrain, 03 medical and health sciences, 030104 developmental biology, 0302 clinical medicine, Acoustic Stimulation, Mesencephalon, Gabazine, medicine, Selectivity, Neuroscience, 030217 neurology & neurosurgery, Shunting inhibition, Intracellular, Excitation, media_common, medicine.drug
Abstract: Across sensory systems, temporal frequency information is progressively transformed along ascending central pathways. Despite considerable effort to elucidate the mechanistic basis of these transformations, they remain poorly understood. Here we used a novel constellation of approaches, including whole-cell recordings and focal pharmacological manipulation, in vivo, and new computational algorithms that identify conductances resulting from excitation, inhibition and active membrane properties, to elucidate the mechanisms underlying the selectivity of midbrain auditory neurons for long temporal intervals. Surprisingly, we found that stimulus-driven excitation can be increased and its selectivity decreased following attenuation of inhibition with gabazine or intracellular delivery of fluoride. We propose that this nonlinear interaction is due to shunting inhibition. The rate-dependence of this inhibition results in the illusion that excitation to a cell shows greater temporal selectivity than is actually the case. We also show that rate-dependent depression of excitation, an important component of long-interval selectivity, can be decreased after attenuating inhibition. These novel findings indicate that nonlinear shunting inhibition plays a key role in shaping the amplitude and interval selectivity of excitation. Our findings provide a major advance in understanding how the brain decodes intervals and may explain paradoxical temporal selectivity of excitation to midbrain neurons reported previously.
Published: 2020

11. Vocal Loudness Variation With Spectral Slope

Author: Ingo R. Titze and Anil Palaparthi
Subjects: Speech Acoustics, Linguistics and Language, Sound Spectrography, Acoustics, Loudness Perception, Fundamental frequency, Octave (electronics), Sensitivity and Specificity, Language and Linguistics, Loudness, Speech and Hearing, Sound, Reference Values, Spectral slope, Range (statistics), Pressure, Voice, Speech, Humans, Sound pressure, Sensitivity (electronics), Mathematics, Retrospective Studies
Abstract: Objective This investigation addresses the loudness variations in sones achievable with spectral slope variations (higher harmonic energy) in human vocalization and compares it to the sound pressure level (SPL) variations typically reported in the voice range profile (VRP). Method The primary methodology was computational. The ISO standard 226 was used to convert SPL values to sones for a 125- to 1000-Hz range of fundamental frequency and a –3 dB/octave to –12 dB/octave range of spectral slope. In addition, a retrospective analysis of human subjects' VRPs was conducted, and the experimental results were compared to the theoretical results. Results A very small range of SPL variation (less than 5 dB) in the VRP can produce a large range of loudness. The sensitivity can be on the order of 4 sones per dB SPL change. Conclusion For vocalization in the modal register, loudness variation is not well described by SPL change in dB, especially at high fundamental frequencies where the SPL range in the VRP becomes very small but sizeable loudness variations are still possible.
Published: 2020

12. Radiation efficiency for long-range vocal communication in mammals and birds

Author: Anil Palaparthi and Ingo R. Titze
Subjects: 0106 biological sciences, Vocal communication, Acoustics and Ultrasonics, Acoustics, Reactance, Intelligibility (communication), 010603 evolutionary biology, 01 natural sciences, Birds, Species Specificity, Arts and Humanities (miscellaneous), Animals, Humans, 0501 psychology and cognitive sciences, 050102 behavioral science & comparative psychology, Mathematics, Mammals, Radiation impedance, Communication, 05 social sciences, Lagomorpha, Fundamental frequency, Animal Bioacoustics, Antenna efficiency, Sound, Beak, Beam direction, Vocalization, Animal
Abstract: Long-distance vocal communication by birds and mammals, including humans, is facilitated largely by radiation efficiency from the mouth or beak. Here, this efficiency is defined and quantified. It depends on frequency content of vocalization, mouth opening, head and upper body geometry, and directionality. Each of these factors is described mathematically with a piston-in-a-sphere model. While this model is considered a classic, never before has the high frequency solution been applied in detail to vocalization. Results indicate that frequency content in the 1–50 kHz range can be radiated with nearly 100% efficiency if a reactance peak in the radiation impedance is utilized with adjustments of head size, mouth opening, and beam direction. Without these adjustments, radiation efficiency is generally below 1%, especially in human speech where a high fundamental frequency is a disadvantage for intelligibility. Thus, two distinct modes of vocal communication are identified, (1) short range with optimized information transfer and (2) long range with maximum efficiency for release of acoustic power.
Published: 2018

13. Comparison of a fiber-gel finite element model of vocal fold vibration to a transversely isotropic stiffness model

Author: Ingo R. Titze, Fariborz Alipour, Anil Palaparthi, and Douglas Blake
Subjects: Speech Communication, Acoustics and Ultrasonics, Acoustics, Finite Element Analysis, Vocal Cords, Models, Biological, Vibration, 01 natural sciences, Shear modulus, 03 medical and health sciences, 0302 clinical medicine, Arts and Humanities (miscellaneous), Normal mode, Transverse isotropy, 0103 physical sciences, medicine, Humans, Computer Simulation, 030223 otorhinolaryngology, 010301 acoustics, Plane stress, Physics, Mathematical analysis, Stiffness, Elasticity, Finite element method, Transverse plane, medicine.symptom, Gels
Abstract: A fiber-gel vocal fold model is compared to a transversely isotropic stiffness model in terms of normal mode vibration. The fiber-gel finite element model (FG-FEM) consists of a series of gel slices, each with a two-dimensional finite element mesh, in a plane transverse to the tissue fibers. The gel slices are coupled with fibers under tension in the anterior-posterior dimension. No vibrational displacement in the fiber-length direction is allowed, resulting in a plane strain state. This is consistent with the assumption of transverse displacement of a simple string, offering a wide range of natural frequencies (well into the kHz region) with variable tension. For low frequencies, the results compare favorably with the natural frequencies of a transversely isotropic elastic stiffness model (TISM) in which the shear modulus in the longitudinal plane is used to approximate the effect of fiber tension. For high frequencies, however, the natural frequencies do not approach the string mode frequencies unless plane strain is imposed on the TISM model. The simplifying assumption of plane strain, as well as the use of analytical closed-form shape functions, allow for substantial savings in computational time, which is important in clinical and exploratory applications of the FG-FEM model.
Published: 2017

14. Sensitivity of Source–Filter Interaction to Specific Vocal Tract Shapes

Author: Ingo R. Titze and Anil Palaparthi
Subjects: Physics, Glottis, Conus elasticus, Acoustics and Ultrasonics, Laryngeal vestibule, Acoustics, respiratory system, Article, 030507 speech-language pathology & audiology, 03 medical and health sciences, Computational Mathematics, 0302 clinical medicine, medicine.anatomical_structure, Formant, Vocal folds, otorhinolaryngologic diseases, Computer Science (miscellaneous), medicine, Phonation, Sensitivity (control systems), Electrical and Electronic Engineering, 030223 otorhinolaryngology, 0305 other medical science, Vocal tract
Abstract: A systematic variation of length and cross-sectional area of specific segments of the vocal tract (trachea to lips) was conducted computationally to quantify the effects of source–filter interaction. A one-dimensional Navier–Stokes (transmission line) solution was used to compute peak glottal airflow, maximum flow declination rate, and formant ripple on glottal flow for Level 1 (aero-acoustic) interactions. For Level 2 (tissue movement) interaction, peak glottal area, phonation threshold pressure, and deviation in $f_{{{o}}}$ were quantified. Results show that the ventricle, the false-fold glottis, the conus elasticus entry, and the laryngeal vestibule are the regions to which acoustic variables are most sensitive. Generally, any narrow section of the vocal tract increases the degree of interaction, both in terms of its length and its cross-sectional area. The closer the narrow section is to the vocal folds, the greater the effect.
Published: 2016

15. Effect of resection depth of early glottic cancer on vocal outcome: An optimized finite element simulation

Author: Ingo R. Titze, Ted Mau, Tobias Riede, and Anil Palaparthi
Subjects: medicine.medical_specialty, Glottis, medicine.diagnostic_test, business.industry, medicine.medical_treatment, Laryngoscopy, Surgical planning, Resection, Surgery, Laryngectomy, medicine.anatomical_structure, Otorhinolaryngology, Glottic cancer, otorhinolaryngologic diseases, medicine, Ligament, Cordectomy, business
Abstract: Objectives/Hypothesis To test the hypothesis that subligamental cordectomy produces superior acoustic outcome than subepithelial cordectomy for early (T1–2) glottic cancer that requires complete removal of the superficial lamina propria but does not involve the vocal ligament. Study Design Computer simulation. Methods A computational tool for vocal fold surgical planning and simulation (the National Center for Voice and Speech Phonosurgery Optimizer-Simulator) was used to evaluate the acoustic output of alternative vocal fold morphologies. Four morphologies were simulated: normal, subepithelial cordectomy, subligamental cordectomy, and transligamental cordectomy (partial ligament resection). The primary outcome measure was the range of fundamental frequency (F0) and sound pressure level (SPL). A more restricted F0-SPL range was considered less favorable because of reduced acoustic possibilities given the same range of driving subglottic pressure and identical vocal fold posturing. Results Subligamental cordectomy generated solutions covering an F0-SPL range 82% of normal for a rectangular vocal fold. In contrast, transligamental and subepithelial cordectomies produced significantly smaller F0-SPL ranges, 57% and 19% of normal, respectively. Conclusion This study illustrates the use of the Phonosurgery Optimizer-Simulator to test a specific hypothesis regarding the merits of two surgical alternatives. These simulation results provide theoretical support for vocal ligament excision with maximum muscle preservation when superficial lamina propria resection is necessary but the vocal ligament can be spared on oncological grounds. The resection of more tissue may paradoxically allow the eventual recovery of a better speaking voice, assuming glottal width is restored. Application of this conclusion to surgical practice will require confirmatory clinical data. Level of Evidence N/A. Laryngoscope, 125:1892–1899, 2015
Published: 2015

16. Estimation of Source-Filter Interaction Regions Based on Electroglottography

Author: Lynn Maxfield, Ingo R. Titze, and Anil Palaparthi
Subjects: Male, Glottis, Sound Spectrography, Time Factors, Voice Quality, Acoustics, Vocal Cords, 01 natural sciences, Vibration, Article, 03 medical and health sciences, Speech and Hearing, 0302 clinical medicine, Sex Factors, Phonation, 0103 physical sciences, medicine, Pressure, Humans, 030223 otorhinolaryngology, 010301 acoustics, Electroglottograph, Mathematics, Electrodiagnosis, Fundamental frequency, respiratory system, LPN and LVN, medicine.anatomical_structure, Formant, Otorhinolaryngology, Vocal folds, Harmonics, Female, sense organs, Contact area, Vocal tract, Algorithms
Abstract: Summary Source-filter interaction is a phenomenon in which acoustic airway pressures influence the glottal airflow at the source (level 1) and the vibration pattern of the vocal folds (level 2). This interaction is most significant when dominant source harmonics are near airway resonances. The influence of acoustic airway pressures on vocal fold vibration (level 2) was studied systematically by changing the supraglottal vocal tract length in human subjects with tube extensions. The subjects were asked to perform fundamental frequency (fo) glides while phonating through tubes of various lengths. An algorithm was developed using the quasi-open quotient extracted from the electroglottograph. Regions of sudden vocal fold vibration pattern change due to source-filter interaction were inferred from contact area changes. The algorithm correctly identified 89% of male and 84.8% of female quantal changes in contact pattern associated with interactions between source harmonics and formants during ascending glides. During the descending glides, the algorithm correctly identified 84% of male and 81.1% of female quantal changes in contact pattern. These results are in comparison with those obtained from the fo-based algorithm (Maxfield et al).
Published: 2017

17. The accuracy of a voice vote

Author: Ingo R. Titze and Anil Palaparthi
Subjects: Adult, Male, Sound Spectrography, Speech perception, Acoustics and Ultrasonics, Voice Quality, Loudness Perception, Energy (esotericism), Acoustics, Speech Acoustics, Loudness, Judgment, Discrimination, Psychological, Arts and Humanities (miscellaneous), Phonetics, Pressure, Humans, Speech Production [70], Control (linguistics), Reproducibility of Results, Signal Processing, Computer-Assisted, Middle Aged, Speech processing, Group Processes, Sound, Speech Perception, Sound energy, Female, Psychology, Perceptual Masking, Word (group theory), Cognitive psychology
Abstract: The accuracy of a voice vote was addressed by systematically varying group size, individual voter loudness, and words that are typically used to express agreement or disagreement. Five judges rated the loudness of two competing groups in A-B comparison tasks. Acoustic analysis was performed to determine the sound energy level of each word uttered by each group. Results showed that individual voter differences in energy level can grossly alter group loudness and bias the vote. Unless some control is imposed on the sound level of individual voters, it is difficult to establish even a two-thirds majority, much less a simple majority. There is no symmetry in the bias created by unequal sound production of individuals. Soft voices do not bias the group loudness much, but loud voices do. The phonetic balance of the two words chosen (e.g., “yea” and “nay” as opposed to “aye” and “no”) seems to be less of an issue.
Published: 2014

18. Comparison of two laryngeal tissue fiber constitutive models

Author: Eric Hunter, Anil Palaparthi, Roger W. Chan, and Thomas Siegmund
Subjects: Engineering, Deformation (mechanics), business.industry, Mechanical Engineering, General Chemical Engineering, Aerospace Engineering, Mechanical engineering, Shear modulus, Nonlinear system, Viscosity, Ultimate tensile strength, Solid mechanics, Cylinder stress, General Materials Science, Sensitivity (control systems), business, Biological system
Abstract: Biological tissues are complex time-dependent materials, and the best choice of the appropriate time-dependent constitutive description is not evident. This report reviews two constitutive models (a modified Kelvin model and a two-network Ogden–Boyce model) in the characterization of the passive stress–strain properties of laryngeal tissue under tensile deformation. The two models are compared, as are the automated methods for parameterization of tissue stress–strain data (a brute force vs. a common optimization method). Sensitivity (error curves) of parameters from both models and the optimized parameter set are calculated and contrast by optimizing to the same tissue stress–strain data. Both models adequately characterized empirical stress–strain datasets and could be used to recreate a good likeness of the data. Nevertheless, parameters in both models were sensitive to measurement errors or uncertainties in stress–strain, which would greatly hinder the confidence in those parameters. The modified Kelvin model emerges as a potential better choice for phonation models which use a tissue model as one component, or for general comparisons of the mechanical properties of one type of tissue to another (e.g., axial stress nonlinearity). In contrast, the Ogden–Boyce model would be more appropriate to provide a basic understanding of the tissue’s mechanical response with better insights into the tissue’s physical characteristics in terms of standard engineering metrics such as shear modulus and viscosity.
Published: 2013

19. New evidence that nonlinear source-filter coupling affects harmonic intensity and fo stability during instances of harmonics crossing formants

Author: Anil Palaparthi, Ingo R. Titze, and Lynn Maxfield
Subjects: Adult, Male, Glottis, Sound Spectrography, Voice Quality, Acoustics, Vocal Cords, 01 natural sciences, Models, Biological, Article, Speech Acoustics, 03 medical and health sciences, Speech and Hearing, Young Adult, 0302 clinical medicine, Phonation, Speech Production Measurement, 0103 physical sciences, Humans, 030223 otorhinolaryngology, 010301 acoustics, Physics, Linear system, Filter (signal processing), Fundamental frequency, Middle Aged, LPN and LVN, Nonlinear system, Formant, Otorhinolaryngology, Nonlinear Dynamics, Harmonics, Harmonic, Female, Vocal tract
Abstract: The traditional source-filter theory of voice production describes a linear relationship between the source (glottal flow pulse) and the filter (vocal tract). Such a linear relationship does not allow for nor explain how changes in the filter may impact the stability and regularity of the source. The objective of this experiment was to examine what effect unpredictable changes to vocal tract dimensions could have on fo stability and individual harmonic intensities in situations in which low frequency harmonics cross formants in a fundamental frequency glide. To determine these effects, eight human subjects (five male, three female) were recorded producing fo glides while their vocal tracts were artificially lengthened by a section of vinyl tubing inserted into the mouth. It was hypothesized that if the source and filter operated as a purely linear system, harmonic intensities would increase and decrease at nearly the same rates as they passed through a formant bandwidth, resulting in a relatively symmetric peak on an intensity-time contour. Additionally, fo stability should not be predictably perturbed by formant/harmonic crossings in a linear system. Acoustic analysis of these recordings, however, revealed that harmonic intensity peaks were asymmetric in 76% of cases, and that 85% of fo instabilities aligned with a crossing of one of the first four harmonics with the first three formants. These results provide further evidence that nonlinear dynamics in the source-filter relationship can impact fo stability as well as harmonic intensities as harmonics cross through formant bandwidths.
Published: 2016

20. AN ORAL PRESSURE CONVERSION RATIO AS A PREDICTOR OF VOCAL EFFICIENCY

Author: Lynn Maxfield, Anil Palaparthi, and Ingo R. Titze
Subjects: Male, Glottis, Aerodynamic power, Acoustics, Airflow, Lung pressure, 01 natural sciences, Article, 03 medical and health sciences, Speech and Hearing, 0302 clinical medicine, Phonation, 0103 physical sciences, Pressure, Transducers, Pressure, Medicine, Humans, 030223 otorhinolaryngology, 010301 acoustics, Lung, Mouth, business.industry, Electrodiagnosis, Acoustic energy, Reproducibility of Results, Equipment Design, respiratory system, Models, Theoretical, LPN and LVN, Voice production, Sound power, Biomechanical Phenomena, Trachea, Otorhinolaryngology, Energy Transfer, Vocal effort, Voice, Female, business, Energy (signal processing)
Abstract: Voice production is an inefficient process in terms of energy expended versus acoustic energy produced. A traditional efficiency measure, glottal efficiency, relates acoustic power radiated from the mouth to aerodynamic power produced in the trachea. This efficiency ranges between 0.0001 % and 1.0 %. It involves lung pressure, and hence would appear to be a useful effort measure for a given acoustic output. Difficulty in the combined measurement of lung pressure and tracheal airflow, however, has impeded clinical application of glottal efficiency. This paper utilizes the large data base from Schutte (1980) and a few new measurements to validate a pressure conversion ratio (PCR) as a substitute for glottal efficiency. PCR has the potential for wide application due to low cost and ease of use in clinics and vocal studios.
Published: 2015

21. Effect of resection depth of early glottic cancer on vocal outcome: an optimized finite element simulation

Author: Ted, Mau, Anil, Palaparthi, Tobias, Riede, and Ingo R, Titze
Subjects: Glottis, Treatment Outcome, Laryngoscopy, Voice Quality, Finite Element Analysis, Humans, Computer Simulation, Laryngectomy, Laser Therapy, Vocal Cords, Laryngeal Neoplasms, Article, Neoplasm Staging
Abstract: To test the hypothesis that subligamental cordectomy produces superior acoustic outcome than subepithelial cordectomy for early (T1-2) glottic cancer that requires complete removal of the superficial lamina propria but does not involve the vocal ligament.Computer simulation.A computational tool for vocal fold surgical planning and simulation (the National Center for Voice and Speech Phonosurgery Optimizer-Simulator) was used to evaluate the acoustic output of alternative vocal fold morphologies. Four morphologies were simulated: normal, subepithelial cordectomy, subligamental cordectomy, and transligamental cordectomy (partial ligament resection). The primary outcome measure was the range of fundamental frequency (F0 ) and sound pressure level (SPL). A more restricted F0 -SPL range was considered less favorable because of reduced acoustic possibilities given the same range of driving subglottic pressure and identical vocal fold posturing.Subligamental cordectomy generated solutions covering an F0 -SPL range 82% of normal for a rectangular vocal fold. In contrast, transligamental and subepithelial cordectomies produced significantly smaller F0 -SPL ranges, 57% and 19% of normal, respectively.This study illustrates the use of the Phonosurgery Optimizer-Simulator to test a specific hypothesis regarding the merits of two surgical alternatives. These simulation results provide theoretical support for vocal ligament excision with maximum muscle preservation when superficial lamina propria resection is necessary but the vocal ligament can be spared on oncological grounds. The resection of more tissue may paradoxically allow the eventual recovery of a better speaking voice, assuming glottal width is restored. Application of this conclusion to surgical practice will require confirmatory clinical data.N/A.
Published: 2015

22. Benchmarks for time-domain simulation of sound propagation in soft-walled airways: steady configurations

Author: Ingo R. Titze, Anil Palaparthi, and Simeon L. Smith
Subjects: Speech production, Acoustics and Ultrasonics, Computer science, Acoustics, Computational fluid dynamics, Arts and Humanities (miscellaneous), Phonation, Speech Production Measurement, Humans, Computer Simulation, Time domain, Speech Production [70], Navier–Stokes equations, business.industry, Bandwidth (signal processing), Models, Theoretical, Benchmarking, Formant, Sound, Computer Science::Sound, Aeroacoustics, Respiratory Physiological Phenomena, business, Pulmonary Ventilation
Abstract: Time-domain computer simulation of sound production in airways is a widely used tool, both for research and synthetic speech production technology. Speed of computation is generally the rationale for one-dimensional approaches to sound propagation and radiation. Transmission line and wave-reflection (scattering) algorithms are used to produce formant frequencies and bandwidths for arbitrarily shaped airways. Some benchmark graphs and tables are provided for formant frequencies and bandwidth calculations based on specific mathematical terms in the one-dimensional Navier–Stokes equation. Some rules are provided here for temporal and spatial discretization in terms of desired accuracy and stability of the solution. Kinetic losses, which have been difficult to quantify in frequency-domain simulations, are quantified here on the basis of the measurements of Scherer, Torkaman, Kucinschi, and Afjeh [(2010). J. Acoust. Soc. Am. 128(2), 828–838].
Published: 2014

23. Combining multiobjective optimization and cluster analysis to study vocal fold functional morphology

Author: Anil Palaparthi, Ingo R. Titze, and Tobias Riede
Subjects: Engineering, Linear programming, business.industry, Crossover, Biomedical Engineering, Evolutionary algorithm, Brute-force search, Reproducibility of Results, Fundamental frequency, Vocal Cords, Multi-objective optimization, Models, Biological, Vibration, Speech Acoustics, Article, medicine.anatomical_structure, Vocal folds, medicine, Cluster Analysis, Humans, Artificial intelligence, Sound pressure, Biological system, business, Algorithms
Abstract: Morphological design and the relationship between form and function have great influence on the functionality of a biological organ. However, the simultaneous investigation of morphological diversity and function is difficult in complex natural systems. We have developed a multiobjective optimization (MOO) approach in association with cluster analysis to study the form-function relation in vocal folds. An evolutionary algorithm (NSGA-II) was used to integrate MOO with an existing finite element model of the laryngeal sound source. Vocal fold morphology parameters served as decision variables and acoustic requirements (fundamental frequency, sound pressure level) as objective functions. A two-layer and a three-layer vocal fold configuration were explored to produce the targeted acoustic requirements. The mutation and crossover parameters of the NSGA-II algorithm were chosen to maximize a hypervolume indicator. The results were expressed using cluster analysis and were validated against a brute force method. Results from the MOO and the brute force approaches were comparable. The MOO approach demonstrated greater resolution in the exploration of the morphological space. In association with cluster analysis, MOO can efficiently explore vocal fold functional morphology.
Published: 2014

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

23 results on '"Anil Palaparthi"'

1. Deep Learning for Neuromuscular Control of Vocal Source for Voice Production

2. Examining the Quasi-Steady Airflow Assumption in Irregular Vocal Fold Vibration

3. Vocalization with semi-occluded airways is favorable for optimizing sound production.

4. Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds

5. Mapping Thyroarytenoid and Cricothyroid Activations to Postural and Acoustic Features in a Fiber-Gel Model of the Vocal Folds

6. Analysis of glottal inverse filtering in the presence of source-filter interaction

7. Vocal Tradeoffs in Anterior Glottoplasty for Voice Feminization

8. A computational study of depth of vibration into vocal fold tissues

9. Individualized Patient Vocal Priorities for Tailored Therapy

10. How auditory selectivity for sound timing arises: the diverse roles of GABAergic inhibition in shaping the excitation to interval-selective midbrain neurons

11. Vocal Loudness Variation With Spectral Slope

12. Radiation efficiency for long-range vocal communication in mammals and birds

13. Comparison of a fiber-gel finite element model of vocal fold vibration to a transversely isotropic stiffness model

14. Sensitivity of Source–Filter Interaction to Specific Vocal Tract Shapes

15. Effect of resection depth of early glottic cancer on vocal outcome: An optimized finite element simulation

16. Estimation of Source-Filter Interaction Regions Based on Electroglottography

17. The accuracy of a voice vote

18. Comparison of two laryngeal tissue fiber constitutive models

19. New evidence that nonlinear source-filter coupling affects harmonic intensity and fo stability during instances of harmonics crossing formants

20. AN ORAL PRESSURE CONVERSION RATIO AS A PREDICTOR OF VOCAL EFFICIENCY

21. Effect of resection depth of early glottic cancer on vocal outcome: an optimized finite element simulation

22. Benchmarks for time-domain simulation of sound propagation in soft-walled airways: steady configurations

23. Combining multiobjective optimization and cluster analysis to study vocal fold functional morphology

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

23 results on '"Anil Palaparthi"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources