Author: "Mariño Acebal, José Bernardo" / Topic: speech processing systems - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mariño Acebal, José Bernardo"' showing total 17 results

Start Over Author "Mariño Acebal, José Bernardo" Topic speech processing systems

17 results on '"Mariño Acebal, José Bernardo"'

1. The TALP-UPC phrase-based translation systems for WMT13: system combination with morphology generation, domain adaptation and corpus filtering

Author: Formiga Fanals, Lluís, Ruiz Costa-Jussà, Marta, Mariño Acebal, José Bernardo, Rodríguez Fonollosa, José Adrián, Barrón-Cedeño, Alberto, Màrquez Villodre, Lluís, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla, and Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
Subjects: Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems
Abstract: This paper describes the TALP participation in the WMT13 evaluation campaign. Our participation is based on the combination of several statistical machine translation systems: based on standard hrasebased Moses systems. Variations include techniques such as morphology generation, training sentence filtering, and domain adaptation through unit derivation. The results show a coherent improvement on TER, METEOR, NIST, and BLEU scores when compared to our baseline system.
Published: 2013

2. The TALP & I2R SMT Systems for IWSLT 2008

Author: Li, H., Aw, A., Zhang, Ming, Khalilov, Maxim, Ruiz Costa-Jussà, Marta, Henríquez Quintana, Carlos Alberto, Rodríguez Fonollosa, José Adrián, Hernández, A., Mariño Acebal, José Bernardo, Banchs Martínez, Rafael Enrique, Chen, B., Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Senyal, Teoria del (Telecomunicació), Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Signal theory (Telecommunication), Speech processing systems, Machine translation
Abstract: This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polit`ecnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.
Published: 2008

3. Técnicas robustas de reconocimiento del habla en ambientes adversos

Author: Hernando Pericás, Francisco Javier, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Etiquetado múltiple, Predicción lineal de la parte causal de la autocorrelación, Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Filtrado de parámetros espectrales, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC], Reconocimiento del habla
Abstract: El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo. Recientemente, se ha propuesto una técnica de representación de la señal de voz basada en la predicción lineal de la parte causal de la autocorrelación (OSALPC) que ha mostrado ser atractiva para el reconocimiento de habla ruidosa debido a sus altas prestaciones con respecto a la predicción lineal (LPC) convencional en condiciones severas de ruido blanco aditivo y a su simplicidad computacional. El propósito de este artículo es doble: 1) mostrar que la técnica OSALPC obtiene también buenas prestaciones en un entorno ruidoso real (ruido de coche), y 2) explorar su combinación con varias técnicas robustas de medida de similitud, mostrando que sus prestaciones mejoran aún más filtrando convenientemente los parámetros espectrales y realizando un etiquetado múltiple de los mismos. | The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. A novel representation of the speech signal, which is based on Linear Prediction of the One-Sided Autocorrelation sequence (OSALPC), has shown to be attractive to speech recognition because of both its high recognition performance with respect to the standard LPC in severe conditions of additive white noise and its computational simplicity. The aim of this work is twofold: 1) to show that OSALPC also achieves good performance in a case of real noisy speech (in a car environment), and 2) to explore its combination with several robust similarity measuring techniques, showing that its performance even improves by filtering and multilabeling conveniently the spectral parameters.
Published: 1997

4. Modelado de la señal en reconocimiento de habla ruidosa

Author: Pascual, E, Hernando Pericás, Francisco Javier|||0000-0002-1730-8154, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Gustavo, H, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: Conventional modelling techniques of speech suffer a very big performance degradation in adverse noisy environments. So, it is necessary to research for more robust representations of speech signal. This paper presents new models that have succeeded in adverse environments. They are hybrid models of the classical parametrizations techniques used so far that have demonstrated being very useful in order to obtain good results in different noisy environments. In order to prove the their performance we have used white and machine noise in our experiments.
Published: 1996

5. A comparative study of techniques for HMM-based noisy speech recognition in noisy car environment

Author: Hernando Pericás, Francisco Javier, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: ComputingMethodologies_PATTERNRECOGNITION, Computer Science::Sound, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to report the application of several robust techniques on a system based on the HMM (Hidden Markov Models) and VQ (Vector Quantization) approaches for speech recognition in noisy car environment: parameterization based on the linear prediction of the causal part of the autocorrelation sequence (OSALPC) - proposed by the authors in [1] [2]-, optimization of spectral model order and cepstral lifter, cepstral projection distance measure, dynamic information and multilabeling.
Published: 1993

6. Multiple multilabeling applied to HMM-based noisy speech recognition

Author: Hernando Pericás, Francisco Javier, Mariño Acebal, José Bernardo, Moreno Bilbao, M. Asunción, Nadeu Camprubí, Climent, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Computer Science::Sound, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to propose the application of a simple multilabeling method, instead of the standard vector quantization -so called labeling-, as the front end for a speech recognizer based on the Vector Quantization (VQ) and Hidden Markov Models (HMM) approaches in order to increase its robustness to noise. Furthermore, not only cepstrum but also other features such as energy and dynamic parameters are evaluated and quantized independently in the multilabeling stage to represent more accurately characteristics of speech. The result of this process is a multiple multilabeling. Experimental results in the presence of additive white noise clearly demonstrate its good performance in isolated word recognition in noisy environments.
Published: 1993

7. Multiple multilabeling to improve HMM-based speech recognition in noise

Author: Hernando Pericás, Francisco Javier, Mariño Acebal, José Bernardo, Nadeu Camprubí, Climent, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Noisy speech recognition, Markov, Processos de, Computer Science::Sound, Markov processes, Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems, Hidden Markov models, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC], Vector quantization
Abstract: The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to propose the application of a simple multilabeling method, instead of the standard vector quantization -so called labeling-, as the front end for a speech recognizer based on the Vector Quantization (VQ) and Hidden Markov Models (HMM) approaches in order to increase its robustness to noise. Furthermore, not only cepstrum but also other features such as energy and dynamic parameters are evaluated and quantized independently in the multilabeling stage to represent more accurately characteristics of speech. The result of this process is a multiple multilabeling. Experimental results in the presence of additive white noise and car environment clearly demonstrate its good performance in isolated word recognition in noisy environments.
Published: 1993

8. The normalized backpropagation and some experiments on speech recognition

Author: Monte Moreno, Enrique, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: In the paper we present the theoretical development of the normalized backpropagation, and we compare it with other algorithms that have been presented in the literature. The algorithm that we propose is based on the idea of normalizing the adaptation step in the gradient search by the variance of the input. This algorithm is simple and gives good results in comparison with other algorithms that accelerate the learning and has the additional advantage that the parameters are calculated by the algorithm, so the user does not have to make several trials in order to trim the adaptation step and the momentum until the best combination is found. The task which we have designed in order to compare the algorithms is the recognition of digits in the Catalan language, with a data base of 1000 items, spoken by 10 speakers. The algorithms that we have compared with the normalized back propagation are: D.E.Rumelhart and J .L. McCielland, Franzini, Suddhard, Fahlman, Monte.
Published: 1990

9. Selección de caracterísitcas en el reconocimiento automático del habla

Author: Lleida Solano, Eduardo, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: In this paper we investigate the use of a feature selection step in a isolated word recognition system. The feature selection step tries lo modal the correlalion among dajacent feature vectors and the variabilily of the speech. Thus, the feature selection is performed in two steps. The first step takes into account the temporal correlation among feature vectors in order to obtain a new set of feature vectors which are uncorrelated. This step gives a new template of M feature vectors, being M<
Published: 1990

10. Recognition of numbers by using demisyllables and hidden Markov models

Author: Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Bonafonte Cávez, Antonio|||0000-0002-6240-9915, Moreno Bilbao, M. Asunción|||0000-0002-1823-5970, Lleida Solano, Eduardo, Nadeu Camprubí, Climent|||0000-0002-5863-0983, Monte Moreno, Enrique|||0000-0002-4907-0494, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Markov, Processos de, Markov processes, education, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC]
Published: 1990

11. Reconocimiento de los numeros catalanes mediante semisilabas y con

Author: Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: education, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Published: 1990

12. Subband splitting, adaptive scalar prediction and vector quantization for speech coding

Author: Masgrau Gómez, Enrique José, Rodríguez Fonollosa, José Adrián, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Processament de la parla, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: This paper describes a new coding structure based on the combination of Vector Quantizati.on, Linear Prediction l)nd Subband Splitting that achieves high guality speech at rates below 10 Kbit/sec. In this scheme, a vector is formed with one sanple of the normalized prediction error of each band and then a vector quanti.zer is applied to it. This guantization of the prediction error a.llows to use scalar adaptive predictors while conserving the advantages of the vector guantization. The necessary noise shap~ for achjev:ing high subjective quality is obtained by the use of a Freguency-Weighted distance in thc vecto r guantizer
Published: 1988

13. Adaptive prediction and bit-assignment in subband coding of speech

Author: Mariño Acebal, José Bernardo, Martí Ros, Jaume, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Signal processing, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC], Tractament del senyal
Abstract: The combination of time-domain harmonic scaling (TDHS) and sub-band coding (SBC) provides an encoding approach which allows 9.6 Kb/s speech encoding with good communication quality. Starting from this structure, this paper focuses the improvement of earlier designs. It is shown that adaptive prediction and bit-assigment enhances the subband signal coding and, hence, the performance of the overall system. The prediction is realized by an adaptive lattice, the algorithm being GAL2. The dynamic bit allocation takes place from the step-sizes of the backward adaptive quantizers (Jayant) in each sub-band. Improvements as high as 5 dB can be achieved for the average segmented signal to noise ratio.

14. Reconocimiento de los números castellanos mediante semisílabas

Author: Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Nadeu Camprubí, Climent|||0000-0002-5863-0983, Moreno, Antonio, Lleida Solano, Eduardo, Hernaez, I., Monte Moreno, Enrique|||0000-0002-4907-0494, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Speech processing systems, Tractament del senyal, Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC]
Abstract: En esta comunicación se describe el uso de la semisílaba en el reconocimiento de habla continua para una aplicación específica: el reconocimiento de los números enteros castellanos del cero al mil. Tras una breve descripción de la arquitectura del sistema de reconocimiento, se detalla la inferencia de la gramática de estados finitos que representa los números en términos de semisílabas, y se indica el procedimiento seguido para la generación de las referencias de las mismas. Finalmente, se presentan los resultados alcanzados en dos experimentos: en el primero el sistema de reconocimiento es entrenado para un locutor y las referencias utilizadas para las semisílabas son patrones de características frecuenciales; en el segundo, el entrenamiento es multilocutor y las semisílabas son representadas mediante Modelos Ocultos de Markov. En ambos casos la tasa de reconocimiento del sistema es excelente.

15. Diseño de medidas de calidad para codificadores de voz

Author: Roser, Miquel, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Signal processing, Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Tractament del senyal, Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC]
Abstract: In this paper the voice quality of several 32-9.6 Kbit/s coders is assesed by a subjective test and by objective measures. A subjective test called Degradation Pairs Test (TDP) has been designed and the performance of objective measures in predicting the subjective quality is studied.

16. Adaptive vector predictive speech coding with sample by sample update at 16 Kbps

Author: Masgrau Gómez, Enrique José|||0000-0002-6413-3203, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Subjects: Signal processing, Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC], Tractament del senyal, Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC]
Abstract: A vectorial generalization of the ADPCM system is introduced. Once the speech signal is grouped in vectors, they are coded using a vector predictor (VP) and a vector quantizer (VQ). Both subsystems are continously adaptive; the VQ is matched the signal power; two algorithms (VLMS and VGAL) are considered to adapt the predictor. At the bit stream corresponding to 2 bits per signal sample, the coding system rends a segmented signal to noise ratio equivalent to other existing coders.

17. Simbad: a tool for speech analysis and synthesis

Author: Nadeu Camprubí, Climent|||0000-0002-5863-0983, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Oliveras Vergés, Albert|||0000-0003-1574-5622, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla, and Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
Subjects: Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC], Processament de la parla, Speech processing systems, Enginyeria de la telecomunicació [Àrees temàtiques de la UPC]
Abstract: SIMBAD has been developed to facilitate the tasks involved in the design of a concatenative speech synthesis system: 1) building a dictionary of parameterized speech units, and 2) obtaining a set of rules to concatenate these units and to model prosodics. It is an interactive, menu-driven and graphical software tool that allows both automatic processing and graphical editing of speech with high flexibility.

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Mariño Acebal, José Bernardo"'

1. The TALP-UPC phrase-based translation systems for WMT13: system combination with morphology generation, domain adaptation and corpus filtering

2. The TALP & I2R SMT Systems for IWSLT 2008

3. Técnicas robustas de reconocimiento del habla en ambientes adversos

4. Modelado de la señal en reconocimiento de habla ruidosa

5. A comparative study of techniques for HMM-based noisy speech recognition in noisy car environment

6. Multiple multilabeling applied to HMM-based noisy speech recognition

7. Multiple multilabeling to improve HMM-based speech recognition in noise

8. The normalized backpropagation and some experiments on speech recognition

9. Selección de caracterísitcas en el reconocimiento automático del habla

10. Recognition of numbers by using demisyllables and hidden Markov models

11. Reconocimiento de los numeros catalanes mediante semisilabas y con

12. Subband splitting, adaptive scalar prediction and vector quantization for speech coding

13. Adaptive prediction and bit-assignment in subband coding of speech

14. Reconocimiento de los números castellanos mediante semisílabas

15. Diseño de medidas de calidad para codificadores de voz

16. Adaptive vector predictive speech coding with sample by sample update at 16 Kbps

17. Simbad: a tool for speech analysis and synthesis

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

17 results on '"Mariño Acebal, José Bernardo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources