17 results on '"Mariño Acebal, José Bernardo"'
Search Results
2. The TALP & I2R SMT Systems for IWSLT 2008
- Author
-
Li, H., Aw, A., Zhang, Ming, Khalilov, Maxim, Ruiz Costa-Jussà, Marta, Henríquez Quintana, Carlos Alberto, Rodríguez Fonollosa, José Adrián, Hernández, A., Mariño Acebal, José Bernardo, Banchs Martínez, Rafael Enrique, Chen, B., Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Senyal, Teoria del (Telecomunicació) ,Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Signal theory (Telecommunication) ,Speech processing systems ,Machine translation - Abstract
This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polit`ecnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.
- Published
- 2008
3. Técnicas robustas de reconocimiento del habla en ambientes adversos
- Author
-
Hernando Pericás, Francisco Javier, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Etiquetado múltiple ,Predicción lineal de la parte causal de la autocorrelación ,Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Filtrado de parámetros espectrales ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] ,Reconocimiento del habla - Abstract
El comportamiento de los sistemas actuales de reconocimiento del habla se degrada rápidamente en presencia de ruido de fondo. Recientemente, se ha propuesto una técnica de representación de la señal de voz basada en la predicción lineal de la parte causal de la autocorrelación (OSALPC) que ha mostrado ser atractiva para el reconocimiento de habla ruidosa debido a sus altas prestaciones con respecto a la predicción lineal (LPC) convencional en condiciones severas de ruido blanco aditivo y a su simplicidad computacional. El propósito de este artículo es doble: 1) mostrar que la técnica OSALPC obtiene también buenas prestaciones en un entorno ruidoso real (ruido de coche), y 2) explorar su combinación con varias técnicas robustas de medida de similitud, mostrando que sus prestaciones mejoran aún más filtrando convenientemente los parámetros espectrales y realizando un etiquetado múltiple de los mismos. | The performance of the existing speech recognition systems degrades rapidly in the presence of background noise. A novel representation of the speech signal, which is based on Linear Prediction of the One-Sided Autocorrelation sequence (OSALPC), has shown to be attractive to speech recognition because of both its high recognition performance with respect to the standard LPC in severe conditions of additive white noise and its computational simplicity. The aim of this work is twofold: 1) to show that OSALPC also achieves good performance in a case of real noisy speech (in a car environment), and 2) to explore its combination with several robust similarity measuring techniques, showing that its performance even improves by filtering and multilabeling conveniently the spectral parameters.
- Published
- 1997
4. Modelado de la señal en reconocimiento de habla ruidosa
- Author
-
Pascual, E, Hernando Pericás, Francisco Javier|||0000-0002-1730-8154, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Gustavo, H, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
Conventional modelling techniques of speech suffer a very big performance degradation in adverse noisy environments. So, it is necessary to research for more robust representations of speech signal. This paper presents new models that have succeeded in adverse environments. They are hybrid models of the classical parametrizations techniques used so far that have demonstrated being very useful in order to obtain good results in different noisy environments. In order to prove the their performance we have used white and machine noise in our experiments.
- Published
- 1996
5. A comparative study of techniques for HMM-based noisy speech recognition in noisy car environment
- Author
-
Hernando Pericás, Francisco Javier, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
ComputingMethodologies_PATTERNRECOGNITION ,Computer Science::Sound ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to report the application of several robust techniques on a system based on the HMM (Hidden Markov Models) and VQ (Vector Quantization) approaches for speech recognition in noisy car environment: parameterization based on the linear prediction of the causal part of the autocorrelation sequence (OSALPC) - proposed by the authors in [1] [2]-, optimization of spectral model order and cepstral lifter, cepstral projection distance measure, dynamic information and multilabeling.
- Published
- 1993
6. Multiple multilabeling applied to HMM-based noisy speech recognition
- Author
-
Hernando Pericás, Francisco Javier, Mariño Acebal, José Bernardo, Moreno Bilbao, M. Asunción, Nadeu Camprubí, Climent, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Computer Science::Sound ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to propose the application of a simple multilabeling method, instead of the standard vector quantization -so called labeling-, as the front end for a speech recognizer based on the Vector Quantization (VQ) and Hidden Markov Models (HMM) approaches in order to increase its robustness to noise. Furthermore, not only cepstrum but also other features such as energy and dynamic parameters are evaluated and quantized independently in the multilabeling stage to represent more accurately characteristics of speech. The result of this process is a multiple multilabeling. Experimental results in the presence of additive white noise clearly demonstrate its good performance in isolated word recognition in noisy environments.
- Published
- 1993
7. Multiple multilabeling to improve HMM-based speech recognition in noise
- Author
-
Hernando Pericás, Francisco Javier, Mariño Acebal, José Bernardo, Nadeu Camprubí, Climent, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Noisy speech recognition ,Markov, Processos de ,Computer Science::Sound ,Markov processes ,Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Speech processing systems ,Hidden Markov models ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] ,Vector quantization - Abstract
The performance of existing speech recognition systems degrades rapidly in the presence of background noise when training and testing cannot be done under the same ambient conditions. The aim of this paper is to propose the application of a simple multilabeling method, instead of the standard vector quantization -so called labeling-, as the front end for a speech recognizer based on the Vector Quantization (VQ) and Hidden Markov Models (HMM) approaches in order to increase its robustness to noise. Furthermore, not only cepstrum but also other features such as energy and dynamic parameters are evaluated and quantized independently in the multilabeling stage to represent more accurately characteristics of speech. The result of this process is a multiple multilabeling. Experimental results in the presence of additive white noise and car environment clearly demonstrate its good performance in isolated word recognition in noisy environments.
- Published
- 1993
8. The normalized backpropagation and some experiments on speech recognition
- Author
-
Monte Moreno, Enrique, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
In the paper we present the theoretical development of the normalized backpropagation, and we compare it with other algorithms that have been presented in the literature. The algorithm that we propose is based on the idea of normalizing the adaptation step in the gradient search by the variance of the input. This algorithm is simple and gives good results in comparison with other algorithms that accelerate the learning and has the additional advantage that the parameters are calculated by the algorithm, so the user does not have to make several trials in order to trim the adaptation step and the momentum until the best combination is found. The task which we have designed in order to compare the algorithms is the recognition of digits in the Catalan language, with a data base of 1000 items, spoken by 10 speakers. The algorithms that we have compared with the normalized back propagation are: D.E.Rumelhart and J .L. McCielland, Franzini, Suddhard, Fahlman, Monte.
- Published
- 1990
9. Selección de caracterísitcas en el reconocimiento automático del habla
- Author
-
Lleida Solano, Eduardo, Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
In this paper we investigate the use of a feature selection step in a isolated word recognition system. The feature selection step tries lo modal the correlalion among dajacent feature vectors and the variabilily of the speech. Thus, the feature selection is performed in two steps. The first step takes into account the temporal correlation among feature vectors in order to obtain a new set of feature vectors which are uncorrelated. This step gives a new template of M feature vectors, being M<
- Published
- 1990
10. Recognition of numbers by using demisyllables and hidden Markov models
- Author
-
Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Bonafonte Cávez, Antonio|||0000-0002-6240-9915, Moreno Bilbao, M. Asunción|||0000-0002-1823-5970, Lleida Solano, Eduardo, Nadeu Camprubí, Climent|||0000-0002-5863-0983, Monte Moreno, Enrique|||0000-0002-4907-0494, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Markov, Processos de ,Markov processes ,education ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC] - Published
- 1990
11. Reconocimiento de los numeros catalanes mediante semisilabas y con
- Author
-
Nadeu Camprubí, Climent, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
education ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Published
- 1990
12. Subband splitting, adaptive scalar prediction and vector quantization for speech coding
- Author
-
Masgrau Gómez, Enrique José, Rodríguez Fonollosa, José Adrián, Mariño Acebal, José Bernardo, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Processament de la parla ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
This paper describes a new coding structure based on the combination of Vector Quantizati.on, Linear Prediction l)nd Subband Splitting that achieves high guality speech at rates below 10 Kbit/sec. In this scheme, a vector is formed with one sanple of the normalized prediction error of each band and then a vector quanti.zer is applied to it. This guantization of the prediction error a.llows to use scalar adaptive predictors while conserving the advantages of the vector guantization. The necessary noise shap~ for achjev:ing high subjective quality is obtained by the use of a Freguency-Weighted distance in thc vecto r guantizer
- Published
- 1988
13. Adaptive prediction and bit-assignment in subband coding of speech
- Author
-
Mariño Acebal, José Bernardo, Martí Ros, Jaume, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Signal processing ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] ,Tractament del senyal - Abstract
The combination of time-domain harmonic scaling (TDHS) and sub-band coding (SBC) provides an encoding approach which allows 9.6 Kb/s speech encoding with good communication quality. Starting from this structure, this paper focuses the improvement of earlier designs. It is shown that adaptive prediction and bit-assigment enhances the subband signal coding and, hence, the performance of the overall system. The prediction is realized by an adaptive lattice, the algorithm being GAL2. The dynamic bit allocation takes place from the step-sizes of the backward adaptive quantizers (Jayant) in each sub-band. Improvements as high as 5 dB can be achieved for the average segmented signal to noise ratio.
14. Reconocimiento de los números castellanos mediante semisílabas
- Author
-
Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Nadeu Camprubí, Climent|||0000-0002-5863-0983, Moreno, Antonio, Lleida Solano, Eduardo, Hernaez, I., Monte Moreno, Enrique|||0000-0002-4907-0494, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Speech processing systems ,Tractament del senyal ,Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC] - Abstract
En esta comunicación se describe el uso de la semisílaba en el reconocimiento de habla continua para una aplicación específica: el reconocimiento de los números enteros castellanos del cero al mil. Tras una breve descripción de la arquitectura del sistema de reconocimiento, se detalla la inferencia de la gramática de estados finitos que representa los números en términos de semisílabas, y se indica el procedimiento seguido para la generación de las referencias de las mismas. Finalmente, se presentan los resultados alcanzados en dos experimentos: en el primero el sistema de reconocimiento es entrenado para un locutor y las referencias utilizadas para las semisílabas son patrones de características frecuenciales; en el segundo, el entrenamiento es multilocutor y las semisílabas son representadas mediante Modelos Ocultos de Markov. En ambos casos la tasa de reconocimiento del sistema es excelente.
15. Diseño de medidas de calidad para codificadores de voz
- Author
-
Roser, Miquel, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Signal processing ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Tractament del senyal ,Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC] - Abstract
In this paper the voice quality of several 32-9.6 Kbit/s coders is assesed by a subjective test and by objective measures. A subjective test called Degradation Pairs Test (TDP) has been designed and the performance of objective measures in predicting the subjective quality is studied.
16. Adaptive vector predictive speech coding with sample by sample update at 16 Kbps
- Author
-
Masgrau Gómez, Enrique José|||0000-0002-6413-3203, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, and Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
- Subjects
Signal processing ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] ,Tractament del senyal ,Enginyeria de la telecomunicació::Processament del senyal [Àrees temàtiques de la UPC] - Abstract
A vectorial generalization of the ADPCM system is introduced. Once the speech signal is grouped in vectors, they are coded using a vector predictor (VP) and a vector quantizer (VQ). Both subsystems are continously adaptive; the VQ is matched the signal power; two algorithms (VLMS and VGAL) are considered to adapt the predictor. At the bit stream corresponding to 2 bits per signal sample, the coding system rends a segmented signal to noise ratio equivalent to other existing coders.
17. Simbad: a tool for speech analysis and synthesis
- Author
-
Nadeu Camprubí, Climent|||0000-0002-5863-0983, Mariño Acebal, José Bernardo|||0000-0002-9471-8675, Oliveras Vergés, Albert|||0000-0003-1574-5622, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla, and Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
- Subjects
Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,Processament de la parla ,Speech processing systems ,Enginyeria de la telecomunicació [Àrees temàtiques de la UPC] - Abstract
SIMBAD has been developed to facilitate the tasks involved in the design of a concatenative speech synthesis system: 1) building a dictionary of parameterized speech units, and 2) obtaining a set of rules to concatenate these units and to model prosodics. It is an interactive, menu-driven and graphical software tool that allows both automatic processing and graphical editing of speech with high flexibility.
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.