Author: "Popescu-Belis, Andrei" / Publisher: idiap - Searchworks@Jio Institute Digital Library Search Results

1. Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT)

Author: Miculicich Werlen, Lesly Sadiht and Popescu-Belis, Andrei
Abstract: In this paper, we define and assess a reference-based metric to evaluate the accuracy of pronoun translation (APT). The metric automatically aligns a candidate and a reference translation using GIZA++ augmented with specific heuristics, and then counts the number of identical or different pronouns, with provision for legitimate variations and omitted pronouns. All counts are then combined into one score. The metric is applied to the results of seven systems (including the baseline) that participated in the DiscoMT 2015 shared task on pronoun translation from English to French. The APT metric reaches around 0.993-0.999 Pearson correlation with human judges (depending on the parameters of APT), while other automatic metrics such as BLEU, METEOR, or those specific to pronouns used at DiscoMT 2015 reach only 0.972-0.986 Pearson correlation.
Published: 2016

2. Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings

Author: Mahdabi, Parvaz and Popescu-Belis, Andrei
Subjects: InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Abstract: This report presents a study on assisting users in building queries to perform real-time searches in a news and social media monitoring system. The system accepts complex queries, and we assist the user by suggesting related keywords or entities. We do this by leveraging two different word representations: (1) probabilistic topic models, and (2) unsupervised word embeddings. We compare the vector representations obtained by these two approaches to find related keywords (i.e. suggestions) with respect to specific queries, taken from the query log of a commercial system. Through crowdsourcing we solicited relevance judgments and compared the two methods. Our results show that word embeddings outperform topic models for keyword suggestion.

3. When Users Meet Technology: The Meeting Browser Development Helix

Author: Popescu-Belis, Andrei, Lalanne, Denis, and Bourlard, Hervé

4. Comparing meeting browsers using a task-based evaluation method

Author: Popescu-Belis, Andrei
Abstract: Information access within meeting recordings, potentially transcribed and augmented with other media, is facilitated by the use of meeting browsers. To evaluate their performance through a shared benchmark task, users are asked to discriminate between true and false parallel statements about facts in meetings, using different browsers. This paper offers a review of the results obtained so far with five types of meeting browsers, using similar sets of statements over the same meeting recordings. The results indicate that state-of-the-art speed for true/false question answering is 1.5-2 minutes per question, and precision is 70%-80% (vs. 50% random guess). The use of ASR compared to manual transcripts, or the use of audio signals only, lead to a perceptible though not dramatic decrease in performance scores.

5. Finding Information in Multimedia Records of Meetings

Author: Popescu-Belis, Andrei, Lalanne, Denis, and Bourlard, Hervé
Abstract: This paper overviews the work carried out within two large consortia on improving the access to records of human meetings using multimodal interfaces. The design of meeting browsers has emerged as an important goal, with both theoretical interest and practical applications. Meeting browsers are assistance tools that help humans navigate through multimedia records of meetings (audio, video, documents, and metadata), in order to obtain a general idea about what happened in a meeting or to find specific pieces of information, for discovery or verification. To explain the importance that meeting browsers have gained in time, the paper summarizes findings of user studies, discusses features of meeting browser prototypes, and outlines the main evaluation protocol proposed. Reference scores are provided for future benchmarking. These achievements in meeting browsing constitute an iterative software process, from user studies to prototypes and then to products.

6. User Interface Design in a Just-in-time Retrieval System for Meetings

Author: Popescu-Belis, Andrei, Poller, Peter, Kilgour, Jonathan, Flynn, Mike, Germesin, Sebastian, Nanchen, Alexandre, and Yazdani, Majid
Abstract: The Automatic Content Linking Device (ACLD) is a just-in-time multimedia retrieval system that monitors and supports the conversation among a small group of people within a meeting. The ACLD retrieves from a repository, at regular intervals, information that might be relevant to the group's activity, and presents it through a graphical user interface (GUI). The repository contains documents from past meetings such as slides or reports along with processed meeting recordings; in parallel, Web searches are run as well. The acceptance by users of such a system depends considerably on the GUI, along with the performance of retrieval. The trade-off between informativeness and unobtrusiveness is studied here through the design of a series of GUIs. The requirements and feedback collected while demonstrating the successive versions show that users vary considerably in their preferences for a given style of interface. After studying two extreme options, a widget vs. a wide-screen UI, we conclude that a modular UI, which can be flexibly structured and resized by users, is the most sensible design for a just-in-time multimedia retrieval system.

7. Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph

Author: Bhatt, Chidansh A. and Popescu-Belis, Andrei
Subjects: InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Abstract: In this paper, we present an approach for topic-level video snippet-based extractive summarization, which relies on con tent-based recommendation techniques. We identify topic-level snippets using transcripts of all videos in the dataset and indexed these snippets globally in a word vector space. Generate snippet cosine similarity scores matrix, which are then utilized to compute top snippets to be utilized for summarization. We also compare the snippet similarity globally across all video snippets and locally within a video snippets. This approach has performed well on the AMI meeting corpus, in terms of ROUGE scores compare to state-of-the-art methods. Experiments showed that corpus like AMI meeting has large overlap between global and local snippet similarity of 80% and the ROUGE scores are comparable. Moreover, we applied proposed TopS summarizer in dierent scenarios on Video Lectures, to emphasize the merits of ease in utilizing summarizer with such content-based recommendation technique.

8. From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval

Author: Popescu-Belis, Andrei, Habibi, Maryam, Garner, Philip N., and Li, Nan
Abstract: This paper presents a series of tests that were performed on a state-of-the-art real-time automatic speech recognition system for English, in a single-computer implementation. As the intention is to use the system for speech-based query-free document retrieval in conversations, several parameters were varied: text type, microphone quality, computing power, speaker fluency, and pace of the speech. Word accuracy over various word counts, including a restriction to content words, varied in the 30%-70% range. The paper compares results over many conditions, and concludes that the ASR system is acceptable for the intended use only if all the parameters are in optimal conditions. If more than two parameters are suboptimal, then its output becomes too noisy for document retrieval.

9. Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links

Author: Yazdani, Majid and Popescu-Belis, Andrei

10. Evaluating Attention Networks for Anaphora Resolution

Author: Pilault, Jonathan, Pappas, Nikolaos, Miculicich Werlen, Lesly, and Popescu-Belis, Andrei
Abstract: In this paper, we evaluate the results of using inter and intra attention mechanisms from two architectures, a Deep Attention Long Short-Term Memory-Network (LSTM-N) (Cheng et al., 2016) and a Decomposable Attention model (Parikh et al., 2016), for anaphora resolution, i.e. detecting coreference relations between a pronoun and a noun (its antecedent). The models are adapted from an entailment task, to address the pronominal coreference resolution task by comparing two pairs of sentences: one with the original sentences containing the antecedent and the pronoun, and another one with the pronoun replaced with a correct or an incorrect antecedent. The goal is thus to detect the correct replacements, assuming the original sentence pair entails the one with the correct replacement, but not one with an incorrect replacement. We use the CoNLL-2012 English dataset (Pradhan et al., 2012) to train the models and evaluate the ability to recognize correct and incorrect pronoun replacements in sentence pairs. We find that the Decomposable Attention Model performs better, while using a much simpler architecture. Furthermore, we focus on two previous studies that use intra- and inter-attention mechanisms, discuss how they relate to each other, and examine how these advances work to identify correct antecedent replacements.

11. Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models

Author: Mrini, Khalil, Pappas, Nikolaos, and Popescu-Belis, Andrei
Subjects: multilingual hierarchical networks, document labeling
Abstract: Cross-lingual transfer has been shown to increase the performance of a text classification model thanks to the use of Multilingual Hierarchical Attention Networks (MHAN), on which this work is based. Firstly, we compared the performance of monolingual and mulitilingual HANs with three types of bag-of-words models. We found that the Binary Unigram model outperforms the HAN model with Dense encoders on the full vocabulary in 6 out of 8 languages, and ties against MHAN with the Dense encoders, when it uses the full vocabulary i.e. many more parameters than neural models. However, this is not true when we limit the number of parameters and (or) we increase the sophistication of the neural encoders to GRU or biGRU. Secondly, new configurations of parameter sharing were tested. We found that sharing attention at the sentence level was the best configuration by a small margin when transferring from 5 out of 7 languages to English, as well as for cross-lingual transfer between English and Spanish, Russian, and Arabic. The tests were performed on the Deutsche Welle news corpus with 8 languages and 600k documents.

12. The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites

Author: Popescu-Belis, Andrei, Kilgour, Jonathan, Nanchen, Alexandre, and Poller, Peter
Abstract: The Automatic Content Linking Device (ACLD) is a just-in-time retrieval system that monitors an ongoing conversation or a monologue and enriches it with potentially related documents, including transcripts of past meetings, from local repositories or from the Internet. The linked content is displayed in real-time to the participants in the conversation, or to users watching a recorded conversation or talk. The system can be demonstrated in both settings, using real-time automatic speech recognition (ASR) or replaying offline ASR, via a flexible user interface that displays results and provides access to the content of past meetings and documents.

13. Annotation of face detection: description of XML format and files

Author: Marcel, Sébastien, Rodriguez, Yann, Guillemot, Maël, and Popescu-Belis, Andrei

14. Topic and Sentiment in Phrase-Based Statistical Machine Translation

Author: Habibi, Maryam, Pappas, Nikolaos, and Popescu-Belis, Andrei
Subjects: topic models, Machine Translation, Sentiment Analysis

15. Finding without searching

Author: Popescu-Belis, Andrei

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

15 results on '"Popescu-Belis, Andrei"'

1. Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT)

2. Explicit Suggestion of Query Terms for News Search using Topic Models and Word Embeddings

3. When Users Meet Technology: The Meeting Browser Development Helix

4. Comparing meeting browsers using a task-based evaluation method

5. Finding Information in Multimedia Records of Meetings

6. User Interface Design in a Just-in-time Retrieval System for Meetings

7. Topic-Level Extractive Summarization of Lectures and Meetings Using a Snippet Similarity Graph

8. From Research to Reality: Evaluation of a Single-Computer Real-Time LVCSR System for Speech-Based Retrieval

9. Joint Similarity Learning for Predicting Links in Networks with Multiple-type Links

10. Evaluating Attention Networks for Anaphora Resolution

11. Cross-lingual Transfer for News Article Labeling: Benchmarking Statistical and Neural Models

12. The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites

13. Annotation of face detection: description of XML format and files

14. Topic and Sentiment in Phrase-Based Statistical Machine Translation

15. Finding without searching

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

15 results on '"Popescu-Belis, Andrei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources