Author: "Facebook AI Research (FAIR)" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Facebook AI Research (FAIR)"' showing total 7 results

Start Over Author "Facebook AI Research (FAIR)"

7 results on '"Facebook AI Research (FAIR)"'

1. Memory-Augmented Reinforcement Learning for Image-Goal Navigation

Author: Lina Mezghani, Sainbayar Sukhbaatar, Thibaut Lavril, Oleksandr Maksymets, Dhruv Batra, Piotr Bojanowski, Karteek Alahari, Apprentissage de modèles à partir de données massives (Thoth), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Jean Kuntzmann (LJK), Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Meta AI, Georgia Institute of Technology [Atlanta], ANR-18-CE23-0011, ANR-18-CE23-0011,AVENUE,Réseau de mémoire visuelle pour l'interprétation de scènes(2018), Facebook AI Research [Paris] (FAIR), Facebook, and Facebook AI Research (FAIR)
Subjects: FOS: Computer and information sciences, Computer Science - Robotics, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Robotics (cs.RO)
Abstract: In this work, we address the problem of image-goal navigation in the context of visually-realistic 3D environments. This task involves navigating to a location indicated by a target image in a previously unseen environment. Earlier attempts, including RL-based and SLAM-based approaches, have either shown poor generalization performance, or are heavily-reliant on pose/depth sensors. We present a novel method that leverages a cross-episode memory to learn to navigate. We first train a state-embedding network in a self-supervised fashion, and then use it to embed previously-visited states into a memory. In order to avoid overfitting, we propose to use data augmentation on the RGB input during training. We validate our approach through extensive evaluations, showing that our data-augmented memory-based model establishes a new state of the art on the image-goal navigation task in the challenging Gibson dataset. We obtain this competitive performance from RGB input only, without access to additional sensors such as position or depth.
Published: 2022

2. Lightweight Adapter Tuning for Multilingual Speech Translation

Author: Jiatao Gu, Hang Le, Changhan Wang, Didier Schwab, Laurent Besacier, Juan Pino, Groupe d’Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole (GETALP), Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Facebook AI Research (FAIR), Naver Labs Europe [Meylan], and ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019)
Subjects: FOS: Computer and information sciences, Task (computing), Computer Science - Computation and Language, Machine translation, Computer science, Adapter (computing), Transfer (computing), Speech recognition, Speech translation, computer.software_genre, computer, Computation and Language (cs.CL), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Abstract: Adapter modules were recently introduced as an efficient alternative to fine-tuning in NLP. Adapter tuning consists in freezing pretrained parameters of a model and injecting lightweight modules between layers, resulting in the addition of only a small number of task-specific trainable parameters. While adapter tuning was investigated for multilingual neural machine translation, this paper proposes a comprehensive analysis of adapters for multilingual speech translation (ST). Starting from different pre-trained models (a multilingual ST trained on parallel data or a multilingual BART (mBART) trained on non-parallel multilingual data), we show that adapters can be used to: (a) efficiently specialize ST to specific language pairs with a low extra cost in terms of parameters, and (b) transfer from an automatic speech recognition (ASR) task and an mBART pre-trained model to a multilingual ST task. Experiments show that adapter tuning offer competitive results to full fine-tuning, while being much more parameter-efficient., Comment: Accepted at ACL-IJCNLP 2021
Published: 2021

3. Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

Author: Jiatao Gu, Juan Pino, Didier Schwab, Changhan Wang, Laurent Besacier, Hang Le, Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Facebook AI Research (FAIR), Groupe d’Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole (GETALP), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Institut Universitaire de France (IUF), Ministère de l'Education nationale, de l’Enseignement supérieur et de la Recherche (M.E.N.E.S.R.), and ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019)
Subjects: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Computer science, Speech recognition, 02 engineering and technology, Computer Science - Sound, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], 03 medical and health sciences, 0302 clinical medicine, Audio and Speech Processing (eess.AS), Dual-decoder Transformer, multilingual speech translation, Speech translation, FOS: Electrical engineering, electronic engineering, information engineering, 030221 ophthalmology & optometry, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computation and Language (cs.CL), Transformer (machine learning model), Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We introduce dual-decoder Transformer, a new model architecture that jointly performs automatic speech recognition (ASR) and multilingual speech translation (ST). Our models are based on the original Transformer architecture (Vaswani et al., 2017) but consist of two decoders, each responsible for one task (ASR or ST). Our major contribution lies in how these decoders interact with each other: one decoder can attend to different information sources from the other via a dual-attention mechanism. We propose two variants of these architectures corresponding to two different levels of dependencies between the decoders, called the parallel and cross dual-decoder Transformers, respectively. Extensive experiments on the MuST-C dataset show that our models outperform the previously-reported highest translation performance in the multilingual settings, and outperform as well bilingual one-to-one results. Furthermore, our parallel models demonstrate no trade-off between ASR and ST compared to the vanilla multi-task architecture. Our code and pre-trained models are available at https://github.com/formiel/speech-translation., Accepted at COLING 2020 (Oral)
Published: 2020
Full Text: View/download PDF

4. A Memory Efficient Baseline for Open Domain Question Answering

Author: Izacard, Gautier, Petroni, Fabio, Hosseini, Lucas, De Cao, Nicola, Riedel, Sebastian, Grave, Edouard, Facebook AI Research (FAIR), University of Amsterdam [Amsterdam] (UvA), and University College of London [London] (UCL)
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL), [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Abstract: Recently, retrieval systems based on dense representations have led to important improvements in open-domain question answering, and related tasks. While very effective, this approach is also memory intensive, as the dense vectors for the whole knowledge source need to be kept in memory. In this paper, we study how the memory footprint of dense retriever-reader systems can be reduced. We consider three strategies to reduce the index size: dimension reduction, vector quantization and passage filtering. We evaluate our approach on two question answering benchmarks: TriviaQA and NaturalQuestions, showing that it is possible to get competitive systems using less than 6Gb of memory.
Published: 2020
Full Text: View/download PDF

5. RUBi: Reducing Unimodal Biases for Visual Question Answering

Author: Cadene, Remi, Dancette, Corentin, Ben-younes, Hedi, Cord, Matthieu, Parikh, Devi, Dancette, Corentin, Machine Learning and Information Access (MLIA), LIP6, Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Facebook AI Research (FAIR), and Georgia Institute of Technology [Atlanta]
Subjects: [INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG]
Abstract: International audience; Visual Question Answering (VQA) is the task of answering questions about an image. Some VQA models often exploit unimodal biases to provide the correct answer without using the image information. As a result, they suffer from a huge drop in performance when evaluated on data outside their training set distribution. This critical issue makes them unsuitable for real-world settings. We propose RUBi, a new learning strategy to reduce biases in any VQA model. It reduces the importance of the most biased examples, i.e. examples that can be correctly classified without looking at the image. It implicitly forces the VQA model to use the two input modalities instead of relying on statistical regularities between the question and the answer. We leverage a question-only model that captures the language biases by identifying when these unwanted regularities are used. It prevents the base VQA model from learning them by influencing its predictions. This leads to dynamically adjusting the loss in order to compensate for biases. We validate our contributions by surpassing the current state-of-the-art results on VQA-CP v2. This dataset is specifically designed to assess the robustness of VQA models when exposed to different question biases at test time than what was seen during training. Our code is available: github.com/cdancette/rubi.bootstrap.pytorch
Published: 2019

6. Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation

Author: Xiaoqian Jiang, Miran Kim, Lucila Ohno-Machado, Jung Hee Cheon, Yongsoo Song, Sergiu Carpov, Juan Ramón Troncoso-Pastoriza, Heidi J. Sofia, Jean-Pierre Hubaux, Kristin E. Lauter, Jean-Philippe Bossuat, Seungwan Hong, David Froelicher, Arif Harmanci, Ilaria Chillotti, Mariya Georgieva, Yongha Son, Nicolas Gama, Wonhee Cho, Duhyeong Kim, Yiping Ma, University of Ulsan, University of Texas Health Science Center, The University of Texas Health Science Center at Houston (UTHealth), Ecole Polytechnique Fédérale de Lausanne (EPFL), Département Systèmes et Circuits Intégrés Numériques (DSCIN), Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA), Seoul National University [Seoul] (SNU), Zama, IMEC (IMEC), Catholic University of Leuven - Katholieke Universiteit Leuven (KU Leuven), EPFL Innovation Park, Gene Predictis SA, University of California [San Diego] (UC San Diego), University of California (UC), University of Pennsylvania, West Coast Head of Research Science, Facebook AI Research (FAIR), [Seattle], National Human Genome Research Institute (NHGRI), and ERC Advanced Grant ERC-2015-AdG-IMPaCT
Subjects: Service (systems architecture), Histology, Source code, Computer science, [SDV]Life Sciences [q-bio], media_common.quotation_subject, Genomic data, computer.software_genre, Pathology and Forensic Medicine, Outsourcing, scale, Genotype, Imputation (statistics), media_common, Genotype imputation, business.industry, Homomorphic encryption, snp, Cell Biology, sequence, rare, loci, genome-wide association, Data mining, business, computer, metaanalysis
Abstract: Summary Genotype imputation is a fundamental step in genomic data analysis, where missing variant genotypes are predicted using the existing genotypes of nearby “tag” variants. Although researchers can outsource genotype imputation, privacy concerns may prohibit genetic data sharing with an untrusted imputation service. Here, we developed secure genotype imputation using efficient homomorphic encryption (HE) techniques. In HE-based methods, the genotype data are secure while it is in transit, at rest, and in analysis. It can only be decrypted by the owner. We compared secure imputation with three state-of-the-art non-secure methods and found that HE-based methods provide genetic data security with comparable accuracy for common variants. HE-based methods have time and memory requirements that are comparable or lower than those for the non-secure methods. Our results provide evidence that HE-based methods can practically perform resource-intensive computations for high-throughput genetic data analysis. The source code is freely available for download at https://github.com/K-miran/secure-imputation .
Full Text: View/download PDF

7. Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation.

Author: Kim M, Harmanci AO, Bossuat JP, Carpov S, Cheon JH, Chillotti I, Cho W, Froelicher D, Gama N, Georgieva M, Hong S, Hubaux JP, Kim D, Lauter K, Ma Y, Ohno-Machado L, Sofia H, Son Y, Song Y, Troncoso-Pastoriza J, and Jiang X
Subjects: Computer Security, Genome-Wide Association Study, Genotype, Privacy, Outsourced Services
Abstract: Genotype imputation is a fundamental step in genomic data analysis, where missing variant genotypes are predicted using the existing genotypes of nearby "tag" variants. Although researchers can outsource genotype imputation, privacy concerns may prohibit genetic data sharing with an untrusted imputation service. Here, we developed secure genotype imputation using efficient homomorphic encryption (HE) techniques. In HE-based methods, the genotype data are secure while it is in transit, at rest, and in analysis. It can only be decrypted by the owner. We compared secure imputation with three state-of-the-art non-secure methods and found that HE-based methods provide genetic data security with comparable accuracy for common variants. HE-based methods have time and memory requirements that are comparable or lower than those for the non-secure methods. Our results provide evidence that HE-based methods can practically perform resource-intensive computations for high-throughput genetic data analysis. The source code is freely available for download at https://github.com/K-miran/secure-imputation., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.)
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

7 results on '"Facebook AI Research (FAIR)"'

1. Memory-Augmented Reinforcement Learning for Image-Goal Navigation

2. Lightweight Adapter Tuning for Multilingual Speech Translation

3. Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

4. A Memory Efficient Baseline for Open Domain Question Answering

5. RUBi: Reducing Unimodal Biases for Visual Question Answering

6. Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation

7. Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

7 results on '"Facebook AI Research (FAIR)"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources