Search

Your search keyword '"Ranzato, Marc'Aurelio"' showing total 253 results

Search Constraints

Start Over You searched for: Author "Ranzato, Marc'Aurelio" Remove constraint Author: "Ranzato, Marc'Aurelio"
253 results on '"Ranzato, Marc'Aurelio"'

Search Results

1. DiPaCo: Distributed Path Composition

2. Asynchronous Local-SGD Training for Language Modeling

3. DiLoCo: Distributed Low-Communication Training of Language Models

4. Towards Robust and Efficient Continual Language Learning

5. Towards Compute-Optimal Transfer Learning

6. NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

7. Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

8. Towards Learning Universal Hyperparameter Optimizers with Transformers

10. On Anytime Learning at Macroscale

11. The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

12. Efficient Continual Learning with Modular Networks and Task-Driven Priors

13. Few-shot Sequence Learning with Transformers

14. Multi-scale Transformer Language Models

15. Residual Energy-Based Models for Text Generation

16. Residual Energy-Based Models for Text

17. Facebook AI's WAT19 Myanmar-English Translation Task Submission

18. Revisiting Self-Training for Neural Sequence Generation

19. The Source-Target Domain Mismatch Problem in Machine Translation

20. On The Evaluation of Machine Translation Systems Trained With Back-Translation

21. Large Memory Layers with Product Keys

22. Real or Fake? Learning to Discriminate Machine from Human Generated Text

23. Task-Driven Modular Networks for Zero-Shot Compositional Learning

24. On Tiny Episodic Memories in Continual Learning

25. Mixture Models for Diverse Machine Translation: Tricks of the Trade

26. The FLoRes Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English

27. Efficient Lifelong Learning with A-GEM

28. Multiple-Attribute Text Style Transfer

29. Phrase-Based & Neural Unsupervised Machine Translation

30. Lightweight Adaptive Mixture of Neural and N-gram Language Models

31. Analyzing Uncertainty in Neural Machine Translation

32. Classical Structured Prediction Losses for Sequence to Sequence Learning

33. Unsupervised Machine Translation Using Monolingual Corpora Only

34. Word Translation Without Parallel Data

35. Gradient Episodic Memory for Continual Learning

36. Fader Networks: Manipulating Images by Sliding Attributes

37. Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

38. Training Language Models Using Target-Propagation

39. Transformation-Based Models of Video Sequences

40. Learning through Dialogue Interactions by Asking Questions

41. Dialogue Learning With Human-In-The-Loop

42. Sequence Level Training with Recurrent Neural Networks

43. Convolutional networks and learning invariant to homogeneous multiplicative scalings

44. Learning Longer Memory in Recurrent Neural Networks

45. Ensemble of Generative and Discriminative Techniques for Sentiment Analysis of Movie Reviews

46. Web-Scale Training for Face Identification

47. On Learning Where To Look

48. Multi-GPU Training of ConvNets

49. Learning Factored Representations in a Deep Mixture of Experts

50. PANDA: Pose Aligned Networks for Deep Attribute Modeling

Catalog

Books, media, physical & digital resources