Author: "Chitale, Pranjal A." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chitale, Pranjal A."' showing total 12 results

Start Over Author "Chitale, Pranjal A."

12 results on '"Chitale, Pranjal A."'

1. Towards Inducing Long-Context Abilities in Multilingual Neural Machine Translation Models

Author: Gumma, Varun, Chitale, Pranjal A., and Bali, Kalika
Subjects: Computer Science - Computation and Language
Abstract: Neural Machine Translation (NMT) models have traditionally used Sinusoidal Positional Embeddings (PEs), which often struggle to capture long-range dependencies and are inefficient for handling extended context or document-level translation tasks. This work addresses the challenge of transitioning pre-trained NMT models from absolute Sinusoidal PEs to Relative PEs, such as RoPE and ALiBi, without compromising performance. We demonstrate that parameter-efficient fine-tuning, using only a small amount of high-quality data, can successfully facilitate this transition. Experimental results indicate that switching from Sinusoidal to Relative PEs results in competitive translation quality on sentence-level evaluation benchmarks. Additionally, models trained with RoPE consistently outperform those using ALiBi and Sinusoidal PEs on document-level benchmarks across both string-based metrics and qualitative evaluations. Moreover, we find that a small amount of long-context data in a few languages is sufficient for cross-lingual length generalization, thereby inducing long-context capabilities., Comment: Accepted at NAACL 2025
Published: 2024

2. CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Author: Romero, David, Lyu, Chenyang, Wibowo, Haryo Akbarianto, Lynn, Teresa, Hamed, Injy, Kishore, Aditya Nanda, Mandal, Aishik, Dragonetti, Alina, Abzaliev, Artem, Tonja, Atnafu Lambebo, Balcha, Bontu Fufa, Whitehouse, Chenxi, Salamea, Christian, Velasco, Dan John, Adelani, David Ifeoluwa, Meur, David Le, Villa-Cueva, Emilio, Koto, Fajri, Farooqui, Fauzan, Belcavello, Frederico, Batnasan, Ganzorig, Vallejo, Gisela, Caulfield, Grainne, Ivetta, Guido, Song, Haiyue, Ademtew, Henok Biadglign, Maina, Hernán, Lovenia, Holy, Azime, Israel Abebe, Cruz, Jan Christian Blaise, Gala, Jay, Geng, Jiahui, Ortiz-Barajas, Jesus-German, Baek, Jinheon, Dunstan, Jocelyn, Alemany, Laura Alonso, Nagasinghe, Kumaranage Ravindu Yasas, Benotti, Luciana, D'Haro, Luis Fernando, Viridiano, Marcelo, Estecha-Garitagoitia, Marcos, Cabrera, Maria Camila Buitrago, Rodríguez-Cantelar, Mario, Jouitteau, Mélanie, Mihaylov, Mihail, Imam, Mohamed Fazli Mohamed, Adilazuarda, Muhammad Farid, Gochoo, Munkhjargal, Otgonbold, Munkh-Erdene, Etori, Naome, Niyomugisha, Olivier, Silva, Paula Mónica, Chitale, Pranjal, Dabre, Raj, Chevi, Rendi, Zhang, Ruochen, Diandaru, Ryandito, Cahyawijaya, Samuel, Góngora, Santiago, Jeong, Soyeong, Purkayastha, Sukannya, Kuribayashi, Tatsuki, Clifford, Teresa, Jayakumar, Thanmay, Torrent, Tiago Timponi, Ehsan, Toqeer, Araujo, Vladimir, Kementchedjhieva, Yova, Burzo, Zara, Lim, Zheng Wei, Yong, Zheng Xin, Ignat, Oana, Nwatu, Joan, Mihalcea, Rada, Solorio, Thamar, and Aji, Alham Fikri
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, although these datasets often extend their linguistic range via translation or some other approaches, they usually keep images the same, resulting in narrow cultural representation. To address these limitations, we construct CVQA, a new Culturally-diverse multilingual Visual Question Answering benchmark, designed to cover a rich set of languages and cultures, where we engage native speakers and cultural experts in the data collection process. As a result, CVQA includes culturally-driven images and questions from across 30 countries on four continents, covering 31 languages with 13 scripts, providing a total of 10k questions. We then benchmark several Multimodal Large Language Models (MLLMs) on CVQA, and show that the dataset is challenging for the current state-of-the-art models. This benchmark can serve as a probing evaluation suite for assessing the cultural capability and bias of multimodal models and hopefully encourage more research efforts toward increasing cultural awareness and linguistic diversity in this field., Comment: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks
Published: 2024

3. An Empirical Study of In-context Learning in LLMs for Machine Translation

Author: Chitale, Pranjal A., Gala, Jay, and Dabre, Raj
Subjects: Computer Science - Computation and Language
Abstract: Recent interest has surged in employing Large Language Models (LLMs) for machine translation (MT) via in-context learning (ICL) (Vilar et al., 2023). Most prior studies primarily focus on optimizing translation quality, with limited attention to understanding the specific aspects of ICL that influence the said quality. To this end, we perform the first of its kind, an exhaustive study of in-context learning for machine translation. We first establish that ICL is primarily example-driven and not instruction-driven. Following this, we conduct an extensive exploration of various aspects of the examples to understand their influence on downstream performance. Our analysis includes factors such as quality and quantity of demonstrations, spatial proximity, and source versus target originality. Further, we also investigate challenging scenarios involving indirectness and misalignment of examples to understand the limits of ICL. While we establish the significance of the quality of the target distribution over the source distribution of demonstrations, we further observe that perturbations sometimes act as regularizers, resulting in performance improvements. Surprisingly, ICL does not necessitate examples from the same task, and a related task with the same target distribution proves sufficient. We hope that our study acts as a guiding resource for considerations in utilizing ICL for MT. Our code is available on https://github.com/PranjalChitale/in-context-mt-analysis., Comment: Accepted to ACL 2024 Findings
Published: 2024

4. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

Author: Gala, Jay, Chitale, Pranjal A., AK, Raghavan, Gumma, Varun, Doddapaneni, Sumanth, Kumar, Aswanth, Nawale, Janki, Sujatha, Anupama, Puduppully, Ratish, Raghavan, Vivek, Kumar, Pratyush, Khapra, Mitesh M., Dabre, Raj, and Kunchukuttan, Anoop
Subjects: Computer Science - Computation and Language
Abstract: India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-quality and accessible Machine Translation (MT) systems are essential in a country like India. Prior to this work, there was (i) no parallel training data spanning all 22 languages, (ii) no robust benchmarks covering all these languages and containing content relevant to India, and (iii) no existing translation models which support all the 22 scheduled languages of India. In this work, we aim to address this gap by focusing on the missing pieces required for enabling wide, easy, and open access to good machine translation systems for all 22 scheduled Indian languages. We identify four key areas of improvement: curating and creating larger training datasets, creating diverse and high-quality benchmarks, training multilingual models, and releasing models with open access. Our first contribution is the release of the Bharat Parallel Corpus Collection (BPCC), the largest publicly available parallel corpora for Indic languages. BPCC contains a total of 230M bitext pairs, of which a total of 126M were newly added, including 644K manually translated sentence pairs created as part of this work. Our second contribution is the release of the first n-way parallel benchmark covering all 22 Indian languages, featuring diverse domains, Indian-origin content, and source-original test sets. Next, we present IndicTrans2, the first model to support all 22 languages, surpassing existing models on multiple existing and new benchmarks created as a part of this work. Lastly, to promote accessibility and collaboration, we release our models and associated data with permissive licenses at https://github.com/AI4Bharat/IndicTrans2., Comment: Accepted at TMLR
Published: 2023

5. Improving Image-Based Dialog by Reducing Modality Biases

Author: Gala, Jay, Shenai, Hrishikesh, Chitale, Pranjal, Kekre, Kaustubh, Kanani, Pratik, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Singh, Mayank, editor, Tyagi, Vipin, editor, Gupta, P. K., editor, Flusser, Jan, editor, Ören, Tuncer, editor, and Sonawane, V. R., editor
Published: 2021
Full Text: View/download PDF

6. Combating COVID-19 using object detection techniques for next-generation autonomous systems

Author: Shenai, Hrishikesh, primary, Gala, Jay, additional, Kekre, Kaustubh, additional, Chitale, Pranjal, additional, and Karani, Ruhina, additional
Published: 2022
Full Text: View/download PDF

7. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

Author: AI4Bharat, Gala, Jay, Chitale, Pranjal A., AK, Raghavan, Doddapaneni, Sumanth, Gumma, Varun, Kumar, Aswanth, Nawale, Janki, Sujatha, Anupama, Puduppully, Ratish, Raghavan, Vivek, Kumar, Pratyush, Khapra, Mitesh M., Dabre, Raj, and Kunchukuttan, Anoop
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)
Abstract: India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-quality and accessible Machine Translation (MT) systems are essential in a country like India. Prior to this work, there was (i) no parallel training data spanning all the 22 languages, (ii) no robust benchmarks covering all these languages and containing content relevant to India, and (iii) no existing translation models which support all the 22 scheduled languages of India. In this work, we aim to address this gap by focusing on the missing pieces required for enabling wide, easy, and open access to good machine translation systems for all 22 scheduled Indian languages. We identify four key areas of improvement: curating and creating larger training datasets, creating diverse and high-quality benchmarks, training multilingual models, and releasing models with open access. Our first contribution is the release of the Bharat Parallel Corpus Collection (BPCC), the largest publicly available parallel corpora for Indic languages. BPCC contains a total of 230M bitext pairs, of which a total of 126M were newly added, including 644K manually translated sentence pairs created as part of this work. Our second contribution is the release of the first n-way parallel benchmark covering all 22 Indian languages, featuring diverse domains, Indian-origin content, and source-original test sets. Next, we present IndicTrans2, the first model to support all 22 languages, surpassing existing models on multiple existing and new benchmarks created as a part of this work. Lastly, to promote accessibility and collaboration, we release our models and associated data with permissive licenses at https://github.com/ai4bharat/IndicTrans2.
Published: 2023

8. NICT-AI4B’s Submission to the Indic MT Shared Task in WMT 2023

Author: Dabre, Raj, primary, Gala, Jay, additional, and Chitale, Pranjal, additional
Published: 2023
Full Text: View/download PDF

9. Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages

Author: Gala, Jay, primary, Chitale, Pranjal A., additional, and Dabre, Raj, additional
Published: 2023
Full Text: View/download PDF

10. Chapter 4 - Combating COVID-19 using object detection techniques for next-generation autonomous systems

Author: Shenai, Hrishikesh, Gala, Jay, Kekre, Kaustubh, Chitale, Pranjal, and Karani, Ruhina
Published: 2022
Full Text: View/download PDF

11. Pothole Detection and Dimension Estimation System using Deep Learning (YOLO) and Image Processing

Author: Chitale, Pranjal A., primary, Kekre, Kaustubh Y., additional, Shenai, Hrishikesh R., additional, Karani, Ruhina, additional, and Gala, Jay P., additional
Published: 2020
Full Text: View/download PDF

12. List of contributors

Author: Agarwal, Basant, Ahlawat, Chanchal, Amritesh, Banerjee, Jyoti Sekhar, Bhaskar, Priyanka, Chakraborty, Arpita, Chatterjee, Avik, Chaube, Nandita, Chitale, Pranjal, Das, Abhijit, Das, Barshan, Das, Basabdatta, Gala, Jay, Gothwal, Pushpa, Guhathakurata, Soham, Harjule, Priyanka, Javeri, Yash Tejas, Jeyakumar, Vijay, Karani, Ruhina, Kekre, Kaustubh, Krishnamurthi, Rajalakshmi, Kumar, Krishan, Kundu, Souvik, Luqman, Nadeem, Malik, Ananya, Mangrulkar, Ramchandra, Nanda, Shweta, Negi, Alok, Nirmala, K., Padmanabhan, Mani, Pandya, Abhijit S., Patel, Riki, Rao, Sunita, Saha, Sayak, Sanghvi, Harshal, Sarate, Sachin G., Saxena, Sumit, Shah, Manav, Sharma, Ashish, Sharma, Bosky Dharmendra, Sharma, Neha V., Sharma, Saurabh, Sharma, Vaishnavi, Shenai, Hrishikesh, Tiwari, Vinita, and Yadav, Narendra Singh
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

12 results on '"Chitale, Pranjal A."'

1. Towards Inducing Long-Context Abilities in Multilingual Neural Machine Translation Models

2. CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

3. An Empirical Study of In-context Learning in LLMs for Machine Translation

4. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

5. Improving Image-Based Dialog by Reducing Modality Biases

6. Combating COVID-19 using object detection techniques for next-generation autonomous systems

7. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

8. NICT-AI4B’s Submission to the Indic MT Shared Task in WMT 2023

9. Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages

10. Chapter 4 - Combating COVID-19 using object detection techniques for next-generation autonomous systems

11. Pothole Detection and Dimension Estimation System using Deep Learning (YOLO) and Image Processing

12. List of contributors

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

12 results on '"Chitale, Pranjal A."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources