Author: "Kordi, Yeganeh" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kordi, Yeganeh"' showing total 6 results

Start Over Author "Kordi, Yeganeh"

6 results on '"Kordi, Yeganeh"'

1. Tur[k]ingBench: A Challenge Benchmark for Web Agents

Author: Xu, Kevin, Kordi, Yeganeh, Nayak, Tanay, Asija, Ado, Wang, Yizhong, Sanders, Kate, Byerly, Adam, Zhang, Jingyu, Van Durme, Benjamin, and Khashabi, Daniel
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction
Abstract: Can advanced multi-modal models effectively tackle complex web-based tasks? Such tasks are often found on crowdsourcing platforms, where crowdworkers engage in challenging micro-tasks within web-based environments. Building on this idea, we present TurkingBench, a benchmark consisting of tasks presented as web pages with textual instructions and multi-modal contexts. Unlike previous approaches that rely on artificially synthesized web pages, our benchmark uses natural HTML pages originally designed for crowdsourcing workers to perform various annotation tasks. Each task's HTML instructions are instantiated with different values derived from crowdsourcing tasks, creating diverse instances. This benchmark includes 32.2K instances spread across 158 tasks. To support the evaluation of TurkingBench, we have developed a framework that links chatbot responses to actions on web pages (e.g., modifying a text box, selecting a radio button). We assess the performance of cutting-edge private and open-source models, including language-only and vision-language models (such as GPT4 and InternVL), on this benchmark. Our results show that while these models outperform random chance, there is still significant room for improvement. We hope that this benchmark will drive progress in the evaluation and development of web-based agents.
Published: 2024

2. Self-Instruct: Aligning Language Models with Self-Generated Instructions

Author: Wang, Yizhong, Kordi, Yeganeh, Mishra, Swaroop, Liu, Alisa, Smith, Noah A., Khashabi, Daniel, and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off their own generations. Our pipeline generates instructions, input, and output samples from a language model, then filters invalid or similar ones before using them to finetune the original model. Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT-001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning. Our code and data are available at https://github.com/yizhongw/self-instruct., Comment: ACL 2023 camera ready, 23 pages, 9 figures, 11 tables
Published: 2022

3. Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Author: Wang, Yizhong, Mishra, Swaroop, Alipoormolabashi, Pegah, Kordi, Yeganeh, Mirzaei, Amirreza, Arunkumar, Anjana, Ashok, Arjun, Dhanasekaran, Arut Selvan, Naik, Atharva, Stap, David, Pathak, Eshaan, Karamanolakis, Giannis, Lai, Haizhi Gary, Purohit, Ishan, Mondal, Ishani, Anderson, Jacob, Kuznia, Kirby, Doshi, Krima, Patel, Maitreya, Pal, Kuntal Kumar, Moradshahi, Mehrad, Parmar, Mihir, Purohit, Mirali, Varshney, Neeraj, Kaza, Phani Rohitha, Verma, Pulkit, Puri, Ravsehaj Singh, Karia, Rushang, Sampat, Shailaja Keyur, Doshi, Savan, Mishra, Siddhartha, Reddy, Sujan, Patro, Sumanta, Dixit, Tanay, Shen, Xudong, Baral, Chitta, Choi, Yejin, Smith, Noah A., Hajishirzi, Hannaneh, and Khashabi, Daniel
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting, and text composition. This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions -- training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones. Furthermore, we build Tk-Instruct, a transformer model trained to follow a variety of in-context instructions (plain language task definitions or k-shot examples). Our experiments show that Tk-Instruct outperforms existing instruction-following models such as InstructGPT by over 9% on our benchmark despite being an order of magnitude smaller. We further analyze generalization as a function of various scaling parameters, such as the number of observed tasks, the number of instances per task, and model sizes. We hope our dataset and model facilitate future progress towards more general-purpose NLP models., Comment: Accepted to EMNLP 2022, 25 pages
Published: 2022

4. UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

Author: Khashabi, Daniel, Kordi, Yeganeh, and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We present UnifiedQA-v2, a QA model built with the same process as UnifiedQA, except that it utilizes more supervision -- roughly 3x the number of datasets used for UnifiedQA. This generally leads to better in-domain and cross-domain results.
Published: 2022

5. Self-Instruct: Aligning Language Models with Self-Generated Instructions

Author: Wang, Yizhong, primary, Kordi, Yeganeh, additional, Mishra, Swaroop, additional, Liu, Alisa, additional, Smith, Noah A., additional, Khashabi, Daniel, additional, and Hajishirzi, Hannaneh, additional
Published: 2023
Full Text: View/download PDF

6. Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Author: Wang, Yizhong, primary, Mishra, Swaroop, additional, Alipoormolabashi, Pegah, additional, Kordi, Yeganeh, additional, Mirzaei, Amirreza, additional, Naik, Atharva, additional, Ashok, Arjun, additional, Dhanasekaran, Arut Selvan, additional, Arunkumar, Anjana, additional, Stap, David, additional, Pathak, Eshaan, additional, Karamanolakis, Giannis, additional, Lai, Haizhi, additional, Purohit, Ishan, additional, Mondal, Ishani, additional, Anderson, Jacob, additional, Kuznia, Kirby, additional, Doshi, Krima, additional, Pal, Kuntal Kumar, additional, Patel, Maitreya, additional, Moradshahi, Mehrad, additional, Parmar, Mihir, additional, Purohit, Mirali, additional, Varshney, Neeraj, additional, Kaza, Phani Rohitha, additional, Verma, Pulkit, additional, Puri, Ravsehaj Singh, additional, Karia, Rushang, additional, Doshi, Savan, additional, Sampat, Shailaja Keyur, additional, Mishra, Siddhartha, additional, Reddy A, Sujan, additional, Patro, Sumanta, additional, Dixit, Tanay, additional, and Shen, Xudong, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Kordi, Yeganeh"'

1. Tur[k]ingBench: A Challenge Benchmark for Web Agents

2. Self-Instruct: Aligning Language Models with Self-Generated Instructions

3. Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

4. UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

5. Self-Instruct: Aligning Language Models with Self-Generated Instructions

6. Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

6 results on '"Kordi, Yeganeh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources