Author: "Orr, Laurel" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Orr, Laurel"' showing total 40 results

Start Over Author "Orr, Laurel"

40 results on '"Orr, Laurel"'

1. Holistic Evaluation of Language Models

Author: Liang, Percy, Bommasani, Rishi, Lee, Tony, Tsipras, Dimitris, Soylu, Dilara, Yasunaga, Michihiro, Zhang, Yian, Narayanan, Deepak, Wu, Yuhuai, Kumar, Ananya, Newman, Benjamin, Yuan, Binhang, Yan, Bobby, Zhang, Ce, Cosgrove, Christian, Manning, Christopher D., Ré, Christopher, Acosta-Navas, Diana, Hudson, Drew A., Zelikman, Eric, Durmus, Esin, Ladhak, Faisal, Rong, Frieda, Ren, Hongyu, Yao, Huaxiu, Wang, Jue, Santhanam, Keshav, Orr, Laurel, Zheng, Lucia, Yuksekgonul, Mert, Suzgun, Mirac, Kim, Nathan, Guha, Neel, Chatterji, Niladri, Khattab, Omar, Henderson, Peter, Huang, Qian, Chi, Ryan, Xie, Sang Michael, Santurkar, Shibani, Ganguli, Surya, Hashimoto, Tatsunori, Icard, Thomas, Zhang, Tianyi, Chaudhary, Vishrav, Wang, William, Li, Xuechen, Mai, Yifan, Zhang, Yuhui, and Koreeda, Yuta
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest for LMs. Then we select a broad subset based on coverage and feasibility, noting what's missing or underrepresented (e.g. question answering for neglected English dialects, metrics for trustworthiness). Second, we adopt a multi-metric approach: We measure 7 metrics (accuracy, calibration, robustness, fairness, bias, toxicity, and efficiency) for each of 16 core scenarios when possible (87.5% of the time). This ensures metrics beyond accuracy don't fall to the wayside, and that trade-offs are clearly exposed. We also perform 7 targeted evaluations, based on 26 targeted scenarios, to analyze specific aspects (e.g. reasoning, disinformation). Third, we conduct a large-scale evaluation of 30 prominent language models (spanning open, limited-access, and closed models) on all 42 scenarios, 21 of which were not previously used in mainstream LM evaluation. Prior to HELM, models on average were evaluated on just 17.9% of the core HELM scenarios, with some prominent models not sharing a single scenario in common. We improve this to 96.0%: now all 30 models have been densely benchmarked on the same core scenarios and metrics under standardized conditions. Our evaluation surfaces 25 top-level findings. For full transparency, we release all raw model prompts and completions publicly for further analysis, as well as a general modular toolkit. We intend for HELM to be a living benchmark for the community, continuously updated with new scenarios, metrics, and models., Comment: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Project page: https://crfm.stanford.edu/helm/v1.0
Published: 2022

2. Ask Me Anything: A simple strategy for prompting language models

Author: Arora, Simran, Narayan, Avanika, Chen, Mayee F., Orr, Laurel, Guha, Neel, Bhatia, Kush, Chami, Ines, Sala, Frederic, and Ré, Christopher
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect prompt" for a task. To mitigate the high degree of effort involved in prompt-design, we instead ask whether producing multiple effective, yet imperfect, prompts and aggregating them can lead to a high quality prompting strategy. Our observations motivate our proposed prompting method, ASK ME ANYTHING (AMA). We first develop an understanding of the effective prompt formats, finding that question-answering (QA) prompts, which encourage open-ended generation ("Who went to the park?") tend to outperform those that restrict the model outputs ("John went to the park. Output True or False."). Our approach recursively uses the LLM itself to transform task inputs to the effective QA format. We apply the collected prompts to obtain several noisy votes for the input's true label. We find that the prompts can have very different accuracies and complex dependencies and thus propose to use weak supervision, a procedure for combining the noisy predictions, to produce the final predictions for the inputs. We evaluate AMA across open-source model families (e.g., EleutherAI, BLOOM, OPT, and T0) and model sizes (125M-175B parameters), demonstrating an average performance lift of 10.2% over the few-shot baseline. This simple strategy enables the open-source GPT-J-6B model to match and exceed the performance of few-shot GPT3-175B on 15 of 20 popular benchmarks. Averaged across these tasks, the GPT-J-6B model outperforms few-shot GPT3-175B. We release our code here: https://github.com/HazyResearch/ama_prompting
Published: 2022

3. Can Foundation Models Wrangle Your Data?

Author: Narayan, Avanika, Chami, Ines, Orr, Laurel, Arora, Simran, and Ré, Christopher
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Databases
Abstract: Foundation Models (FMs) are models trained on large corpora of data that, at very large scale, can generalize to new tasks without any task-specific finetuning. As these models continue to grow in size, innovations continue to push the boundaries of what these models can do on language and image tasks. This paper aims to understand an underexplored area of FMs: classical data tasks like cleaning and integration. As a proof-of-concept, we cast five data cleaning and integration tasks as prompting tasks and evaluate the performance of FMs on these tasks. We find that large FMs generalize and achieve SoTA performance on data cleaning and integration tasks, even though they are not trained for these data tasks. We identify specific research challenges and opportunities that these models present, including challenges with private and domain specific data, and opportunities to make data management systems more accessible to non-experts. We make our code and experiments publicly available at: https://github.com/HazyResearch/fm_data_tasks., Comment: 12 pages, 5 figures; additional experiments, typo corrections, modifications to Section 5 (Research Agenda)
Published: 2022

4. Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Author: Varma, Maya, Orr, Laurel, Wu, Sen, Leszczynski, Megan, Ling, Xiao, and Ré, Christopher
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Named entity disambiguation (NED), which involves mapping textual mentions to structured entities, is particularly challenging in the medical domain due to the presence of rare entities. Existing approaches are limited by the presence of coarse-grained structural resources in biomedical knowledge bases as well as the use of training datasets that provide low coverage over uncommon resources. In this work, we address these issues by proposing a cross-domain data integration method that transfers structural knowledge from a general text knowledge base to the medical domain. We utilize our integration scheme to augment structural resources and generate a large biomedical NED dataset for pretraining. Our pretrained model with injected structural knowledge achieves state-of-the-art performance on two benchmark medical NED datasets: MedMentions and BC5CDR. Furthermore, we improve disambiguation of rare entities by up to 57 accuracy points., Comment: Accepted to Findings of EMNLP 2021
Published: 2021

5. On the Opportunities and Risks of Foundation Models

Author: Bommasani, Rishi, Hudson, Drew A., Adeli, Ehsan, Altman, Russ, Arora, Simran, von Arx, Sydney, Bernstein, Michael S., Bohg, Jeannette, Bosselut, Antoine, Brunskill, Emma, Brynjolfsson, Erik, Buch, Shyamal, Card, Dallas, Castellon, Rodrigo, Chatterji, Niladri, Chen, Annie, Creel, Kathleen, Davis, Jared Quincy, Demszky, Dora, Donahue, Chris, Doumbouya, Moussa, Durmus, Esin, Ermon, Stefano, Etchemendy, John, Ethayarajh, Kawin, Fei-Fei, Li, Finn, Chelsea, Gale, Trevor, Gillespie, Lauren, Goel, Karan, Goodman, Noah, Grossman, Shelby, Guha, Neel, Hashimoto, Tatsunori, Henderson, Peter, Hewitt, John, Ho, Daniel E., Hong, Jenny, Hsu, Kyle, Huang, Jing, Icard, Thomas, Jain, Saahil, Jurafsky, Dan, Kalluri, Pratyusha, Karamcheti, Siddharth, Keeling, Geoff, Khani, Fereshte, Khattab, Omar, Koh, Pang Wei, Krass, Mark, Krishna, Ranjay, Kuditipudi, Rohith, Kumar, Ananya, Ladhak, Faisal, Lee, Mina, Lee, Tony, Leskovec, Jure, Levent, Isabelle, Li, Xiang Lisa, Li, Xuechen, Ma, Tengyu, Malik, Ali, Manning, Christopher D., Mirchandani, Suvir, Mitchell, Eric, Munyikwa, Zanele, Nair, Suraj, Narayan, Avanika, Narayanan, Deepak, Newman, Ben, Nie, Allen, Niebles, Juan Carlos, Nilforoshan, Hamed, Nyarko, Julian, Ogut, Giray, Orr, Laurel, Papadimitriou, Isabel, Park, Joon Sung, Piech, Chris, Portelance, Eva, Potts, Christopher, Raghunathan, Aditi, Reich, Rob, Ren, Hongyu, Rong, Frieda, Roohani, Yusuf, Ruiz, Camilo, Ryan, Jack, Ré, Christopher, Sadigh, Dorsa, Sagawa, Shiori, Santhanam, Keshav, Shih, Andy, Srinivasan, Krishnan, Tamkin, Alex, Taori, Rohan, Thomas, Armin W., Tramèr, Florian, Wang, Rose E., Wang, William, Wu, Bohan, Wu, Jiajun, Wu, Yuhuai, Xie, Sang Michael, Yasunaga, Michihiro, You, Jiaxuan, Zaharia, Matei, Zhang, Michael, Zhang, Tianyi, Zhang, Xikun, Zhang, Yuhui, Zheng, Lucia, Zhou, Kaitlyn, and Liang, Percy
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature., Comment: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html
Published: 2021

6. Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems

Author: Orr, Laurel, Sanyal, Atindriyo, Ling, Xiao, Goel, Karan, and Leszczynski, Megan
Subjects: Computer Science - Machine Learning, Computer Science - Databases
Abstract: The industrial machine learning pipeline requires iterating on model features, training and deploying models, and monitoring deployed models at scale. Feature stores were developed to manage and standardize the engineer's workflow in this end-to-end pipeline, focusing on traditional tabular feature data. In recent years, however, model development has shifted towards using self-supervised pretrained embeddings as model features. Managing these embeddings and the downstream systems that use them introduces new challenges with respect to managing embedding training data, measuring embedding quality, and monitoring downstream models that use embeddings. These challenges are largely unaddressed in standard feature stores. Our goal in this tutorial is to introduce the feature store system and discuss the challenges and current solutions to managing these new embedding-centric pipelines.
Published: 2021

7. Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

Author: Orr, Laurel, Leszczynski, Megan, Arora, Simran, Wu, Sen, Guha, Neel, Ling, Xiao, and Re, Christopher
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: A challenge for named entity disambiguation (NED), the task of mapping textual mentions to entities in a knowledge base, is how to disambiguate entities that appear rarely in the training data, termed tail entities. Humans use subtle reasoning patterns based on knowledge of entity facts, relations, and types to disambiguate unfamiliar entities. Inspired by these patterns, we introduce Bootleg, a self-supervised NED system that is explicitly grounded in reasoning patterns for disambiguation. We define core reasoning patterns for disambiguation, create a learning procedure to encourage the self-supervised model to learn the patterns, and show how to use weak supervision to enhance the signals in the training data. Encoding the reasoning patterns in a simple Transformer architecture, Bootleg meets or exceeds state-of-the-art on three NED benchmarks. We further show that the learned representations from Bootleg successfully transfer to other non-disambiguation tasks that require entity-based knowledge: we set a new state-of-the-art in the popular TACRED relation extraction task by 1.0 F1 points and demonstrate up to 8% performance lift in highly optimized production search and assistant tasks at a major technology company
Published: 2020

8. Sample Debiasing in the Themis Open World Database System (Extended Version)

Author: Orr, Laurel, Balazinska, Magda, and Suciu, Dan
Subjects: Computer Science - Databases
Abstract: Open world database management systems assume tuples not in the database still exist and are becoming an increasingly important area of research. We present Themis, the first open world database that automatically rebalances arbitrarily biased samples to approximately answer queries as if they were issued over the entire population. We leverage apriori population aggregate information to develop and combine two different approaches for automatic debiasing: sample reweighting and Bayesian network probabilistic modeling. We build a prototype of Themis and demonstrate that Themis achieves higher query accuracy than the default AQP approach, an alternative sample reweighting technique, and a variety of Bayesian network models while maintaining interactive query response times. We also show that \name is robust to differences in the support between the sample and population, a key use case when using social media samples., Comment: SIGMOD 2020
Published: 2020

9. Mosaic: A Sample-Based Database System for Open World Query Processing

Author: Orr, Laurel, Ainsworth, Samuel, Cai, Walter, Jamieson, Kevin, Balazinska, Magda, and Suciu, Dan
Subjects: Computer Science - Databases, Computer Science - Machine Learning
Abstract: Data scientists have relied on samples to analyze populations of interest for decades. Recently, with the increase in the number of public data repositories, sample data has become easier to access. It has not, however, become easier to analyze. This sample data is arbitrarily biased with an unknown sampling probability, meaning data scientists must manually debias the sample with custom techniques to avoid inaccurate results. In this vision paper, we propose Mosaic, a database system that treats samples as first-class citizens and allows users to ask questions over populations represented by these samples. Answering queries over biased samples is non-trivial as there is no existing, standard technique to answer population queries when the sampling probability is unknown. In this paper, we show how our envisioned system solves this problem by having a unique sample-based data model with extensions to the SQL language. We propose how to perform population query answering using biased samples and give preliminary results for one of our novel query answering techniques., Comment: CIDR 2020
Published: 2019

10. EntropyDB: A Probabilistic Approach to Approximate Query Processing

Author: Orr, Laurel, Balazinska, Magdalena, and Suciu, Dan
Subjects: Computer Science - Databases
Abstract: We present EntropyDB, an interactive data exploration system that uses a probabilistic approach to generate a small, query-able summary of a dataset. Departing from traditional summarization techniques, we use the Principle of Maximum Entropy to generate a probabilistic representation of the data that can be used to give approximate query answers. We develop the theoretical framework and formulation of our probabilistic representation and show how to use it to answer queries. We then present solving techniques, give two critical optimizations to improve preprocessing time and query execution time, and explore methods to reduce query error. Lastly, we experimentally evaluate our work using a 5 GB dataset of flights within the United States and a 210 GB dataset from an astronomy particle simulation. While our current work only supports linear queries, we show that our technique can successfully answer queries faster than sampling while introducing, on average, no more error than sampling and can better distinguish between rare and nonexistent values. We also discuss extensions that can allow for data updates and linear queries over joins., Comment: arXiv admin note: text overlap with arXiv:1703.03856
Published: 2019

11. Data-induced predicates for sideways information passing in query optimizers

Author: Kandula, Srikanth, Orr, Laurel, and Chaudhuri, Surajit
Published: 2022
Full Text: View/download PDF

12. Probabilistic Database Summarization for Interactive Data Exploration

Author: Orr, Laurel, Balazinska, Magda, and Suciu, Dan
Subjects: Computer Science - Databases
Abstract: We present a probabilistic approach to generate a small, query-able summary of a dataset for interactive data exploration. Departing from traditional summarization techniques, we use the Principle of Maximum Entropy to generate a probabilistic representation of the data that can be used to give approximate query answers. We develop the theoretical framework and formulation of our probabilistic representation and show how to use it to answer queries. We then present solving techniques and give three critical optimizations to improve preprocessing time and query accuracy. Lastly, we experimentally evaluate our work using a 5 GB dataset of flights within the United States and a 210 GB dataset from an astronomy particle simulation. While our current work only supports linear queries, we show that our technique can successfully answer queries faster than sampling while introducing, on average, no more error than sampling and can better distinguish between rare and nonexistent values., Comment: To appear VLDB 2017
Published: 2017

13. EntropyDB: a probabilistic approach to approximate query processing

Author: Orr, Laurel, Balazinska, Magdalena, and Suciu, Dan
Published: 2020
Full Text: View/download PDF

14. Can Foundation Models Wrangle Your Data?

Author: Narayan, Avanika, primary, Chami, Ines, additional, Orr, Laurel, additional, and Ré, Christopher, additional
Published: 2022
Full Text: View/download PDF

15. Data-induced predicates for sideways information passing in query optimizers

Author: Kandula, Srikanth, primary, Orr, Laurel, additional, and Chaudhuri, Surajit, additional
Published: 2021
Full Text: View/download PDF

16. Managing ML pipelines

Author: Orr, Laurel, primary, Sanyal, Atindriyo, additional, Ling, Xiao, additional, Goel, Karan, additional, and Leszczynski, Megan, additional
Published: 2021
Full Text: View/download PDF

17. Goodwill Hunting: Analyzing and Repurposing Off-the-Shelf Named Entity Linking Systems

Author: Goel, Karan, primary, Orr, Laurel, additional, Rajani, Nazneen Fatema, additional, Vig, Jesse, additional, and Ré, Christopher, additional
Published: 2021
Full Text: View/download PDF

18. Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Author: Varma, Maya, primary, Orr, Laurel, additional, Wu, Sen, additional, Leszczynski, Megan, additional, Ling, Xiao, additional, and Ré, Christopher, additional
Published: 2021
Full Text: View/download PDF

19. Sample Debiasing in the Themis Open World Database System

Author: Orr, Laurel, primary, Balazinska, Magdalena, additional, and Suciu, Dan, additional
Published: 2020
Full Text: View/download PDF

20. EntropyDB: a probabilistic approach to approximate query processing

Author: Orr, Laurel, primary, Balazinska, Magdalena, additional, and Suciu, Dan, additional
Published: 2019
Full Text: View/download PDF

21. Pushing data-induced predicates through joins in big-data clusters

Author: Kandula, Srikanth, primary, Orr, Laurel, additional, and Chaudhuri, Surajit, additional
Published: 2019
Full Text: View/download PDF

22. High performance graphics processor based computed tomography reconstruction algorithms for nuclear and other large scale applications.

Author: Jimenez, Edward, primary, Orr, Laurel, additional, and Thompson, Kyle, additional
Published: 2013
Full Text: View/download PDF

23. Probabilistic database summarization for interactive data exploration

Author: Orr, Laurel, primary, Balazinska, Magdalena, additional, and Suciu, Dan, additional
Published: 2017
Full Text: View/download PDF

24. Explaining query answers with explanation-ready databases

Author: Roy, Sudeepa, primary, Orr, Laurel, additional, and Suciu, Dan, additional
Published: 2015
Full Text: View/download PDF

25. Cluster-based approach to a multi-GPU CT reconstruction algorithm

Author: Orr, Laurel J., primary, Jimenez, Edward S., additional, and Thompson, Kyle R., additional
Published: 2014
Full Text: View/download PDF

26. Object composition identification via mediated-reality supplemented radiographs

Author: Jimenez, Edward S., primary, Orr, Laurel J., additional, and Thompson, Kyle R., additional
Published: 2014
Full Text: View/download PDF

27. Exploring mediated reality to approximate x-ray attenuation coefficients from radiographs

Author: Jimenez, Edward S., additional, Orr, Laurel J., additional, Morgan, Megan L., additional, and Thompson, Kyle R., additional
Published: 2014
Full Text: View/download PDF

28. Irregular large-scale computed tomography on multiple graphics processors improves energy-efficiency metrics for industrial applications

Author: Jimenez, Edward S., additional, Goodman, Eric L., additional, Park, Ryeojin, additional, Orr, Laurel J., additional, and Thompson, Kyle R., additional
Published: 2014
Full Text: View/download PDF

29. Big-Data Management Use-Case

Author: Loebman, Sarah, primary, Ortiz, Jennifer, additional, Choo, Lee Lee, additional, Orr, Laurel, additional, Anderson, Lauren, additional, Halperin, Daniel, additional, Balazinska, Magdalena, additional, Quinn, Thomas, additional, and Governato, Fabio, additional
Published: 2014
Full Text: View/download PDF

30. Preparing for the 100-megapixel detector: reconstructing a multi-terabyte computed-tomography dataset

Author: Orr, Laurel J., additional and Jimenez, Edward S., additional
Published: 2013
Full Text: View/download PDF

31. Rethinking the union of computed tomography reconstruction and GPGPU computing

Author: Jimenez, Edward S., additional and Orr, Laurel J., additional
Published: 2013
Full Text: View/download PDF

32. An Irregular Approach to Large-Scale Computed Tomography on Multiple Graphics Processors Improves Voxel Processing Throughput

Author: Jimenez, Edward S., primary, Orr, Laurel J., additional, and Thompson, Kyle R., additional
Published: 2012
Full Text: View/download PDF