Author: "null Mausam" - Searchworks@Jio Institute Digital Library Search Results

1. Welcome back!

Author: Hemangee Kalpesh Kapoor, null Mausam, and Venkatesh Raman
Subjects: General Computer Science
Published: 2022

2. ToolTango: Common sense Generalization in Predicting Sequential Tool Interactions for Robot Plan Synthesis

Author: Shreshth Tuli, Rajas Bansal, Rohan Paul, and null Mausam
Subjects: FOS: Computer and information sciences, Computer Science - Robotics, Artificial Intelligence, Robotics (cs.RO)
Abstract: Robots assisting us in environments such as factories or homes must learn to make use of objects as tools to perform tasks, for instance using a tray to carry objects. We consider the problem of learning commonsense knowledge of when a tool may be useful and how its use may be composed with other tools to accomplish a high-level task instructed by a human. Specifically, we introduce a novel neural model, termed TOOLTANGO, that first predicts the next tool to be used, and then uses this information to predict the next action. We show that this joint model can inform learning of a fine-grained policy enabling the robot to use a particular tool in sequence and adds a significant value in making the model more accurate. TOOLTANGO encodes the world state, comprising objects and symbolic relationships between them, using a graph neural network and is trained using demonstrations from human teachers instructing a virtual robot in a physics simulator. The model learns to attend over the scene using knowledge of the goal and the action history, finally decoding the symbolic action to execute. Crucially, we address generalization to unseen environments where some known tools are missing, but alternative unseen tools are present. We show that by augmenting the representation of the environment with pre-trained embeddings derived from a knowledge-base, the model can generalize effectively to novel environments. Experimental results show at least 48.8-58.1% absolute improvement over the baselines in predicting successful symbolic plans for a simulated mobile manipulator in novel environments with unseen objects. This work takes a step in the direction of enabling robots to rapidly synthesize robust plans for complex tasks, particularly in novel settings, Accepted in Journal of AI Research. arXiv admin note: substantial text overlap with arXiv:2105.04556
Published: 2022

3. Focused Topological Value Iteration

Author: Peng Dai, null Mausam, and Daniel Weld
Abstract: Topological value iteration (TVI) is an effective algorithm for solving Markov decision processes (MDPs) optimally, which 1) divides an MDP into strongly-connected components, and 2) solves these components sequentially. Yet, TVI’s usefulness tends to degrade if an MDP has large components, because the cost of the division process isn’t offset by gains during solution. This paper presents a new algorithm to solve MDPs optimally, focused topological value iteration (FTVI). FTVI addresses TVI’s limitations by restricting its attention to connected components that are relevant for solving the MDP. Specifically, FTVI uses a small amount of heuristic search to eliminate provably sub-optimal actions; this pruning allows FTVI to find smaller connected components, thus running faster. We demonstrate that our new algorithm outperforms TVI by an order of magnitude, averaged across several domains. Surprisingly, FTVI also significantly outperforms popular ‘heuristically-informed’ MDP algorithms such as LAO*, LRTDP, and BRTDP in many domains, sometimes by as much as two orders of magnitude. Finally, we characterize the type of domains where FTVI excels — suggesting a way to an informed choice of solver.
Published: 2021

4. MatSciBERT: A materials domain language model for text mining and information extraction

Author: Tanishq Gupta, Mohd Zaki, N. M. Anoop Krishnan, and null Mausam
Subjects: Mechanics of Materials, Modeling and Simulation, General Materials Science, Computer Science Applications
Abstract: A large amount of materials science knowledge is generated and stored as text published in peer-reviewed scientific literature. While recent developments in natural language processing, such as Bidirectional Encoder Representations from Transformers (BERT) models, provide promising information extraction tools, these models may yield suboptimal results when applied on materials domain since they are not trained in materials science specific notations and jargons. Here, we present a materials-aware language model, namely, MatSciBERT, trained on a large corpus of peer-reviewed materials science publications. We show that MatSciBERT outperforms SciBERT, a language model trained on science corpus, and establish state-of-the-art results on three downstream tasks, named entity recognition, relation classification, and abstract classification. We make the pre-trained weights of MatSciBERT publicly accessible for accelerated materials discovery and information extraction from materials science texts.
Published: 2022

5. Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions

Author: Danish Contractor, Barun Patra, null Mausam, and Parag Singla
Subjects: Conditional random field, Linguistics and Language, Vocabulary, Parsing, Computer science, business.industry, media_common.quotation_subject, 02 engineering and technology, computer.software_genre, Language and Linguistics, Task (project management), Domain (software engineering), Artificial Intelligence, 020204 information systems, Schema (psychology), 0202 electrical engineering, electronic engineering, information engineering, Question answering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Software, Sentence, Natural language processing, media_common
Abstract: We present the novel task of understanding multi-sentenceentity-seekingquestions (MSEQs), that is, the questions that may be expressed in multiple sentences, and that expect one or more entities as an answer. We formulate the problem of understanding MSEQs as a semantic labeling task over an open representation that makes minimal assumptions about schema or ontology-specific semantic vocabulary. At the core of our model, we use a BiLSTM (bidirectional LSTM) conditional random field (CRF), and to overcome the challenges of operating with low training data, we supplement it by using BERT embeddings, hand-designed features, as well as hard and soft constraints spanning multiple sentences. We find that this results in a 12–15 points gain over a vanilla BiLSTM CRF. We demonstrate the strengths of our work using the novel task of answering real-world entity-seeking questions from the tourism domain. The use of our labels helps answer 36% more questions with 35% more (relative) accuracy as compared to baselines. We also demonstrate how our framework can rapidly enable the parsing of MSEQs in an entirely new domain with small amounts of training data and little change in the semantic representation.
Published: 2020

6. Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence

Author: Guy Barash, Mauricio Castillo-Effen, Niyati Chhaya, Peter Clark, Huáscar Espinoza, Eitan Farchi, Christopher Geib, Odd Erik Gundersen, Seán HÉigeartaigh, José Hernández-Orallo, Chiori Hori, Xiaowei Huang, Kokil Jaidka, Pavan Kapanipathi, Sarah Keren, Seokhwan Kim, Marc Lanctot, Danny Lange, Julian McAuley, David Martinez, Marwan Mattar, null Mausam, Martin Michalowski, Reuth Mirsky, Roozbeh Mottaghi, Joseph Osborn, Julien Perolat, Martin Schmid, Arash Shaban-Nejad, Onn Shehory, Biplav Srivastava, William Streilein, Kartik Talamadupula, Julian Togelius, Koichiro Yoshino, Quanshi Zhang, and Imed Zitouni
Subjects: Computer science, business.industry, Deep learning, Robotics, Plan (drawing), Recommender system, computer.software_genre, Knowledge extraction, Artificial Intelligence, Reinforcement learning, Artificial intelligence, Dialog system, business, computer, Agile software development
Abstract: The workshop program of the Association for the Advancement of Artificial Intelligence’s 33rd Conference on Artificial Intelligence (AAAI-19) was held in Honolulu, Hawaii, on Sunday and Monday, January 27–28, 2019. There were fifteen workshops in the program: Affective Content Analysis: Modeling Affect-in-Action, Agile Robotics for Industrial Automation Competition, Artificial Intelligence for Cyber Security, Artificial Intelligence Safety, Dialog System Technology Challenge, Engineering Dependable and Secure Machine Learning Systems, Games and Simulations for Artificial Intelligence, Health Intelligence, Knowledge Extraction from Games, Network Interpretability for Deep Learning, Plan, Activity, and Intent Recognition, Reasoning and Learning for Human-Machine Dialogues, Reasoning for Complex Question Answering, Recommender Systems Meet Natural Language Processing, Reinforcement Learning in Games, and Reproducible AI. This report contains brief summaries of the all the workshops that were held.
Published: 2019

7. Joint Spatio-Textual Reasoning for Answering Tourism Questions

Author: Danish Contractor, Shashank Goel, null Mausam, and Parag Singla
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Information retrieval, Computer Science - Artificial Intelligence, business.industry, Computer science, Spatial intelligence, 02 engineering and technology, Semantic reasoner, Modular design, Task (project management), Artificial Intelligence (cs.AI), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Question answering, 020201 artificial intelligence & image processing, Joint (building), business, Computation and Language (cs.CL), Tourism
Abstract: Our goal is to answer real-world tourism questions that seek Points-of-Interest (POI) recommendations. Such questions express various kinds of spatial and non-spatial constraints, necessitating a combination of textual and spatial reasoning. In response, we develop the first joint spatio-textual reasoning model, which combines geo-spatial knowledge with information in textual corpora to answer questions. We first develop a modular spatial-reasoning network that uses geo-coordinates of location names mentioned in a question, and of candidate answer POIs, to reason over only spatial constraints. We then combine our spatial-reasoner with a textual reasoner in a joint model and present experiments on a real world POI recommendation task. We report substantial improvements over existing models with-out joint spatio-textual reasoning., Updated version
Published: 2021

8. Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs

Author: Dinesh Raghu, Atishya Jain, null Mausam, and Sachindra Joshi
Subjects: FOS: Computer and information sciences, Structure (mathematical logic), Computer Science - Machine Learning, Multiset, Computer Science - Computation and Language, Correctness, business.industry, Computer science, Filter (signal processing), computer.software_genre, Machine Learning (cs.LG), Constraint (information theory), Knowledge base, Metric (mathematics), Artificial intelligence, Dialog box, business, Computation and Language (cs.CL), computer, Natural language processing
Abstract: End-to-End task-oriented dialogue systems generate responses based on dialog history and an accompanying knowledge base (KB). Inferring those KB entities that are most relevant for an utterance is crucial for response generation. Existing state of the art scales to large KBs by softly filtering over irrelevant KB information. In this paper, we propose a novel filtering technique that consists of (1) a pairwise similarity based filter that identifies relevant information by respecting the n-ary structure in a KB record. and, (2) an auxiliary loss that helps in separating contextually unrelated KB information. We also propose a new metric -- multiset entity F1 which fixes a correctness issue in the existing entity F1 metric. Experimental results on three publicly available task-oriented dialog datasets show that our proposed approach outperforms existing state-of-the-art models., D. Raghu and A. Jain contributed equally to this work
Published: 2021

9. OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction

Author: Keshav Kolluru, Vaibhav Adlakha, Samarth Aggarwal, null Mausam, and Soumen Chakrabarti
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer science, 05 social sciences, 010501 environmental sciences, Grid, computer.software_genre, 01 natural sciences, Bridge (nautical), Sequence labeling, Task (project management), Information extraction, Computer engineering, 0502 economics and business, State (computer science), 050207 economics, Computation and Language (cs.CL), computer, 0105 earth and related environmental sciences
Abstract: A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based system that establishes a new state of the art for OpenIE, while extracting 10x faster. This is achieved through a novel Iterative Grid Labeling (IGL) architecture, which treats OpenIE as a 2-D grid labeling task. We improve its performance further by applying coverage (soft) constraints on the grid at training time. Moreover, on observing that the best OpenIE systems falter at handling coordination structures, our OpenIE system also incorporates a new coordination analyzer built with the same IGL architecture. This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12.3 pts improvement in F1 over previous analyzers. Our OpenIE system, OpenIE6, beats the previous systems by as much as 4 pts in F1, while being much faster., EMNLP 2020 (Long)
Published: 2020

10. IMoJIE: Iterative Memory-Based Joint Open Information Extraction

Author: Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, null Mausam, and Soumen Chakrabarti
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computer science, business.industry, 05 social sciences, Pattern recognition, Extension (predicate logic), 010501 environmental sciences, computer.software_genre, 01 natural sciences, Information extraction, Task (computing), 0502 economics and business, Redundancy (engineering), State (computer science), Noise (video), Artificial intelligence, 050207 economics, Tuple, business, Computation and Language (cs.CL), computer, Sentence, 0105 earth and related environmental sciences
Abstract: While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information. We present IMoJIE, an extension to CopyAttention, which produces the next extraction conditioned on all previously extracted tuples. This approach overcomes both shortcomings of CopyAttention, resulting in a variable number of diverse extractions per sentence. We train IMoJIE on training data bootstrapped from extractions of several non-neural systems, which have been automatically filtered to reduce redundancy and noise. IMoJIE outperforms CopyAttention by about 18 F1 pts, and a BERT-based strong baseline by 2 F1 pts, establishing a new state of the art for the task.
Published: 2020

11. Temporal Knowledge Base Completion: New Algorithms and Evaluation Protocols

Author: Prachi Jain, Sushant Rathi, null Mausam, and Soumen Chakrabarti
Subjects: Social and Information Networks (cs.SI), FOS: Computer and information sciences, Computer Science - Machine Learning, business.industry, Computer science, 010102 general mathematics, Computer Science - Social and Information Networks, 010501 environmental sciences, Space (commercial competition), Machine learning, computer.software_genre, 01 natural sciences, Machine Learning (cs.LG), Knowledge base, Artificial intelligence, 0101 mathematics, business, Link (knot theory), computer, 0105 earth and related environmental sciences
Abstract: Temporal knowledge bases associate relational (s,r,o) triples with a set of times (or a single time instant) when the relation is valid. While time-agnostic KB completion (KBC) has witnessed significant research, temporal KB completion (TKBC) is in its early days. In this paper, we consider predicting missing entities (link prediction) and missing time intervals (time prediction) as joint TKBC tasks where entities, relations, and time are all embedded in a uniform, compatible space. We present TIMEPLEX, a novel time-aware KBC method, that also automatically exploits the recurrent nature of some relations and temporal interactions between pairs of relations. TIMEPLEX achieves state-of-the-art performance on both prediction tasks. We also find that existing TKBC models heavily overestimate link prediction performance due to imperfect evaluation mechanisms. In response, we propose improved TKBC evaluation protocols for both link and time prediction tasks, dealing with subtle issues that arise from the partial overlap of time intervals in gold instances and system predictions.
Published: 2020

12. Unsupervised Learning of KB Queries in Task-Oriented Dialogs

Author: Dinesh Raghu, Nikhil Gupta, and null Mausam
Subjects: FOS: Computer and information sciences, Linguistics and Language, Computer Science - Machine Learning, Information retrieval, Computer Science - Computation and Language, Computer science, business.industry, Communication, Machine Learning (stat.ML), Machine Learning (cs.LG), Computer Science Applications, Human-Computer Interaction, Annotation, Task (computing), Knowledge base, Artificial Intelligence, Statistics - Machine Learning, Task oriented, Reinforcement learning, Unsupervised learning, State (computer science), Dialog box, business, Computation and Language (cs.CL)
Abstract: Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries -- these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training the dialog agent, without explicit KB query annotation. For query prediction, we propose a reinforcement learning (RL) baseline, which rewards the generation of those queries whose KB results cover the entities mentioned in subsequent dialog. Further analysis reveals that correlation among query attributes in KB can significantly confuse memory augmented policy optimization (MAPO), an existing state of the art RL agent. To address this, we improve the MAPO baseline with simple but important modifications suited to our task. To train the full TOD system for our setting, we propose a pipelined approach: it independently predicts when to make a KB query (query position predictor), then predicts a KB query at the predicted position (query predictor), and uses the results of predicted query in subsequent dialog (next response predictor). Overall, our work proposes first solutions to our novel problem, and our analysis highlights the research challenges in training TOD systems without query annotation., Presented at ACL 2021
Published: 2020

13. Why and when should you pool? Analyzing Pooling in Recurrent Architectures

Author: Pratyush Maini, Keshav Kolluru, Danish Pruthi, and null Mausam
Subjects: FOS: Computer and information sciences, Sequence, Computer Science - Machine Learning, Computer Science - Computation and Language, Computer science, business.industry, 05 social sciences, Pooling, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Machine Learning (cs.LG), 0502 economics and business, Artificial intelligence, 050207 economics, business, computer, Computation and Language (cs.CL), 0105 earth and related environmental sciences
Abstract: Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a sentence. We find that pooling-based architectures substantially differ from their non-pooling equivalents in their learning ability and positional biases--which elucidate their performance benefits. By analyzing the gradient propagation, we discover that pooling facilitates better gradient flow compared to BiLSTMs. Further, we expose how BiLSTMs are positionally biased towards tokens in the beginning and the end of a sequence. Pooling alleviates such biases. Consequently, we identify settings where pooling offers large benefits: (i) in low resource scenarios, and (ii) when important words lie towards the middle of the sentence. Among the pooling techniques studied, max-attention is the most effective, resulting in significant performance gains on several text classification tasks., Accepted to Findings of EMNLP 2020, to be presented at BlackBoxNLP. Updated Version
Published: 2020

14. A Simple Yet Strong Pipeline for HotpotQA

Author: Dirk Groeneveld, Tushar Khot, null Mausam, and Ashish Sabharwal
Subjects: FOS: Computer and information sciences, Theoretical computer science, Computer Science - Computation and Language, Computer science, Context (language use), 02 engineering and technology, computer.software_genre, Pipeline (software), 03 medical and health sciences, Identification (information), 0302 clinical medicine, Named-entity recognition, 030221 ophthalmology & optometry, 0202 electrical engineering, electronic engineering, information engineering, Question answering, Graph (abstract data type), 020201 artificial intelligence & image processing, Language model, Computation and Language (cs.CL), computer, Sentence
Abstract: State-of-the-art models for multi-hop question answering typically augment large-scale language models like BERT with additional, intuitively useful capabilities such as named entity recognition, graph-based reasoning, and question decomposition. However, does their strong performance on popular multi-hop datasets really justify this added design complexity? Our results suggest that the answer may be no, because even our simple pipeline based on BERT, named \model, performs surprisingly well. Specifically, on HotpotQA, Quark outperforms these models on both question answering and support identification (and achieves performance very close to a RoBERTa model). Our pipeline has three steps: 1) use BERT to identify potentially relevant sentences \emph{independently} of each other; 2) feed the set of selected sentences as context into a standard BERT span prediction model to choose an answer; and 3) use the sentence selection model, now with the chosen answer, to produce supporting sentences. The strong performance of Quark resurfaces the importance of carefully exploring simple model designs before using popular benchmarks to justify the value of complex techniques.
Published: 2020

15. [Untitled]

Author: Dinesh Raghu, Nikhil Gupta, and null Mausam
Subjects: Measure (data warehouse), Artificial neural network, Computer science, business.industry, 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Boss, Knowledge base, 0202 electrical engineering, electronic engineering, information engineering, Task oriented, 020201 artificial intelligence & image processing, Language model, Artificial intelligence, business, computer, Natural language processing, 0105 earth and related environmental sciences, Interpretability
Abstract: The Knowledge Base (KB) used for real-world applications, such as booking a movie or restaurant reservation, keeps changing over time. End-to-end neural networks trained for these task-oriented dialogs are expected to be immune to any changes in the KB. However, existing approaches breakdown when asked to handle such changes. We propose an encoder-decoder architecture (BoSsNet) with a novel Bag-of-Sequences (BoSs) memory, which facilitates the disentangled learning of the response’s language model and its knowledge incorporation. Consequently, the KB can be modified with new knowledge without a drop in interpretability. We find that BoSsNeT outperforms state-of-the-art models, with considerable improvements (>10%) on bAbI OOV test sets and other human-human datasets. We also systematically modify existing datasets to measure disentanglement and show BoSsNeT to be robust to KB modifications.
Published: 2019

16. Sprout

Author: Jonathan Bragg, null Mausam, and Daniel S. Weld
Subjects: Iterative and incremental development, Data collection, Process (engineering), business.industry, Computer science, 05 social sciences, Job design, 020207 software engineering, 02 engineering and technology, Crowdsourcing, Task (project management), Workflow, Human–computer interaction, 0202 electrical engineering, electronic engineering, information engineering, 0501 psychology and cognitive sciences, business, Set (psychology), 050107 human factors
Abstract: While crowdsourcing enables data collection at scale, ensuring high-quality data remains a challenge. In particular, effective task design underlies nearly every reported crowdsourcing success, yet remains difficult to accomplish. Task design is hard because it involves a costly iterative process: identifying the kind of work output one wants, conveying this information to workers, observing worker performance, understanding what remains ambiguous, revising the instructions, and repeating the process until the resulting output is satisfactory. To facilitate this process, we propose a novel meta-workflow that helps requesters optimize crowdsourcing task designs and Sprout, our open-source tool, which implements this workflow. Sprout improves task designs by (1) eliciting points of confusion from crowd workers, (2) enabling requesters to quickly understand these misconceptions and the overall space of questions, and (3) guiding requesters to improve the task design in response. We report the results of a user study with two labeling tasks demonstrating that requesters strongly prefer Sprout and produce higher-rated instructions compared to current best practices for creating gated instructions (instructions plus a workflow for training and testing workers). We also offer a set of design recommendations for future tools that support crowdsourcing task design.
Published: 2018

17. Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision

Author: Prachi Jain, Pankaj Kumar, null Mausam, and Soumen Chakrabarti
Subjects: Relation (database), Computer science, business.industry, 010401 analytical chemistry, Inference, 02 engineering and technology, Type (model theory), Base (topology), Machine learning, computer.software_genre, 01 natural sciences, 0104 chemical sciences, Knowledge base, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: State-of-the-art knowledge base completion (KBC) models predict a score for every known or unknown fact via a latent factorization over entity and relation embeddings. We observe that when they fail, they often make entity predictions that are incompatible with the type required by the relation. In response, we enhance each base factorization with two type-compatibility terms between entity-relation pairs, and combine the signals in a novel manner. Without explicit supervision from a type catalog, our proposed modification obtains up to 7% MRR gains over base models, and new state-of-the-art results on several datasets. Further analysis reveals that our models better represent the latent types of entities and their embeddings also predict supervised types better than the embeddings fitted by baseline models.
Published: 2018

18. Coarse-to-Fine Lifted MAP Inference in Computer Vision

Author: Haroun Habeeb, Ankit Anand, null Mausam, and Parag Singla
Subjects: FOS: Computer and information sciences, Pixel, Unary operation, Lift (data mining), business.industry, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Inference, Image segmentation, Line (geometry), Node (circuits), Computer vision, Artificial intelligence, Graphical model, business
Abstract: There is a vast body of theoretical research on lifted inference in probabilistic graphical models (PGMs). However, few demonstrations exist where lifting is applied in conjunction with top of the line applied algorithms. We pursue the applicability of lifted inference for computer vision (CV), with the insight that a globally optimal (MAP) labeling will likely have the same label for two symmetric pixels. The success of our approach lies in efficiently handling a distinct unary potential on every node (pixel), typical of CV applications. This allows us to lift the large class of algorithms that model a CV problem via PGM inference. We propose a generic template for coarse-to-fine (C2F) inference in CV, which progressively refines an initial coarsely lifted PGM for varying quality-time trade-offs. We demonstrate the performance of C2F inference by developing lifted versions of two near state-of-the-art CV algorithms for stereo vision and interactive image segmentation. We find that, against flat algorithms, the lifted versions have a much superior anytime performance, without any loss in final solution quality., Published in IJCAI 2017
Published: 2017

19. Bootstrapping for Numerical Open IE

Author: Swarnadeep Saha, Harinder Pal, and null Mausam
Subjects: Phrase, Dependency (UML), Relation (database), Computer science, business.industry, Bootstrapping (linguistics), Context (language use), 02 engineering and technology, computer.software_genre, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Tuple, business, computer, Natural language processing, Sentence, Word (computer architecture)
Abstract: We design and release BONIE, the first open numerical relation extractor, for extracting Open IE tuples where one of the arguments is a number or a quantity-unit phrase. BONIE uses bootstrapping to learn the specific dependency patterns that express numerical relations in a sentence. BONIE’s novelty lies in task-specific customizations, such as inferring implicit relations, which are clear due to context such as units (for e.g., ‘square kilometers’ suggests area, even if the word ‘area’ is missing in the sentence). BONIE obtains 1.5x yield and 15 point precision gain on numerical facts over a state-of-the-art Open IE system.
Published: 2017

20. Modeling Missing Data in Distant Supervision for Information Extraction

Author: Alan Ritter, Luke Zettlemoyer, null Mausam, and Oren Etzioni
Subjects: Linguistics and Language, Unary operation, Computer science, business.industry, Communication, Inference, Binary number, Missing data, computer.software_genre, Machine learning, Relationship extraction, Computer Science Applications, Human-Computer Interaction, Information extraction, Artificial Intelligence, Artificial intelligence, Precision and recall, Heuristics, business, computer
Abstract: Distant supervision algorithms learn information extraction models given only large readily available databases and text collections. Most previous work has used heuristics for generating labeled data, for example assuming that facts not contained in the database are not mentioned in the text, and facts in the database must be mentioned at least once. In this paper, we propose a new latent-variable approach that models missing data. This provides a natural way to incorporate side information, for instance modeling the intuition that text will often mention rare entities which are likely to be missing in the database. Despite the added complexity introduced by reasoning about missing data, we demonstrate that a carefully designed local search approach to inference is very accurate and scales to large datasets. Experiments demonstrate improved performance for binary and unary relation extraction when compared to learning with heuristic labels, including on average a 27% increase in area under the precision recall curve in the binary case.
Published: 2013

21. POMDP-based control of workflows for crowdsourcing

Author: Peng Dai, Christopher H. Lin, null Mausam, and Daniel S. Weld
Subjects: Linguistics and Language, business.industry, Process (engineering), Computer science, Partially observable Markov decision process, Bayesian network, Crowdsourcing, Machine learning, computer.software_genre, Language and Linguistics, Task (project management), Outsourcing, Workflow, Artificial Intelligence, Markov decision process, Artificial intelligence, business, computer
Abstract: Crowdsourcing, outsourcing of tasks to a crowd of unknown people ("workers") in an open call, is rapidly rising in popularity. It is already being heavily used by numerous employers ("requesters") for solving a wide variety of tasks, such as audio transcription, content screening, and labeling training data for machine learning. However, quality control of such tasks continues to be a key challenge because of the high variability in worker quality. In this paper we show the value of decision-theoretic techniques for the problem of optimizing workflows used in crowdsourcing. In particular, we design AI agents that use Bayesian network learning and inference in combination with Partially-Observable Markov Decision Processes (POMDPs) for obtaining excellent cost-quality tradeoffs. We use these techniques for three distinct crowdsourcing scenarios: (1) control of voting to answer a binary-choice question, (2) control of an iterative improvement workflow, and (3) control of switching between alternate workflows for a task. In each scenario, we design a Bayes net model that relates worker competency, task difficulty and worker response quality. We also design a POMDP for each task, whose solution provides the dynamic control policy. We demonstrate the usefulness of our models and agents in live experiments on Amazon Mechanical Turk. We consistently achieve superior quality results than non-adaptive controllers, while incurring equal or less cost.
Published: 2013

22. Knowledge-Guided Linguistic Rewrites for Inference Rule Verification

Author: Prachi Jain and null Mausam
Subjects: Point (typography), Relation (database), Computer science, media_common.quotation_subject, 05 social sciences, 010501 environmental sciences, 01 natural sciences, Linguistics, 0502 economics and business, Quality (business), 050207 economics, Rule of inference, 0105 earth and related environmental sciences, media_common
Abstract: A corpus of inference rules between a pair of relation phrases is typically generated using the statistical overlap of argument-pairs associated with the relations (e.g., PATTY, CLEAN). We investigate knowledge-guided linguistic rewrites as a secondary source of evidence and find that they can vastly improve the quality of inference rule corpora, obtaining 27 to 33 point precision improvement while retaining substantial recall. The facts inferred using cleaned inference rules are 29-32 points more accurate.
Published: 2016

23. Entity-balanced Gaussian pLSA for Automated Comparison

Author: Danish Contractor, Parag Singla, and null Mausam
Subjects: Probabilistic latent semantic analysis, Computer science, business.industry, Gaussian, Pattern recognition, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, symbols.namesake, 0202 electrical engineering, electronic engineering, information engineering, symbols, 020201 artificial intelligence & image processing, Artificial intelligence, business, 0105 earth and related environmental sciences
Published: 2016

24. Planning with Markov Decision Processes: An AI Perspective

Author: null Mausam and Andrey Kolobov
Subjects: Artificial Intelligence
Published: 2012

25. Panlingual lexical translation via probabilistic inference

Author: null Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Kobi Reiter, Michael Skinner, Marcus Sammer, and Jeff Bilmes
Subjects: Linguistics and Language, Computer science, business.industry, Transitive inference, Inference, Probabilistic inference, computer.software_genre, Lexical translation, Language and Linguistics, Rule-based machine translation, Multilinguality, Artificial Intelligence, Graph (abstract data type), Pairwise comparison, Compiler, Artificial intelligence, Precision and recall, business, computer, Natural language processing
Abstract: This paper introduces a novel approach to the task of lexical translation between languages for which no translation dictionaries are available. We build a massive translation graph, automatically constructed from over 630 machine-readable dictionaries and Wiktionaries. In this graph each node denotes a word in some language and each edge (vi,vj) denotes a word sense shared by vi and vj. Our current graph contains over 10,000,000 nodes and expresses more than 60,000,000 pairwise translations.The composition of multiple translation dictionaries leads to a transitive inference problem: if word A translates to word B which in turn translates to word C, what is the probability that C is a translation of A? The paper describes a series of probabilistic inference algorithms that solve this problem at varying precision and recall levels. All algorithms enable us to quantify our confidence in a translation derived from the graph, and thus trade precision for recall.We compile the results of our best inference algorithm to yield PanDictionary, a novel multilingual dictionary. PanDictionary contains more than four times as many translations as in the largest Wiktionary at precision 0.90 and over 200,000,000 pairwise translations in over 200,000 language pairs at precision 0.8.
Published: 2010
Full Text: View/download PDF

26. Planning with Durative Actions in Stochastic Domains

Author: D. S. Weld and null Mausam
Subjects: Mathematical optimization, business.industry, Closeness, Probabilistic logic, Action (philosophy), Artificial Intelligence, Key (cryptography), Pruning (decision trees), Markov decision process, Artificial intelligence, State (computer science), Heuristics, business, Mathematics
Abstract: Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative actions. This poses severe restrictions in modeling and solving a real world planning problem. We extend the MDP model to incorporate 1) simultaneous action execution, 2) durative actions, and 3) stochastic durations. We develop several algorithms to combat the computational explosion introduced by these features. The key theoretical ideas used in building these algorithms are -- modeling a complex problem as an MDP in extended state/action space, pruning of irrelevant actions, sampling of relevant actions, using informed heuristics to guide the search, hybridizing different planners to achieve benefits of both, approximating the problem and replanning. Our empirical evaluation illuminates the different merits in using various algorithms, viz., optimality, empirical closeness to optimality, theoretical error bounds, and speed.
Published: 2008

27. Hierarchical Summarization: Scaling Up Multi-Document Summarization

Author: Janara Christensen, Stephen Soderland, Gagan Bansal, and null Mausam
Subjects: Hierarchy, Information retrieval, Salient, Computer science, Multi-document summarization, Timeline, Coherence (statistics), Automatic summarization, Scaling
Abstract: Multi-document summarization (MDS) systems have been designed for short, unstructured summaries of 10-15 documents, and are inadequate for larger document collections. We propose a new approach to scaling up summarization called hierarchical summarization, and present the first implemented system, SUMMA. SUMMA produces a hierarchy of relatively short summaries, in which the top level provides a general overview and users can navigate the hierarchy to drill down for more details on topics of interest. SUMMA optimizes for coherence as well as coverage of salient information. In an Amazon Mechanical Turk evaluation, users prefered SUMMA ten times as often as flat MDS and three times as often as timelines.
Published: 2014

28. Open domain event extraction from twitter

Author: Alan Ritter, null Mausam, Oren Etzioni, and Sam Clark
Subjects: Information extraction, Categorization, Computer science, business.industry, Event (computing), Aggregate (data warehouse), Open domain, Artificial intelligence, business, computer.software_genre, Machine learning, computer, Natural language processing
Abstract: Tweets are the most up-to-date and inclusive stream of in- formation and commentary on current events, but they are also fragmented and noisy, motivating the need for systems that can extract, aggregate and categorize important events. Previous work on extracting structured representations of events has focused largely on newswire text; Twitter's unique characteristics present new challenges and opportunities for open-domain event extraction. This paper describes TwiCal-- the first open-domain event-extraction and categorization system for Twitter. We demonstrate that accurately extracting an open-domain calendar of significant events from Twitter is indeed feasible. In addition, we present a novel approach for discovering important event categories and classifying extracted events based on latent variable models. By leveraging large volumes of unlabeled data, our approach achieves a 14% increase in maximum F1 over a supervised baseline. A continuously updating demonstration of our system can be viewed at http://statuscalendar.com; Our NLP tools are available at http://github.com/aritter/ twitter_nlp.
Published: 2012

29. Planning with Markov Decision Processes

Author: null Mausam and Andrey Kolobov
Published: 2012

30. Execution control for crowdsourcing

Author: Daniel S. Weld, null Mausam, and Peng Dai
Subjects: Computer science, business.industry, Decision theory, media_common.quotation_subject, Partially observable Markov decision process, Machine learning, computer.software_genre, Crowdsourcing, Task (project management), Set (abstract data type), Range (mathematics), Workflow, Quality (business), Artificial intelligence, business, computer, media_common
Abstract: Crowdsourcing marketplaces enable a wide range of applications, but constructing any new application is challenging - usually requiring a complex, self-managing workflow in order to guarantee quality results. We report on the CLOWDER project, which uses machine learning to continually refine models of worker performance and task difficulty. We present decision-theoretic optimization techniques that can select the best parameters for a range of workflows. Initial experiments show our optimized workflows are significantly more economical than with manually set parameters.
Published: 2011

31. An analysis of open information extraction based on semantic role labeling

Author: Janara Christensen, null Mausam, Stephen Soderland, and Oren Etzioni
Subjects: Vocabulary, Parsing, Exploit, Computer science, business.industry, media_common.quotation_subject, computer.software_genre, PropBank, Information extraction, Task (computing), Semantic role labeling, Isolation (database systems), Artificial intelligence, business, computer, Natural language processing, media_common
Abstract: Open Information Extraction extracts relations from text without requiring a pre-specified domain or vocabulary. While existing techniques have used only shallow syntactic features, we investigate the use of semantic role labeling techniques for the task of Open IE. Semantic role labeling (SRL) and Open IE, although developed mostly in isolation, are quite related. We compare SRL-based open extractors, which perform computationally expensive, deep syntactic analysis, with TextRunner, an open extractor, which uses shallow syntactic analysis but is able to analyze many more sentences in a fixed amount of time and thus exploit corpus-level statistics. Our evaluation answers questions regarding these systems, including, can SRL extractors, which are trained on PropBank, cope with heterogeneous text found on the Web? Which extractor attains better precision, recall, f-measure, or running time? How does extractor performance vary for binary, n-ary and nested relations? How much do we gain by running multiple extractors? How do we select the optimal extractor given amount of data, available time, types of extractions desired?
Published: 2011

32. Adversarial classification

Author: Nilesh Dalvi, Pedro Domingos, null Mausam, Sumit Sanghai, and Deepak Verma
Subjects: business.industry, Computer science, Intrusion detection system, Adversary, Machine learning, computer.software_genre, Naive Bayes classifier, ComputingMethodologies_PATTERNRECOGNITION, Margin classifier, Artificial intelligence, Data mining, business, computer, Game theory, Classifier (UML), Adversary model
Abstract: Essentially all data mining algorithms assume that the data-generating process is independent of the data miner's activities. However, in many domains, including spam detection, intrusion detection, fraud detection, surveillance and counter-terrorism, this is far from the case: the data is actively manipulated by an adversary seeking to make the classifier produce false negatives. In these domains, the performance of a classifier can degrade rapidly after it is deployed, as the adversary learns to defeat it. Currently the only solution to this is repeated, manual, ad hoc reconstruction of the classifier. In this paper we develop a formal framework and algorithms for this problem. We view classification as a game between the classifier and the adversary, and produce a classifier that is optimal given the adversary's optimal strategy. Experiments in a spam detection domain show that this approach can greatly outperform a classifier learned in the standard way, and (within the parameters of the problem) automatically adapt the classifier to the adversary's evolving manipulations.
Published: 2004

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

32 results on '"null Mausam"'

1. Welcome back!

2. ToolTango: Common sense Generalization in Predicting Sequential Tool Interactions for Robot Plan Synthesis

3. Focused Topological Value Iteration

4. MatSciBERT: A materials domain language model for text mining and information extraction

5. Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions

6. Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence

7. Joint Spatio-Textual Reasoning for Answering Tourism Questions

8. Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs

9. OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction

10. IMoJIE: Iterative Memory-Based Joint Open Information Extraction

11. Temporal Knowledge Base Completion: New Algorithms and Evaluation Protocols

12. Unsupervised Learning of KB Queries in Task-Oriented Dialogs

13. Why and when should you pool? Analyzing Pooling in Recurrent Architectures

14. A Simple Yet Strong Pipeline for HotpotQA

15. [Untitled]

16. Sprout

17. Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision

18. Coarse-to-Fine Lifted MAP Inference in Computer Vision

19. Bootstrapping for Numerical Open IE

20. Modeling Missing Data in Distant Supervision for Information Extraction

21. POMDP-based control of workflows for crowdsourcing

22. Knowledge-Guided Linguistic Rewrites for Inference Rule Verification

23. Entity-balanced Gaussian pLSA for Automated Comparison

24. Planning with Markov Decision Processes: An AI Perspective

25. Panlingual lexical translation via probabilistic inference

26. Planning with Durative Actions in Stochastic Domains

27. Hierarchical Summarization: Scaling Up Multi-Document Summarization

28. Open domain event extraction from twitter

29. Planning with Markov Decision Processes

30. Execution control for crowdsourcing

31. An analysis of open information extraction based on semantic role labeling

32. Adversarial classification

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

32 results on '"null Mausam"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources