32 results on '"null Mausam"'
Search Results
2. ToolTango: Common sense Generalization in Predicting Sequential Tool Interactions for Robot Plan Synthesis
- Author
-
Shreshth Tuli, Rajas Bansal, Rohan Paul, and null Mausam
- Subjects
FOS: Computer and information sciences ,Computer Science - Robotics ,Artificial Intelligence ,Robotics (cs.RO) - Abstract
Robots assisting us in environments such as factories or homes must learn to make use of objects as tools to perform tasks, for instance using a tray to carry objects. We consider the problem of learning commonsense knowledge of when a tool may be useful and how its use may be composed with other tools to accomplish a high-level task instructed by a human. Specifically, we introduce a novel neural model, termed TOOLTANGO, that first predicts the next tool to be used, and then uses this information to predict the next action. We show that this joint model can inform learning of a fine-grained policy enabling the robot to use a particular tool in sequence and adds a significant value in making the model more accurate. TOOLTANGO encodes the world state, comprising objects and symbolic relationships between them, using a graph neural network and is trained using demonstrations from human teachers instructing a virtual robot in a physics simulator. The model learns to attend over the scene using knowledge of the goal and the action history, finally decoding the symbolic action to execute. Crucially, we address generalization to unseen environments where some known tools are missing, but alternative unseen tools are present. We show that by augmenting the representation of the environment with pre-trained embeddings derived from a knowledge-base, the model can generalize effectively to novel environments. Experimental results show at least 48.8-58.1% absolute improvement over the baselines in predicting successful symbolic plans for a simulated mobile manipulator in novel environments with unseen objects. This work takes a step in the direction of enabling robots to rapidly synthesize robust plans for complex tasks, particularly in novel settings, Accepted in Journal of AI Research. arXiv admin note: substantial text overlap with arXiv:2105.04556
- Published
- 2022
3. Focused Topological Value Iteration
- Author
-
Peng Dai, null Mausam, and Daniel Weld
- Abstract
Topological value iteration (TVI) is an effective algorithm for solving Markov decision processes (MDPs) optimally, which 1) divides an MDP into strongly-connected components, and 2) solves these components sequentially. Yet, TVI’s usefulness tends to degrade if an MDP has large components, because the cost of the division process isn’t offset by gains during solution. This paper presents a new algorithm to solve MDPs optimally, focused topological value iteration (FTVI). FTVI addresses TVI’s limitations by restricting its attention to connected components that are relevant for solving the MDP. Specifically, FTVI uses a small amount of heuristic search to eliminate provably sub-optimal actions; this pruning allows FTVI to find smaller connected components, thus running faster. We demonstrate that our new algorithm outperforms TVI by an order of magnitude, averaged across several domains. Surprisingly, FTVI also significantly outperforms popular ‘heuristically-informed’ MDP algorithms such as LAO*, LRTDP, and BRTDP in many domains, sometimes by as much as two orders of magnitude. Finally, we characterize the type of domains where FTVI excels — suggesting a way to an informed choice of solver.
- Published
- 2021
4. MatSciBERT: A materials domain language model for text mining and information extraction
- Author
-
Tanishq Gupta, Mohd Zaki, N. M. Anoop Krishnan, and null Mausam
- Subjects
Mechanics of Materials ,Modeling and Simulation ,General Materials Science ,Computer Science Applications - Abstract
A large amount of materials science knowledge is generated and stored as text published in peer-reviewed scientific literature. While recent developments in natural language processing, such as Bidirectional Encoder Representations from Transformers (BERT) models, provide promising information extraction tools, these models may yield suboptimal results when applied on materials domain since they are not trained in materials science specific notations and jargons. Here, we present a materials-aware language model, namely, MatSciBERT, trained on a large corpus of peer-reviewed materials science publications. We show that MatSciBERT outperforms SciBERT, a language model trained on science corpus, and establish state-of-the-art results on three downstream tasks, named entity recognition, relation classification, and abstract classification. We make the pre-trained weights of MatSciBERT publicly accessible for accelerated materials discovery and information extraction from materials science texts.
- Published
- 2022
5. Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions
- Author
-
Danish Contractor, Barun Patra, null Mausam, and Parag Singla
- Subjects
Conditional random field ,Linguistics and Language ,Vocabulary ,Parsing ,Computer science ,business.industry ,media_common.quotation_subject ,02 engineering and technology ,computer.software_genre ,Language and Linguistics ,Task (project management) ,Domain (software engineering) ,Artificial Intelligence ,020204 information systems ,Schema (psychology) ,0202 electrical engineering, electronic engineering, information engineering ,Question answering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Software ,Sentence ,Natural language processing ,media_common - Abstract
We present the novel task of understanding multi-sentenceentity-seekingquestions (MSEQs), that is, the questions that may be expressed in multiple sentences, and that expect one or more entities as an answer. We formulate the problem of understanding MSEQs as a semantic labeling task over an open representation that makes minimal assumptions about schema or ontology-specific semantic vocabulary. At the core of our model, we use a BiLSTM (bidirectional LSTM) conditional random field (CRF), and to overcome the challenges of operating with low training data, we supplement it by using BERT embeddings, hand-designed features, as well as hard and soft constraints spanning multiple sentences. We find that this results in a 12–15 points gain over a vanilla BiLSTM CRF. We demonstrate the strengths of our work using the novel task of answering real-world entity-seeking questions from the tourism domain. The use of our labels helps answer 36% more questions with 35% more (relative) accuracy as compared to baselines. We also demonstrate how our framework can rapidly enable the parsing of MSEQs in an entirely new domain with small amounts of training data and little change in the semantic representation.
- Published
- 2020
6. Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence
- Author
-
Guy Barash, Mauricio Castillo-Effen, Niyati Chhaya, Peter Clark, Huáscar Espinoza, Eitan Farchi, Christopher Geib, Odd Erik Gundersen, Seán HÉigeartaigh, José Hernández-Orallo, Chiori Hori, Xiaowei Huang, Kokil Jaidka, Pavan Kapanipathi, Sarah Keren, Seokhwan Kim, Marc Lanctot, Danny Lange, Julian McAuley, David Martinez, Marwan Mattar, null Mausam, Martin Michalowski, Reuth Mirsky, Roozbeh Mottaghi, Joseph Osborn, Julien Perolat, Martin Schmid, Arash Shaban-Nejad, Onn Shehory, Biplav Srivastava, William Streilein, Kartik Talamadupula, Julian Togelius, Koichiro Yoshino, Quanshi Zhang, and Imed Zitouni
- Subjects
Computer science ,business.industry ,Deep learning ,Robotics ,Plan (drawing) ,Recommender system ,computer.software_genre ,Knowledge extraction ,Artificial Intelligence ,Reinforcement learning ,Artificial intelligence ,Dialog system ,business ,computer ,Agile software development - Abstract
The workshop program of the Association for the Advancement of Artificial Intelligence’s 33rd Conference on Artificial Intelligence (AAAI-19) was held in Honolulu, Hawaii, on Sunday and Monday, January 27–28, 2019. There were fifteen workshops in the program: Affective Content Analysis: Modeling Affect-in-Action, Agile Robotics for Industrial Automation Competition, Artificial Intelligence for Cyber Security, Artificial Intelligence Safety, Dialog System Technology Challenge, Engineering Dependable and Secure Machine Learning Systems, Games and Simulations for Artificial Intelligence, Health Intelligence, Knowledge Extraction from Games, Network Interpretability for Deep Learning, Plan, Activity, and Intent Recognition, Reasoning and Learning for Human-Machine Dialogues, Reasoning for Complex Question Answering, Recommender Systems Meet Natural Language Processing, Reinforcement Learning in Games, and Reproducible AI. This report contains brief summaries of the all the workshops that were held.
- Published
- 2019
7. Joint Spatio-Textual Reasoning for Answering Tourism Questions
- Author
-
Danish Contractor, Shashank Goel, null Mausam, and Parag Singla
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Information retrieval ,Computer Science - Artificial Intelligence ,business.industry ,Computer science ,Spatial intelligence ,02 engineering and technology ,Semantic reasoner ,Modular design ,Task (project management) ,Artificial Intelligence (cs.AI) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Question answering ,020201 artificial intelligence & image processing ,Joint (building) ,business ,Computation and Language (cs.CL) ,Tourism - Abstract
Our goal is to answer real-world tourism questions that seek Points-of-Interest (POI) recommendations. Such questions express various kinds of spatial and non-spatial constraints, necessitating a combination of textual and spatial reasoning. In response, we develop the first joint spatio-textual reasoning model, which combines geo-spatial knowledge with information in textual corpora to answer questions. We first develop a modular spatial-reasoning network that uses geo-coordinates of location names mentioned in a question, and of candidate answer POIs, to reason over only spatial constraints. We then combine our spatial-reasoner with a textual reasoner in a joint model and present experiments on a real world POI recommendation task. We report substantial improvements over existing models with-out joint spatio-textual reasoning., Updated version
- Published
- 2021
8. Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs
- Author
-
Dinesh Raghu, Atishya Jain, null Mausam, and Sachindra Joshi
- Subjects
FOS: Computer and information sciences ,Structure (mathematical logic) ,Computer Science - Machine Learning ,Multiset ,Computer Science - Computation and Language ,Correctness ,business.industry ,Computer science ,Filter (signal processing) ,computer.software_genre ,Machine Learning (cs.LG) ,Constraint (information theory) ,Knowledge base ,Metric (mathematics) ,Artificial intelligence ,Dialog box ,business ,Computation and Language (cs.CL) ,computer ,Natural language processing - Abstract
End-to-End task-oriented dialogue systems generate responses based on dialog history and an accompanying knowledge base (KB). Inferring those KB entities that are most relevant for an utterance is crucial for response generation. Existing state of the art scales to large KBs by softly filtering over irrelevant KB information. In this paper, we propose a novel filtering technique that consists of (1) a pairwise similarity based filter that identifies relevant information by respecting the n-ary structure in a KB record. and, (2) an auxiliary loss that helps in separating contextually unrelated KB information. We also propose a new metric -- multiset entity F1 which fixes a correctness issue in the existing entity F1 metric. Experimental results on three publicly available task-oriented dialog datasets show that our proposed approach outperforms existing state-of-the-art models., D. Raghu and A. Jain contributed equally to this work
- Published
- 2021
9. OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction
- Author
-
Keshav Kolluru, Vaibhav Adlakha, Samarth Aggarwal, null Mausam, and Soumen Chakrabarti
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Computer science ,05 social sciences ,010501 environmental sciences ,Grid ,computer.software_genre ,01 natural sciences ,Bridge (nautical) ,Sequence labeling ,Task (project management) ,Information extraction ,Computer engineering ,0502 economics and business ,State (computer science) ,050207 economics ,Computation and Language (cs.CL) ,computer ,0105 earth and related environmental sciences - Abstract
A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based system that establishes a new state of the art for OpenIE, while extracting 10x faster. This is achieved through a novel Iterative Grid Labeling (IGL) architecture, which treats OpenIE as a 2-D grid labeling task. We improve its performance further by applying coverage (soft) constraints on the grid at training time. Moreover, on observing that the best OpenIE systems falter at handling coordination structures, our OpenIE system also incorporates a new coordination analyzer built with the same IGL architecture. This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12.3 pts improvement in F1 over previous analyzers. Our OpenIE system, OpenIE6, beats the previous systems by as much as 4 pts in F1, while being much faster., EMNLP 2020 (Long)
- Published
- 2020
10. IMoJIE: Iterative Memory-Based Joint Open Information Extraction
- Author
-
Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, null Mausam, and Soumen Chakrabarti
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Computer science ,business.industry ,05 social sciences ,Pattern recognition ,Extension (predicate logic) ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Information extraction ,Task (computing) ,0502 economics and business ,Redundancy (engineering) ,State (computer science) ,Noise (video) ,Artificial intelligence ,050207 economics ,Tuple ,business ,Computation and Language (cs.CL) ,computer ,Sentence ,0105 earth and related environmental sciences - Abstract
While traditional systems for Open Information Extraction were statistical and rule-based, recently neural models have been introduced for the task. Our work builds upon CopyAttention, a sequence generation OpenIE model (Cui et. al., 2018). Our analysis reveals that CopyAttention produces a constant number of extractions per sentence, and its extracted tuples often express redundant information. We present IMoJIE, an extension to CopyAttention, which produces the next extraction conditioned on all previously extracted tuples. This approach overcomes both shortcomings of CopyAttention, resulting in a variable number of diverse extractions per sentence. We train IMoJIE on training data bootstrapped from extractions of several non-neural systems, which have been automatically filtered to reduce redundancy and noise. IMoJIE outperforms CopyAttention by about 18 F1 pts, and a BERT-based strong baseline by 2 F1 pts, establishing a new state of the art for the task.
- Published
- 2020
11. Temporal Knowledge Base Completion: New Algorithms and Evaluation Protocols
- Author
-
Prachi Jain, Sushant Rathi, null Mausam, and Soumen Chakrabarti
- Subjects
Social and Information Networks (cs.SI) ,FOS: Computer and information sciences ,Computer Science - Machine Learning ,business.industry ,Computer science ,010102 general mathematics ,Computer Science - Social and Information Networks ,010501 environmental sciences ,Space (commercial competition) ,Machine learning ,computer.software_genre ,01 natural sciences ,Machine Learning (cs.LG) ,Knowledge base ,Artificial intelligence ,0101 mathematics ,business ,Link (knot theory) ,computer ,0105 earth and related environmental sciences - Abstract
Temporal knowledge bases associate relational (s,r,o) triples with a set of times (or a single time instant) when the relation is valid. While time-agnostic KB completion (KBC) has witnessed significant research, temporal KB completion (TKBC) is in its early days. In this paper, we consider predicting missing entities (link prediction) and missing time intervals (time prediction) as joint TKBC tasks where entities, relations, and time are all embedded in a uniform, compatible space. We present TIMEPLEX, a novel time-aware KBC method, that also automatically exploits the recurrent nature of some relations and temporal interactions between pairs of relations. TIMEPLEX achieves state-of-the-art performance on both prediction tasks. We also find that existing TKBC models heavily overestimate link prediction performance due to imperfect evaluation mechanisms. In response, we propose improved TKBC evaluation protocols for both link and time prediction tasks, dealing with subtle issues that arise from the partial overlap of time intervals in gold instances and system predictions.
- Published
- 2020
12. Unsupervised Learning of KB Queries in Task-Oriented Dialogs
- Author
-
Dinesh Raghu, Nikhil Gupta, and null Mausam
- Subjects
FOS: Computer and information sciences ,Linguistics and Language ,Computer Science - Machine Learning ,Information retrieval ,Computer Science - Computation and Language ,Computer science ,business.industry ,Communication ,Machine Learning (stat.ML) ,Machine Learning (cs.LG) ,Computer Science Applications ,Human-Computer Interaction ,Annotation ,Task (computing) ,Knowledge base ,Artificial Intelligence ,Statistics - Machine Learning ,Task oriented ,Reinforcement learning ,Unsupervised learning ,State (computer science) ,Dialog box ,business ,Computation and Language (cs.CL) - Abstract
Task-oriented dialog (TOD) systems often need to formulate knowledge base (KB) queries corresponding to the user intent and use the query results to generate system responses. Existing approaches require dialog datasets to explicitly annotate these KB queries -- these annotations can be time consuming, and expensive. In response, we define the novel problems of predicting the KB query and training the dialog agent, without explicit KB query annotation. For query prediction, we propose a reinforcement learning (RL) baseline, which rewards the generation of those queries whose KB results cover the entities mentioned in subsequent dialog. Further analysis reveals that correlation among query attributes in KB can significantly confuse memory augmented policy optimization (MAPO), an existing state of the art RL agent. To address this, we improve the MAPO baseline with simple but important modifications suited to our task. To train the full TOD system for our setting, we propose a pipelined approach: it independently predicts when to make a KB query (query position predictor), then predicts a KB query at the predicted position (query predictor), and uses the results of predicted query in subsequent dialog (next response predictor). Overall, our work proposes first solutions to our novel problem, and our analysis highlights the research challenges in training TOD systems without query annotation., Presented at ACL 2021
- Published
- 2020
13. Why and when should you pool? Analyzing Pooling in Recurrent Architectures
- Author
-
Pratyush Maini, Keshav Kolluru, Danish Pruthi, and null Mausam
- Subjects
FOS: Computer and information sciences ,Sequence ,Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer science ,business.industry ,05 social sciences ,Pooling ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Machine Learning (cs.LG) ,0502 economics and business ,Artificial intelligence ,050207 economics ,business ,computer ,Computation and Language (cs.CL) ,0105 earth and related environmental sciences - Abstract
Pooling-based recurrent neural architectures consistently outperform their counterparts without pooling. However, the reasons for their enhanced performance are largely unexamined. In this work, we examine three commonly used pooling techniques (mean-pooling, max-pooling, and attention), and propose max-attention, a novel variant that effectively captures interactions among predictive tokens in a sentence. We find that pooling-based architectures substantially differ from their non-pooling equivalents in their learning ability and positional biases--which elucidate their performance benefits. By analyzing the gradient propagation, we discover that pooling facilitates better gradient flow compared to BiLSTMs. Further, we expose how BiLSTMs are positionally biased towards tokens in the beginning and the end of a sequence. Pooling alleviates such biases. Consequently, we identify settings where pooling offers large benefits: (i) in low resource scenarios, and (ii) when important words lie towards the middle of the sentence. Among the pooling techniques studied, max-attention is the most effective, resulting in significant performance gains on several text classification tasks., Accepted to Findings of EMNLP 2020, to be presented at BlackBoxNLP. Updated Version
- Published
- 2020
14. A Simple Yet Strong Pipeline for HotpotQA
- Author
-
Dirk Groeneveld, Tushar Khot, null Mausam, and Ashish Sabharwal
- Subjects
FOS: Computer and information sciences ,Theoretical computer science ,Computer Science - Computation and Language ,Computer science ,Context (language use) ,02 engineering and technology ,computer.software_genre ,Pipeline (software) ,03 medical and health sciences ,Identification (information) ,0302 clinical medicine ,Named-entity recognition ,030221 ophthalmology & optometry ,0202 electrical engineering, electronic engineering, information engineering ,Question answering ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Language model ,Computation and Language (cs.CL) ,computer ,Sentence - Abstract
State-of-the-art models for multi-hop question answering typically augment large-scale language models like BERT with additional, intuitively useful capabilities such as named entity recognition, graph-based reasoning, and question decomposition. However, does their strong performance on popular multi-hop datasets really justify this added design complexity? Our results suggest that the answer may be no, because even our simple pipeline based on BERT, named \model, performs surprisingly well. Specifically, on HotpotQA, Quark outperforms these models on both question answering and support identification (and achieves performance very close to a RoBERTa model). Our pipeline has three steps: 1) use BERT to identify potentially relevant sentences \emph{independently} of each other; 2) feed the set of selected sentences as context into a standard BERT span prediction model to choose an answer; and 3) use the sentence selection model, now with the chosen answer, to produce supporting sentences. The strong performance of Quark resurfaces the importance of carefully exploring simple model designs before using popular benchmarks to justify the value of complex techniques.
- Published
- 2020
15. [Untitled]
- Author
-
Dinesh Raghu, Nikhil Gupta, and null Mausam
- Subjects
Measure (data warehouse) ,Artificial neural network ,Computer science ,business.industry ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Boss ,Knowledge base ,0202 electrical engineering, electronic engineering, information engineering ,Task oriented ,020201 artificial intelligence & image processing ,Language model ,Artificial intelligence ,business ,computer ,Natural language processing ,0105 earth and related environmental sciences ,Interpretability - Abstract
The Knowledge Base (KB) used for real-world applications, such as booking a movie or restaurant reservation, keeps changing over time. End-to-end neural networks trained for these task-oriented dialogs are expected to be immune to any changes in the KB. However, existing approaches breakdown when asked to handle such changes. We propose an encoder-decoder architecture (BoSsNet) with a novel Bag-of-Sequences (BoSs) memory, which facilitates the disentangled learning of the response’s language model and its knowledge incorporation. Consequently, the KB can be modified with new knowledge without a drop in interpretability. We find that BoSsNeT outperforms state-of-the-art models, with considerable improvements (>10%) on bAbI OOV test sets and other human-human datasets. We also systematically modify existing datasets to measure disentanglement and show BoSsNeT to be robust to KB modifications.
- Published
- 2019
16. Sprout
- Author
-
Jonathan Bragg, null Mausam, and Daniel S. Weld
- Subjects
Iterative and incremental development ,Data collection ,Process (engineering) ,business.industry ,Computer science ,05 social sciences ,Job design ,020207 software engineering ,02 engineering and technology ,Crowdsourcing ,Task (project management) ,Workflow ,Human–computer interaction ,0202 electrical engineering, electronic engineering, information engineering ,0501 psychology and cognitive sciences ,business ,Set (psychology) ,050107 human factors - Abstract
While crowdsourcing enables data collection at scale, ensuring high-quality data remains a challenge. In particular, effective task design underlies nearly every reported crowdsourcing success, yet remains difficult to accomplish. Task design is hard because it involves a costly iterative process: identifying the kind of work output one wants, conveying this information to workers, observing worker performance, understanding what remains ambiguous, revising the instructions, and repeating the process until the resulting output is satisfactory. To facilitate this process, we propose a novel meta-workflow that helps requesters optimize crowdsourcing task designs and Sprout, our open-source tool, which implements this workflow. Sprout improves task designs by (1) eliciting points of confusion from crowd workers, (2) enabling requesters to quickly understand these misconceptions and the overall space of questions, and (3) guiding requesters to improve the task design in response. We report the results of a user study with two labeling tasks demonstrating that requesters strongly prefer Sprout and produce higher-rated instructions compared to current best practices for creating gated instructions (instructions plus a workflow for training and testing workers). We also offer a set of design recommendations for future tools that support crowdsourcing task design.
- Published
- 2018
17. Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision
- Author
-
Prachi Jain, Pankaj Kumar, null Mausam, and Soumen Chakrabarti
- Subjects
Relation (database) ,Computer science ,business.industry ,010401 analytical chemistry ,Inference ,02 engineering and technology ,Type (model theory) ,Base (topology) ,Machine learning ,computer.software_genre ,01 natural sciences ,0104 chemical sciences ,Knowledge base ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer - Abstract
State-of-the-art knowledge base completion (KBC) models predict a score for every known or unknown fact via a latent factorization over entity and relation embeddings. We observe that when they fail, they often make entity predictions that are incompatible with the type required by the relation. In response, we enhance each base factorization with two type-compatibility terms between entity-relation pairs, and combine the signals in a novel manner. Without explicit supervision from a type catalog, our proposed modification obtains up to 7% MRR gains over base models, and new state-of-the-art results on several datasets. Further analysis reveals that our models better represent the latent types of entities and their embeddings also predict supervised types better than the embeddings fitted by baseline models.
- Published
- 2018
18. Coarse-to-Fine Lifted MAP Inference in Computer Vision
- Author
-
Haroun Habeeb, Ankit Anand, null Mausam, and Parag Singla
- Subjects
FOS: Computer and information sciences ,Pixel ,Unary operation ,Lift (data mining) ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Inference ,Image segmentation ,Line (geometry) ,Node (circuits) ,Computer vision ,Artificial intelligence ,Graphical model ,business - Abstract
There is a vast body of theoretical research on lifted inference in probabilistic graphical models (PGMs). However, few demonstrations exist where lifting is applied in conjunction with top of the line applied algorithms. We pursue the applicability of lifted inference for computer vision (CV), with the insight that a globally optimal (MAP) labeling will likely have the same label for two symmetric pixels. The success of our approach lies in efficiently handling a distinct unary potential on every node (pixel), typical of CV applications. This allows us to lift the large class of algorithms that model a CV problem via PGM inference. We propose a generic template for coarse-to-fine (C2F) inference in CV, which progressively refines an initial coarsely lifted PGM for varying quality-time trade-offs. We demonstrate the performance of C2F inference by developing lifted versions of two near state-of-the-art CV algorithms for stereo vision and interactive image segmentation. We find that, against flat algorithms, the lifted versions have a much superior anytime performance, without any loss in final solution quality., Published in IJCAI 2017
- Published
- 2017
19. Bootstrapping for Numerical Open IE
- Author
-
Swarnadeep Saha, Harinder Pal, and null Mausam
- Subjects
Phrase ,Dependency (UML) ,Relation (database) ,Computer science ,business.industry ,Bootstrapping (linguistics) ,Context (language use) ,02 engineering and technology ,computer.software_genre ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Tuple ,business ,computer ,Natural language processing ,Sentence ,Word (computer architecture) - Abstract
We design and release BONIE, the first open numerical relation extractor, for extracting Open IE tuples where one of the arguments is a number or a quantity-unit phrase. BONIE uses bootstrapping to learn the specific dependency patterns that express numerical relations in a sentence. BONIE’s novelty lies in task-specific customizations, such as inferring implicit relations, which are clear due to context such as units (for e.g., ‘square kilometers’ suggests area, even if the word ‘area’ is missing in the sentence). BONIE obtains 1.5x yield and 15 point precision gain on numerical facts over a state-of-the-art Open IE system.
- Published
- 2017
20. Modeling Missing Data in Distant Supervision for Information Extraction
- Author
-
Alan Ritter, Luke Zettlemoyer, null Mausam, and Oren Etzioni
- Subjects
Linguistics and Language ,Unary operation ,Computer science ,business.industry ,Communication ,Inference ,Binary number ,Missing data ,computer.software_genre ,Machine learning ,Relationship extraction ,Computer Science Applications ,Human-Computer Interaction ,Information extraction ,Artificial Intelligence ,Artificial intelligence ,Precision and recall ,Heuristics ,business ,computer - Abstract
Distant supervision algorithms learn information extraction models given only large readily available databases and text collections. Most previous work has used heuristics for generating labeled data, for example assuming that facts not contained in the database are not mentioned in the text, and facts in the database must be mentioned at least once. In this paper, we propose a new latent-variable approach that models missing data. This provides a natural way to incorporate side information, for instance modeling the intuition that text will often mention rare entities which are likely to be missing in the database. Despite the added complexity introduced by reasoning about missing data, we demonstrate that a carefully designed local search approach to inference is very accurate and scales to large datasets. Experiments demonstrate improved performance for binary and unary relation extraction when compared to learning with heuristic labels, including on average a 27% increase in area under the precision recall curve in the binary case.
- Published
- 2013
21. POMDP-based control of workflows for crowdsourcing
- Author
-
Peng Dai, Christopher H. Lin, null Mausam, and Daniel S. Weld
- Subjects
Linguistics and Language ,business.industry ,Process (engineering) ,Computer science ,Partially observable Markov decision process ,Bayesian network ,Crowdsourcing ,Machine learning ,computer.software_genre ,Language and Linguistics ,Task (project management) ,Outsourcing ,Workflow ,Artificial Intelligence ,Markov decision process ,Artificial intelligence ,business ,computer - Abstract
Crowdsourcing, outsourcing of tasks to a crowd of unknown people ("workers") in an open call, is rapidly rising in popularity. It is already being heavily used by numerous employers ("requesters") for solving a wide variety of tasks, such as audio transcription, content screening, and labeling training data for machine learning. However, quality control of such tasks continues to be a key challenge because of the high variability in worker quality. In this paper we show the value of decision-theoretic techniques for the problem of optimizing workflows used in crowdsourcing. In particular, we design AI agents that use Bayesian network learning and inference in combination with Partially-Observable Markov Decision Processes (POMDPs) for obtaining excellent cost-quality tradeoffs. We use these techniques for three distinct crowdsourcing scenarios: (1) control of voting to answer a binary-choice question, (2) control of an iterative improvement workflow, and (3) control of switching between alternate workflows for a task. In each scenario, we design a Bayes net model that relates worker competency, task difficulty and worker response quality. We also design a POMDP for each task, whose solution provides the dynamic control policy. We demonstrate the usefulness of our models and agents in live experiments on Amazon Mechanical Turk. We consistently achieve superior quality results than non-adaptive controllers, while incurring equal or less cost.
- Published
- 2013
22. Knowledge-Guided Linguistic Rewrites for Inference Rule Verification
- Author
-
Prachi Jain and null Mausam
- Subjects
Point (typography) ,Relation (database) ,Computer science ,media_common.quotation_subject ,05 social sciences ,010501 environmental sciences ,01 natural sciences ,Linguistics ,0502 economics and business ,Quality (business) ,050207 economics ,Rule of inference ,0105 earth and related environmental sciences ,media_common - Abstract
A corpus of inference rules between a pair of relation phrases is typically generated using the statistical overlap of argument-pairs associated with the relations (e.g., PATTY, CLEAN). We investigate knowledge-guided linguistic rewrites as a secondary source of evidence and find that they can vastly improve the quality of inference rule corpora, obtaining 27 to 33 point precision improvement while retaining substantial recall. The facts inferred using cleaned inference rules are 29-32 points more accurate.
- Published
- 2016
23. Entity-balanced Gaussian pLSA for Automated Comparison
- Author
-
Danish Contractor, Parag Singla, and null Mausam
- Subjects
Probabilistic latent semantic analysis ,Computer science ,business.industry ,Gaussian ,Pattern recognition ,02 engineering and technology ,010501 environmental sciences ,01 natural sciences ,symbols.namesake ,0202 electrical engineering, electronic engineering, information engineering ,symbols ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,0105 earth and related environmental sciences - Published
- 2016
24. Planning with Markov Decision Processes: An AI Perspective
- Author
-
null Mausam and Andrey Kolobov
- Subjects
Artificial Intelligence - Published
- 2012
25. Panlingual lexical translation via probabilistic inference
- Author
-
null Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Kobi Reiter, Michael Skinner, Marcus Sammer, and Jeff Bilmes
- Subjects
Linguistics and Language ,Computer science ,business.industry ,Transitive inference ,Inference ,Probabilistic inference ,computer.software_genre ,Lexical translation ,Language and Linguistics ,Rule-based machine translation ,Multilinguality ,Artificial Intelligence ,Graph (abstract data type) ,Pairwise comparison ,Compiler ,Artificial intelligence ,Precision and recall ,business ,computer ,Natural language processing - Abstract
This paper introduces a novel approach to the task of lexical translation between languages for which no translation dictionaries are available. We build a massive translation graph, automatically constructed from over 630 machine-readable dictionaries and Wiktionaries. In this graph each node denotes a word in some language and each edge (vi,vj) denotes a word sense shared by vi and vj. Our current graph contains over 10,000,000 nodes and expresses more than 60,000,000 pairwise translations.The composition of multiple translation dictionaries leads to a transitive inference problem: if word A translates to word B which in turn translates to word C, what is the probability that C is a translation of A? The paper describes a series of probabilistic inference algorithms that solve this problem at varying precision and recall levels. All algorithms enable us to quantify our confidence in a translation derived from the graph, and thus trade precision for recall.We compile the results of our best inference algorithm to yield PanDictionary, a novel multilingual dictionary. PanDictionary contains more than four times as many translations as in the largest Wiktionary at precision 0.90 and over 200,000,000 pairwise translations in over 200,000 language pairs at precision 0.8.
- Published
- 2010
- Full Text
- View/download PDF
26. Planning with Durative Actions in Stochastic Domains
- Author
-
D. S. Weld and null Mausam
- Subjects
Mathematical optimization ,business.industry ,Closeness ,Probabilistic logic ,Action (philosophy) ,Artificial Intelligence ,Key (cryptography) ,Pruning (decision trees) ,Markov decision process ,Artificial intelligence ,State (computer science) ,Heuristics ,business ,Mathematics - Abstract
Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative actions. This poses severe restrictions in modeling and solving a real world planning problem. We extend the MDP model to incorporate 1) simultaneous action execution, 2) durative actions, and 3) stochastic durations. We develop several algorithms to combat the computational explosion introduced by these features. The key theoretical ideas used in building these algorithms are -- modeling a complex problem as an MDP in extended state/action space, pruning of irrelevant actions, sampling of relevant actions, using informed heuristics to guide the search, hybridizing different planners to achieve benefits of both, approximating the problem and replanning. Our empirical evaluation illuminates the different merits in using various algorithms, viz., optimality, empirical closeness to optimality, theoretical error bounds, and speed.
- Published
- 2008
27. Hierarchical Summarization: Scaling Up Multi-Document Summarization
- Author
-
Janara Christensen, Stephen Soderland, Gagan Bansal, and null Mausam
- Subjects
Hierarchy ,Information retrieval ,Salient ,Computer science ,Multi-document summarization ,Timeline ,Coherence (statistics) ,Automatic summarization ,Scaling - Abstract
Multi-document summarization (MDS) systems have been designed for short, unstructured summaries of 10-15 documents, and are inadequate for larger document collections. We propose a new approach to scaling up summarization called hierarchical summarization, and present the first implemented system, SUMMA. SUMMA produces a hierarchy of relatively short summaries, in which the top level provides a general overview and users can navigate the hierarchy to drill down for more details on topics of interest. SUMMA optimizes for coherence as well as coverage of salient information. In an Amazon Mechanical Turk evaluation, users prefered SUMMA ten times as often as flat MDS and three times as often as timelines.
- Published
- 2014
28. Open domain event extraction from twitter
- Author
-
Alan Ritter, null Mausam, Oren Etzioni, and Sam Clark
- Subjects
Information extraction ,Categorization ,Computer science ,business.industry ,Event (computing) ,Aggregate (data warehouse) ,Open domain ,Artificial intelligence ,business ,computer.software_genre ,Machine learning ,computer ,Natural language processing - Abstract
Tweets are the most up-to-date and inclusive stream of in- formation and commentary on current events, but they are also fragmented and noisy, motivating the need for systems that can extract, aggregate and categorize important events. Previous work on extracting structured representations of events has focused largely on newswire text; Twitter's unique characteristics present new challenges and opportunities for open-domain event extraction. This paper describes TwiCal-- the first open-domain event-extraction and categorization system for Twitter. We demonstrate that accurately extracting an open-domain calendar of significant events from Twitter is indeed feasible. In addition, we present a novel approach for discovering important event categories and classifying extracted events based on latent variable models. By leveraging large volumes of unlabeled data, our approach achieves a 14% increase in maximum F1 over a supervised baseline. A continuously updating demonstration of our system can be viewed at http://statuscalendar.com; Our NLP tools are available at http://github.com/aritter/ twitter_nlp.
- Published
- 2012
29. Planning with Markov Decision Processes
- Author
-
null Mausam and Andrey Kolobov
- Published
- 2012
30. Execution control for crowdsourcing
- Author
-
Daniel S. Weld, null Mausam, and Peng Dai
- Subjects
Computer science ,business.industry ,Decision theory ,media_common.quotation_subject ,Partially observable Markov decision process ,Machine learning ,computer.software_genre ,Crowdsourcing ,Task (project management) ,Set (abstract data type) ,Range (mathematics) ,Workflow ,Quality (business) ,Artificial intelligence ,business ,computer ,media_common - Abstract
Crowdsourcing marketplaces enable a wide range of applications, but constructing any new application is challenging - usually requiring a complex, self-managing workflow in order to guarantee quality results. We report on the CLOWDER project, which uses machine learning to continually refine models of worker performance and task difficulty. We present decision-theoretic optimization techniques that can select the best parameters for a range of workflows. Initial experiments show our optimized workflows are significantly more economical than with manually set parameters.
- Published
- 2011
31. An analysis of open information extraction based on semantic role labeling
- Author
-
Janara Christensen, null Mausam, Stephen Soderland, and Oren Etzioni
- Subjects
Vocabulary ,Parsing ,Exploit ,Computer science ,business.industry ,media_common.quotation_subject ,computer.software_genre ,PropBank ,Information extraction ,Task (computing) ,Semantic role labeling ,Isolation (database systems) ,Artificial intelligence ,business ,computer ,Natural language processing ,media_common - Abstract
Open Information Extraction extracts relations from text without requiring a pre-specified domain or vocabulary. While existing techniques have used only shallow syntactic features, we investigate the use of semantic role labeling techniques for the task of Open IE. Semantic role labeling (SRL) and Open IE, although developed mostly in isolation, are quite related. We compare SRL-based open extractors, which perform computationally expensive, deep syntactic analysis, with TextRunner, an open extractor, which uses shallow syntactic analysis but is able to analyze many more sentences in a fixed amount of time and thus exploit corpus-level statistics. Our evaluation answers questions regarding these systems, including, can SRL extractors, which are trained on PropBank, cope with heterogeneous text found on the Web? Which extractor attains better precision, recall, f-measure, or running time? How does extractor performance vary for binary, n-ary and nested relations? How much do we gain by running multiple extractors? How do we select the optimal extractor given amount of data, available time, types of extractions desired?
- Published
- 2011
32. Adversarial classification
- Author
-
Nilesh Dalvi, Pedro Domingos, null Mausam, Sumit Sanghai, and Deepak Verma
- Subjects
business.industry ,Computer science ,Intrusion detection system ,Adversary ,Machine learning ,computer.software_genre ,Naive Bayes classifier ,ComputingMethodologies_PATTERNRECOGNITION ,Margin classifier ,Artificial intelligence ,Data mining ,business ,computer ,Game theory ,Classifier (UML) ,Adversary model - Abstract
Essentially all data mining algorithms assume that the data-generating process is independent of the data miner's activities. However, in many domains, including spam detection, intrusion detection, fraud detection, surveillance and counter-terrorism, this is far from the case: the data is actively manipulated by an adversary seeking to make the classifier produce false negatives. In these domains, the performance of a classifier can degrade rapidly after it is deployed, as the adversary learns to defeat it. Currently the only solution to this is repeated, manual, ad hoc reconstruction of the classifier. In this paper we develop a formal framework and algorithms for this problem. We view classification as a game between the classifier and the adversary, and produce a classifier that is optimal given the adversary's optimal strategy. Experiments in a spam detection domain show that this approach can greatly outperform a classifier learned in the standard way, and (within the parameters of the problem) automatically adapt the classifier to the adversary's evolving manipulations.
- Published
- 2004
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.