Author: "Kleiman-Weiner, Max" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kleiman-Weiner, Max"' showing total 202 results

Start Over Author "Kleiman-Weiner, Max"

202 results on '"Kleiman-Weiner, Max"'

1. Value Internalization: Learning and Generalizing from Social Reward

Author: Rong, Frieda and Kleiman-Weiner, Max
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: Social rewards shape human behavior. During development, a caregiver guides a learner's behavior towards culturally aligned goals and values. How do these behaviors persist and generalize when the caregiver is no longer present, and the learner must continue autonomously? Here, we propose a model of value internalization where social feedback trains an internal social reward (ISR) model that generates internal rewards when social rewards are unavailable. Through empirical simulations, we show that an ISR model prevents agents from unlearning socialized behaviors and enables generalization in out-of-distribution tasks. We characterize the implications of incomplete internalization, akin to "reward hacking" on the ISR. Additionally, we show that our model internalizes prosocial behavior in a multi-agent environment. Our work provides a foundation for understanding how humans acquire and generalize values and offers insights for aligning AI with human values., Comment: Reinforcement Learning Conference (RLC) 2024 & Cognitive Science Conference Oral
Published: 2024

2. Multilingual Trolley Problems for Language Models

Author: Jin, Zhijing, Levine, Sydney, Kleiman-Weiner, Max, Piatti, Giorgio, Liu, Jiarui, Adauto, Fernando Gonzalez, Ortu, Francesco, Strausz, András, Sachan, Mrinmaya, Mihalcea, Rada, Choi, Yejin, and Schölkopf, Bernhard
Subjects: Computer Science - Computation and Language
Abstract: As large language models (LLMs) are deployed in more and more real-world situations, it is crucial to understand their decision-making when faced with moral dilemmas. Inspired by a large-scale cross-cultural study of human moral preferences, "The Moral Machine Experiment", we set up the same set of moral choices for LLMs. We translate 1K vignettes of moral dilemmas, parametrically varied across key axes, into 100+ languages, and reveal the preferences of LLMs in each of these languages. We then compare the responses of LLMs to that of human speakers of those languages, harnessing a dataset of 40 million human moral judgments. We discover that LLMs are more aligned with human preferences in languages such as English, Korean, Hungarian, and Chinese, but less aligned in languages such as Hindi and Somali (in Africa). Moreover, we characterize the explanations LLMs give for their moral choices and find that fairness is the most dominant supporting reason behind GPT-4's decisions and utilitarianism by GPT-3. We also discover "language inequality" (which we define as the model's different development levels in different languages) in a series of meta-properties of moral decision making.
Published: 2024

3. Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents

Author: Piatti, Giorgio, Jin, Zhijing, Kleiman-Weiner, Max, Schölkopf, Bernhard, Sachan, Mrinmaya, and Mihalcea, Rada
Subjects: Computer Science - Computation and Language
Abstract: As AI systems pervade human life, ensuring that large language models (LLMs) make safe decisions remains a significant challenge. We introduce the Governance of the Commons Simulation (GovSim), a generative simulation platform designed to study strategic interactions and cooperative decision-making in LLMs. In GovSim, a society of AI agents must collectively balance exploiting a common resource with sustaining it for future use. This environment enables the study of how ethical considerations, strategic planning, and negotiation skills impact cooperative outcomes. We develop an LLM-based agent architecture and test it with the leading open and closed LLMs. We find that all but the most powerful LLM agents fail to achieve a sustainable equilibrium in GovSim, with the highest survival rate below 54%. Ablations reveal that successful multi-agent communication between agents is critical for achieving cooperation in these cases. Furthermore, our analyses show that the failure to achieve sustainable cooperation in most LLMs stems from their inability to formulate and analyze hypotheses about the long-term effects of their actions on the equilibrium of the group. Finally, we show that agents that leverage "Universalization"-based reasoning, a theory of moral thinking, are able to achieve significantly better sustainability. Taken together, GovSim enables us to study the mechanisms that underlie sustainable self-government with specificity and scale. We open source the full suite of our research results, including the simulation environment, agent prompts, and a comprehensive web interface., Comment: Revised version
Published: 2024

4. CLadder: Assessing Causal Reasoning in Language Models

Author: Jin, Zhijing, Chen, Yuen, Leeb, Felix, Gresele, Luigi, Kamal, Ojasv, Lyu, Zhiheng, Blin, Kevin, Adauto, Fernando Gonzalez, Kleiman-Weiner, Max, Sachan, Mrinmaya, and Schölkopf, Bernhard
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The ability to perform causal reasoning is widely considered a core feature of intelligence. In this work, we investigate whether large language models (LLMs) can coherently reason about causality. Much of the existing work in natural language processing (NLP) focuses on evaluating commonsense causal reasoning in LLMs, thus failing to assess whether a model can perform causal inference in accordance with a set of well-defined formal rules. To address this, we propose a new NLP task, causal inference in natural language, inspired by the "causal inference engine" postulated by Judea Pearl et al. We compose a large dataset, CLadder, with 10K samples: based on a collection of causal graphs and queries (associational, interventional, and counterfactual), we obtain symbolic questions and ground-truth answers, through an oracle causal inference engine. These are then translated into natural language. We evaluate multiple LLMs on our dataset, and we introduce and evaluate a bespoke chain-of-thought prompting strategy, CausalCoT. We show that our task is highly challenging for LLMs, and we conduct an in-depth analysis to gain deeper insights into the causal reasoning abilities of LLMs. Our data is open-sourced at https://huggingface.co/datasets/causalNLP/cladder, and our code can be found at https://github.com/causalNLP/cladder., Comment: NeurIPS 2023; updated with CLadder dataset v1.5
Published: 2023

5. Computational Principles of Caregiving

Author: Kleiman-Weiner, Max
Subjects: Artificial Intelligence, Philosophy, Psychology, Cognitive development, Learning, Machine learning, Social cognition, Theory of Mind, Computational Modeling, Mathematical modeling
Abstract: I formalize the problem of care in the mathematical language of sequential decision-making. Drawing upon insights from developmental psychology, robotics, and computational cognitive modeling, I conceptualize care as a dynamic interplay between the caregiver ('one-caring') and the care recipient ('cared-for'). Caring actions maximize the utility of the cared-for at a future point when they are required to act autonomously. Since this quantity cannot be directly optimized, the focus is on enabling increasing levels of autonomy through environmental shaping, risk reduction, and safe exploration. I distinguish caregiving from helping and teaching by care's focus on exploration and autonomy that increase capacity over time. In the context of elderly care, the emphasis shifts towards preserving rather than enhancing capacity. Finally, I consider the role of caregiving in the development of moral values and the possibility of artificially intelligent agents that might someday care for us.
Published: 2024

6. When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data

Author: Awad, Edmond, Levine, Sydney, Loreggia, Andrea, Mattei, Nicholas, Rahwan, Iyad, Rossi, Francesca, Talamadupula, Kartik, Tenenbaum, Joshua, and Kleiman-Weiner, Max
Published: 2024
Full Text: View/download PDF

7. When rules are over-ruled: Virtual bargaining as a contractualist method of moral judgment

Author: Levine, Sydney, Kleiman-Weiner, Max, Chater, Nick, Cushman, Fiery, and Tenenbaum, Joshua B.
Published: 2024
Full Text: View/download PDF

8. Learning Intuitive Policies Using Action Features

Author: Ma, Mingwei, Liu, Jizhou, Sokota, Samuel, Kleiman-Weiner, Max, and Foerster, Jakob
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: An unaddressed challenge in multi-agent coordination is to enable AI agents to exploit the semantic relationships between the features of actions and the features of observations. Humans take advantage of these relationships in highly intuitive ways. For instance, in the absence of a shared language, we might point to the object we desire or hold up our fingers to indicate how many objects we want. To address this challenge, we investigate the effect of network architecture on the propensity of learning algorithms to exploit these semantic relationships. Across a procedurally generated coordination task, we find that attention-based architectures that jointly process a featurized representation of observations and actions have a better inductive bias for learning intuitive policies. Through fine-grained evaluation and scenario analysis, we show that the resulting policies are human-interpretable. Moreover, such agents coordinate with people without training on any human data., Comment: ICML 2023
Published: 2022

9. When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgement Based on Empirical Data

Author: Awad, Edmond, Levine, Sydney, Loreggia, Andrea, Mattei, Nicholas, Rahwan, Iyad, Rossi, Francesca, Talamadupula, Kartik, Tenenbaum, Joshua, and Kleiman-Weiner, Max
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: One of the most remarkable things about the human moral mind is its flexibility. We can make moral judgments about cases we have never seen before. We can decide that pre-established rules should be broken. We can invent novel rules on the fly. Capturing this flexibility is one of the central challenges in developing AI systems that can interpret and produce human-like moral judgment. This paper details the results of a study of real-world decision makers who judge whether it is acceptable to break a well-established norm: ``no cutting in line.'' We gather data on how human participants judge the acceptability of line-cutting in a range of scenarios. Then, in order to effectively embed these reasoning capabilities into a machine, we propose a method for modeling them using a preference-based structure, which captures a novel modification to standard ``dual process'' theories of moral judgment.
Published: 2022
Full Text: View/download PDF

10. Most people do not “value the struggle”: Tempted agents are judged as less virtuous than those who were never tempted

Author: McManus, Ryan M., Padilla Fong, Helen, Kleiman-Weiner, Max, and Young, Liane
Published: 2024
Full Text: View/download PDF

11. Modeling Communication to Coordinate Perspectives in Cooperation

Author: Stacy, Stephanie, Li, Chenfei, Zhao, Minglu, Yun, Yiling, Zhao, Qingyi, Kleiman-Weiner, Max, and Gao, Tao
Subjects: Computer Science - Artificial Intelligence
Abstract: Communication is highly overloaded. Despite this, even young children are good at leveraging context to understand ambiguous signals. We propose a computational account of overloaded signaling from a shared agency perspective which we call the Imagined We for Communication. Under this framework, communication helps cooperators coordinate their perspectives, allowing them to act together to achieve shared goals. We assume agents are rational cooperators, which puts constraints on how signals can be sent and interpreted. We implement this model in a set of simulations demonstrating this model's success under increasing ambiguity as well as increasing layers of reasoning. Our model is capable of improving performance with deeper recursive reasoning; however, it outperforms comparison baselines at even the shallowest level, highlighting how shared knowledge and cooperative logic can do much of the heavy-lifting in language.
Published: 2021

12. Overloaded Communication as Paternalistic Helping

Author: Stacy, Stephanie, Parab, Aishni, Kleiman-Weiner, Max, and Gao, Tao
Subjects: Language understanding, Pragmatics, Reasoning, Theory of Mind, Computational Modeling
Abstract: Even simple, ambiguous signals can have a rich interpretation when viewed in the context of an interaction in a shared environment. We create a model called Paternalistic Communication by combining an existing model of overloaded language -- Rational Speech Acts (RSA) -- with a full agent model of Theory of Mind (ToM). This integration allows signals to be processed in conjunction with common ground in a principled manner dependent on task-dependent action utilities. This modeling perspective treats communication as a way to coordinate diverging perspectives in a cooperative setting. Under Paternalistic Communication, a speaker decides what to say by predicting their partner’s reaction based on the information in common ground and then evaluates those reactions using their own mind which may contain additional information. We demonstrate the flexibility and performance of Paternalistic Communication in a case study with ambiguous signaling through a set of simulations.
Published: 2022

13. Does big data serve policy? Not without context. An experiment with in silico social science

Author: Graziul, Chris, Belikov, Alexander, Chattopadyay, Ishanu, Chen, Ziwen, Fang, Hongbo, Girdhar, Anuraag, Jia, Xiaoshuang, Krafft, P. M., Kleiman-Weiner, Max, Lewis, Candice, Liang, Chen, Muchovej, John, Vientós, Alejandro, Young, Meg, and Evans, James
Published: 2023
Full Text: View/download PDF

14. Antarjami: Exploring psychometric evaluation through a computer-based game

Author: Lahiri, Anirban, Mitra, Utanko, Sen, Sunreeta, Chakraborty, Mrinal, Kleiman-Weiner, Max, Guha, Rajlakshmi, Mitra, Pabitra, Basu, Anupam, and Chakraborty, Partha Pratim
Subjects: Computer Science - Human-Computer Interaction
Abstract: A number of questionnaire based psychometric testing frameworks are globally for example OCEAN (Five factor) indicator, MBTI (Myers Brigg Type Indicator) etc. However, questionnaire based psychometric tests have some known shortcomings. This work explores whether these shortcomings can be mitigated through computer-based gaming platforms for evaluating psychometric parameters. A computer based psychometric game framework called Antarjami has been developed for evaluating OCEAN (Five factor) indicators. It investigates the feasibility of extracting psychometric parameters through computer-based games, utilizing underlying improvements in the area of modern artificial intelligence. The candidates for the test are subjected to a number scenarios as part of the computer based game and their reactions/responses are used to evaluate their psychometric parameters. As part of the study, the parameters obtained from the game were compared with those evaluated using paper based tests and scores given by a panel of psychologists. The achieved results were very promising., Comment: Submitted to CogSci 2020
Published: 2020

15. Too many cooks: Bayesian inference for coordinating multi-agent collaboration

Author: Wang, Rose E., Wu, Sarah A., Evans, James A., Tenenbaum, Joshua B., Parkes, David C., and Kleiman-Weiner, Max
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: Collaboration requires agents to coordinate their behavior on the fly, sometimes cooperating to solve a single task together and other times dividing it up into sub-tasks to work on in parallel. Underlying the human ability to collaborate is theory-of-mind, the ability to infer the hidden mental states that drive others to act. Here, we develop Bayesian Delegation, a decentralized multi-agent learning mechanism with these abilities. Bayesian Delegation enables agents to rapidly infer the hidden intentions of others by inverse planning. We test Bayesian Delegation in a suite of multi-agent Markov decision processes inspired by cooking problems. On these tasks, agents with Bayesian Delegation coordinate both their high-level plans (e.g. what sub-task they should work on) and their low-level actions (e.g. avoiding getting in each other's way). In a self-play evaluation, Bayesian Delegation outperforms alternative algorithms. Bayesian Delegation is also a capable ad-hoc collaborator and successfully coordinates with other agent types even in the absence of prior experience. Finally, in a behavioral experiment, we show that Bayesian Delegation makes inferences similar to human observers about the intent of others. Together, these results demonstrate the power of Bayesian Delegation for decentralized multi-agent collaboration., Comment: Rose E. Wang and Sarah A. Wu contributed equally
Published: 2020

16. Unpacking the computations of human spatial search under uncertainty: noisy utility maximization, discounting, and probability warping

Author: Yu, Suhyoun, Kryven, Marta, Tenenbaum, Josh, and Kleiman-Weiner, Max
Subjects: cognitive science
Abstract: Humans navigate daily decision-making by flexibly choosing appropriate approximations of what ought to be done. Which mental algorithms do people use, and when? We use behavioural experiments and modelling to investigate three computational principles known to influence decision making: noisy utility maximization, discounting, and the probability warping principle of Prospect Theory.While these principles have been shown to separately influence human behaviour in simple laboratory tasks, such as bandits and gambles, we evaluate their combined use in the context of a naturalistic spatial search that required sequential decision-making. We found that while aggregate human behaviour can be reasonably well explained by an optimal planner with noisy utility maximization, individual-level behaviour exhibits consistent irregularities, that deviate from expected utility theory.We show that model-based prediction of individual-level behaviours in our experiment is significantly improved by combining the three computational principles, and benefits particularly strongly from probability warping. Furthermore, our results suggest that probability warping may be a common factor of human decision making, that generalizes beyond the gambles explored in Prospect Theory, to natural human behaviours such as spatial search and navigation.
Published: 2021

17. Finding Friend and Foe in Multi-Agent Games

Author: Serrino, Jack, Kleiman-Weiner, Max, Parkes, David C., and Tenenbaum, Joshua B.
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems, Statistics - Machine Learning
Abstract: Recent breakthroughs in AI for multi-agent games like Go, Poker, and Dota, have seen great strides in recent years. Yet none of these games address the real-life challenge of cooperation in the presence of unknown and uncertain teammates. This challenge is a key game mechanism in hidden role games. Here we develop the DeepRole algorithm, a multi-agent reinforcement learning agent that we test on The Resistance: Avalon, the most popular hidden role game. DeepRole combines counterfactual regret minimization (CFR) with deep value networks trained through self-play. Our algorithm integrates deductive reasoning into vector-form CFR to reason about joint beliefs and deduce partially observable actions. We augment deep value networks with constraints that yield interpretable representations of win probabilities. These innovations enable DeepRole to scale to the full Avalon game. Empirical game-theoretic methods show that DeepRole outperforms other hand-crafted and learned agents in five-player Avalon. DeepRole played with and against human players on the web in hybrid human-agent teams. We find that DeepRole outperforms human players as both a cooperator and a competitor., Comment: Jack Serrino and Max Kleiman-Weiner contributed equally
Published: 2019

18. Theory of Minds: Understanding Behavior in Groups Through Inverse Planning

Author: Shum, Michael, Kleiman-Weiner, Max, Littman, Michael L., and Tenenbaum, Joshua B.
Subjects: Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: Human social behavior is structured by relationships. We form teams, groups, tribes, and alliances at all scales of human life. These structures guide multi-agent cooperation and competition, but when we observe others these underlying relationships are typically unobservable and hence must be inferred. Humans make these inferences intuitively and flexibly, often making rapid generalizations about the latent relationships that underlie behavior from just sparse and noisy observations. Rapid and accurate inferences are important for determining who to cooperate with, who to compete with, and how to cooperate in order to compete. Towards the goal of building machine-learning algorithms with human-like social intelligence, we develop a generative model of multi-agent action understanding based on a novel representation for these latent relationships called Composable Team Hierarchies (CTH). This representation is grounded in the formalism of stochastic games and multi-agent reinforcement learning. We use CTH as a target for Bayesian inference yielding a new algorithm for understanding behavior in groups that can both infer hidden relationships as well as predict future actions for multiple agents interacting together. Our algorithm rapidly recovers an underlying causal model of how agents relate in spatial stochastic games from just a few observations. The patterns of inference made by this algorithm closely correspond with human judgments and the algorithm makes the same rapid generalizations that people do., Comment: published in AAAI 2019; Michael Shum and Max Kleiman-Weiner contributed equally
Published: 2019

19. Too many cooks: Coordinating multi-agent collaboration through inverse planning

Author: Wu, Sarah A., Wang, Rose E., Evans, James A., Tenenbaum, Joshua B., Parkes, David C., and Kleiman-Weiner, Max
Subjects: coordination, social learning, inverse planning, Bayesian inference
Abstract: Collaboration requires agents to coordinate their behavior onthe fly, sometimes cooperating to solve a single task togetherand other times dividing it up into sub-tasks to work on in par-allel. Underlying the human ability to collaborate is theory-of-mind, the ability to infer the hidden mental states that driveothers to act. Here, we develop Bayesian Delegation, a decen-tralized multi-agent learning mechanism with these abilities.Bayesian Delegation enables agents to rapidly infer the hid-den intentions of others by inverse planning. These inferencesenable agents to flexibly decide in the absence of communi-cation when to cooperate on the same sub-task and when towork on different sub-tasks in parallel. We test this model ina suite of multi-agent Markov decision processes inspired bycooking problems. To succeed, agents must coordinate boththeir high-level plans (e.g., what sub-task they should work on)and their low-level actions (e.g., avoiding collisions). BayesianDelegation bridges these two levels and rapidly aligns agents’beliefs about who should work on what. Finally, we testedBayesian Delegation in a behavioral experiment where partici-pants made sub-task inferences from sparse observations of co-operative behavior. Bayesian Delegation outperformed heuris-tic models and was closely aligned with human judgments.
Published: 2020

20. Downloading Culture.zip: Social learning by program induction

Author: Kleiman-Weiner, Max, Sosa, Felix, Thompson, Bill, Opheusden, Bas van, Griffiths, Thomas L., Gershman, Samuel, and Cushman, Fiery
Subjects: social learning, program induction, Bayesianmodeling, imitation learning, theory of mind
Abstract: Cumulative culture depends on the fidelity of learning be-tween successive generations, and the robustness with whichthe lessons of one generation apply to the problems of the next.How do humans accomplish these twin goals? We formalizesocial learning as a kind of program induction, and provide anexperimental test of a key prediction. To do this, we exploit akey fact: When humans learn from others, in addition to ob-serving inputs and outputs we often observe the process thatled to that output. For instance, when preparing a meal, wedon’t just observe a pile of vegetables and then a ratatouille.Instead, we observe a causal process that transforms those in-gredients into a finished food. Here, we use probabilistic pro-grams to represent causal processes and show that the observa-tion of an execution trace speeds up program induction, evenwhen learning from only a single example. This model pre-dicts that the inferences and behavior of people will be struc-tured by these execution traces. In two behavioral experiments,we show that human judgments and behavior are affected bythe execution trace in the systematic ways predicted by our for-mal model. These findings shed light on the mechanisms thatunderlie high fidelity social learning in humans, and unify therole of emulation and imitation in social learning.
Published: 2020

21. Intuitive Signaling Through an ”Imagined We’”

Author: Stacy, Stephanie, Zhao, Qingyi, Zhao, Minglu, Kleiman-Weiner, Max, and Gao, Tao
Abstract: Communication is highly overloaded. Despite this, even young children are good at leveraging context to understandambiguous signals. We propose a computational shared agency account of signaling that we call the Imagined We (IW)framework. We leverage Bayesian Theory of Mind to provide mechanisms for rational action planning and inverse actioninterpretation. In order to expand this framework for communication, we first treat signals as rational actions. We thenincorporate our rich understanding of intuitive utilities to constrain the scope of affordable actions. Finally, we treatcommunication as a cooperative act, subject to constraints of maximizing a shared utility. We implement this modelin two completely different behavioral psychology works to demonstrate the generality of the IW under different typesof uncertainty in cooperative communication. Additionally, we demonstrate that the IW outperforms multiple baselinemodels in a novel task across a series of simulation conditions.
Published: 2020

22. Approximate Planning in Spatial Search

Author: Kryven, Marta, primary, Yu, Suhyoun, additional, Kleiman-Weiner, Max, additional, Ullman, Tomer David, additional, and Tenenbaum, Joshua, additional
Published: 2024
Full Text: View/download PDF

23. Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility

Author: Halpern, Joseph Y. and Kleiman-Weiner, Max
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: We provide formal definitions of degree of blameworthiness and intention relative to an epistemic state (a probability over causal models and a utility function on outcomes). These, together with a definition of actual causality, provide the key ingredients for moral responsibility judgments. We show that these definitions give insight into commonsense intuitions in a variety of puzzling cases from the literature., Comment: Appears in AAAI-18
Published: 2018

24. Learning to Share and Hide Intentions using Information Regularization

Author: Strouse, DJ, Kleiman-Weiner, Max, Tenenbaum, Josh, Botvinick, Matt, and Schwab, David
Subjects: Computer Science - Artificial Intelligence, Computer Science - Information Theory, Computer Science - Machine Learning, Computer Science - Multiagent Systems, Statistics - Machine Learning
Abstract: Learning to cooperate with friends and compete with foes is a key component of multi-agent reinforcement learning. Typically to do so, one requires access to either a model of or interaction with the other agent(s). Here we show how to learn effective strategies for cooperation and competition in an asymmetric information game with no such model or interaction. Our approach is to encourage an agent to reveal or hide their intentions using an information-theoretic regularizer. We consider both the mutual information between goal and action given state, as well as the mutual information between goal and state. We show how to optimize these regularizers in a way that is easy to integrate with policy gradient reinforcement learning. Finally, we demonstrate that cooperative (competitive) policies learned with our approach lead to more (less) reward for a second agent in two simple asymmetric information games., Comment: Presented at the 32nd Conference on Neural Information Processing Systems (NIPS 2018)
Published: 2018

25. Blaming humans in autonomous vehicle accidents: Shared responsibility across levels of automation

Author: Awad, Edmond, Levine, Sydney, Kleiman-Weiner, Max, Dsouza, Sohan, Tenenbaum, Joshua B., Shariff, Azim, Bonnefon, Jean-François, and Rahwan, Iyad
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: When a semi-autonomous car crashes and harms someone, how are blame and causal responsibility distributed across the human and machine drivers? In this article, we consider cases in which a pedestrian was hit and killed by a car being operated under shared control of a primary and a secondary driver. We find that when only one driver makes an error, that driver receives the blame and is considered causally responsible for the harm, regardless of whether that driver is a machine or a human. However, when both drivers make errors in cases of shared control between a human and a machine, the blame and responsibility attributed to the machine is reduced. This finding portends a public under-reaction to the malfunctioning AI components of semi-autonomous cars and therefore has a direct policy implication: a bottom-up regulatory scheme (which operates through tort law that is adjudicated through the jury system) could fail to properly regulate the safety of shared-control vehicles; instead, a top-down scheme (enacted through federal laws) may be called for.
Published: 2018

26. A Computational Model of Commonsense Moral Decision Making

Author: Kim, Richard, Kleiman-Weiner, Max, Abeliuk, Andres, Awad, Edmond, Dsouza, Sohan, Tenenbaum, Josh, and Rahwan, Iyad
Subjects: Computer Science - Artificial Intelligence
Abstract: We introduce a new computational model of moral decision making, drawing on a recent theory of commonsense moral learning via social dynamics. Our model describes moral dilemmas as a utility function that computes trade-offs in values over abstract moral dimensions, which provide interpretable parameter values when implemented in machine-led ethical decision-making. Moreover, characterizing the social structures of individuals and groups as a hierarchical Bayesian model, we show that a useful description of an individual's moral values - as well as a group's shared values - can be inferred from a limited amount of observed data. Finally, we apply and evaluate our approach to data from the Moral Machine, a web application that collects human judgments on moral dilemmas involving autonomous vehicles.
Published: 2018

27. What if everybody did that?: Universalization as a mechanism of moraldecision-making

Author: Levine, Sydney, Kleiman-Weiner, Max, Schulz, Laura, Tenenbaum, Josh, and Cushman, Fiery
Abstract: We describe a cognitive mechanism of moral judgment, universalization, that has received little attention up to now. Underuniversalization, an action’s moral permissibility is determined by calculating what the outcome would be if all people whoare similarly situated to the actor also acted in that way. This mechanism is particularly well-suited to capture our moraljudgments of free-rider cases, where one person doing the action increases utility but many people doing it decreasesutility. Universalization fits into an agreement-based (contractualist) theory of moral cognition, and explains properties ofour moral judgments that an outcome-based or rule-based approach cannot. We show patterns of universalization reasoningin young children as well as adults.
Published: 2019

28. Emotion attributions echo the structure of people’s intuitive theory of psychology

Author: Houlihan, Sean, Kleiman-Weiner, Max, Tenenbaum, Josh, and Saxe, Rebecca
Abstract: We present a generative model of how observers think about the emotions experienced by players in a socially-chargedgame: a public, high-stakes, one-shot Prisoner’s Dilemma. The model extends inverse planning frameworks to captureobservers’ judgments about players’ reactions to hypothetical events. Observers attribute different beliefs and values toplayers based on what decisions the players make. We model how observers’ noisy inferences of players’ mental contentsbias emotion predictions. Incorporation of non-monetary features into forward planning enables us to model emotions thatreflect complex social concerns (e.g. Embarrassment depends on how much players think others will infer that they tried totake advantage of their opponents). In addition to matching the intensities of twenty attributed emotions, the model reflectshow observers’ emotion judgments covary within single stimuli, indicating that the model captures important aspects ofthe generative process underlying humans’ emotion attributions in this game.
Published: 2019

29. Downloading Culture.zip: Social learning by program induction with executiontraces

Author: Kleiman-Weiner, Max, Sosa, Felix, Gershman, Samuel, and Cushman, Fiery
Abstract: Cumulative culture ultimately depends on the fidelity of learning between successive generations. When humans learnfrom others in addition to observing inputs and outputs we often observe the process which led to that output. Forinstance, when preparing a meal we don’t just observe a pile of vegetables and then a ratatouille. Instead, we observe acausal process by which those ingredients are transformed. Here we use programs to represent a cultural process and showthat the observation of an execution trace speeds up program induction even when learning from only a single example.This mechanism could account for (1) the high fidelity of social learning which leads to cumulative culture in humans(2) unify the role of emulation and imitation in social learning and (3) account for aspects of moral learning such asritualization.
Published: 2019

30. Heirarchical Drift-Diffusion Model for Moral Dilemma: Understanding Reaction Times and Choices

Author: Kim, Richard, Pescetelli, Niccolo, Kleiman-Weiner, Max, Awad, Edmond, Dsouza, Sohan, Tenenbaum, Josh, and Rahwan, Iyad
Abstract: Discrete choice models (e.g. logistic regression) are popular models in the economics literature that describe choices between twoor more discrete alternatives. These models have been successfully used to model value-based decisions, e.g. decisions in moraldilemmas, although temporal components of a decision, such as reaction times and changes of mind are not included. In cognitivesciences, another class of decision models, namely sequential-sampling models, has gained popularity in modelling choice accuracy,reaction time and decision uncertainty (e.g. confidence judgments). Here, we model decisions in moral dilemmas using a variant ofa hierarchical drift-diffusion model, factor drift diffusion, that combines the value-based approach with that of evidence accumulationmechanism by sequential-sampling. Specifically, we model the evidence accumulation process as resulting from a subjective weightingof abstract moral dimensions (factors). We train our model on a data set of 6500 moral decisions by 500 respondents on a popularweb platform (MoralMachine.mit.edu) and separately infer different sources of uncertainty in moral decisions. We show that the modelsuccessfully predicts reaction times and choices in moral dilemmas, while also leading to unexpected results
Published: 2018

31. A generative model of people’s intuitive theory of emotions: inverse planning inrich social games

Author: Houlihan, Sean, Kleiman-Weiner, Max, Tenenbaum, Josh, and Saxe, Rebecca
Abstract: We propose a formal model of humans’ intuitive theories of others’ emotions. From a single choice in a social interaction(e.g. the choice to cooperate in a Prisoner’s Dilemma game), human observers can infer a player’s complex values, such asprosocial preferences and reputational concerns. When the player then experiences a new situation (the game’s outcome),observers infer the player’s reaction to the event based on the mental state likely to have produced the player’s action. Herewe capture this process by inverting a richly structured generative model of social gameplay, including social equity andreputational dimensions, and translate players’ subjective motivations, expectations, and prediction errors into forwardpredictions of the emotional experiences of the players. Our model infers players’ values and expectations, generatespatterns of play that match observers’ intuitions, and supports formally generated emotion predictions with substantiallyextended breadth and nuance.
Published: 2018

32. The logic of universalization guides moral judgment

Author: Levine, Sydney, Kleiman-Weiner, Max, Schulz, Laura, Tenenbaum, Joshua, and Cushman, Fiery
Published: 2020

33. Preschoolers and Infants Calibrate Persistence from Adult Models

Author: Leonard, Julia, Kleiman-Weiner, Max, Lee, Yuna, Tenenbaum, Josh, and Schulz, Laura
Abstract: Perseverance, above and beyond IQ, predicts academic outcomes in school age children, however, little is knownabout what factors affect persistence in early childhood. Here, we propose a formal Bayesian model of how children mightlearn how to calibrate effort from observing adult models and then explore this idea behaviorally across two experiments inchildren and infants. Results from Experiment 1 show that preschoolers persist more after watching an adult persist, but onlyif the adult is successful at reaching their goal. Experiment 2 and a pre-registered replication extend these findings, showingthat even infants use adult models to modulate their persistence, and can generalize this inference to novel situations. Theseresults suggest that both preschoolers and infants are sensitive to adult persistence and use it to calibrate their own effort infar-reaching ways.
Published: 2017

34. Cooperative Social Intelligence:Understanding and Acting with Others

Author: Kleiman-Weiner, Max , Zhao, Yibiao, and Tenenbaum, Joshua B.
Subjects: multi-agent, Cooperation, Communication, coordi-nation, theory-of-mind, social learning
Published: 2017

35. Constructing Social Preferences From Anticipated Judgments:When Impartial Inequity is Fair and Why?

Author: Kleiman-Weiner, Max, Shaw, Alex, and Tenenbaum, Joshua B.
Subjects: fairness, social cognition, theory-of-mind, deci-sion making, Bayesian models
Abstract: Successful and repeated cooperation requires fairly sharingthe spoils of joint endeavors. Fair distribution is often doneaccording to preferences for equitable outcomes even thoughstrictly equitable outcomes can lead to inefficient waste. In ad-dition to preferences about the outcome itself, decision makersare also sensitive to the attributions others might make aboutthem as a result of their choice. We develop a novel mathemat-ical model where decision makers turn their capacity to inferlatent desires and beliefs from the behavior of others (theory-of-mind) towards themselves, anticipating the judgments oth-ers will make about them. Using this model we can construct apreference to be seen as impartial and integrate it with prefer-ences for equitable and efficient outcomes. We test this modelin two studies where the anticipated attribution of impartialityis ambiguous: when one agent is more deserving than the otherand when unbiased procedures for distribution are made avail-able. This model explains both participants’ judgments aboutthe partiality of others and their hypothetical decisions. Ourmodel argues that people avoid inequity not only because theyfind it inherently undesirable, they also want to avoid beingjudged as partial.
Published: 2017

36. Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents

Author: Piatti, Giorgio, Jin, Zhijing, Kleiman-Weiner, Max, Schölkopf, Bernhard, Sachan, Mrinmaya, Mihalcea, Rada, Piatti, Giorgio, Jin, Zhijing, Kleiman-Weiner, Max, Schölkopf, Bernhard, Sachan, Mrinmaya, and Mihalcea, Rada
Abstract: As AI systems pervade human life, ensuring that large language models (LLMs) make safe decisions is a significant challenge. This paper introduces the Governance of the Commons Simulation (GovSim), a generative simulation platform designed to study strategic interactions and cooperative decision-making in LLMs. Using GovSim, we investigate the dynamics of sustainable resource sharing in a society of AI agents. This environment allows us to study the influence of ethical considerations, strategic planning, and negotiation skills on cooperative outcomes for AI agents. We develop an LLM-based agent architecture designed for these social dilemmas and test it with a variety of LLMs. We find that all but the most powerful LLM agents fail to achieve a sustainable equilibrium in GovSim. Ablations reveal that successful multi-agent communication between agents is critical for achieving cooperation in these cases. Furthermore, our analyses show that the failure to achieve sustainable cooperation in most LLMs stems from their inability to formulate and analyze hypotheses about the long-term effects of their actions on the equilibrium of the group. Finally, we show that agents that leverage ``Universalization''-based reasoning, a theory of moral thinking, are able to achieve significantly greater sustainability. Taken together, GovSim enables us to study the mechanisms that underlie sustainable self-government with significant specificity and scale. We open source the full suite of our research results, including the simulation environment, agent prompts, and a comprehensive web interface., Comment: Revised version
Published: 2024

37. Too Many cooks: Bayesian inference for coordinating Multi-agent Collaboration

Author: Wang, Rose E., primary, Wu, Sarah A., additional, Evans, James A., additional, Parkes, David C., additional, Tenenbaum, Joshua B., additional, and Kleiman-Weiner, Max, additional
Published: 2021
Full Text: View/download PDF

38. Feature-based Joint Planning and Norm Learning in Collaborative Games

Author: Ho, Mark, MacGlashan, James, Greenwald, Amy, Littman, Michael, Hilliard, Elizabeth, Trimbach, Carl, Brawner, Stephen, Tenenbaum, Joshua, Kleiman-Weiner, Max, and Austerweil, Joseph
Subjects: joint intentionality, norms, team reasoning, reinforcement learning, features, computational modeling
Abstract: People often use norms to coordinate behavior andaccomplish shared goals. But how do people learn andrepresent norms? Here, we formalize the process by whichcollaborating individuals (1) reason about group plans duringinteraction, and (2) use task features to abstractly representnorms. In Experiment 1, we test the assumptions of our modelin a gridworld that requires coordination and contrast it with a“best response” model. In Experiment 2, we use our model totest whether group members’ joint planning relies more onstate features independent of other agents (landmark-basedfeatures) or state features determined by the configuration ofagents (agent-relative features).
Published: 2016

39. Coordinate to cooperate or compete:Abstract goals and joint intentions in social interaction

Author: Kleiman-Weiner, Max, Ho, Mark K., Austerweil, Joseph L., Littman, Michael L., and Tenenbaum, Joshua B.
Subjects: joint intention, Cooperation, coordination, rein-forcement learning, teams
Abstract: Successfully navigating the social world requires reasoningabout both high-level strategic goals, such as whether to co-operate or compete, as well as the low-level actions neededto achieve those goals. While previous work in experimentalgame theory has examined the former and work on multi-agentsystems has examined the later, there has been little work in-vestigating behavior in environments that require simultaneousplanning and inference across both levels. We develop a hierar-chical model of social agency that infers the intentions of otheragents, strategically decides whether to cooperate or competewith them, and then executes either a cooperative or competi-tive planning program. Learning occurs across both high-levelstrategic decisions and low-level actions leading to the emer-gence of social norms. We test predictions of this model inmulti-agent behavioral experiments using rich video-game likeenvironments. By grounding strategic behavior in a formalmodel of planning, we develop abstract notions of both co-operation and competition and shed light on the computationalnature of joint intentionality.
Published: 2016

40. Go fishing! Responsibility judgments when cooperation breaks down

Author: Allen, Kelsey, Jara-Ettinger, Julian, Gerstenberg, Tobias, Kleiman-Weiner, Max, and Tenenbaum, Joshua Bonnie
Subjects: responsibility attribution, theory of mind, recursivereasoning, multi-agent coordination
Abstract: Many social judgments hinge on assigning responsibility to individualsfor their role in a group‚Äôs success or failure. Often thegroup‚Äôs success depends on every team member acting in a rationalway. When someone does not conform to what othersexpect of them, cooperation breaks down. We present a computationalmodel of responsibility judgments for individualsin a cooperative setting. We test the model in two behavioralexperiments where participants were asked to evaluate agentsacting in a cooperative, one-shot game. In Experiment 1, weshow that participants‚Äô action predictions are consistent with arecursive reasoning model. In Experiment 2, we show that people‚Äôsassignments of blame are influenced by both an agent‚Äôspresumed rationality, or adherence to an expected policy, aswell as the pivotality of the agent‚Äôs actions, or how close thesituation was to one in which the action would have made adifference to the outcome.
Published: 2015

41. Inference of Intention and Permissibility in Moral Decision Making

Author: Kleiman-Weiner, Max, Gerstenberg, Tobias, Levine, Sydney, and Tenenbaum, Joshua B
Subjects: moral judgment, social cognition, intention, theoryof mind, influence diagrams, counterfactuals
Abstract: The actions of a rational agent reveal information about itsmental states. These inferred mental states, particularly theagent‚Äôs intentions, play an important role in the evaluation ofmoral permissibility. While previous computational modelshave shown that beliefs and desires can be inferred from behaviorunder the assumption of rational action they have criticallylacked a third mental state, intentions. In this work, wedevelop a novel formalism for intentions and show how theycan be inferred as counterfactual contrasts over influence diagrams.This model is used to quantitatively explain judgmentsabout intention and moral permissibility in classic and noveltrolley problems.
Published: 2015

42. Wins above replacement: Responsibility attributions as counterfactual replacements

Author: Gerstenberg, Tobias, Ullman, Tomer, Kleiman-Weiner, Max, Lagnado, David, and Tenenbaum, Josh
Published: 2014

43. Similar failures of consideration arise in human and machine planning

Author: Zhang, Alice, primary, Langenkamp, Max, additional, Kleiman-Weiner, Max, additional, Oikarinen, Tuomas, additional, and Cushman, Fiery Andrews, additional
Published: 2023
Full Text: View/download PDF

44. Non-parametric Bayesian inference of strategies in repeated games

Author: Kleiman-Weiner, Max, Tenenbaum, Joshua B., and Zhou, Penghui
Published: 2018
Full Text: View/download PDF

45. Lucky or clever? From expectations to responsibility judgments

Author: Gerstenberg, Tobias, Ullman, Tomer D., Nagel, Jonas, Kleiman-Weiner, Max, Lagnado, David A., and Tenenbaum, Joshua B.
Published: 2018
Full Text: View/download PDF

46. Emotion prediction as computation over a generative theory of mind

Author: Houlihan, Sean Dae, primary, Kleiman-Weiner, Max, additional, Hewitt, Luke B., additional, Tenenbaum, Joshua B., additional, and Saxe, Rebecca, additional
Published: 2023
Full Text: View/download PDF

47. Learning a commonsense moral theory

Author: Kleiman-Weiner, Max, Saxe, Rebecca, and Tenenbaum, Joshua B.
Published: 2017
Full Text: View/download PDF

48. Assessing and dissociating virtues from the 'bottom up': A case study of generosity vs. fairness.

Author: Kraft-Todd, Gordon T., Kleiman-Weiner, Max, and Young, Liane
Subjects: *ETHICS, *CHARITY, *ALTRUISM, *NATURAL language processing, *MOTIVATION (Psychology), *SOCIAL justice, *COOPERATIVENESS, *CASE studies, *RESEARCH funding, *EMPIRICAL research
Abstract: The empirical study of virtue is plagued by imprecise definitions and assessment. Here we propose a three-stage, data-driven ('bottom-up') method to differentiate lay perceptions of virtues. Employing two virtues – generosity (as cooperation) and fairness (as impartiality) – as a case study, we present findings utilizing data from three studies (total N = 2,667). First, natural language processing of free-response data indicated that participants used different 'topics' (i.e. clusters of words) to describe behaviours representing generosity (topics: 'charity' and 'kindness') and fairness ('equality'). Second, participants in a survey experiment rated behaviours expressing generosity and fairness differently across 6 out of 9 underlying features measured. Third, participants perceive that actors in vignette-based experiments engaging in behaviours expressing generosity versus fairness were motivated differently on 5 out of 6 motivations measured. Our findings support the distinction of the virtues of generosity (as cooperation) and fairness (as impartiality) and indicate the utility of our bottom-up method for assessing and distinguishing virtues. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

49. Does big data serve policy? Not without context. An experiment with in silico social science

Author: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, Center for Brains, Minds, and Machines, Graziul, Chris, Belikov, Alexander, Chattopadyay, Ishanu, Chen, Ziwen, Fang, Hongbo, Girdhar, Anuraag, Jia, Xiaoshuang, Krafft, P. M., Kleiman-Weiner, Max, Lewis, Candice, Liang, Chen, Muchovej, John, Vientós, Alejandro, Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, Center for Brains, Minds, and Machines, Graziul, Chris, Belikov, Alexander, Chattopadyay, Ishanu, Chen, Ziwen, Fang, Hongbo, Girdhar, Anuraag, Jia, Xiaoshuang, Krafft, P. M., Kleiman-Weiner, Max, Lewis, Candice, Liang, Chen, Muchovej, John, and Vientós, Alejandro
Abstract: The DARPA Ground Truth project sought to evaluate social science by constructing four varied simulated social worlds with hidden causality and unleashed teams of scientists to collect data, discover their causal structure, predict their future, and prescribe policies to create desired outcomes. This large-scale, long-term experiment of in silico social science, about which the ground truth of simulated worlds was known, but not by us, reveals the limits of contemporary quantitative social science methodology. First, problem solving without a shared ontology—in which many world characteristics remain existentially uncertain—poses strong limits to quantitative analysis even when scientists share a common task, and suggests how they could become insurmountable without it. Second, data labels biased the associations our analysts made and assumptions they employed, often away from the simulated causal processes those labels signified, suggesting limits on the degree to which analytic concepts developed in one domain may port to others. Third, the current standard for computational social science publication is a demonstration of novel causes, but this limits the relevance of models to solve problems and propose policies that benefit from the simpler and less surprising answers associated with most important causes, or the combination of all causes. Fourth, most singular quantitative methods applied on their own did not help to solve most analytical challenges, and we explored a range of established and emerging methods, including probabilistic programming, deep neural networks, systems of predictive probabilistic finite state machines, and more to achieve plausible solutions. However, despite these limitations common to the current practice of computational social science, we find on the positive side that even imperfect knowledge can be sufficient to identify robust prediction if a more pluralistic approach is applied. Applying competing approaches by distinct
Published: 2023

50. Virtue Discounting: Observability Reduces Moral Actors’ Perceived Virtue

Author: Kraft-Todd, Gordon T., primary, Kleiman-Weiner, Max, additional, and Young, Liane, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

202 results on '"Kleiman-Weiner, Max"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources