Author: "Song-Chun Zhu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Song-Chun Zhu"' showing total 789 results

Start Over Author "Song-Chun Zhu"

789 results on '"Song-Chun Zhu"'

1. The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

Author: Yujia Peng, Jiaheng Han, Zhenliang Zhang, Lifeng Fan, Tengyu Liu, Siyuan Qi, Xue Feng, Yuxi Ma, Yizhou Wang, and Song-Chun Zhu
Subjects: Artificial general intelligence, Artificial intelligence benchmark, Artificial intelligence evaluation, Embodied artificial intelligence, Value alignment, Turing test, Engineering (General). Civil engineering (General), TA1-2040
Abstract: The release of the generative pre-trained transformer (GPT) series has brought artificial general intelligence (AGI) to the forefront of the artificial intelligence (AI) field once again. However, the questions of how to define and evaluate AGI remain unclear. This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions (DEPSI). More specifically, we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system. The Tong test describes a value- and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI, allowing for infinite task generation. We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized, quantitative, and objective benchmarks and evaluation of AGI.
Published: 2024
Full Text: View/download PDF

2. A Reconfigurable Data Glove for Reconstructing Physical and Virtual Grasps

Author: Hangxin Liu, Zeyu Zhang, Ziyuan Jiao, Zhenliang Zhang, Minchen Li, Chenfanfu Jiang, Yixin Zhu, and Song-Chun Zhu
Subjects: Data glove, Tactile sensing, Virtual reality, Physics-based simulation, Engineering (General). Civil engineering (General), TA1-2040
Abstract: In this work, we present a reconfigurable data glove design to capture different modes of human hand–object interactions, which are critical in training embodied artificial intelligence (AI) agents for fine manipulation tasks. To achieve various downstream tasks with distinct features, our reconfigurable data glove operates in three modes sharing a unified backbone design that reconstructs hand gestures in real time. In the tactile-sensing mode, the glove system aggregates manipulation force via customized force sensors made from a soft and thin piezoresistive material; this design minimizes interference during complex hand movements. The virtual reality (VR) mode enables real-time interaction in a physically plausible fashion: A caging-based approach is devised to determine stable grasps by detecting collision events. Leveraging a state-of-the-art finite element method, the simulation mode collects data on fine-grained four-dimensional manipulation events comprising hand and object motions in three-dimensional space and how the object’s physical properties (e.g., stress and energy) change in accordance with manipulation over time. Notably, the glove system presented here is the first to use high-fidelity simulation to investigate the unobservable physical and causal factors behind manipulation actions. In a series of experiments, we characterize our data glove in terms of individual sensors and the overall system. More specifically, we evaluate the system’s three modes by ① recording hand gestures and associated forces, ② improving manipulation fluency in VR, and ③ producing realistic simulation effects of various tool uses, respectively. Based on these three modes, our reconfigurable data glove collects and reconstructs fine-grained human grasp data in both physical and virtual environments, thereby opening up new avenues for the learning of manipulation skills for embodied AI agents.
Published: 2024
Full Text: View/download PDF

3. Communicative Learning: A Unified Learning Formalism

Author: Luyao Yuan and Song-Chun Zhu
Subjects: Artificial intelligence, Cooperative communication, Machine learning, Pedagogy, Theory of mind, Engineering (General). Civil engineering (General), TA1-2040
Abstract: In this article, we propose a communicative learning (CL) formalism that unifies existing machine learning paradigms, such as passive learning, active learning, algorithmic teaching, and so forth, and facilitates the development of new learning methods. Arising from human cooperative communication, this formalism poses learning as a communicative process and combines pedagogy with the burgeoning field of machine learning. The pedagogical insight facilitates the adoption of alternative information sources in machine learning besides randomly sampled data, such as intentional messages given by a helpful teacher. More specifically, in CL, a teacher and a student exchange information with each other collaboratively to transmit and acquire certain knowledge. Each agent has a mind, which includes the agent’s knowledge, utility, and mental dynamics. To establish effective communication, each agent also needs an estimation of its partner’s mind. We define expressive mental representations and learning formulation sufficient for such recursive modeling, which endows CL with human-comparable learning efficiency. We demonstrate the application of CL to several prototypical collaboration tasks and illustrate that this formalism allows learning protocols to go beyond Shannon’s communication limit. Finally, we present our contribution to the foundations of learning by putting forth hierarchies in learning and defining the halting problem of learning.
Published: 2023
Full Text: View/download PDF

4. Artificial Social Intelligence: A Comparative and Holistic View

Author: Lifeng Fan, Manjie Xu, Zhihao Cao, Yixin Zhu, and Song-Chun Zhu
Subjects: social intelligence, theory of mind (tom), communication, human-machine teaming, Electronic computers. Computer science, QA75.5-76.95
Abstract: In addition to a physical comprehension of the world, humans possess a high social intelligence—the intelligence that senses social events, infers the goals and intents of others, and facilitates social interaction. Notably, humans are distinguished from their closest primate cousins by their social cognitive skills as opposed to their physical counterparts. We believe that artificial social intelligence (ASI) will play a crucial role in shaping the future of artificial intelligence (AI). This article begins with a review of ASI from a cognitive science standpoint, including social perception, theory of mind (ToM), and social interaction. Next, we examine the recently-emerged computational counterpart in the AI community. Finally, we provide an in-depth discussion on topics related to ASI.
Published: 2022
Full Text: View/download PDF

5. Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense

Author: Yixin Zhu, Tao Gao, Lifeng Fan, Siyuan Huang, Mark Edmonds, Hangxin Liu, Feng Gao, Chi Zhang, Siyuan Qi, Ying Nian Wu, Joshua B. Tenenbaum, and Song-Chun Zhu
Subjects: Engineering (General). Civil engineering (General), TA1-2040
Abstract: Recent progress in deep learning is essentially based on a “big data for small tasks” paradigm, under which massive amounts of data are used to train a classifier for a single narrow task. In this paper, we call for a shift that flips this paradigm upside down. Specifically, we propose a “small data for big tasks” paradigm, wherein a single artificial intelligence (AI) system is challenged to develop “common sense,” enabling it to solve a wide range of tasks with little training data. We illustrate the potential power of this new paradigm by reviewing models of common sense that synthesize recent breakthroughs in both machine and human vision. We identify functionality, physics, intent, causality, and utility (FPICU) as the five core domains of cognitive AI with humanlike common sense. When taken as a unified concept, FPICU is concerned with the questions of “why” and “how,” beyond the dominant “what” and “where” framework for understanding vision. They are invisible in terms of pixels but nevertheless drive the creation, maintenance, and development of visual scenes. We therefore coin them the “dark matter” of vision. Just as our universe cannot be understood by merely studying observable matter, we argue that vision cannot be understood without studying FPICU. We demonstrate the power of this perspective to develop cognitive AI systems with humanlike common sense by showing how to observe and apply FPICU with little training data to solve a wide range of challenging tasks, including tool use, planning, utility inference, and social learning. In summary, we argue that the next generation of AI must embrace “dark” humanlike common sense for solving novel tasks. Keywords: Computer vision, Artificial intelligence, Causality, Intuitive physics, Functionality, Perceived intent, Utility
Published: 2020
Full Text: View/download PDF

6. CX-ToM: Counterfactual explanations with theory-of-mind for enhancing human trust in image recognition models

Author: Arjun R. Akula, Keze Wang, Changsong Liu, Sari Saba-Sadiya, Hongjing Lu, Sinisa Todorovic, Joyce Chai, and Song-Chun Zhu
Subjects: Computer science, Artificial intelligence, Human-computer interaction, Science
Abstract: Summary: We propose CX-ToM, short for counterfactual explanations with theory-of-mind, a new explainable AI (XAI) framework for explaining decisions made by a deep convolutional neural network (CNN). In contrast to the current methods in XAI that generate explanations as a single shot response, we pose explanation as an iterative communication process, i.e., dialogue between the machine and human user. More concretely, our CX-ToM framework generates a sequence of explanations in a dialogue by mediating the differences between the minds of the machine and human user. To do this, we use Theory of Mind (ToM) which helps us in explicitly modeling the human’s intention, the machine’s mind as inferred by the human, as well as human's mind as inferred by the machine. Moreover, most state-of-the-art XAI frameworks provide attention (or heat map) based explanations. In our work, we show that these attention-based explanations are not sufficient for increasing human trust in the underlying CNN model. In CX-ToM, we instead use counterfactual explanations called fault-lines which we define as follows: given an input image I for which a CNN classification model M predicts class cpred, a fault-line identifies the minimal semantic-level features (e.g., stripes on zebra), referred to as explainable concepts, that need to be added to or deleted from I to alter the classification category of I by M to another specified class calt. Extensive experiments verify our hypotheses, demonstrating that our CX-ToM significantly outperforms the state-of-the-art XAI models.
Published: 2022
Full Text: View/download PDF

7. Patching interpretable And‐Or‐Graph knowledge representation using augmented reality

Author: Hangxin Liu, Yixin Zhu, and Song‐Chun Zhu
Subjects: augmented reality (AR), explainable artificial intelligence (XAI), robot learning, Electronic computers. Computer science, QA75.5-76.95
Abstract: Abstract We present a novel augmented reality (AR) interface to provide effective means to diagnose a robot's erroneous behaviors, endow it with new skills, and patch its knowledge structure represented by an And‐Or‐Graph (AOG). Specifically, an AOG representation of opening medicine bottles is learned from human demonstration and yields a hierarchical structure that captures the spatiotemporal compositional nature of the given task, which is highly interpretable for the users. Through a series of psychological experiments, we demonstrate that the explanations of a robotic system, inherited from and produced by the AOG, can better foster human trust compared to other forms of explanations. Moreover, by visualizing the knowledge structure and robot states, the AR interface allows human users to intuitively understand what the robot knows, supervise the robot's task planner, and interactively teach the robot with new actions. Together, users can quickly identify the reasons for failures and conveniently patch the current knowledge structure to prevent future errors. This capability demonstrates the interpretability of our knowledge representation and the new forms of interactions afforded by the proposed AR interface.
Published: 2021
Full Text: View/download PDF

8. CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update.

Author: Zhi Gao, Yuntao Du 0005, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-Chun Zhu, and Qing Li 0003
Published: 2024
Full Text: View/download PDF

9. MindDial: Enhancing Conversational Agents with Theory-of-Mind for Common Ground Alignment and Negotiation.

Author: Shuwen Qiu, Mingdian Liu, Hengli Li, Song-Chun Zhu, and Zilong Zheng
Published: 2024

10. RulE: Knowledge Graph Reasoning with Rule Embedding.

Author: Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, and Muhan Zhang
Published: 2024
Full Text: View/download PDF

11. LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments.

Author: Zixia Jia, Mengmeng Wang, Baichen Tong, Song-Chun Zhu, and Zilong Zheng
Published: 2024
Full Text: View/download PDF

12. On the Emergence of Symmetrical Reality.

Author: Zhenliang Zhang 0002, Zeyu Zhang 0001, Ziyuan Jiao, Yao Su 0001, Hangxin Liu, Wei Wang 0115, and Song-Chun Zhu
Published: 2024
Full Text: View/download PDF

13. ProAgent: Building Proactive Cooperative Agents with Large Language Models.

Author: Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, and Yaodong Yang 0001
Published: 2024
Full Text: View/download PDF

14. An Embodied Generalist Agent in 3D World.

Author: Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li 0003, Song-Chun Zhu, Baoxiong Jia, and Siyuan Huang 0001
Published: 2024

15. Fast Peer Adaptation with Context-aware Exploration.

Author: Long Ma, Yuanfei Wang, Fangwei Zhong, Song-Chun Zhu, and Yizhou Wang 0001
Published: 2024

16. Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning.

Author: Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang 0001, Song-Chun Zhu, and Xue Feng
Published: 2024

17. Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World.

Author: Rujie Wu, Xiaojian Ma, Zhenliang Zhang 0002, Wei Wang 0115, Qing Li 0003, Song-Chun Zhu, and Yizhou Wang 0001
Published: 2024

18. Neural-Symbolic Recursive Machine for Systematic Generalization.

Author: Qing Li 0003, Yixin Zhu 0001, Yitao Liang, Ying Nian Wu, Song-Chun Zhu, and Siyuan Huang 0001
Published: 2024

19. CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents.

Author: Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Yaodong Yang 0001, and Song-Chun Zhu
Published: 2024

20. ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes.

Author: Ran Gong, Jiangyong Huang, Yizhou Zhao, Haoran Geng, Xiaofeng Gao 0002, Qingyang Wu, Wensi Ai, Ziheng Zhou, Demetri Terzopoulos, Song-Chun Zhu, Baoxiong Jia, and Siyuan Huang 0001
Published: 2023
Full Text: View/download PDF

21. X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events.

Author: Bo Dai 0025, Linge Wang, Baoxiong Jia, Zeyu Zhang 0001, Song-Chun Zhu, Chi Zhang 0017, and Yixin Zhu 0001
Published: 2023
Full Text: View/download PDF

22. Learning a Causal Transition Model for Object Cutting.

Author: Zeyu Zhang 0001, Muzhi Han, Baoxiong Jia, Ziyuan Jiao, Yixin Zhu 0001, Song-Chun Zhu, and Hangxin Liu
Published: 2023
Full Text: View/download PDF

23. Part-level Scene Reconstruction Affords Robot Interaction.

Author: Zeyu Zhang 0001, Lexing Zhang, Zaijin Wang, Ziyuan Jiao, Muzhi Han, Yixin Zhu 0001, Song-Chun Zhu, and Hangxin Liu
Published: 2023
Full Text: View/download PDF

24. Diffusion-based Generation, Optimization, and Planning in 3D Scenes.

Author: Siyuan Huang 0001, Zan Wang, Puhao Li, Baoxiong Jia, Tengyu Liu, Yixin Zhu 0001, Wei Liang 0008, and Song-Chun Zhu
Published: 2023
Full Text: View/download PDF

25. Sim2Plan: Robot Motion Planning via Message Passing Between Simulation and Reality.

Author: Yizhou Zhao, Yuanhong Zeng, Qian Long, Ying Nian Wu, and Song-Chun Zhu
Published: 2023
Full Text: View/download PDF

26. On the Complexity of Bayesian Generalization.

Author: Yu-Zhe Shi, Manjie Xu, John E. Hopcroft, Kun He 0001, Joshua B. Tenenbaum, Song-Chun Zhu, Ying Nian Wu, Wenjuan Han, and Yixin Zhu 0001
Published: 2023

27. Rearrange Indoor Scenes for Human-Robot Co-Activity.

Author: Weiqi Wang, Zihang Zhao, Ziyuan Jiao, Yixin Zhu 0001, Song-Chun Zhu, and Hangxin Liu
Published: 2023
Full Text: View/download PDF

28. Sequential Manipulation Planning on Scene Graph.

Author: Ziyuan Jiao, Yida Niu, Zeyu Zhang 0001, Song-Chun Zhu, Yixin Zhu 0001, and Hangxin Liu
Published: 2022
Full Text: View/download PDF

29. Towards Socially Intelligent Agents with Mental State Transition and Human Value.

Author: Liang Qiu 0001, Yizhou Zhao, Yuan Liang 0001, Pan Lu, Weiyan Shi, Zhou Yu, and Song-Chun Zhu
Published: 2022
Full Text: View/download PDF

30. Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning.

Author: Chi Zhang 0017, Sirui Xie, Baoxiong Jia, Ying Nian Wu, Song-Chun Zhu, and Yixin Zhu 0001
Published: 2022
Full Text: View/download PDF

31. Latent Diffusion Energy-Based Model for Interpretable Text Modelling.

Author: Peiyu Yu, Sirui Xie, Xiaojian Ma, Baoxiong Jia, Bo Pang 0004, Ruiqi Gao, Yixin Zhu 0001, Song-Chun Zhu, and Ying Nian Wu
Published: 2022

32. COAT: Measuring Object Compositionality in Emergent Representations.

Author: Sirui Xie, Ari S. Morcos, Song-Chun Zhu, and Ramakrishna Vedantam
Published: 2022

33. Learning V1 Simple Cells with Vector Representation of Local Content and Matrix Representation of Local Motion.

Author: Ruiqi Gao, Jianwen Xie, Siyuan Huang 0001, Yufan Ren, Song-Chun Zhu, and Ying Nian Wu
Published: 2022
Full Text: View/download PDF

34. ValueNet: A New Dataset for Human Value Driven Dialogue System.

Author: Liang Qiu 0001, Yizhou Zhao, Jinchao Li, Pan Lu, Baolin Peng, Jianfeng Gao 0001, and Song-Chun Zhu
Published: 2022
Full Text: View/download PDF

35. Learning from the Tangram to Solve Mini Visual Tasks.

Author: Yizhou Zhao, Liang Qiu 0001, Pan Lu, Feng Shi 0006, Tian Han 0001, and Song-Chun Zhu
Published: 2022
Full Text: View/download PDF

36. SQA3D: Situated Question Answering in 3D Scenes.

Author: Xiaojian Ma, Silong Yong, Zilong Zheng, Qing Li 0003, Yitao Liang, Song-Chun Zhu, and Siyuan Huang 0001
Published: 2023

37. A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics.

Author: Qing Li 0003, Siyuan Huang 0001, Yining Hong, Yixin Zhu 0001, Ying Nian Wu, and Song-Chun Zhu
Published: 2023

38. Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning.

Author: Pan Lu, Liang Qiu 0001, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Tanmay Rajpurohit, Peter Clark, and Ashwin Kalyan
Published: 2023

39. Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models.

Author: Pan Lu, Baolin Peng, Hao Cheng 0002, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, and Jianfeng Gao 0001
Published: 2023

40. Learning non-Markovian Decision-Making from State-only Sequences.

Author: Aoyang Qin, Feng Gao 0013, Qing Li 0003, Song-Chun Zhu, and Sirui Xie
Published: 2023

41. Learning Energy-Based Prior Model with Diffusion-Amortized MCMC.

Author: Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun Zhu, and Ying Nian Wu
Published: 2023

42. Evaluating and Inducing Personality in Pre-trained Language Models.

Author: Guangyuan Jiang, Manjie Xu, Song-Chun Zhu, Wenjuan Han, Chi Zhang 0017, and Yixin Zhu 0001
Published: 2023

43. Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning.

Author: Hengli Li, Song-Chun Zhu, and Zilong Zheng
Published: 2023

44. Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds.

Author: Siyuan Huang 0001, Yichen Xie 0002, Song-Chun Zhu, and Yixin Zhu 0001
Published: 2021
Full Text: View/download PDF

45. YouRefIt: Embodied Reference Understanding with Language and Gesture.

Author: Yixin Chen 0003, Qing Li 0003, Deqian Kong, Yik Lun Kei, Song-Chun Zhu, Tao Gao 0004, Yixin Zhu 0001, and Siyuan Huang 0001
Published: 2021
Full Text: View/download PDF

46. VLGrammar: Grounded Grammar Induction of Vision and Language.

Author: Yining Hong, Qing Li 0003, Song-Chun Zhu, and Siyuan Huang 0001
Published: 2021
Full Text: View/download PDF

47. Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions.

Author: Arjun R. Akula, Spandana Gella, Keze Wang, Song-Chun Zhu, and Siva Reddy
Published: 2021
Full Text: View/download PDF

48. CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization.

Author: Arjun R. Akula, Soravit Changpinyo, Boqing Gong, Piyush Sharma, Song-Chun Zhu, and Radu Soricut
Published: 2021
Full Text: View/download PDF

49. Consolidating Kinematic Models to Promote Coordinated Mobile Manipulations.

Author: Ziyuan Jiao, Zeyu Zhang 0001, Xin Jiang, David Han, Song-Chun Zhu, Yixin Zhu 0001, and Hangxin Liu
Published: 2021
Full Text: View/download PDF

50. Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective.

Author: Ziyuan Jiao, Zeyu Zhang 0001, Weiqi Wang, David Han, Song-Chun Zhu, Yixin Zhu 0001, and Hangxin Liu
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Database

Publisher

789 results on '"Song-Chun Zhu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources