1. A neural integrator model for planning and value-based decision making of a robotics assistant
- Author
-
Weronika Wojtak, Flora Ferreira, Wolfram Erlhagen, Luís Louro, Estela Bicho, Paulo Vicente, and Universidade do Minho
- Subjects
0209 industrial biotechnology ,Computer science ,Process (engineering) ,Dynamic field theory ,Sevice robotics ,02 engineering and technology ,Sequence learning ,Task (project management) ,020901 industrial engineering & automation ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Value-based decision making ,Learning ,Neural integrator ,Ciências Naturais::Matemáticas ,Science & Technology ,business.industry ,Action planning ,Robotics ,Mobile robot ,Object (computer science) ,Dynamic Neural Field ,Integrator ,Robot ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Assembly robot ,Software ,Matemáticas [Ciências Naturais] - Abstract
Modern manufacturing and assembly environments are characterized by a high variability in the built process which challenges human–robot cooperation. To reduce the cognitive workload of the operator, the robot should not only be able to learn from experience but also to plan and decide autonomously. Here, we present an approach based on Dynamic Neural Fields that apply brain-like computations to endow a robot with these cognitive functions. A neural integrator is used to model the gradual accumulation of sensory and other evidence as time-varying persistent activity of neural populations. The decision to act is modeled by a competitive dynamics between neural populations linked to different motor behaviors. They receive the persistent activation pattern of the integrators as input. In the first experiment, a robot learns rapidly by observation the sequential order of object transfers between an assistant and an operator to subsequently substitute the assistant in the joint task. The results show that the robot is able to proactively plan the series of handovers in the correct order. In the second experiment, a mobile robot searches at two different workbenches for a specific object to deliver it to an operator. The object may appear at the two locations in a certain time period with independent probabilities unknown to the robot. The trial-by-trial decision under uncertainty is biased by the accumulated evidence of past successes and choices. The choice behavior over a longer period reveals that the robot achieves a high search efficiency in stationary as well as dynamic environments., The work received financial support from FCT through the PhD fellowships PD/BD/128183/2016 and SFRH/BD/124912/2016, the project “Neurofield” (PTDC/MAT-APL/31393/2017) and the research centre CMAT within the project UID/MAT/00013/2013.
- Published
- 2020