Abstract
One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implementing machine learning. In contrast, we simplify the controller using a decentralized and modular approach, with the aim of finding specific requirements needed for a robust and scalable learning strategy in robots. To achieve this, we conducted experiments and simulations on a specific robotic platform assembled from identical autonomous units that continuously sense their environment and react to it. By letting each unit adapt its behavior independently using a basic Monte Carlo scheme, the assembled system is able to learn and maintain optimal behavior in a dynamic environment as long as its memory is representative of the current environment, even when incurring damage. We show that the physical connection between the units is enough to achieve learning, and no additional communication or centralized information is required. As a result, such a distributed learning approach can be easily scaled to larger assemblies, blurring the boundaries between materials and robots, paving the way for a new class of modular “robotic matter” that can autonomously learn to thrive in dynamic or unfamiliar situations, for example, encountered by soft robots or self-assembled (micro)robots in various environments spanning from the medical realm to space explorations.
| Original language | English |
|---|---|
| Article number | e2017015118 |
| Number of pages | 6 |
| Journal | Proceedings of the National Academy of Sciences of the United States of America |
| Volume | 118 |
| Issue number | 21 |
| DOIs | |
| Publication status | Published - May 2021 |
| Externally published | Yes |
Bibliographical note
Funding Information:ACKNOWLEDGMENTS. This work is part of the Dutch Research Council (NWO) and was performed at the research institute of AMOLF. This project is part of the research program Innovational Research Incentives Scheme Veni from NWO with project number 15868 (NWO) and has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement 767195.
Funding Information:
This work is part of the Dutch Research Council (NWO) and was performed at the research institute of AMOLF. This project is part of the research program Innovational Research Incentives Scheme Veni from NWO with project number 15868 (NWO) and has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement 767195.
Funding
ACKNOWLEDGMENTS. This work is part of the Dutch Research Council (NWO) and was performed at the research institute of AMOLF. This project is part of the research program Innovational Research Incentives Scheme Veni from NWO with project number 15868 (NWO) and has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement 767195. This work is part of the Dutch Research Council (NWO) and was performed at the research institute of AMOLF. This project is part of the research program Innovational Research Incentives Scheme Veni from NWO with project number 15868 (NWO) and has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement 767195.
Keywords
- Dynamic environment |
- Emergent behavior
- Modular robot
- Reinforced learning
Fingerprint
Dive into the research topics of 'Continuous learning of emergent behavior in robotic matter'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver