Abstract
Despite significant advancements in the field of vision, language and robotics, integrating these capabilities to create an autonomous robot assistant remains a challenge. This paper presents ViLaBot (Vision and Language roBot), a system designed to aid humans in daily activities while at home. ViLaBot combines a language model with a library of basic visuomotor skills to understand human needs, create action plans and execute them. The system relies solely on onboard visual and proprioceptive sensing, eliminating the need for pre-built maps or precise object locations and facilitating real-world deployment in a variety of environments. Experimental validation conducted in 11 realistic home environments featuring simulated human agents using the Habitat simulator indicated that ViLaBot can achieve promising results when using ground-truth image segmentation, yet exhibits modest performance in scenarios involving imperfect visual perception. The results support the validity of the proposed pipeline and highlight the critical components of the system that should be improved to increase its overall success rate and reliability.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2024 - Proceedings |
Publisher | IEEE |
Pages | 1206-1211 |
Number of pages | 6 |
ISBN (Electronic) | 9798350378009 |
DOIs | |
Publication status | Published - 2024 |
Event | 3rd IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2024 - St Albans, United Kingdom Duration: 21 Oct 2024 → 23 Oct 2024 https://metroxraine.org/metroxraine2024/index |
Publication series
Series | IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE - Proceedings |
---|
Conference
Conference | 3rd IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering, MetroXRAINE 2024 |
---|---|
Abbreviated title | MetroXRAINE 2024 |
Country/Territory | United Kingdom |
City | St Albans |
Period | 21/10/24 → 23/10/24 |
Internet address |
Keywords
- assistive tasks
- human-robot interaction
- navigation and manipulation
- task planning