Abstract
Collaboration between human and robot requires effective modes of communication to assign robot tasks and coordinate activities. As communication can utilize different modalities, a multi-modal approach can be more expressive than single modal models alone. In this work we propose a co-speech gesture model that can assign robot tasks for human-robot collaboration. Human gestures and speech, detected by computer vision and speech recognition, can thus refer to objects in the scene and apply robot actions to them. We present an experimental evaluation of the multi-modal co-speech model with a real-world industrial use case. Results demonstrate that multi-modal communication is easy to achieve and can provide benefits for collaboration with respect to single modal tools.
Original language | English |
---|---|
Title of host publication | IEEE International Conference on Robotic Computing (IRC) |
Publisher | IEEE |
Pages | 110-114 |
ISBN (Electronic) | 979-8-3503-9574-7 |
DOIs | |
Publication status | Published - 30 Nov 2023 |
Publication type | A4 Article in conference proceedings |
Event | IEEE International Conference on Robotic Computing (IRC) - Laguna Hills, California, United States Duration: 11 Dec 2023 → 13 Dec 2023 |
Conference
Conference | IEEE International Conference on Robotic Computing (IRC) |
---|---|
Country/Territory | United States |
City | Laguna Hills, California |
Period | 11/12/23 → 13/12/23 |
Keywords
- cs.RO
Publication forum classification
- Publication forum level 1