Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)
15 Downloads (Pure)

Abstract

Robot control for tactile feedback based manip-ulation can be difficult due to modeling of physical contacts, partial observability of the environment, and noise in perception and control. This work focuses on solving partial observability of contact-rich manipulation tasks as a Sequence-to-Sequence (Seq2Seq) Imitation Learning (IL) problem. The proposed Seq2Seq model first produces a robot-environment interaction sequence to estimate the partially observable environment state variables, and then, the observed interaction sequence is transformed to a control sequence for the task itself. The proposed Seq2Seq IL for tactile feedback based manipulation is experimentally validated on a door-open task in a simulated environment and a snap-on insertion task with a real robot. The model is able to learn both tasks from only 50 expert demonstrations while state-of-the-art reinforcement learning and imitation learning methods fail.

Original languageEnglish
Title of host publicationProceedings - ICRA 2023
Subtitle of host publicationIEEE International Conference on Robotics and Automation
PublisherIEEE
Pages5829-5836
Number of pages8
ISBN (Electronic)979-8-3503-2365-8
ISBN (Print)979-8-3503-2366-5
DOIs
Publication statusPublished - 2023
Publication typeA4 Article in conference proceedings
EventIEEE International Conference on Robotics and Automation - London, United Kingdom
Duration: 29 May 20232 Jun 2023

Conference

ConferenceIEEE International Conference on Robotics and Automation
Country/TerritoryUnited Kingdom
CityLondon
Period29/05/232/06/23

Publication forum classification

  • Publication forum level 2

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation'. Together they form a unique fingerprint.

Cite this