Reinforcement learning on variable impedance controller for high-precision robotic assembly

Jianlan Luo, Eugen Solowjow, Chengtao Wen, Juan Aparicio Ojea, Alice M. Agogino, Aviv Tamar, Pieter Abbeel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Precise robotic manipulation skills are desirable in many industrial settings, reinforcement learning (RL) methods hold the promise of acquiring these skills autonomously. In this paper, we explicitly consider incorporating operational space force/torque information into reinforcement learning; this is motivated by humans heuristically mapping perceived forces to control actions, which results in completing high-precision tasks in a fairly easy manner. Our approach combines RL with force/torque information by incorporating a proper operational space force controller; where we also exploit different ablations on processing this information. Moreover, we propose a neural network architecture that generalizes to reasonable variations of the environment. We evaluate our method on the open-source Siemens Robot Learning Challenge, which requires precise and delicate force-controlled behavior to assemble a tight-fit gear wheel set.

Original languageEnglish
Title of host publication2019 International Conference on Robotics and Automation, ICRA 2019
Pages3080-3087
Number of pages8
ISBN (Electronic)9781538660263
DOIs
StatePublished - May 2019
Event2019 International Conference on Robotics and Automation, ICRA 2019 - Montreal, Canada
Duration: 20 May 201924 May 2019

Publication series

NameProceedings - IEEE International Conference on Robotics and Automation

Conference

Conference2019 International Conference on Robotics and Automation, ICRA 2019
Country/TerritoryCanada
CityMontreal
Period20/05/1924/05/19

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Reinforcement learning on variable impedance controller for high-precision robotic assembly'. Together they form a unique fingerprint.

Cite this