The Effectiveness Index Intrinsic Reward for Coordinating Service Robots

Yinon Douchan, Gal A. Kaminka

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

Modern multi-robot service robotics applications often rely on coordination capabilities at multiple levels, from global (system-wide) task allocation and selection, to local (nearby) spatial coordination to avoid collisions. Often, the global methods are considered to be the heart of the multi-robot system, while local methods are tacked on to overcome intermittent, spatially-limited hindrances. We tackle this general assumption. Utilizing the alphabet soup simulator (simulating order picking, made famous by Kiva Systems), we experiment with a set of myopic, local methods for obstacle avoidance. We report on a series of experiments with a reinforcement-learning approach, using the Effectiveness-Index intrinsic reward, to allow robots to learn to select between methods to use when avoiding collisions. We show that allowing the learner to explore the space of parameterized methods results in significant improvements, even compared to the original methods provided by the simulator.

Original languageEnglish
Title of host publicationSpringer Proceedings in Advanced Robotics
Pages299-311
Number of pages13
DOIs
StatePublished - 2018

Publication series

NameSpringer Proceedings in Advanced Robotics
Volume6

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Mechanical Engineering
  • Engineering (miscellaneous)
  • Artificial Intelligence
  • Computer Science Applications
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'The Effectiveness Index Intrinsic Reward for Coordinating Service Robots'. Together they form a unique fingerprint.

Cite this