Toward fast policy search for learning legged locomotion

Marc Peter Deisenroth; Roberto Calandra; Andre Seyfarth; Jan Peters

doi:10.1109/IROS.2012.6385955

Toward fast policy search for learning legged locomotion

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/Gutachten › Beitrag in Konferenzband › Beigetragen › Begutachtung

Beitragende

Marc Peter Deisenroth - , Technische Universität Darmstadt (Autor:in)
Roberto Calandra - , Technische Universität Darmstadt (Autor:in)
Andre Seyfarth - , Technische Universität Darmstadt (Autor:in)
Jan Peters - , Technische Universität Darmstadt, Max-Planck-Institut für Intelligente Systeme (Autor:in)

Abstract

Legged locomotion is one of the most versatile forms of mobility. However, despite the importance of legged locomotion and the large number of legged robotics studies, no biped or quadruped matches the agility and versatility of their biological counterparts to date. Approaches to designing controllers for legged locomotion systems are often based on either the assumption of perfectly known dynamics or mechanical designs that substantially reduce the dimensionality of the problem. The few existing approaches for learning controllers for legged systems either require exhaustive real-world data or they improve controllers only conservatively, leading to slow learning. We present a data-efficient approach to learning feedback controllers for legged locomotive systems, based on learned probabilistic forward models for generating walking policies. On a compass walker, we show that our approach allows for learning gait policies from very little data. Moreover, we analyze learned locomotion models of a biomechanically inspired biped. Our approach has the potential to scale to high-dimensional humanoid robots with little loss in efficiency.

Details

Originalsprache	Englisch
Titel	2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012
Seiten	1787-1792
Seitenumfang	6
Publikationsstatus	Veröffentlicht - 2012
Peer-Review-Status	Ja
Extern publiziert	Ja

Publikationsreihe

Reihe	IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
ISSN	2153-0858

Konferenz

Titel	25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012
Dauer	7 - 12 Oktober 2012
Stadt	Vilamoura, Algarve
Land	Portugal

Externe IDs

ORCID	/0000-0001-9430-8433/work/158768047

Schlagworte

ASJC Scopus Sachgebiete

Steuerungs- und Systemtechnik
Software
Maschinelles Sehen und Mustererkennung
Angewandte Informatik

Forschungsportal der TU Dresden