Policy Gradient Methods for Robotics

Peters, J; Schaal, S

doi:10.1109/IROS.2006.282564

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Conference Paper

Policy Gradient Methods for Robotics

MPS-Authors

There are no MPG-Authors in the publication available

External Resource

https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4058714
(Publisher version)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Peters, J., & Schaal, S. (2006). Policy Gradient Methods for Robotics. In 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems (pp. 2219-2225). Los Alamitos, CA, USA: IEEE Computer Society.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0013-CFD3-6

Abstract

The acquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-structured environments. However, to date only few existing reinforcement learning methods have been scaled into the domains of high-dimensional robots such as manipulator, legged or humanoid robots. Policy gradient methods remain one of the few exceptions and have found a variety of applications. Nevertheless, the application of such methods is not without peril if done in an uninformed manner. In this paper, we give an overview on learning with policy gradient methods for robotics with a strong focus on recent advances in the field. We outline previous applications to robotics and show how the most recently developed methods can significantly improve learning performance. Finally, we evaluate our most promising algorithm in the application of hitting a baseball with an anthropomorphic arm.