Hierarchical Relative Entropy Policy Search

Many real-world problems are inherently hierarchically structured. The use of this structure in an agent¿s policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy ¿ the `mixed option¿ policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.

Weitere Produkte vom selben Autor

Regenerative Kulturen gestalten Wahl, Daniel Christian

29,95 €*
Digitalisierung in KMU kompakt Leeser, Daniel Christian

19,99 €*
Download
PDF
Aktuelle Entwicklungen im Weltanschauungsrecht Jacqueline Neumann, Gerhard Czermak, Reinhard Merkel, Holm Putzke

92,00 €*
Download
PDF
Der Fall Kristina Hänel Jörg Scheinfeld, Jacqueline Neumann, Gerhard Czermak, Reinhard Merkel, Holm Putzke

0,00 €*