Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 11 von 823

Details

Autor(en) / Beteiligte
Titel
Contextual Direct Policy Search: With Regularized Covariance Matrix Estimation
Ist Teil von
  • Journal of intelligent & robotic systems, 2019-11, Vol.96 (2), p.141-157
Ort / Verlag
Dordrecht: Springer Netherlands
Erscheinungsjahr
2019
Link zum Volltext
Quelle
SpringerLink
Beschreibungen/Notizen
  • Stochastic search and optimization techniques are used in a vast number of areas, ranging from refining the design of vehicles, determining the effectiveness of new drugs, developing efficient strategies in games, or learning proper behaviors in robotics. However, they specialize for the specific problem they are solving, and if the problem’s context slightly changes, they cannot adapt properly. In fact, they require complete re-leaning in order to perform correctly in new unseen scenarios, regardless of how similar they are to previous learned environments. Contextual algorithms have recently emerged as solutions to this problem. They learn the policy for a task that depends on a given context, such that widely different contexts belonging to the same task are learned simultaneously. That being said, the state-of-the-art proposals of this class of algorithms prematurely converge, and simply cannot compete with algorithms that learn a policy for a single context. We describe the Contextual Relative Entropy Policy Search (CREPS) algorithm, which belongs to the before-mentioned class of contextual algorithms. We extend it with a technique that allows the algorithm to severely increase its performance, and we call it Contextual Relative Entropy Policy Search with Covariance Matrix Adaptation (CREPS-CMA). We propose two variants, and demonstrate their behavior in a set of classic contextual optimization problems, and on complex simulator robot tasks.
Sprache
Englisch
Identifikatoren
ISSN: 0921-0296
eISSN: 1573-0409
DOI: 10.1007/s10846-018-0968-4
Titel-ID: cdi_gale_infotracacademiconefile_A724288068

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX