Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 14 von 361
IEEE access, 2020, Vol.8, p.159639-159649
2020
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Policy Reuse for Dialog Management Using Action-Relation Probability
Ist Teil von
  • IEEE access, 2020, Vol.8, p.159639-159649
Ort / Verlag
Piscataway: IEEE
Erscheinungsjahr
2020
Quelle
EZB Electronic Journals Library
Beschreibungen/Notizen
  • We study the problem of policy adaptation for reinforcement-learning-based dialog management. Policy adaptation is a commonly used technique to alleviate the problem of data sparsity when training a goal-oriented dialog system for a new task (the target task) by using knowledge when learning policies in an existing task. The methods used by current works in dialog policy adaptation need much time and effort for adapting because they use reinforcement learning algorithms to train a new policy for the target task from scratch. In this paper, we show that a dialog policy can be learned without training by reinforcement learning in the target task. In contrast to existing works, our proposed method learns the relation in the form of probability distribution between the action sets of the source and the target tasks. Thus, we can immediately derive a policy for the target task, which significantly reduces the adaptation time. Our experiments show that the proposed method learns a new policy for the target task much more quickly. In addition, the learned policy achieves higher performance than policies created by fine-tuning when the amount of available data on the target task is limited.
Sprache
Englisch
Identifikatoren
ISSN: 2169-3536
eISSN: 2169-3536
DOI: 10.1109/ACCESS.2020.3017780
Titel-ID: cdi_crossref_primary_10_1109_ACCESS_2020_3017780

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX