Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 16 von 1403
IEEE transactions on knowledge and data engineering, 2017-02, Vol.29 (2), p.330-343
2017
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Guided HTM: Hierarchical Topic Model with Dirichlet Forest Priors
Ist Teil von
  • IEEE transactions on knowledge and data engineering, 2017-02, Vol.29 (2), p.330-343
Ort / Verlag
New York: IEEE
Erscheinungsjahr
2017
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • Despite the proliferation of topic models, the organization of topics from the probabilistic models needs improvement in two ways: the better structured presentation of topics and the incorporation of domain knowledge on the corpus. The structured presentation, i.e., the hierarchical topic model, helps in categorizing similar topics; the incorporation of domain knowledge enables the concentrated sampling of predefined keywords in the mixture parameter learning. This paper presents a hierarchical topic models with incorporated domain knowledge, called Guided Hierarchical Topic Model (GHTM). Specifically, we allocated the prior information from the knowledge to the Dirichlet Forest prior. From the prior adjustment, we obtained the topic tree guided by the domain knowledge. This paper also contributes in enumerating four different knowledge extraction methods and applying the extracted knowledge to GHTM. We evaluated the performance of GHTM in terms of the hierarchical clustering accuracy, and we found a significant improvement of hierarchical clustering measured by F-measures. This improvement is also verified by the perplexity analyses. Additionally, we measured topic quality with KL-divergence and visualization, and these confirm the ability to better separate topic distributions. Finally, we tested the hierarchical topic quality through human experiments, and this also revealed significant improvements originating from the guidance.
Sprache
Englisch
Identifikatoren
ISSN: 1041-4347
eISSN: 1558-2191
DOI: 10.1109/TKDE.2016.2625790
Titel-ID: cdi_ieee_primary_7737042

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX