Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 9 von 36
Machine learning, 2015-01, Vol.98 (1-2), p.31-56
2015
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Subjectively interesting alternative clusterings
Ist Teil von
  • Machine learning, 2015-01, Vol.98 (1-2), p.31-56
Ort / Verlag
New York: Springer US
Erscheinungsjahr
2015
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • We deploy a recently proposed framework for mining subjectively interesting patterns from data to the problem of alternative clustering, where patterns are sets of clusters (clusterings) in the data. This framework outlines how subjective interestingness of patterns (here, clusterings) can be quantified using sound information theoretic concepts. We demonstrate how it motivates a new objective function quantifying the interestingness of a clustering, automatically accounting for a user’s prior beliefs and for redundancies between the discovered patterns. Directly searching for the optimal set of clusterings defined in this way is hard. However, the optimization problem can be solved approximately if clusterings are generated iteratively. In this iterative scheme, each subsequent clustering is maximally interesting given the whole set of previously generated clusterings, automatically trading off interestingness with non-redundancy. Although generating each clustering in an iterative fashion is computationally hard as well, we develop an approximation technique similar to spectral clustering algorithms. Our method can generate as many clusterings as the user requires. Subjective evaluation or the value of the objective function can guide the termination of the process. In addition our method allows varying the number of clusters in each successive clustering. Experiments on artificial and real-world datasets show that the mined clusterings fulfill the requirements of a good clustering solution by being both non-redundant and of high compactness. Comparison with existing solutions shows that our approach compares favourably with regard to well-known objective measures of similarity and quality of clusterings, even though it is not designed to directly optimize them.
Sprache
Englisch
Identifikatoren
ISSN: 0885-6125
eISSN: 1573-0565
DOI: 10.1007/s10994-013-5333-z
Titel-ID: cdi_proquest_miscellaneous_1660098510

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX