UB Paderborn / Katalog / Suche / Details

Ergebnis 11 von 27

Lecture notes in computer science, 2000, p.665-672

2000

Autor(en) / Beteiligte

Titel

Lightweight Document Clustering

Ist Teil von

Ort / Verlag

Berlin, Heidelberg: Springer Berlin Heidelberg

Erscheinungsjahr

2000

Link zum Volltext

Quelle

Alma/SFX Local Collection

Beschreibungen/Notizen

A lightweight document clustering method is described that operates in high dimensions, processes tens of thousands of documents and groups them into several thousand clusters, or by varying a single parameter, into a few dozen clusters. The method uses a reduced indexing view of the original documents, where only the k best keywords of each document are indexed. An effcient procedure for clustering is speci fied in two parts (a) compute k most similar documents for each document in the collection and (b) group the documents into clusters using these similarity scores. The method has been evaluated on a database of over 50,000 customer service problem reports that are reduced to 3,000 clusters and 5,000 exemplar documents. Results demonstrate effcient clustering performance with excellent group similarity measures.

Sprache: Englisch
Identifikatoren: ISBN: 9783540410669, 354041066X
ISSN: 0302-9743
eISSN: 1611-3349
DOI: 10.1007/3-540-45372-5_82
Titel-ID: cdi_pascalfrancis_primary_778132

Format: –
Schlagworte: Applied sciences, Artificial intelligence, Computer science, control theory, systems, Exact sciences and technology, Learning and adaptive systems

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX