Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
An Incremental Algorithm of Text Clustering Based on Semantic Sequences
Ist Teil von
Wuhan University journal of natural sciences, 2006-09, Vol.11 (5), p.1340-1344
Ort / Verlag
Institute of Computer Software, Xi'an Jiaotong University,Xi'an 710049, Shaannxi, China
Erscheinungsjahr
2006
Link zum Volltext
Quelle
SpringerLink (Online service)
Beschreibungen/Notizen
This paper proposed an incremental textclustering algorithm based on semantic sequence. Using similarity relation of semantic sequences and calculating the cover of similarity semantic sequences set, the candidate cluster with minimum entropy overlap value was selected as a result cluster every time in this algorithm. The comparison of experimental results shows that the precision of the algorithm is higher than other algorithms under same conditions and this is obvious especially on long documents set.