Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Sparse Spherical K-Means for Document Clustering

J. Knittel, S. Koch, und T. Ertl. Proceedings of the 21st ACM Symposium on Document Engineering, New York, NY, USA, Association for Computing Machinery, (2021)
DOI: 10.1145/3469096.3474937

Zusammenfassung

Spherical k-Means is frequently used to cluster document collections because it performs reasonably well in many settings and is computationally efficient. However, the time complexity increases linearly with the number of clusters k, which limits the suitability of the algorithm for larger values of k depending on the size of the collection. Optimizations targeted at the Euclidean k-Means algorithm largely do not apply because the cosine distance is not a metric. We therefore propose an efficient indexing structure to improve the scalability of Spherical k-Means with respect to k. Our approach exploits the sparsity of the input vectors and the convergence behavior of k-Means to reduce the number of comparisons on each iteration significantly.

Links und Ressourcen

BibTeX-Schlüssel: Knittel21Clustering
Eintragstyp: inproceedings
Adresse: New York, NY, USA
Buchtitel: Proceedings of the 21st ACM Symposium on Document Engineering
Jahr: 2021
Verlag: Association for Computing Machinery
Reihe: DocEng '21
isbn: 9781450385961
numpages: 4
articleno: 6
location: Limerick, Ireland
DOI: 10.1145/3469096.3474937
URL: https://doi.org/10.1145/3469096.3474937

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Sparse Spherical K-Means for Document Clustering

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Efficient Sparse Spherical K-Means for Document Clustering

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Sparse Spherical K-Means for Document Clustering

Kommentare und Rezensionen
(0)