Process / pipeline

文档聚类

文档聚类是一种无监督的文本挖掘任务，它在不使用任何标签的情况下将内容相似的文档分组。它用于组织大型文档集合和进行探索性分析，借鉴了 Aggarwal 和 Zhai (2012) 巩固的文本挖掘技术体系，并由 Steinbach、Karypis 和 Kumar (2000) 进行实证比较。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

The neighbourhood of related methods — select a node to explore.

文档聚类

+1 more

Aggarwal, C. C. & Zhai, C. (2012). Mining Text Data. Springer. ISBN: 9781461432227
Steinbach, M., Karypis, G. & Kumar, V. (2000). A Comparison of Document Clustering Techniques. KDD Workshop on Text Mining. link ↗

ScholarGate. (2026, June 1). Document Clustering. ScholarGate. https://scholargate.app/zh/text-mining/document-clustering

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.