Process / pipeline
文档聚类
文档聚类是一种无监督的文本挖掘任务,它在不使用任何标签的情况下将内容相似的文档分组。它用于组织大型文档集合和进行探索性分析,借鉴了 Aggarwal 和 Zhai (2012) 巩固的文本挖掘技术体系,并由 Steinbach、Karypis 和 Kumar (2000) 进行实证比较。
阅读完整方法
仅限会员
登录使用免费账户登录即可阅读本节。
Method map
The neighbourhood of related methods — select a node to explore.
+1 more
来源
- Aggarwal, C. C. & Zhai, C. (2012). Mining Text Data. Springer. ISBN: 9781461432227
- Steinbach, M., Karypis, G. & Kumar, V. (2000). A Comparison of Document Clustering Techniques. KDD Workshop on Text Mining. link ↗
如何引用本页
ScholarGate. (2026, June 1). Document Clustering. ScholarGate. https://scholargate.app/zh/text-mining/document-clustering
Which method?
Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.
Compare side by side →