ScholarGate
助手
Process / pipeline

文档聚类

文档聚类是一种无监督的文本挖掘任务,它在不使用任何标签的情况下将内容相似的文档分组。它用于组织大型文档集合和进行探索性分析,借鉴了 Aggarwal 和 Zhai (2012) 巩固的文本挖掘技术体系,并由 Steinbach、Karypis 和 Kumar (2000) 进行实证比较。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

+1 more

来源

  1. Aggarwal, C. C. & Zhai, C. (2012). Mining Text Data. Springer. ISBN: 9781461432227
  2. Steinbach, M., Karypis, G. & Kumar, V. (2000). A Comparison of Document Clustering Techniques. KDD Workshop on Text Mining. link

如何引用本页

ScholarGate. (2026, June 1). Document Clustering. ScholarGate. https://scholargate.app/zh/text-mining/document-clustering

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

被引用于

ScholarGateDocument Clustering (Document Clustering). 于 2026-06-15 检索自 https://scholargate.app/zh/text-mining/document-clustering · 数据集: https://doi.org/10.5281/zenodo.20539026