方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 在线 DBSCAN× | DBSCAN× | HDBSCAN× | 在线高斯混合模型× | |
|---|---|---|---|---|
| 领域 | 机器学习 | 机器学习 | 机器学习 | 机器学习 |
| 方法族 | Machine learning | Machine learning | Machine learning | Machine learning |
| 起源年份≠ | 1998 | 1996 | 2013 | 2000–2009 |
| 提出者≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. | Campello, R. J. G. B.; Moulavi, D.; Sander, J. | Cappé, O. & Moulines, E. (online EM formulation) |
| 类型≠ | Incremental density-based clustering | Density-based clustering algorithm | Hierarchical density-based clustering | Probabilistic clustering / density estimation (incremental) |
| 开创性文献≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. (1998). Incremental Clustering for Mining in a Data Warehousing Environment. In Proceedings of the 24th International Conference on Very Large Data Bases (VLDB), pp. 323–333. link ↗ | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. (1996). A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Proceedings of the 2nd KDD, 226–231. link ↗ | Campello, R. J. G. B., Moulavi, D., & Sander, J. (2013). Density-Based Clustering Based on Hierarchical Density Estimates. In J. Pei et al. (Eds.), Advances in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science, vol. 7819 (pp. 160–172). Springer, Berlin, Heidelberg. DOI ↗ | Cappé, O. & Moulines, E. (2009). On-line expectation-maximization algorithm for latent data models. Journal of the Royal Statistical Society: Series B, 71(3), 593–613. DOI ↗ |
| 别名≠ | Incremental DBSCAN, Streaming DBSCAN, Online density-based clustering, iDBSCAN | DBSCAN Kümeleme, density-based clustering, density-based spatial clustering | HDBSCAN, Hierarchical DBSCAN, hierarchical density-based clustering, HDBSCAN* | Online GMM, Incremental GMM, Streaming Gaussian Mixture Model, Sequential GMM |
| 相关≠ | 5 | 3 | 3 | 5 |
| 摘要≠ | Online DBSCAN extends the classic density-based clustering algorithm to handle continuously arriving data points without re-clustering the entire dataset from scratch. Each new observation is integrated into the existing cluster structure by local neighborhood queries, making it practical for streaming and data-warehousing scenarios where data grows incrementally. | DBSCAN is a density-based clustering algorithm, introduced by Ester, Kriegel, Sander and Xu in 1996, that groups together points lying in dense regions and flags points in sparse regions as noise. It is effective on noisy data and on clusters of irregular, non-spherical shapes. | HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise) is a density-based clustering algorithm introduced by Campello, Moulavi, and Sander in 2013. It extends DBSCAN by building a full hierarchy of density-based clusters across all density scales and then extracting a stable flat partition, making it robust to datasets where cluster densities vary substantially across regions. | Online Gaussian Mixture Model adapts the classic GMM to streaming or large-scale data by replacing full-batch EM with incremental updates — processing one observation or mini-batch at a time and continuously refining component means, covariances, and mixing weights without revisiting the entire dataset. |
| ScholarGate数据集 ↗ |
|
|
|
|