Porovnať metódy
Prezrite si vybrané metódy vedľa seba; riadky, ktoré sa líšia, sú zvýraznené.
| Online DBSCAN× | HDBSCAN× | Online Gaussov model zmesi× | |
|---|---|---|---|
| Odbor | Strojové učenie | Strojové učenie | Strojové učenie |
| Rodina | Machine learning | Machine learning | Machine learning |
| Rok vzniku≠ | 1998 | 2013 | 2000–2009 |
| Tvorca≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. | Campello, R. J. G. B.; Moulavi, D.; Sander, J. | Cappé, O. & Moulines, E. (online EM formulation) |
| Typ≠ | Incremental density-based clustering | Hierarchical density-based clustering | Probabilistic clustering / density estimation (incremental) |
| Pôvodný zdroj≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. (1998). Incremental Clustering for Mining in a Data Warehousing Environment. In Proceedings of the 24th International Conference on Very Large Data Bases (VLDB), pp. 323–333. link ↗ | Campello, R. J. G. B., Moulavi, D., & Sander, J. (2013). Density-Based Clustering Based on Hierarchical Density Estimates. In J. Pei et al. (Eds.), Advances in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science, vol. 7819 (pp. 160–172). Springer, Berlin, Heidelberg. DOI ↗ | Cappé, O. & Moulines, E. (2009). On-line expectation-maximization algorithm for latent data models. Journal of the Royal Statistical Society: Series B, 71(3), 593–613. DOI ↗ |
| Ďalšie názvy | Incremental DBSCAN, Streaming DBSCAN, Online density-based clustering, iDBSCAN | HDBSCAN, Hierarchical DBSCAN, hierarchical density-based clustering, HDBSCAN* | Online GMM, Incremental GMM, Streaming Gaussian Mixture Model, Sequential GMM |
| Príbuzné≠ | 5 | 3 | 5 |
| Zhrnutie≠ | Online DBSCAN extends the classic density-based clustering algorithm to handle continuously arriving data points without re-clustering the entire dataset from scratch. Each new observation is integrated into the existing cluster structure by local neighborhood queries, making it practical for streaming and data-warehousing scenarios where data grows incrementally. | HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise) is a density-based clustering algorithm introduced by Campello, Moulavi, and Sander in 2013. It extends DBSCAN by building a full hierarchy of density-based clusters across all density scales and then extracting a stable flat partition, making it robust to datasets where cluster densities vary substantially across regions. | Online Gaussian Mixture Model adapts the classic GMM to streaming or large-scale data by replacing full-batch EM with incremental updates — processing one observation or mini-batch at a time and continuously refining component means, covariances, and mixing weights without revisiting the entire dataset. |
| ScholarGateDátová sada ↗ |
|
|
|