विधियों की तुलना करें
चुनी हुई विधियों की आमने-सामने समीक्षा करें; भिन्नता वाली पंक्तियाँ रेखांकित हैं।
| ऑनलाइन डीबीस्कैन (Online DBSCAN)× | HDBSCAN× | ऑनलाइन के-मीन्स× | |
|---|---|---|---|
| क्षेत्र | मशीन अधिगम | मशीन अधिगम | मशीन अधिगम |
| परिवार | Machine learning | Machine learning | Machine learning |
| उद्भव वर्ष≠ | 1998 | 2013 | 1967 (online update rule); 2010 (mini-batch variant) |
| प्रवर्तक≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. | Campello, R. J. G. B.; Moulavi, D.; Sander, J. | MacQueen, J. (batch); Sculley, D. (mini-batch web-scale variant) |
| प्रकार≠ | Incremental density-based clustering | Hierarchical density-based clustering | Unsupervised clustering (online/streaming) |
| मौलिक स्रोत≠ | Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M., & Xu, X. (1998). Incremental Clustering for Mining in a Data Warehousing Environment. In Proceedings of the 24th International Conference on Very Large Data Bases (VLDB), pp. 323–333. link ↗ | Campello, R. J. G. B., Moulavi, D., & Sander, J. (2013). Density-Based Clustering Based on Hierarchical Density Estimates. In J. Pei et al. (Eds.), Advances in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science, vol. 7819 (pp. 160–172). Springer, Berlin, Heidelberg. DOI ↗ | MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, pp. 281–297. University of California Press. link ↗ |
| उपनाम | Incremental DBSCAN, Streaming DBSCAN, Online density-based clustering, iDBSCAN | HDBSCAN, Hierarchical DBSCAN, hierarchical density-based clustering, HDBSCAN* | sequential k-means, streaming k-means, incremental k-means, online clustering |
| संबंधित≠ | 5 | 3 | 4 |
| सारांश≠ | Online DBSCAN extends the classic density-based clustering algorithm to handle continuously arriving data points without re-clustering the entire dataset from scratch. Each new observation is integrated into the existing cluster structure by local neighborhood queries, making it practical for streaming and data-warehousing scenarios where data grows incrementally. | HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise) is a density-based clustering algorithm introduced by Campello, Moulavi, and Sander in 2013. It extends DBSCAN by building a full hierarchy of density-based clusters across all density scales and then extracting a stable flat partition, making it robust to datasets where cluster densities vary substantially across regions. | Online K-means is a streaming variant of the classical K-means algorithm that updates cluster centroids one observation at a time — or in small mini-batches — without storing the entire dataset in memory. It is particularly suited to large-scale, real-time, or continuously arriving data where batch recomputation would be too slow or impractical. |
| ScholarGateडेटासेट ↗ |
|
|
|