Machine learning
BIRCH — Balanced Iterative Reducing and Clustering using Hierarchies
BIRCH is a scalable, incremental clustering algorithm introduced by Zhang, Ramakrishnan, and Livny in 1996. It is designed to cluster very large datasets — potentially larger than available memory — in a single pass, by compressing the data into a compact in-memory summary structure called a CF-tree (Clustering Feature tree) before applying any standard clustering procedure.
MethodMind'de açSoonVideoSoon
Tam yöntemi oku
Members only
Sign inSign in with a free account to read this section.
Sources
- Zhang, T., Ramakrishnan, R., & Livny, M. (1996). BIRCH: An efficient data clustering method for very large databases. Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, 25(2), 103–114. DOI: 10.1145/233269.233324 ↗
- Han, J., Kamber, M., & Pei, J. (2011). Data Mining: Concepts and Techniques (3rd ed., Ch. 10). Morgan Kaufmann. ISBN: 978-0-12-381479-1