Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| Αλγόριθμος Apriori× | Κανόνες Συσχέτισης× | FP-Growth (Ανάπτυξη Συχνών Μοτίβων)× | Ομαδοποίηση K-means× | Διαδικτυακή Μάθηση× | |
|---|---|---|---|---|---|
| Πεδίο | Μηχανική Μάθηση | Μηχανική Μάθηση | Μηχανική Μάθηση | Μηχανική Μάθηση | Μηχανική Μάθηση |
| Οικογένεια | Machine learning | Machine learning | Machine learning | Machine learning | Machine learning |
| Έτος προέλευσης≠ | 1994 | 1993 | 2000 | 1967 (formalized 1982) | 1958–2000s |
| Δημιουργός≠ | Agrawal, R. & Srikant, R. | Agrawal, R., Imielinski, T., & Swami, A. | Jiawei Han, Jian Pei & Yiwen Yin | MacQueen, J. B.; Lloyd, S. P. | Rosenblatt, F.; Littlestone, N.; Shalev-Shwartz, S. (key contributors) |
| Τύπος≠ | Frequent itemset and association rule mining algorithm | Unsupervised pattern discovery | Frequent-itemset mining algorithm | Partitional clustering | Learning paradigm (sequential model update) |
| Θεμελιώδης πηγή≠ | Agrawal, R. & Srikant, R. (1994). Fast algorithms for mining association rules. Proceedings of the 20th International Conference on Very Large Data Bases (VLDB), 487–499. link ↗ | Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 207–216. DOI ↗ | Han, J., Pei, J., & Yin, Y. (2000). Mining frequent patterns without candidate generation. ACM SIGMOD Record, 29(2), 1–12. DOI ↗ | Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129–137. DOI ↗ | Shalev-Shwartz, S. (2011). Online Learning and Online Convex Optimization. Foundations and Trends in Machine Learning, 4(2), 107–194. DOI ↗ |
| Εναλλακτικές ονομασίες | Apriori, frequent itemset mining, ARL-Apriori, Apriori association mining | market basket analysis, association rule mining, frequent itemset mining, affinity analysis | frequent pattern growth, FP-tree mining, FP-Growth algorithm, sık örüntü büyütme | k-means clustering, Lloyd's algorithm, k-means partitioning, hard k-means | incremental learning, sequential learning, streaming learning, online machine learning |
| Συναφείς≠ | 5 | 4 | 4 | 4 | 6 |
| Σύνοψη≠ | The Apriori algorithm, introduced by Agrawal and Srikant in 1994, is the foundational method for discovering frequent itemsets and association rules in transactional databases. It uses a breadth-first, level-wise search guided by the anti-monotone property of support to efficiently enumerate all item combinations that co-occur above a user-set minimum threshold, then extracts interpretable if-then rules from those patterns. | Association rule learning is an unsupervised technique that discovers co-occurrence patterns — 'if X then Y' implications — within large transactional datasets. Originally formalized by Agrawal, Imielinski, and Swami (1993) for supermarket basket analysis, it is now widely applied in e-commerce recommendation, health informatics, bioinformatics, and behavioral research. | FP-Growth, introduced by Jiawei Han, Jian Pei, and Yiwen Yin in 2000, mines frequent itemsets from transaction data without generating candidate sets, the costly step that slows the classic Apriori algorithm. It compresses the database into a frequent-pattern tree (FP-tree) in two scans, then grows frequent patterns recursively from that structure, making it dramatically faster than Apriori on large, dense datasets. | K-means is a classic unsupervised partitional clustering algorithm that divides a dataset into K non-overlapping groups by iteratively assigning each observation to its nearest centroid and updating centroids as the mean of their assigned points. It is one of the most widely used exploratory tools in machine learning and data analysis. | Online learning is a machine learning paradigm in which a model is updated incrementally as each new data point arrives, rather than being trained once on a fixed dataset. It is essential when data streams continuously, storage is limited, or the underlying distribution shifts over time. Theoretical performance is measured by cumulative regret relative to the best fixed predictor in hindsight. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|
|
|
|