Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| F1-score moyenné par micro-classe× | F1 moyen macro× | |
|---|---|---|
| Domaine | Évaluation de modèles | Évaluation de modèles |
| Famille | MCDM | MCDM |
| Année d'origine | 2000s | 2000s |
| Auteur d'origine | Multi-class evaluation community | Multi-class evaluation community |
| Type | Evaluation metric | Evaluation metric |
| Source fondatrice | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ |
| Alias | Micro F1, Frequency-weighted average F1 | Macro F1, Unweighted average F1 |
| Apparentées≠ | 4 | 3 |
| Résumé≠ | Micro-averaged F1 computes the F1-score by aggregating true positives, false positives, and false negatives across all classes, then calculating a single metric. It is equivalent to accuracy in multi-class classification and is useful when class distributions reflect their natural importance. | Macro-averaged F1 computes the F1-score independently for each class and then takes the unweighted arithmetic mean. It treats all classes equally, regardless of their frequency in the dataset, making it useful for imbalanced multi-class problems. |
| ScholarGateJeu de données ↗ |
|
|