قارن الطرق
راجع الطرق التي اخترتها جنبًا إلى جنب؛ الصفوف المختلفة مميَّزة.
| F1 الموزون× | مقياس F1 المُصغَّر (Micro-averaged F1)× | |
|---|---|---|
| المجال | تقييم النماذج | تقييم النماذج |
| العائلة | MCDM | MCDM |
| سنة النشأة | 2000s | 2000s |
| صاحب الطريقة | Multi-class evaluation community | Multi-class evaluation community |
| النوع | Evaluation metric | Evaluation metric |
| المصدر التأسيسي | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ |
| الأسماء البديلة≠ | Support-weighted F1 | Micro F1, Frequency-weighted average F1 |
| ذات صلة≠ | 3 | 4 |
| الملخص≠ | Weighted F1 computes the F1-score for each class and then takes a weighted average, where weights are proportional to the number of samples in each class (support). It provides a middle ground between macro and micro-averaging. | Micro-averaged F1 computes the F1-score by aggregating true positives, false positives, and false negatives across all classes, then calculating a single metric. It is equivalent to accuracy in multi-class classification and is useful when class distributions reflect their natural importance. |
| ScholarGateمجموعة البيانات ↗ |
|
|