مقایسهٔ روشها
روشهای انتخابی خود را کنار هم مرور کنید؛ ردیفهای متفاوت برجسته شدهاند.
| F1 وزنی (Weighted F1)× | F1 میانگین ماکرو× | |
|---|---|---|
| حوزه | ارزیابی مدل | ارزیابی مدل |
| خانواده | MCDM | MCDM |
| سال پیدایش | 2000s | 2000s |
| پدیدآور | Multi-class evaluation community | Multi-class evaluation community |
| نوع | Evaluation metric | Evaluation metric |
| منبع بنیادین | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ | Powers, D. M. (2011). Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1), 37-63. link ↗ |
| نامهای دیگر≠ | Support-weighted F1 | Macro F1, Unweighted average F1 |
| مرتبط | 3 | 3 |
| خلاصه≠ | Weighted F1 computes the F1-score for each class and then takes a weighted average, where weights are proportional to the number of samples in each class (support). It provides a middle ground between macro and micro-averaging. | Macro-averaged F1 computes the F1-score independently for each class and then takes the unweighted arithmetic mean. It treats all classes equally, regardless of their frequency in the dataset, making it useful for imbalanced multi-class problems. |
| ScholarGateمجموعهداده ↗ |
|
|