Evaluation & trust

73 methods in this family.

Featured

AccuracyAccuracy is the proportion of correct predictions among the total number of predictions made by a classification model. It is the most intuitive performance metric and measures how Adjusted R-squaredAdjusted R² is a corrected version of the coefficient of determination that accounts for the number of predictors in a regression model. Introduced by Henri Theil in 1961, it addre Adjusted Rand IndexThe Adjusted Rand Index (ARI), developed by Hubert and Arabie in 1985, is an external clustering evaluation metric that measures the agreement between a predicted clustering and a Akaike Information CriterionThe Akaike Information Criterion is an information-theoretic measure for model selection that balances goodness of fit against model complexity. Introduced by Hirotugu Akaike in 19 Balanced AccuracyBalanced accuracy is the average of recall values computed for each class separately. It corrects for class imbalance by giving equal weight to the performance on each class, regar Brier ScoreThe Brier score measures the mean squared difference between predicted probabilities and actual binary outcomes. It is a simple, interpretable metric for evaluating the accuracy of

All methods 73

Accuracy Adjusted R-squared Adjusted Rand Index Akaike Information Criterion Balanced Accuracy Brier Score BSQ Calinski-Harabasz Index Calorimeter Calibration Computerized adaptive test item analysis Confusion Matrix Counterfactual Explanations Davies-Bouldin Index Dunn Index Elbow Method Explainable Association Rules Explainable Autoencoder Anomaly Detection Explainable Decision Tree Explainable FP-Growth Explainable Gaussian Mixture Model Explainable Gaussian Process Explainable HDBSCAN Explainable Isolation Forest Explainable K-Means Explainable K-Nearest Neighbors Explainable LightGBM Explainable Naive Bayes Explainable One-Class SVM Explainable Random Forest Explainable Stacking Ensemble Explainable Support Vector Machine Explainable Voting Ensemble Explainable XGBoost F-beta Score F1-Score Fairness-Aware ML Fowlkes-Mallows Index Gap Statistic Geometric Morphometrics GQL-15 Hamming Loss Inertia (Within-Cluster Sum of Squares)Jaccard Index Lift and Gain Chart LIME Log-Loss (Cross-Entropy Loss)Longitudinal Item Analysis Macro-averaged F1 Mean Absolute Error Mean Absolute Percentage Error Mean Absolute Scaled Error Mean Squared Error Micro-averaged F1 Model Calibration Normalized Mutual Information Precision Precision-Recall AUC Price Fairness Scale R-squared Recall (Sensitivity)Robust Rasch Model Root Mean Squared Error SHAP Short form Rasch model Short-Form IRT Silhouette Score Specificity Survey Weighting Symmetric MAPE Token Bucket V-measure Weighted F1 Youdens J Statistic