Evaluation & trust
73 methods in this family.
Featured
AccuracyAccuracy is the proportion of correct predictions among the total number of predictions made by a classification model. It is the most intuitive performance metric and measures howAdjusted R-squaredAdjusted R² is a corrected version of the coefficient of determination that accounts for the number of predictors in a regression model. Introduced by Henri Theil in 1961, it addreAdjusted Rand IndexThe Adjusted Rand Index (ARI), developed by Hubert and Arabie in 1985, is an external clustering evaluation metric that measures the agreement between a predicted clustering and a Akaike Information CriterionThe Akaike Information Criterion is an information-theoretic measure for model selection that balances goodness of fit against model complexity. Introduced by Hirotugu Akaike in 19Balanced AccuracyBalanced accuracy is the average of recall values computed for each class separately. It corrects for class imbalance by giving equal weight to the performance on each class, regarBrier ScoreThe Brier score measures the mean squared difference between predicted probabilities and actual binary outcomes. It is a simple, interpretable metric for evaluating the accuracy of
All methods 73
AccuracyAdjusted R-squaredAdjusted Rand IndexAkaike Information CriterionBalanced AccuracyBrier ScoreBSQCalinski-Harabasz IndexCalorimeter CalibrationComputerized adaptive test item analysisConfusion MatrixCounterfactual ExplanationsDavies-Bouldin IndexDunn IndexElbow MethodExplainable Association RulesExplainable Autoencoder Anomaly DetectionExplainable Decision TreeExplainable FP-GrowthExplainable Gaussian Mixture ModelExplainable Gaussian ProcessExplainable HDBSCANExplainable Isolation ForestExplainable K-MeansExplainable K-Nearest NeighborsExplainable LightGBMExplainable Naive BayesExplainable One-Class SVMExplainable Random ForestExplainable Stacking EnsembleExplainable Support Vector MachineExplainable Voting EnsembleExplainable XGBoostF-beta ScoreF1-ScoreFairness-Aware MLFowlkes-Mallows IndexGap StatisticGeometric MorphometricsGQL-15Hamming LossInertia (Within-Cluster Sum of Squares)Jaccard IndexLift and Gain ChartLIMELog-Loss (Cross-Entropy Loss)Longitudinal Item AnalysisMacro-averaged F1Mean Absolute ErrorMean Absolute Percentage ErrorMean Absolute Scaled ErrorMean Squared ErrorMicro-averaged F1Model CalibrationNormalized Mutual InformationPrecisionPrecision-Recall AUCPrice Fairness ScaleR-squaredRecall (Sensitivity)Robust Rasch ModelRoot Mean Squared ErrorSHAPShort form Rasch modelShort-Form IRTSilhouette ScoreSpecificitySurvey WeightingSymmetric MAPEToken BucketV-measureWeighted F1Youdens J Statistic