Which method should I use?
Describe your research situation in a few words; we surface the methods from the library that best fit your goal and data.
Recommendations for: predict a continuous numeric outcome from several predictor variables
- Multivariate RegressionStatistics
Multivariate regression is a linear regression method that predicts several continuous dependent variables at the same time from a shared set of predictors. As developed in standard treatments such as Johnson and Wichern's Applied Multivariate Statistical Analysis (2007), each response equation can be fitted by ordinary least squares while the covariance structure of the residuals is used for joint testing across outcomes.
- Bayesian Multiple linear regressionStatistics
Bayesian Multiple Linear Regression models a continuous outcome as a linear combination of several predictors, but instead of producing a single point estimate it yields a full posterior distribution over all regression coefficients and the error variance. This makes uncertainty quantification explicit and allows seamlessly incorporating prior knowledge from theory or previous studies.
- Robust Multiple linear regressionStatistics
Robust multiple linear regression estimates the linear relationship between a continuous outcome and several predictors while being resistant to outliers and violations of the normality assumption. Instead of minimising the sum of squared residuals, it uses a bounded loss function — most commonly Huber's or Tukey's bisquare — so that extreme observations receive limited influence on the estimated coefficients.
- Robust Discriminant AnalysisStatistics
Robust Discriminant Analysis is a classification method that separates groups with a linear discriminant function while resisting the influence of outliers. It replaces the classical mean and covariance with a high-breakdown estimator such as the Minimum Covariance Determinant (MCD), an approach developed by Hawkins & McLachlan (1997) and Croux & Dehon (2001).
- Linear Regression (ML)Machine Learning
Linear regression fits a straight-line relationship between one or more input features and a continuous numeric outcome by minimising the sum of squared prediction errors. As a machine-learning model it is trained on labeled examples and evaluated on held-out data, making it the simplest supervised learning baseline for any regression task.
- Regularized linear regressionMachine Learning
Regularized linear regression adds a penalty term to the ordinary least-squares objective, shrinking or zeroing out coefficients to reduce overfitting and handle multicollinearity. The three main variants — Ridge (L2 penalty), Lasso (L1 penalty), and Elastic Net (combined L1+L2) — make linear regression usable even when features outnumber observations or predictors are highly correlated.
Common question: which method?
For the most-asked situations, the methods the library surfaces.
Which method compares the means of two or more groups?
- Independent samples t-testStatistics
- Welch t-testStatistics
- Hotelling's T² TestStatistics
Which method predicts a continuous outcome from several variables?
- Multivariate RegressionStatistics
- Bayesian Multiple linear regressionStatistics
- Robust Multiple linear regressionStatistics
Which method classifies observations into categories?
- Grey ClusteringSoft Computing
- CNN Image ClassificationDeep Learning
- YOLODeep Learning
Which method groups similar observations without labels?
- K-Means ClusteringMachine Learning
- Hierarchical ClusteringMachine Learning
- Sentence EmbeddingsDeep Learning
Which method tests the association between two variables?
- Robust CorrelationStatistics
- Cramer's VStatistics
- Spearman CorrelationStatistics
Which method reduces many correlated variables to a few factors?
- Principal Component AnalysisMachine Learning
- Partial Least SquaresMachine Learning
- Locally Linear EmbeddingMachine Learning
Which method ranks alternatives across multiple criteria?
Refine this scenario →Which method analyzes time-to-event data with censoring?
- Weibull RegressionSurvival
- Kaplan-Meier EstimatorStatistics
- Royston-Parmar ModelSurvival