So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Chẩn đoán ảnh hưởng (Khoảng cách Cook, DFFITS, Đòn bẩy)× | Ước lượng Độ lệch Tuyệt đối Trung vị (MAD)× | Ridge Regression× | |
|---|---|---|---|
| Lĩnh vực≠ | Thống kê | Thống kê | Học máy |
| Họ≠ | Regression model | Regression model | Machine learning |
| Năm ra đời≠ | 1977 | 1974 | 1970 |
| Người khởi xướng≠ | R. Dennis Cook (Cook's distance); Belsley, Kuh & Welsch (DFFITS, leverage) | Hampel (influence-curve treatment); classical robust statistics | Hoerl, A.E. & Kennard, R.W. |
| Loại≠ | Regression diagnostic | Robust scale estimator | L2-regularized linear regression |
| Công trình gốc≠ | Cook, R. D. (1977). Detection of Influential Observations in Linear Regression. Technometrics, 19(1), 15-18. DOI ↗ | Hampel, F. R. (1974). The Influence Curve and Its Role in Robust Estimation. Journal of the American Statistical Association, 69(346), 383-393. DOI ↗ | Hoerl, A.E. & Kennard, R.W. (1970). Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics, 12(1), 55–67. DOI ↗ |
| Tên gọi khác≠ | Cook's distance, DFFITS, leverage, influential observation detection | median absolute deviation, MAD scale estimator, robust scale estimation, Medyan Mutlak Sapma (MAD) Tahmini | Ridge Regresyonu, ridge regresyonu, L2-regularized regression, Tikhonov regularization |
| Liên quan≠ | 5 | 5 | 4 |
| Tóm tắt≠ | Influence diagnostics are a family of post-fit measures that quantify how much each single observation affects a fitted regression. Cook's distance was introduced by R. Dennis Cook in 1977, with leverage and DFFITS formalised by Belsley, Kuh and Welsch in 1980, to flag the observations that most strongly pull the estimated coefficients. | Median Absolute Deviation estimation is a robust measure of statistical dispersion that replaces the standard deviation when outliers are present. Rooted in the influence-curve framework formalised by Hampel (1974), it summarises the spread of a continuous variable using medians instead of means, so a single extreme value cannot distort the result. | Ridge Regression is an L2-regularized linear regression method, introduced by Arthur Hoerl and Robert Kennard in 1970, that reduces multicollinearity by adding a penalty on the size of the coefficients. It shrinks coefficients toward zero without setting any of them exactly to zero, producing more stable estimates when predictors are highly correlated. |
| ScholarGateBộ dữ liệu ↗ |
|
|
|