방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 영향력 진단 (쿡 거리, DFFITS, 레버리지)× | 중앙값 절대 편차 (MAD) 추정× | 릿지 회귀(Ridge Regression)× | |
|---|---|---|---|
| 분야≠ | 통계학 | 통계학 | 머신러닝 |
| 계열≠ | Regression model | Regression model | Machine learning |
| 기원 연도≠ | 1977 | 1974 | 1970 |
| 창시자≠ | R. Dennis Cook (Cook's distance); Belsley, Kuh & Welsch (DFFITS, leverage) | Hampel (influence-curve treatment); classical robust statistics | Hoerl, A.E. & Kennard, R.W. |
| 유형≠ | Regression diagnostic | Robust scale estimator | L2-regularized linear regression |
| 원전≠ | Cook, R. D. (1977). Detection of Influential Observations in Linear Regression. Technometrics, 19(1), 15-18. DOI ↗ | Hampel, F. R. (1974). The Influence Curve and Its Role in Robust Estimation. Journal of the American Statistical Association, 69(346), 383-393. DOI ↗ | Hoerl, A.E. & Kennard, R.W. (1970). Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics, 12(1), 55–67. DOI ↗ |
| 별칭≠ | Cook's distance, DFFITS, leverage, influential observation detection | median absolute deviation, MAD scale estimator, robust scale estimation, Medyan Mutlak Sapma (MAD) Tahmini | Ridge Regresyonu, ridge regresyonu, L2-regularized regression, Tikhonov regularization |
| 관련≠ | 5 | 5 | 4 |
| 요약≠ | Influence diagnostics are a family of post-fit measures that quantify how much each single observation affects a fitted regression. Cook's distance was introduced by R. Dennis Cook in 1977, with leverage and DFFITS formalised by Belsley, Kuh and Welsch in 1980, to flag the observations that most strongly pull the estimated coefficients. | Median Absolute Deviation estimation is a robust measure of statistical dispersion that replaces the standard deviation when outliers are present. Rooted in the influence-curve framework formalised by Hampel (1974), it summarises the spread of a continuous variable using medians instead of means, so a single extreme value cannot distort the result. | Ridge Regression is an L2-regularized linear regression method, introduced by Arthur Hoerl and Robert Kennard in 1970, that reduces multicollinearity by adding a penalty on the size of the coefficients. It shrinks coefficients toward zero without setting any of them exactly to zero, producing more stable estimates when predictors are highly correlated. |
| ScholarGate데이터셋 ↗ |
|
|
|