方法对比
并排查看您选择的方法;存在差异的行会高亮显示。
| 多臂老虎机 (UCB, Thompson Sampling)× | 自适应临床试验设计× | |
|---|---|---|
| 领域 | 实验设计 | 实验设计 |
| 方法族 | Hypothesis test | Hypothesis test |
| 起源年份≠ | 1952 | 1994 |
| 提出者≠ | Robbins (1952); UCB1 by Auer et al. (2002); Thompson sampling by Thompson (1933) | Bauer & Köhne |
| 类型≠ | Sequential decision / bandit algorithm | Adaptive hypothesis test with interim analyses |
| 开创性文献≠ | Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-Time Analysis of the Multiarmed Bandit Problem. Machine Learning, 47(2–3), 235–256. DOI ↗ | Bauer, P. & Köhne, K. (1994). Evaluation of Experiments with Adaptive Interim Analyses. Biometrics, 50(4), 1029–1041. DOI ↗ |
| 别名≠ | MAB, bandit algorithm, UCB1, Thompson sampling | adaptive design, group sequential design, sample size re-estimation, platform trial |
| 相关≠ | 4 | 3 |
| 摘要≠ | The multi-armed bandit (MAB) is an adaptive experimental framework that allocates trials sequentially across competing arms to minimise cumulative regret while simultaneously learning which arm performs best. Formalised by Robbins in 1952 and given finite-time guarantees by Auer et al. (2002), it balances exploration of uncertain options against exploitation of currently known best options — outperforming classical A/B testing whenever early stopping or cost-sensitive allocation matters. | Adaptive clinical trial design is a flexible experimental framework, formalised by Bauer and Köhne in 1994, in which pre-specified rules allow the trial to be modified mid-course — adjusting sample size, treatment arms, or randomisation ratios — based on accumulating interim data while rigorously controlling the Type I error rate. |
| ScholarGate数据集 ↗ |
|
|