ScholarGate
アシスタント

手法を比較

選択した手法を並べて確認できます。異なる行はハイライト表示されます。

多腕バンディット(UCB、トンプソンサンプリング)×逐次 / 群逐次試験デザイン×
分野実験計画法実験計画法
系統Hypothesis testHypothesis test
提唱年19521979
提唱者Robbins (1952); UCB1 by Auer et al. (2002); Thompson sampling by Thompson (1933)O'Brien & Fleming; Pocock; Lan & DeMets
種類Sequential decision / bandit algorithmAdaptive stopping trial design
原典Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-Time Analysis of the Multiarmed Bandit Problem. Machine Learning, 47(2–3), 235–256. DOI ↗O'Brien, P.C. & Fleming, T.R. (1979). A Multiple Testing Procedure for Clinical Trials. Biometrics, 35(3), 549–556. DOI ↗
別名MAB, bandit algorithm, UCB1, Thompson samplinggroup sequential design, adaptive stopping design, Ardışık Deneme Tasarımı (Sequential / Group Sequential)
関連43
概要The multi-armed bandit (MAB) is an adaptive experimental framework that allocates trials sequentially across competing arms to minimise cumulative regret while simultaneously learning which arm performs best. Formalised by Robbins in 1952 and given finite-time guarantees by Auer et al. (2002), it balances exploration of uncertain options against exploitation of currently known best options — outperforming classical A/B testing whenever early stopping or cost-sensitive allocation matters.Sequential and group sequential trial designs allow a study to be stopped early — or continued — based on interim analyses conducted as data accumulate. The core framework was formalised by O'Brien and Fleming in 1979 and extended by Lan and DeMets's alpha-spending approach, and it controls the overall Type I error rate across all planned looks by pre-specifying both efficacy and futility boundaries before enrolment begins.
ScholarGateデータセット
  1. v1
  2. 2 出典
  3. PUBLISHED
  1. v1
  2. 2 出典
  3. PUBLISHED

検索へ スライドをダウンロード

ScholarGate手法を比較: Multi-Armed Bandit · Sequential Design. 2026-06-17に以下より取得 https://scholargate.app/ja/compare