So sánh phương pháp

Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.

	Thuật toán Multi-Armed Bandit (UCB, Thompson Sampling)×	Thiết kế thử nghiệm lâm sàng thích ứng ×
Lĩnh vực	Thiết kế thí nghiệm	Thiết kế thí nghiệm
Họ	Hypothesis test	Hypothesis test
Năm ra đời≠	1952	1994
Người khởi xướng≠	Robbins (1952); UCB1 by Auer et al. (2002); Thompson sampling by Thompson (1933)	Bauer & Köhne
Loại≠	Sequential decision / bandit algorithm	Adaptive hypothesis test with interim analyses
Công trình gốc≠	Auer, P., Cesa-Bianchi, N., & Fischer, P. (2002). Finite-Time Analysis of the Multiarmed Bandit Problem. Machine Learning, 47(2–3), 235–256. DOI ↗	Bauer, P. & Köhne, K. (1994). Evaluation of Experiments with Adaptive Interim Analyses. Biometrics, 50(4), 1029–1041. DOI ↗
Tên gọi khác≠	MAB, bandit algorithm, UCB1, Thompson sampling	adaptive design, group sequential design, sample size re-estimation, platform trial
Liên quan≠	4	3
Tóm tắt≠	The multi-armed bandit (MAB) is an adaptive experimental framework that allocates trials sequentially across competing arms to minimise cumulative regret while simultaneously learning which arm performs best. Formalised by Robbins in 1952 and given finite-time guarantees by Auer et al. (2002), it balances exploration of uncertain options against exploitation of currently known best options — outperforming classical A/B testing whenever early stopping or cost-sensitive allocation matters.	Adaptive clinical trial design is a flexible experimental framework, formalised by Bauer and Köhne in 1994, in which pre-specified rules allow the trial to be modified mid-course — adjusting sample size, treatment arms, or randomisation ratios — based on accumulating interim data while rigorously controlling the Type I error rate.
ScholarGateBộ dữ liệu ↗	v1 2 Nguồn tài liệu PUBLISHED	v1 2 Nguồn tài liệu PUBLISHED

Đến trang tìm kiếm → Tải xuống bản trình chiếu