Bandingkan metode

Tinjau metode pilihan Anda berdampingan; baris yang berbeda akan disorot.

	Uji A/B Adaptif ×	Eksperimen multi-lengan ×
Bidang	Desain Eksperimen	Desain Eksperimen
Keluarga	Process / pipeline	Process / pipeline
Tahun asal≠	1952 (Robbins); applied to A/B testing from ~2010s onward	1990s–2000s (clinical formalization); multi-arm concept implicit in ANOVA-era factorial designs
Pencetus≠	Herbert Robbins (bandit framework); Thompson Sampling formalized by William R. Thompson	Developed within clinical trials methodology; formalized by Parmar, Royston and colleagues (UK MRC CTU, early 2000s)
Tipe≠	Adaptive experimental design	Experimental design
Sumber perintis≠	Russo, D., Van Roy, B., Kazerouni, A., Osband, I., & Wen, Z. (2018). A Tutorial on Thompson Sampling. Foundations and Trends in Machine Learning, 11(1), 1–96. DOI ↗	Royston, P., Parmar, M. K. B., & Qian, W. (2003). Novel designs for multi-arm clinical trials with survival outcomes with an application in ovarian cancer. Statistics in Medicine, 22(14), 2239–2256. DOI ↗
Alias	adaptive AB test, bandit A/B test, multi-armed bandit testing, online adaptive experiment	multi-arm trial, multiple-arm experiment, multi-group experiment, many-arm design
Terkait≠	6	5
Ringkasan≠	An Adaptive A/B test is an experimental design that dynamically reallocates traffic or participants toward better-performing variants during the experiment itself, rather than holding allocations fixed until the end. Drawing on multi-armed bandit algorithms such as Thompson Sampling or Upper Confidence Bound (UCB), it balances the exploration of uncertain variants with the exploitation of those already showing superior performance, typically yielding higher aggregate outcomes while still producing valid inferential conclusions.	A multi-arm experiment simultaneously compares three or more treatment or intervention conditions — each called an arm — against a shared control or against one another. By testing multiple alternatives in a single study, it yields more information per participant than running separate two-group experiments sequentially, while controlling the overall Type I error rate through pre-specified comparison strategies.
ScholarGateSet data ↗	v1 2 Sumber PUBLISHED	v1 2 Sumber PUBLISHED

Ke halaman pencarian → Unduh salindia