Thompson Sampling(톰슨 샘플링)

Algorithm

Thompson Sampling(톰슨 샘플링)

빠릿베짱이 2017. 6. 27. 18:40

중국 블로그 : http://x-algo.cn/index.php/2016/12/15/ee-problem-and-bandit-algorithm-for-recommender-systems/

시뮬레이터 : https://learnforeverlearn.com/bandits/

Python Sample code(파이썬 샘플 코드) :

http://mloss.org/software/view/415/

https://github.com/bgalbraith/bandits

논문 : Analysis of Thompson Sampling for the Multi-armed Bandit Problem ( 링크 )

톰슨 샘플링 시뮬레이션 결과( result of thompson sampling simulation)

1. 가장 좋은 보상의 정책이 빠르게 선택된다.

2. 선택된 정책의 보상 확률이 낮아지는 경우, 다른 정책이 선택될 수 있다.

3. 선택되지 않은 정책의 보상 확률이 높아지는 경우에는 잘 적용이 안되는 문제가 발생한다.

저작자표시 비영리 변경금지

'Algorithm' 카테고리의 다른 글

Linear Model for Regression (0)	2017.07.06
vector similarity (0)	2017.04.13
Fast radial symmetry transform (0)	2016.05.31
Contrario (0)	2016.02.23
[KCF]Kernelized Correlation Filters - Tracking (0)	2015.12.28

현재글Thompson Sampling(톰슨 샘플링)

Live, like today is the last day to live.

셀인, 협제맛집, 제스처 인식, RANSAC, 추적, AdaBoost, 라즈베리파이, 수유 맛집, Face Detection, Face, Scala, OpenNI, Deep Learning, ISO26262, 제주도맛집, 셀프 인테리어, Kinect, Spark, Face Alignment, tracking,

Today :
Yesterday :

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Jooo's life story