Thompson Sampling(톰슨 샘플링)

Algorithm

Thompson Sampling(톰슨 샘플링)

빠릿베짱이 2017. 6. 27. 18:40

중국 블로그 : http://x-algo.cn/index.php/2016/12/15/ee-problem-and-bandit-algorithm-for-recommender-systems/

시뮬레이터 : https://learnforeverlearn.com/bandits/

Python Sample code(파이썬 샘플 코드) :

http://mloss.org/software/view/415/

https://github.com/bgalbraith/bandits

논문 : Analysis of Thompson Sampling for the Multi-armed Bandit Problem ( 링크 )

톰슨 샘플링 시뮬레이션 결과( result of thompson sampling simulation)

1. 가장 좋은 보상의 정책이 빠르게 선택된다.

2. 선택된 정책의 보상 확률이 낮아지는 경우, 다른 정책이 선택될 수 있다.

3. 선택되지 않은 정책의 보상 확률이 높아지는 경우에는 잘 적용이 안되는 문제가 발생한다.

저작자표시 비영리 변경금지 (새창열림)

'Algorithm' 카테고리의 다른 글

Linear Model for Regression (0)	2017.07.06
vector similarity (0)	2017.04.13
Fast radial symmetry transform (0)	2016.05.31
Contrario (0)	2016.02.23
[KCF]Kernelized Correlation Filters - Tracking (0)	2015.12.28

현재글Thompson Sampling(톰슨 샘플링)

Jooo's life story

Live, like today is the last day to live.

눈물사료, 퍼피사료추천, Spark, Face Detection, 기호성사료, 제주도맛집, 기호성좋은사료추천, 셀프 인테리어, 협제맛집, ISO26262, 추적, 벨칸도, AdaBoost, Scala, Face Alignment, Face, Kinect, tracking, 베스트브리드, 수유 맛집,

Today :
Yesterday :

일	월	화	수	목	금	토
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Jooo's life story