Dynamic Pricing using Multi Armed Bandit(Reinforcement Learning)

Go to main | Course Page

Section 1: Dynamic Pricing and Bandits

Introduction to Dynamic Pricing
Download Resources
What is Reinforcement Learning?
Overview of Multi Armed Bandit (MAB)
Regret
Problem Statement

Section 2: Greedy Algorithm

Greedy Strategy

Section 3: Epsilon Greedy Algorithm

Epsilon Greedy Approach

Section 4: Upper Confidence Bound

UCB Approach

Section 5: Thompson Sampling

Thompson Sampling

Published with Simplenote