Dynamic Pricing using Multi Armed Bandit(Reinforcement Learning)

Go to main | Course Page

Section 1: Dynamic Pricing and Bandits

  1. Introduction to Dynamic Pricing
  2. Download Resources
  3. What is Reinforcement Learning?
  4. Overview of Multi Armed Bandit (MAB)
  5. Regret
  6. Problem Statement

Section 2: Greedy Algorithm

  1. Greedy Strategy

Section 3: Epsilon Greedy Algorithm

  1. Epsilon Greedy Approach

Section 4: Upper Confidence Bound

  1. UCB Approach

Section 5: Thompson Sampling

  1. Thompson Sampling
Report abuse