Dynamic Pricing using Multi Armed Bandit(Reinforcement Learning)
Go to main | Course Page
Section 1: Dynamic Pricing and Bandits
- Introduction to Dynamic Pricing
- Download Resources
- What is Reinforcement Learning?
- Overview of Multi Armed Bandit (MAB)
- Regret
- Problem Statement
Section 2: Greedy Algorithm
- Greedy Strategy
Section 3: Epsilon Greedy Algorithm
- Epsilon Greedy Approach
Section 4: Upper Confidence Bound
- UCB Approach
Section 5: Thompson Sampling
- Thompson Sampling