Reinforcement Learning

Learning Objectives: In this module, you will learn about developing a smart learning algorithm such that the learning becomes more and more accurate as time passes by. You will be able to define an optimal solution for an agent based on agent-environment interaction.

• What is Reinforcement Learning
• Why Reinforcement Learning
• Elements of Reinforcement Learning
• Exploration vs Exploitation dilemma
• Epsilon Greedy Algorithm
• Markov Decision Process (MDP)
• Q values and V values
• Q – Learning
• α values

• Calculating Reward
• Discounted Reward
• Calculating Optimal quantities
• Implementing Q Learning
• Setting up an Optimal Action

• Implement Reinforcement Learning using python
• Developing Q Learning model in python