Hello to all my dear friends!

This post talks about the Reinforcement Learning (RL) in the field of Artificial Intelligence (AI).

There are three types of learning in AI:

  1. Unsupervised learning – It is used for discovery of new patterns. Example is clustering group of people based on their similarities.
  2. Supervised learning – It is used to identify patterns. In this computer is feeded a lot of data with labels example photos of dogs and cats and the computer learns. Then given a photo, it can identify which is dog and which is cat.
  3. Reinforcement learning – This is different from both of above. Here computer performs random action without any supervision and is either rewarded (+1) or punished (-1) based on the action. This way computer learns itself what is the correct action to do in which situation. Example is how the robots learn to walk.

Reinforcement learning is also present in us humans. We do some action and if that gives a pain, then we do not do the same action again. Similarly for actions giving happiness, joy.

It does not require large labeled dataset like supervised learning. It is also innovative. It can learn a completely new approach/new solutions. It is also resistance to any bias unlike humans.

Real Life Use Cases of RL

  1. Google AlphaGo – Playing games like Go
  2. Google uses RL for energy efficiency in data centers
  3. Decision Service uses RL for advertisement and has improved click through rate
  4. Trendyol uses it for email advertising
  5. Alibaba uses it for advertising, which customer will click which ad.

Similar Posts

Error: GraphComment couldn't be load because your settings are invalid. Please visit your admin panel and go to the GraphComment section and enter a valid website URL/ID.