0 votes
in Artificial Intelligence by
can you please explain what is Markov's Decision process?

2 Answers

0 votes
by

The solution for a reinforcement learning problem can be achieved using the Markov decision process or MDP. Hence, MDP is used to formalize the RL problem. It can be said as the mathematical approach to solve a reinforcement learning problem. The main aim of this process is to gain maximum positive rewards by choosing the optimum policy.

MDP has four elements, which are:

1) A set of finite states S

2) A set of finite actions A

3) Rewards

4) Policy Pa

In this process, the agent performs an action A to take a transition from state S1 to S2 or from the start state to the end state, and while doing these actions, the agent gets some rewards. The series of actions taken by the agent can be defined as the policy.

0 votes
by

The mathematical approach for mapping a solution in Reinforcement Learning is called Markov’s  Decision Process (MDP). 

The following parameters are used to attain a solution using MDP:

  • Set of actions, A
  • Set of states, S
  • Reward, R
  • Policy, p
  • Value, V

Related questions

0 votes
asked Sep 23, 2020 in Artificial Intelligence by SakshiSharma
0 votes
asked Sep 22, 2020 in Artificial Intelligence by Robindeniel
...