can you please explain what is Markov's Decision process?

Question

can you please explain what is Markov's Decision process?

2 Answers

SakshiSharma · Answer 1 · 2020-09-22T03:14:30+0000

The solution for a reinforcement learning problem can be achieved using the Markov decision process or MDP. Hence, MDP is used to formalize the RL problem. It can be said as the mathematical approach to solve a reinforcement learning problem. The main aim of this process is to gain maximum positive rewards by choosing the optimum policy.

MDP has four elements, which are:

1) A set of finite states S

2) A set of finite actions A

3) Rewards

4) Policy Pa

In this process, the agent performs an action A to take a transition from state S1 to S2 or from the start state to the end state, and while doing these actions, the agent gets some rewards. The series of actions taken by the agent can be defined as the policy.

Robindeniel · Answer 2 · 2022-01-17T06:34:43+0000

The mathematical approach for mapping a solution in Reinforcement Learning is called Markov’s Decision Process (MDP).

The following parameters are used to attain a solution using MDP:

Set of actions, A
Set of states, S
Reward, R
Policy, p
Value, V

can you please explain what is Markov's Decision process?

Please log in or register to answer this question.

2 Answers