MATLAB: The reward gets stuck on a single value during training or randomly fluctuates (Reinforcement Learning)

Deep Learning Toolboxreinforcement learningReinforcement Learning Toolbox

I train the reinforcement learning system, and on the reward plot I have some failures during which the reward does not change. This doesn’t look normal, especially when compared with examples (Biped Robot, etc.) I believe that some rlDDPGAgentOptions settings are responsible for this, but it seems that I changed all the possible settings, but even after several thousand episodes, the system does not learn. What can be the reason for this behavior of this graph during training?

Best Answer

It could mean that the training is experiencing a local minima. You can try out a few things:

1. Change the OU noise options to favor more exploration so that the robot can explore more states and get new rewards.

2. Design a different reward function that is not too dependent on sparse rewards. From the graph (flatlines) it looks like you have a sparse reward for a state that the agent is continuously visiting.

In most cases, designing better reward functions will improve training. That being said, 350 episodes might be too early to expect good results. I would let it run for a few 1000 episodes at least before coming to a conclusion that something needs to change.

Related Solutions

MATLAB: Deep reinforcement learning for multi-agents

Hello,

The policies you will get from RL training change depending on the amount of time the agents spend exploring. Usually, if you see a situation like this where agents converge to a non-ideal solution, you may want to change the agent options to increase exploration.

Hope that helps

MATLAB: Trying to load previously saved agent fails (Reinforcement Learning Toolbox)

Just managed to solve this problem by myself.

In case anyone else has the same problem:

In order to load your saved agent you have to use the following code snippet:

load('Agent123.mat','saved_agent');
agent = saved_agent;

Best Answer

Related Solutions

MATLAB: Deep reinforcement learning for multi-agents

MATLAB: Trying to load previously saved agent fails (Reinforcement Learning Toolbox)

Related Question