MATLAB: Unable to run ‘rlwatertank’ example in R2020a

reinforcement learningReinforcement Learning Toolbox

Hello everyone

I was trying to run this example.

While I successfully ran this example in R2019b, I could not successfully train the agent for this example in R2020a.

I also tried other available examples in the documentation; however, the learning plots in Reinforcement Learning Episode Manager differed from the plots exhibited in documentation.

I should mention I followed the exact steps in documentation and did not change value of any parameter.

Is this some sort of bug in "Reinforcement Learning Toolbox" at R2020a release?

Best Answer

Hi Nima,

This is the plot I got when running the watertank example in 20a:

While this is not exactly the same as the one shown in the documentation, training still converges.

A couple of reasons why the visual is not the same as in R2019b:

We recently started using autodifferentiation under the hood, and while gradient values are still close, there are small numerical differences which lead to a different optimization route,
Each release comes with more optimization improvements like the one above, which affect training results.

Note that training should still converge. Hope this helps

Related Solutions

MATLAB: Multi-agent deep reinforcement learning

Assuming you are training multiple agents in Simulink using the Reinforcement Learning Toolbox in R2020b:

The rewards are calculated by the environment, not the agent algorithm so they should not be affected unless the environment is changing them. When you compare rewards between single and multi-agents please ensure that the state-action pairs are the same. Rewards depend on states and actions and you may get different results for different state-action pairs.
In R2020b, the agent neural networks are updated independently.

MATLAB: How to extract a trained RL Agent’s network’s weights and biases

You can get the parameters from the trained's critic representation for DQN agent. In MATLAB R2020a, see getLearnableParameters and getCritic functions (function name changes a bit since R2019b). You can follow similar steps to get the actor's parameters from actor-based agent like DDPG or PPO.

critic = getCritic(agent);
criticParams = getLearnableParameters(critic);

Best Answer

Related Solutions

MATLAB: Multi-agent deep reinforcement learning

MATLAB: How to extract a trained RL Agent’s network’s weights and biases

Related Question