Hi,
I trained the model with the unicycle kinematic and using 20 humans as obstacles and I noticed that the policy does not learn. Is there any requirement for the number of humans with unicycle kinematic?
I believe the interval (-0.06, 0.06) for the change of angle i.e., \Delta \theta is too small for the agent to navigate this dense environment.
Thanks.