- Home /
Question by
Icaro-Lima-TFG · Jun 04, 2019 at 12:07 AM ·
learningagentmachine
Discrete Rewards or Continuous Rewards? ML-Agents PPO
Which of the two forms of reward is better and why?
1. AddReward(Mathf.Cos(_rigidbody2D.rotation * Mathf.Deg2Rad));
2. if (Mathf.Cos(_rigidbody2D.rotation * Mathf.Deg2Rad) >= Mathf.Cos(10 * Mathf.Deg2Rad)) AddReward(1);
I perceived that the first form converges more faster.
Comment