- Home /
 
               Question by 
               Icaro-Lima-TFG · Jun 04, 2019 at 12:07 AM · 
                learningagentmachine  
              
 
              Discrete Rewards or Continuous Rewards? ML-Agents PPO
Which of the two forms of reward is better and why? 
 1. AddReward(Mathf.Cos(_rigidbody2D.rotation * Mathf.Deg2Rad));
 2. if (Mathf.Cos(_rigidbody2D.rotation * Mathf.Deg2Rad) >= Mathf.Cos(10 * Mathf.Deg2Rad)) AddReward(1);
 
 I perceived that the first form converges more faster.
               Comment
              
 
               
              Your answer
 
 
              koobas.hobune.stream
koobas.hobune.stream 
                       
                
                       
			     
			 
                