User Tools

Site Tools


cs501r_f2018:lab9

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
cs501r_f2018:lab9 [2018/11/19 21:17]
wingated
cs501r_f2018:lab9 [2018/11/19 21:17]
wingated
Line 16: Line 16:
   * 45% Proper design, creation and debugging of an actor and critic networks   * 45% Proper design, creation and debugging of an actor and critic networks
   * 25% Proper implementation of the PPO loss function and objective on cart-pole ("​CartPole-v0"​)   * 25% Proper implementation of the PPO loss function and objective on cart-pole ("​CartPole-v0"​)
-  * 20% Implementation and demonstrated learning of PPO on another domain of your choice+  * 20% Implementation and demonstrated learning of PPO on another domain of your choice ​(**except** VizDoom)
   * 10% Visualization of policy return as a function of training   * 10% Visualization of policy return as a function of training
  
cs501r_f2018/lab9.txt ยท Last modified: 2021/06/30 23:42 (external edit)