This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
cs501r_f2018:lab9 [2018/11/19 21:17] wingated |
cs501r_f2018:lab9 [2018/11/19 21:17] wingated |
||
---|---|---|---|
Line 16: | Line 16: | ||
* 45% Proper design, creation and debugging of an actor and critic networks | * 45% Proper design, creation and debugging of an actor and critic networks | ||
* 25% Proper implementation of the PPO loss function and objective on cart-pole ("CartPole-v0") | * 25% Proper implementation of the PPO loss function and objective on cart-pole ("CartPole-v0") | ||
- | * 20% Implementation and demonstrated learning of PPO on another domain of your choice | + | * 20% Implementation and demonstrated learning of PPO on another domain of your choice (**except** VizDoom) |
* 10% Visualization of policy return as a function of training | * 10% Visualization of policy return as a function of training | ||