This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cs501r_f2018:lab9 [2018/11/15 23:15] wingated |
cs501r_f2018:lab9 [2018/11/19 21:17] wingated |
||
---|---|---|---|
Line 57: | Line 57: | ||
---- | ---- | ||
====Hints and helps:==== | ====Hints and helps:==== | ||
+ | |||
+ | **Update**: Here is our | ||
+ | [[https://github.com/joshgreaves/reinforcement-learning|our lab's implementation of PPO]]. NOTE: because this code comes with a complete implementation of running on VizDoom, **you may not use that as your additional test domain.** | ||
+ | |||
Here is some code from our reference implementation. Hopefully it will serve as a good outline of what you need to do. | Here is some code from our reference implementation. Hopefully it will serve as a good outline of what you need to do. |