This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
cs501r_f2016:tmp [2016/09/24 20:44] wingated |
cs501r_f2016:tmp [2016/09/24 20:45] wingated |
||
---|---|---|---|
Line 75: | Line 75: | ||
**Important note**: the dropconnect paper has a somewhat more sophisticated inference method (that is, the method used at test time). **We will not use that method.** Instead, we will use the same inference approximation used by the Dropout paper -- we will simply scale things by the ''keep_probability''. | **Important note**: the dropconnect paper has a somewhat more sophisticated inference method (that is, the method used at test time). **We will not use that method.** Instead, we will use the same inference approximation used by the Dropout paper -- we will simply scale things by the ''keep_probability''. | ||
- | You should scan across the same values of ''keep_probability'', and you should generate the same plot. | + | You should scan across the same values of ''keep_probability'', and you should generate a similar plot. |
Dropconnect seems to want more training steps than dropout, so you should run the optimizer for 1500 iterations. | Dropconnect seems to want more training steps than dropout, so you should run the optimizer for 1500 iterations. |