|
I am trying to build a regression model using Neural Network. The final cost measure is MAE on the output(one output unit, 200 input units) Right now all my hidden units have rectifier activation. The output unit is just a linear unit with pass-through activation. Is this an effective network? It seems the network can not learn efficiently, the error(even on training set) oscillates. I tried to lower learning rate, but doesn seem to be able to find a value that makes the error go down monotonically I suspect the cost function( Thanks, Any help is appreciated! |
|
Obvious approach would be to try L2 norm. For example for ConvNets L2 SVM or L2+L1 SVM as cost function outperform L1 SVM as a cost function. ( L2 SVM as cost was introduced Y Tang, and my experience support his opinion) |