Doesn't Hinton's weight decay scheme help to keep the activations from going to their extreme values of 0 and 1? This would seem to make a feed forward net more analog-like, since the sigmoids try to remain more linear rather than step-like.