Revision as of 22:55, 16 March 2008 edit 99.231.84.156 (talk) →Training ← Previous edit		Revision as of 21:22, 6 October 2008 edit undo Robina Fox (talk \| contribs) Extended confirmed users, Pending changes reviewers 40,814 edits m →Training: link repair Next edit →
Line 14: == Training == An auto-encoder is often trained using one of the many [[~~Backpropagation~~backpropagation]] variants ([[~~Conjugate~~conjugate gradient~~\| Conjugate Gradient~~ ~~Method~~method]], [[~~Steepest~~steepest ~~Descent~~descent]], etc.) Though often reasonably effective, there are fundamental problems with using backpropagation to train networks with many hidden layers. Once the errors get backpropagated to the first few layers, they are minuscule, and quite ineffectual. This causes the network to almost always learn to reconstruct the average of all the training data. Though more advanced backpropagation methods (such as the ~~Conjugate~~conjugate ~~Gradient~~gradient ~~Method~~method) help with this to some degree, it still results in very slow learning and poor solutions. This problem is remedied by using initial weights that approximate the final solution. The process to find these initial weights is often called pretraining. A pretraining technique developed by [[Geoffrey Hinton]] for training many-layered "deep" auto-encoders involves treating each neighboring set of two layers like a [[Boltzmann machine#Restricted Boltzmann Machine\|Restricted Boltzmann Machine]] for pre-training to approximate a good solution and then using a backpropagation technique to fine-tune.

Autoencoder: Difference between revisions