Content deleted Content added
topic in bold |
|||
Line 97:
[[Pseudocode]] for a [[stochastic gradient descent]] algorithm for training a three-layer network (one hidden layer):
initialize network weights (often small random values).
'''do'''
'''for each''' training example named ex '''do'''
|