Revision as of 02:33, 7 February 2018 edit 72.43.172.94 (talk) →Simplifying expected risk for classification ← Previous edit		Revision as of 14:17, 19 April 2018 edit undo Johndburger (talk \| contribs) Extended confirmed users, Rollbackers 4,230 edits →See also: Removed links already present in the article, per WP:SEEALSO. Next edit →
Line 96: The cross entropy loss is closely related to the [[Kullback-Leibler divergence]] between the empirical distribution and the predicted distribution. This function is not naturally represented as a product of the true label and the predicted value, but is convex and can be minimized using [[stochastic gradient descent]] methods. The cross entropy loss is ubiquitous in modern [[deep learning\|deep neural networks]]. ~~==See also==~~ [[Statistical learning theory]] [[Loss function]] *[[Support vector machine]] == References ==

Loss functions for classification: Difference between revisions