Revision as of 15:32, 10 June 2016 edit CV9933 (talk \| contribs) Extended confirmed users 13,383 edits Fixed deprecated CS1 co author parameters. ← Previous edit		Revision as of 18:44, 10 July 2016 edit undo David Eppstein (talk \| contribs) Autopatrolled, Administrators 235,886 edits m authorlinks Next edit →
Line 4: Regularization perspectives on support vector machines interpret SVM as a special case Tikhonov regularization, specifically Tikhonov regularization with the [[hinge loss]] for a loss function. This provides a theoretical framework with which to analyze SVM algorithms and compare them to other algorithms with the same goals: to [[generalize]] without [[overfitting]]. SVM was first proposed in 1995 by [[Corinna Cortes]] and [[Vladimir Vapnik]], and framed geometrically as a method for finding [[hyperplane]]s that can separate [[multidimensional]] data into two categories.<ref>{{cite journal\|last=Cortes\|first=Corinna\|author2=Vladimir Vapnik \|title=Suppor-Vector Networks\|journal=Machine Learning\|year=1995\|volume=20\|pages=273–297\|doi=10.1007/BF00994018\|url=http://www.springerlink.com/content/k238jx04hm87j80g/?MUD=MP}}</ref> This traditional geometric interpretation of SVMs provides useful intuition about how SVMs work, but is difficult to relate to other [[machine learning]] techniques for avoiding overfitting like [[regularization (mathematics)\|regularization]], [[early stopping]], [[sparsity]] and [[Bayesian inference]]. However, once it was discovered that SVM is also a [[special case]] of Tikhonov regularization, regularization perspectives on SVM provided the theory necessary to fit SVM within a broader class of algorithms.<ref name="rosasco1"/><ref>{{cite book\|last=Rifkin\|first=Ryan\|title=Everything Old is New Again: A Fresh Look at Historical Approaches in Machine Learning\|year=2002\|publisher=MIT (PhD thesis)\|url=http://web.mit.edu/~9.520/www/Papers/thesis-rifkin.pdf}} </ref><ref name="Lee 2012 67–81">{{cite journal\|last=Lee\|first=Yoonkyung\|author1-link= Yoonkyung Lee \|first2=Grace\|last2=Wahba\|author2-link=Grace Wahba \|title=Multicategory Support Vector Machines\|journal=Journal of the American Statistical Association\|year=2012\|volume=99\|issue=465\|pages=67–81\|doi=10.1198/016214504000000098\|url=http://www.tandfonline.com/doi/abs/10.1198/016214504000000098}}</ref> This has enabled detailed comparisons between SVM and other forms of Tikhonov regularization, and theoretical grounding for why it is beneficial to use SVM's loss function, the hinge loss.<ref name="Rosasco 2004 1063–1076">{{cite journal\|vauthors=Rosasco L, De Vito E, Caponnetto A, Piana M, Verri A \|title=Are Loss Functions All the Same\|journal=Neural Computation\|date=May 2004\|volume=16\|series=5\|pages=1063–1076\|doi=10.1162/089976604773135104\|url=http://www.mitpressjournals.org/doi/pdf/10.1162/089976604773135104\|pmid=15070510}}</ref> ==Theoretical background==

Regularization perspectives on support vector machines: Difference between revisions