Revision as of 21:45, 21 October 2024 edit 45.93.75.83 (talk) No edit summary ← Previous edit		Revision as of 11:29, 12 January 2025 edit undo 2a02:8109:b6b2:e00:202b:3a2b:6bc3:82dd (talk) No edit summary Next edit →
Line 1: {{Short description\|The process of finding the optimal set of variables for a machine learning algorithm}} In [[machine learning]], '''hyperparameter optimization'''<ref>Matthias Feurer and Frank Hutter. [https://link.springer.com/content/pdf/10.1007%2F978-3-030-05318-5_1.pdf Hyperparameter optimization]. In: ''AutoML: Methods, Systems, Challenges'', pages 3–38.</ref> or tuning is the problem of choosing a set of optimal [[Hyperparameter (machine learning)\|hyperparameters]] for a learning algorithm. A hyperparameter is a [[parameter]] whose value is used to control the learning process, which must be configured before the process starts.<ref>{{cite journal \|last1=Yang\|first1=Li\|title=On hyperparameter optimization of machine learning algorithms: Theory and practice\|journal=Neurocomputing\|year=2020\|volume=415\|pages=295–316\|doi=10.1016/j.neucom.2020.07.061\|arxiv=2007.15745 }}</ref><ref>{{cite journal \|vauthors=Franceschi L, Donini M, Perrone V, Klein A, Archambeau C, Seeger M, Pontil M, Frasconi P \|title=Hyperparameter Optimization in Machine Learning \|journal=arXiv preprint \|year=2024 \|volume= \|issue= \|pages= \|arxiv=2410.22854 \|url=https://arxiv.org/pdf/2410.22854}}</ref> Hyperparameter optimization determines the set of hyperparameters that yields an optimal model which minimizes a predefined [[loss function]] on a given [[data set]].<ref name=abs1502.02127>{{cite arXiv \|eprint=1502.02127\|last1=Claesen\|first1=Marc\|title=Hyperparameter Search in Machine Learning\|author2=Bart De Moor\|class=cs.LG\|year=2015}}</ref> The objective function takes a set of hyperparameters and returns the associated loss.<ref name=abs1502.02127/> [[Cross-validation (statistics)\|Cross-validation]] is often used to estimate this generalization performance, and therefore choose the set of values for hyperparameters that maximize it.<ref name="bergstra">{{cite journal\|last1=Bergstra\|first1=James\|last2=Bengio\|first2=Yoshua\|year=2012\|title=Random Search for Hyper-Parameter Optimization\|url=http://jmlr.csail.mit.edu/papers/volume13/bergstra12a/bergstra12a.pdf\|journal=Journal of Machine Learning Research\|volume=13\|pages=281–305}}</ref>

Hyperparameter optimization: Difference between revisions