Revision as of 03:36, 28 December 2017 edit Rhiever (talk \| contribs) 60 edits m →Re-organize Algorithms section in a manner that is more logical ← Previous edit		Revision as of 03:36, 28 December 2017 edit undo Rhiever (talk \| contribs) 60 edits m →Rename Algorithms section to Approaches because this section describes approaches to hyperparameter optimization Next edit →
Line 3: The same kind of machine learning model can require different constraints, weights or learning rates to generalize different data patterns. These measures are called hyperparameters, and have to be tuned so that the model can optimally solve the machine learning problem. Hyperparameter optimization finds a tuple of hyperparameters that yields an optimal model which minimizes a predefined [[loss function]] on given independent data.<ref name=abs1502.02127>{{cite article \|url=https://arxiv.org/abs/1502.02127 \|title=Claesen, Marc, and Bart De Moor. "Hyperparameter Search in Machine Learning." arXiv preprint arXiv:1502.02127 (2015).}}</ref> The objective function takes a tuple of hyperparameters and returns the associated loss.<ref name=abs1502.02127/> [[Cross-validation (statistics)\|Cross-validation]] is often used to estimate this generalization performance.<ref name="bergstra" /> == ~~Algorithms~~Approaches == === Grid search ===

Hyperparameter optimization: Difference between revisions