Revision as of 16:41, 12 February 2015 edit John of Reading (talk \| contribs) Autopatrolled, Extended confirmed users, Pending changes reviewers 787,571 edits m Typo fixing, replaced: the the → to the, typo(s) fixed: et al → et al. (4) using AWB ← Previous edit		Revision as of 08:29, 17 February 2015 edit undo 83.140.161.134 (talk) →Unsupervised learning: Missing math tags Next edit →
Line 98: ===Unsupervised learning=== [[Unsupervised learning\|Unsupervised]] multiple kernel learning algorithms have also been proposed by Zhuang et al. The problem is defined as follows. Let <math>U={x_i}</math> be a set of unlabeled data. The kernel definition is the linear combined kernel <math>K'=\sum_{i=1}^M\beta_iK_m</math>. In this problem, the data needs to be "clustered" into groups based on the kernel distances. Let <math>B_i</math> be a the group or cluster of which <math>x_i</math> is a member. We define the loss function as <math>\sum^n_{i=1}\left\Vert x_i - \sum_{x_j\in B_i} K(x_i,x_j)x_j\right\Vert^2</math>. Furthermore, we minimize the distortion by minimizing <math>\sum_{i=1}^n\sum_{x_j\in B_i}K(x_i,x_j)\left\Vert x_i - x_j \right\Vert^2</math>. Finally, we add a regularization term to avoid overfitting. Combining these terms, we can write the minimization problem as follows. :<math>\min_{\beta,B}\sum^n_{i=1}\left\Vert x_i - \sum_{x_j\in B_i} K(x_i,x_j)x_j\right\Vert^2 + \gamma_1\sum_{i=1}^n\sum_{x_j\in B_i}K(x_i,x_j)\left\Vert x_i - x_j \right\Vert^2 + \gamma_2\sum_i \|B_i\|</math>

Multiple kernel learning: Difference between revisions