Regularisation

Use-case

Regularization ensures that the weights of the regression model aren’t too large. This is important because large weights usually cause over-fitting which leads to poor performance on test data. It also prevents oscillation and introduces smoothness (which is however an inductive bias in itself!)

Mathematically

The regularized error function is given by:

\tilde{E} (w) = \frac{1}{2} n = 1 \sum N (y (x_{n}, w) - t_{n})^{2} Sum of Squared Error + regularization \frac{λ}{2} ∥ w ∥^{2}

where,

$λ \to$ balances the regularization term against the error term. It is different for each model.
$1/2 \to$ is the scaling factor that simplifies the derivative during optimization (calculating the gradient).

Ashu's Online Notes

Explorer

Regularisation

Use-case

Mathematically

Graph View

Table of Contents

Backlinks