MAP = Regularized Least Squares

MAP solution

A MAP estimation with a Gaussian prior on the weights and a Gaussian likelihood on the data is equivalent to minimizing the sum of squared errors with L2 regularization.

MAP with Gaussian prior = Minimizing squared error + regularization

Formula:

w_{M A P} = ar g w max P (w ∣ D) = ar g w min from Likelihood (Squared Error) \frac{1}{2} n \sum (t_{n} - w^{T} ϕ (x_{n}))^{2} + from Prior (L2 Regularization) \frac{λ}{2} ∥ w ∥^{2}

Key Insight:

The regularization strength $λ$ is inversely proportional to the variance of the Gaussian prior $(σ_{p}^{2})$ :

λ = \frac{1}{σ _{p}^{2}}

Strong Regularization (large $λ$ ) $\leftrightarrow$ Small Prior Variance (weights are assumed to be near zero).
Weak Regularization (small $λ$ ) $\leftrightarrow$ Large Prior Variance (weights are allowed to vary more).

Key Point: The model is linear in parameters $w$ but can be non-linear in inputs $x$ through the choice of basis functions $ϕ (x)$ .

Ashu's Online Notes

Explorer

MAP = Regularized Least Squares

MAP solution

Key Insight:

Graph View

Table of Contents

Backlinks