Nadaraya Watson Regressor

The Nadaraya-Watson kernel regression estimator uses kernel functions for weighting and is defined by the following formula

f (x_{q}) = i \sum y_{i} \frac{K _{σ} ( x _{i} - x _{q} )}{\sum _{j} K _{σ} ( x _{j} - x _{q} )}

The key parameter that has to be chosen is the bandwidth $σ$ of the kernel function, $K_{σ}$ (often a Gaussian kernel). This parameter controls the width of the kernel and thus the smoothness of the resulting function.

$f (x_{q})$ → Final predicted output value for the new, unseen input point $x_{q}$ .
$x_{q}$ → query point, which is the new input for which you want to make a prediction.
$K_{σ} (x_{i} - x_{q})$ : This is the kernel function. It measures the similarity or “closeness” between a training point $x_{i}$ and the new query point $x_{q}$ . The result is a scalar value that is typically large when the points are close and small when they are far apart. A common choice is the Gaussian kernel.
$\frac{K _{σ} ( x _{i} - x _{q} )}{\sum _{j} K _{σ} ( x _{j} - x _{q} )}$ : This entire fraction acts as a normalized weight. The numerator is the similarity of a single training point, and the denominator is the sum of similarities over all training points. This ensures all the weights sum to 1. The weight assigned to a training output $y_{i}$ is directly proportional to the similarity between its corresponding input $x_{i}$ and the query point $x_{q}$ .

Ashu's Online Notes

Explorer

Nadaraya Watson Regressor

Graph View

Backlinks