Fisher's Linear Discriminant Function

Motivation

We try to find a weight vector $w$ such that the Hyper Plane is $⊥$ to it.

center

The position of the Hyper Plane however is not known. We can find it using the following methods.

High-level Heuristic for Fisher’s LDF

center

Project Data onto a Line: The first step is to calculate the projection of the data onto a line, which is represented by the weight vector.
Define the Hyperplane: The discriminant hyperplane (the decision boundary) is orthogonal to the projection line.
Optimize the Hyperplane: The position of this hyperplane is then optimized to achieve the best possible separation between the classes.
Choose a Threshold: Finally, a threshold value ( $ω_{0}$ ) is selected to be used for the final discrimination between classes

Fisher Criterion

To find the Optimum classification, we maximize the Fisher Criterion. For class mean $m$ and projected line $w$ :

w * = ar g w max J (w) = \frac{( m _{2}^{'} - m _{1}^{'} ) ^{2}}{s _{1}^{2} + s _{2}^{2}} = \frac{( w ^{T} m _{2} - w ^{T} m _{1} ) ^{2}}{s _{1}^{2} + s _{2}^{2}}

We do this because it:

Maximizes Inter-Class Variance: The numerator, $(m_{2}^{'} - m_{1}^{'})^{2}$ , represents the squared distance between the means of the projected classes. Maximizing this term pushes the centers of the different classes as far apart as possible.
Minimizes Intra-Class Variance: The denominator, $s_{1}^{2} + s_{2}^{2}$ , represents the sum of the variances within each projected class. By minimizing this term, the criterion ensures that the data points within each class are tightly clustered around their respective centers.

By optimizing both of these objectives at the same time, the Fisher criterion finds a projection that reduces the overlap between the classes, making them easier to separate with a simple threshold.

source

Ashu's Online Notes

Explorer

Fisher's Linear Discriminant Function

Motivation

High-level Heuristic for Fisher’s LDF

Fisher Criterion

Graph View

Table of Contents

Backlinks