Harris Corner Detection Explained | Baeldung on Computer Science

1. Overview

In this tutorial, we’ll talk about Harris Corner Detection, a mathematical approach to detecting corners in an image. First, we’ll illustrate the mathematical formulation. Then, we’ll show the step-by-step procedure to perform the Harris Corner Detector.

2. Mathematical Formulation

A corner is a point whose local neighborhood is characterized by large intensity variation in all directions. Corners are important features in computer vision because they are points stable over changes of viewpoint and illumination. The so-called Harris Corner Detector was introduced by Chris Harris and Mike Stephens in 1988 in the paper “A Combined Corner and Edge Detector”.

Let’s consider a two-dimensional image $I$ and a patch $W$ of size $m \times m$ centered in $(x_0,y_0)$ . We want to evaluate the intensity variation occurred if the window $W$ is shifted by a small amount $(u, v)$ . Such variation can be estimated by computing the Sum of Squared Differences (SSD):

(1) $\begin{equation*} \hbox{SSD}(u, v)=\sum_{\left(x, y)\right \in W} g(x,y) \left[I\left(x, y\right)-I\left(x+u, y+v\right)\right]^{2} \end{equation*}$

where $g(x,y)$ is a window function that can be a rectangular or a Gaussian function. We need to maximize the function $\hbox{SSD}(u, v)$ for corner detection. Since $u$ and $v$ are small, the shifted intensity $I(x+u, y+v)$ can be approximated by the following first-order Taylor expansion:

(2) $\begin{equation*} I(x+u, y+v) \approx I(x, y)+u I_{x}(x, y)+v I_{y}(x, y) \end{equation*}$

where $I_x$ and $I_y$ are partial derivatives of $I$ in $x$ and $y$ direction, respectively. By substituting (2) in (1) we obtain:

(3) $\begin{equation*} \hbox{SSD}(u, v) \approx \sum_{\left(x, y)\right \in W} g(x, y)\left[u^{2} I_{x}^{2}+2 u v I_{x} I_{y}+v^{2} I_{y}^{2}\right] . \end{equation*}$

The equation (3) can be expressed in the following matrix form:

(4) $\begin{equation*} \hbox{SSD}(u, v) \approx\left[\begin{array}{ll} u & v \end{array}\right] M\left[\begin{array}{l} u \\ v \end{array}\right] \end{equation*}$

where $M$ is a $2 \times 2$ matrix computed from image derivatives:

(5) $\begin{equation*} M=\sum_{\left(x, y)\right \in W}g(x, y)\left[\begin{array}{ll} I_{x} I_{x} & I_{x} I_{y} \\ I_{x} I_{y} & I_{y} I_{y} \end{array}\right] \end{equation*}$

The matrix $M$ is called structure tensor.

The Harris detector uses the following response function that scores the presence of a corner within the patch:

(6) $\begin{equation*} R = \operatorname{det}(M)-k \operatorname{tr}(M)^{2} . \end{equation*}$

$k$ is a constant to chose in the range $[0.04, 0.06]$ . Since $M$ is a symmetric matrix, $\operatorname{det}(M) = \lambda_1 \lambda_2$ and $\operatorname{tr}(M) = \lambda_1 + \lambda_2$ where $\lambda_1$ and $\lambda_2$ are the eigenvalues of $M$ . Hence, we can express the corner response as a function of the eigenvalues of the structure tensor:

(7) $\begin{equation*} R=\lambda_{1} \lambda_{2}-k\left(\lambda_{1}+\lambda_{2}\right)^{2} . \end{equation*}$

So the eigenvalues determine whether a region is an edge, a corner or flat:

if $\lambda_1$ and $\lambda_2$ are small, then $|R|$ is small and the region is flat;
if $\lambda_1>> \lambda_2$ or viceversa, then $R <0$ and the region is an edge;
if $\lambda_1 \approx \lambda_2$ and both eigenvalues are large, then R is large and the region is a corner.

The classification of the points using the eigenvalues of the structure tensor is represented in the following figure:

3. Algorithm

Harris corner detector consists of the following steps.

Convert the original image into a grayscale image $I$ . The pixel values of $I$ are computed as a weighted sum of the corresponding $R$ , $G$ , $B$ values:
(8) $\begin{equation*} I = 0.299 \cdot R+0.587 \cdot G+0.114 \cdot B \end{equation*}$
Compute the derivatives $I_x$ and $I_y$ by convolving the image $I$ with the Sobel operator:
(9) $\begin{equation*} I_x = \begin{bmatrix} +1 & 0 & -1 \\ +2 & 0 & -2 \\ +1 & 0 & -1 \end{bmatrix} * I \quad \mbox{and} \quad I_y = \begin{bmatrix} +1 & +2 & +1 \\ 0 & 0 & 0 \\ -1 & -2 & -1 \end{bmatrix} * I \end{equation*}$
Compute the products of the derivatives $I_x I_x$ , $I_x I_y$ , $I_y I_y$ .
Convolve the images $I_x I_x$ , $I_x I_y$ , $I_y I_y$ with a Gaussian filter or a mean filter. Define the structure tensor for each pixel as expressed in eq. (5).
Compute the response function for each pixel:
(10) $\begin{equation*} R = \operatorname{det}(M)-k \operatorname{tr}(M)^{2} . \end{equation*}$
Set a threshold $T$ on the value of $R$ and find pixels with responses above this threshold. Finally, compute the non-max suppression in order to pick up the optimal corners.

4. Example

In the following figure we show a real application of the Harris Corner Detector. The corner response map (top-right image) was computed using Eq. 10 with k=0.04. The bottom-left image represents the thresholded corner response obtained with $T=10^8$ . The detected corners obtained after the application of the non-max suppression are shown in the bottom-right image.

5. Conclusion

In this article, we reviewed the Harris Corner Detector, a fundamental technique in computer vision. We explained the mathematical formulation, and finally, we explained how to implement it.

Core Concepts

Operating Systems

Artificial Intelligence

Graph Theory

Latex

Full Archive

About Baeldung

1. Overview

2. Mathematical Formulation

3. Algorithm

4. Example

5. Conclusion