
Learn through the super-clean Baeldung Pro experience:
>> Membership and Baeldung Pro.
No ads, dark-mode and 6 months free of IntelliJ Idea Ultimate to start with.
Last updated: March 18, 2024
In this tutorial, we’ll review two important concepts in computer vision, the Fundamental Matrix and the Essential Matrix.
Such matrices play a crucial role in determining the structure and motion of objects in a scene, and their understanding is essential for implementing various computer vision algorithms.
The fundamental matrix and the essential matrix are two matrices frequently used in stereo geometry whenever one wants to describe geometric relationships between pairs of images either recorded from a stereo camera or a monocular camera moving in the environment.
Suppose we have two images of the same scene and know a set of corresponding points within the two images. For example, we know that point in image A correspond to the points
in the image B:
The fundamental and essential matrices contain information about the relative orientation of the cameras that can be extracted from corresponding points.
The fundamental and the essential matrices are 3×3 homogeneous matrices with rank 2. Such rank deficiency is used for formulating the so-called coplanarity constraint that can be expressed as:
(1)
(2)
where and
are the fundamental and the essential matrix,
and
are the projections of the same point in the two images. The coplanarity constraint equations must hold for all corresponding points.
and
are generally unknowns, but they can be estimated given a set of corresponding points by solving a system of linear equations. This can be done with the so-called eight-point algorithm, which requires at least 8 corresponding points to estimate the essential or fundamental matrix straightforwardly. The method can work with more than 8 points, and additional points can improve the estimate’s accuracy.
Once and
are known, they can search for correspondences between images.
The main difference between the fundamental and the essential matrix is the type of information they encode.
The essential matrix encodes information about the rotation and translation related to the position and orientation of the cameras within the environment.
The fundamental matrix embeds the same information as
and contains information about the intrinsic parameters of both cameras. In other words, the matrix
is a purely geometric entity that has no connection to imagers properties. It maps the location of a point
in physical coordinates, as seen by the left camera, to the projection of the same point
, as seen by the right camera.
On the other hand, the fundamental matrix relates the location of the point in the two images using image coordinates (pixels). Therefore, the fundamental matrix is used for uncalibrated cameras, while the essential matrix is used for calibrated cameras.
Another difference is the number of degrees of freedom (DoF). has 7 DoF, while
has 5 DoF since it considers the cameras’ intrinsic parameters.
Here are some applications of the essential and fundamental matrix.
In this article, we reviewed the fundamental matrix and the essential matrix, two important tools in computer vision for determining the relationships between two images of a scene. We discussed their differences and provided their main applications in computer vision.