Structure from motion solves
the following problem:
Given a set of images of a
static scene with 2D points in correspondence, shown here as color-coded
points, find…
a set of 3D points P and
a rotation R and position t of
the cameras that explain the observed correspondences. In other words, when we project a point into
any of the cameras, the reprojection error between the projected and observed
2D points is low.
This problem can be formulated
as an optimization problem where we want to find the rotations R, positions t,
and 3D point locations P that minimize sum of squared reprojection errors
f. This is a non-linear least squares
problem and can be solved with algorithms such as Levenberg-Marquart. However, because the problem is non-linear,
it can be susceptible to local minima.
Therefore, it’s important to initialize the parameters of the system
carefully. In addition, we need to be
able to deal with erroneous correspondences.