Therefore, it is desirable if the global motion model we need to recover is translation, which has only two parameters. It turns out that for a pure panning motion, if we convert two images to their cylindrical maps with known focal length, the relationship between them can be represented by a translation.

Here is an example of how cylindrical maps are constructed with differently with f’s. Note how straight lines are curved.

Similarly, we can map an image to its longitude/latitude spherical coordinates as well if f is given.