OpenCV  4.6.0 Open Source Computer Vision
Fourier Transform

## Goal

• To find the Fourier Transform of images using OpenCV
• Some applications of Fourier Transform
• We will learn following functions : cv.dft() etc

## Theory

Fourier Transform is used to analyze the frequency characteristics of various filters. For images, 2D Discrete Fourier Transform (DFT) is used to find the frequency domain. A fast algorithm called Fast Fourier Transform (FFT) is used for calculation of DFT. Details about these can be found in any image processing or signal processing textbooks.

For a sinusoidal signal, $$x(t) = A \sin(2 \pi ft)$$, we can say $$f$$ is the frequency of signal, and if its frequency domain is taken, we can see a spike at $$f$$. If signal is sampled to form a discrete signal, we get the same frequency domain, but is periodic in the range $$[- \pi, \pi]$$ or $$[0,2\pi]$$ (or $$[0,N]$$ for N-point DFT). You can consider an image as a signal which is sampled in two directions. So taking fourier transform in both X and Y directions gives you the frequency representation of image.

More intuitively, for the sinusoidal signal, if the amplitude varies so fast in short time, you can say it is a high frequency signal. If it varies slowly, it is a low frequency signal. You can extend the same idea to images. Where does the amplitude varies drastically in images ? At the edge points, or noises. So we can say, edges and noises are high frequency contents in an image. If there is no much changes in amplitude, it is a low frequency component.

Performance of DFT calculation is better for some array size. It is fastest when array size is power of two. The arrays whose size is a product of 2’s, 3’s, and 5’s are also processed quite efficiently. So if you are worried about the performance of your code, you can modify the size of the array to any optimal size (by padding zeros) before finding DFT. OpenCV provides a function, cv.getOptimalDFTSize() for this.

Now we will see how to find the Fourier Transform.

## Fourier Transform in OpenCV

Performance of DFT calculation is better for some array size. It is fastest when array size is power of two. The arrays whose size is a product of 2’s, 3’s, and 5’s are also processed quite efficiently. So if you are worried about the performance of your code, you can modify the size of the array to any optimal size (by padding zeros). So how do we find this optimal size ? OpenCV provides a function, cv.getOptimalDFTSize() for this.

We use the functions: cv.dft (src, dst, flags = 0, nonzeroRows = 0)

Parameters
 src input array that could be real or complex. dst output array whose size and type depends on the flags. flags transformation flags, representing a combination of the cv.DftFlags nonzeroRows when the parameter is not zero, the function assumes that only the first nonzeroRows rows of the input array (DFT_INVERSE is not set) or only the first nonzeroRows of the output array (DFT_INVERSE is set) contain non-zeros, thus, the function can handle the rest of the rows more efficiently and save some time; this technique is very useful for calculating array cross-correlation or convolution using DFT.

cv.getOptimalDFTSize (vecsize)

Parameters
 vecsize vector size.

cv.copyMakeBorder (src, dst, top, bottom, left, right, borderType, value = new cv.Scalar())

Parameters
 src input array that could be real or complex. dst output array whose size and type depends on the flags. top parameter specifying how many top pixels in each direction from the source image rectangle to extrapolate. bottom parameter specifying how many bottom pixels in each direction from the source image rectangle to extrapolate. left parameter specifying how many left pixels in each direction from the source image rectangle to extrapolate. right parameter specifying how many right pixels in each direction from the source image rectangle to extrapolate. borderType border type. value border value if borderType == cv.BORDER_CONSTANT.

cv.magnitude (x, y, magnitude)

Parameters
 x floating-point array of x-coordinates of the vectors. y floating-point array of y-coordinates of the vectors; it must have the same size as x. magnitude output array of the same size and type as x.

cv.split (m, mv)

Parameters
 m input multi-channel array. mv output vector of arrays; the arrays themselves are reallocated, if needed.

cv.merge (mv, dst)

Parameters
 mv input vector of matrices to be merged; all the matrices in mv must have the same size and the same depth. dst output array of the same size and the same depth as mv[0]; The number of channels will be the total number of channels in the matrix array.