Prev Tutorial: Building OpenCV for Tegra with CUDA
Next Tutorial: Writing documentation for OpenCV
| |
Original author | Ana Huamán |
Compatibility | OpenCV >= 3.4.4 |
- Warning
- This tutorial can contain obsolete information.
Goal
In this tutorial you will learn how to:
Source Code
C++
- Downloadable code: Click here
- Code at glance:
#include <iostream>
int main()
{
{
std::cout << "Could not read the image: " << image_path << std::endl;
return 1;
}
imshow(
"Display window", img);
if(k == 's')
{
}
return 0;
}
Python
- Downloadable code: Click here
- Code at glance:
import cv2 as cv
import sys
if img is None:
sys.exit("Could not read the image.")
if k == ord("s"):
Explanation
C++
In OpenCV 3 we have multiple modules. Each one takes care of a different area or approach towards image processing. You could already observe this in the structure of the user guide of these tutorials itself. Before you use any of them you first need to include the header files where the content of each individual module is declared.
You'll almost always end up using the:
- core section, as here are defined the basic building blocks of the library
- imgcodecs module, which provides functions for reading and writing
- highgui module, as this contains the functions to show an image in a window
We also include the iostream to facilitate console line output and input.
By declaring using namespace cv;
, in the following, the library functions can be accessed without explicitly stating the namespace.
Python
As a first step, the OpenCV python library is imported. The proper way to do this is to additionally assign it the name cv, which is used in the following to reference the library.
import cv2 as cv
import sys
Now, let's analyze the main code. As a first step, we read the image "starry_night.jpg" from the OpenCV samples. In order to do so, a call to the cv::imread function loads the image using the file path specified by the first argument. The second argument is optional and specifies the format in which we want the image. This may be:
- IMREAD_COLOR loads the image in the BGR 8-bit format. This is the default that is used here.
- IMREAD_UNCHANGED loads the image as is (including the alpha channel if present)
- IMREAD_GRAYSCALE loads the image as an intensity one
After reading in the image data will be stored in a cv::Mat object.
C++
Python
- Note
- OpenCV offers support for the image formats Windows bitmap (bmp), portable image formats (pbm, pgm, ppm) and Sun raster (sr, ras). With help of plugins (you need to specify to use them if you build yourself the library, nevertheless in the packages we ship present by default) you may also load image formats like JPEG (jpeg, jpg, jpe), JPEG 2000 (jp2 - codenamed in the CMake as Jasper), TIFF files (tiff, tif) and portable network graphics (png). Furthermore, OpenEXR is also a possibility.
Afterwards, a check is executed, if the image was loaded correctly.
C++
if(img.empty())
{
std::cout << "Could not read the image: " << image_path << std::endl;
return 1;
}
Python
if img is None:
sys.exit("Could not read the image.")
Then, the image is shown using a call to the cv::imshow function. The first argument is the title of the window and the second argument is the cv::Mat object that will be shown.
Because we want our window to be displayed until the user presses a key (otherwise the program would end far too quickly), we use the cv::waitKey function whose only parameter is just how long should it wait for a user input (measured in milliseconds). Zero means to wait forever. The return value is the key that was pressed.
C++
imshow(
"Display window", img);
Python
In the end, the image is written to a file if the pressed key was the "s"-key. For this the cv::imwrite function is called that has the file path and the cv::Mat object as an argument.
C++
Python