Using PIL (Python Image Library) to detect image on screen

Question

I am trying to understand how I can use PIL in Python 2.7 to search the whole screen for a certain image and click on it. I've been searching around and haven't been able to find a solution. I want to create a small GUI with one button in the middle of it that when clicked will search the entire screen for a predefined image. Once the image is found the program will then click in the centre of it and end. In short the program will detect if an image is present on the users screen and click it.

I did find an interesting bit on Sikuli, but that doesn't help me because it's unable to export to an .exe.

The image that the program will look for will most likely be in the same place each time it searches, but I didn't want to hard-code the location as it has the potential to move and I don't want that being an issue later on.

What I need is the code method I would use to search for the image on screen and send back the cords to a variable.

Image explanation/example:

Reference image of rifle:

Answer 1

PIL is the wrong tool for this job. Instead you should look into openCV (open source computer vision), which has fantastic python bindings. Here is a link to an example (in C but should be easy to redo with the python bindings) that does what you are looking for, but even allows the image to be rotated, scaled, etc.

http://docs.opencv.org/doc/tutorials/features2d/feature_homography/feature_homography.html http://docs.opencv.org/doc/tutorials/features2d/detection_of_planar_objects/detection_of_planar_objects.html

Edit:

I assume you are using windows, as your example image looks like window. In this case you can use:

from PIL import ImageGrab
pil_img = ImageGrab.grab()
opencv_img = numpy.array(pil_img)

then use opencv to process the image to find sub image you are looking for.

If you want to do this cross platform, then you will need to use wxWidgets to do the screengrab: https://stackoverflow.com/a/10089645/455532

Answer 2

Even I wanted to do the same but using different module - pyautogui. I finally found the solution for my problem and I am sure this solution will also help you. You have to just go to this webpage and read the locate function topic completely and you'll be able to solve your problem.

Answer 3

我推荐你看一下 PyAutoGUI，一个有据可查的控制鼠标和键盘的库，也可以在屏幕上定位图像，找到位置，将鼠标移动到任何位置并单击位置，还可以模拟拖放，输入输入字段，双击等等。

Using PIL (Python Image Library) to detect image on screen

Question

3 answers

solution1
7 ACCPTED 2014-12-07 18:45:48

solution2
2 2020-09-29 12:42:01

solution3
0 2022-07-16 22:28:57

Using PIL (Python Image Library) to detect image on screen

Question

3 answers

solution1 7 ACCPTED 2014-12-07 18:45:48

solution2 2 2020-09-29 12:42:01

solution3 0 2022-07-16 22:28:57

solution1
7 ACCPTED 2014-12-07 18:45:48

solution2
2 2020-09-29 12:42:01

solution3
0 2022-07-16 22:28:57