如何通過圖像查找元素

Question

眾所周知，硒支持多種定位器策略以在網頁上查找元素。

但是我的要求是不同的，我有一些站點，其中硒支持的任何定位器不足以唯一地找到元素。

由於硒提供了創建自己的自定義定位器策略來查找元素的便利，我正在嘗試創建圖像定位器，它可以像使用appium一樣使用子圖像的base64 String來查找元素。

圖像定位器的要點：

使用URL啟動瀏覽器
捕獲頁面的屏幕截圖
從屏幕快照中檢測子圖像的x ， y位置
在頁面中使用x ， y位置查找元素

為了完成此任務，我將創建自定義Image定位器，如下所示：

public class ByImage extends By {

    String imageBase64String

    /**
     * @param imageBase64String
     */
    public ByImage(String imageBase64String) {
        this.imageBase64String = imageBase64String
    }

    @Override
    public List<WebElement> findElement(SearchContext context) {
        List<WebElement> els = findElements(context)
        if (els) {
            return els.get(0)
        }
        throw new NoSuchElementException("Element not found")
    }

    @Override
    public List<WebElement> findElements(SearchContext context) {
       //Get current screenshot
        byte[] screenshotByte = ((TakesScreenshot)context).getScreenshotAs(OutputType.BYTES))
        byte[] subImgToFindByte = DatatypeConverter.parseBase64Binary(imageBase64String)
        //Convert buffred image to get height and width of subimage
        BufferedImage bufferedSubImgToFind = ImageIO.read(new ByteArrayInputStream(subImgToFindByte ));

        //Here I need a mechanism to get coordinates of sub image from screenshot
        //Suppose I able to find x, y
        double x
        double y

        //Now find element using coordinates
        //Now calculate center point
        int centerX = int(x + (bufferedSubImgToFind.getWidth() / 2))
        int centerY = int(y + (bufferedSubImgToFind.getHeight() / 2))

        JavascriptExecutor js = ((JavascriptExecutor)context)

        return js.executeScript("return document.elementsFromPoint(arguments[0], arguments[1]);", centerX, centerY)
      }   
  }

現在測試用例為：

WebDriver driver = new ChromeDriver()
driver.get("<URL>")
WebElement elementByImage = driver.findElement(new ByImage("<Base64 String of the subimage>"))

我能做到的一切，除了一個更好的庫來檢測的精確坐標subimage從image找到使用坐標的元素。

有人可以建議我一種更好的方法來完成此任務嗎？

Answer 1

您可以選擇不同的選項，例如：

您可以使用Java Bindings for OpenCV來在主屏幕截圖中查找子圖像，請參閱“ 模板匹配”文章以獲取全面的說明和代碼段。
Sikuli項目提供了一些用於圖像識別/交互的簡單API
SeeTest Automation為圖像模板提供圖像識別和對象庫模式實現

Answer 2

正如@Dmitri所建議的那樣，我將使用OpenCV的Java綁定。

下載適當的OpenCV並將其解壓縮到classpath然后嘗試獲取坐標為：

import org.opencv.core.Core;
import org.opencv.core.Core.MinMaxLocResult;
import org.opencv.core.CvType;
import org.opencv.core.Mat;
import org.opencv.core.MatOfByte;
import org.opencv.core.Point;
import org.opencv.imgcodecs.Imgcodecs;
import org.opencv.imgproc.Imgproc;

byte[] screenshotByte = ((TakesScreenshot)context).getScreenshotAs(OutputType.BYTES))
byte[] subImgToFindByte = DatatypeConverter.parseBase64Binary(imageBase64String)

System.loadLibrary(Core.NATIVE_LIBRARY_NAME);
Mat source = Imgcodecs.imdecode(new MatOfByte(screenshotByte), Imgcodecs.IMREAD_UNCHANGED);
Mat template = Imgcodecs.imdecode(new MatOfByte(subImgToFindByte), Imgcodecs.IMREAD_UNCHANGED);

int result_cols = source.cols() - template.cols() + 1;
int result_rows = source.rows() - template.rows() + 1;
Mat outputImage = new Mat(result_rows, result_cols, CvType.CV_32FC1);

// Template matching method
Imgproc.matchTemplate(source, template, outputImage, Imgproc.TM_SQDIFF_NORMED);

MinMaxLocResult mmr = Core.minMaxLoc(outputImage);
// Now get the point
Point point = mmr.minLoc;
double x = point.x;
double y = point.y;

//Now get the find the element using x, y after calculating center point.
int centerX = int(x + (bufferedSubImgToFind.getWidth() / 2));
int centerY = int(y + (bufferedSubImgToFind.getHeight() / 2));

WebElement el = js.executeScript("return document.elementFromPoint(arguments[0], arguments[1]);", centerX, centerY);

希望對大家有幫助。

如何通過圖像查找元素

問題描述

2 個解決方案

解決方案1
2 2019-06-19 11:21:10

解決方案2
-1 已采納 2019-06-19 11:55:46

如何通過圖像查找元素

問題描述

2 個解決方案

解決方案1 2 2019-06-19 11:21:10

解決方案2 -1 已采納 2019-06-19 11:55:46

解決方案1
2 2019-06-19 11:21:10

解決方案2
-1 已采納 2019-06-19 11:55:46