如何在 python 中使用 Selenium WebDriver 截取部分屏幕截圖？

Question

我為此搜索了很多，但找不到解決方案。 這是一個類似的問題，在 Java 中有一個可能的解決方案。

Python中是否有類似的解決方案？

Answer 1

除了 Selenium，這個例子還需要 PIL Imaging 庫。 有時這是作為標准庫之一放入的，有時不是，但是如果您沒有它，您可以使用pip install Pillow安裝它

from selenium import webdriver
from PIL import Image
from io import BytesIO

fox = webdriver.Firefox()
fox.get('http://stackoverflow.com/')

# now that we have the preliminary stuff out of the way time to get that image :D
element = fox.find_element_by_id('hlogo') # find part of the page you want image of
location = element.location
size = element.size
png = fox.get_screenshot_as_png() # saves screenshot of entire page
fox.quit()

im = Image.open(BytesIO(png)) # uses PIL library to open image in memory

left = location['x']
top = location['y']
right = location['x'] + size['width']
bottom = location['y'] + size['height']


im = im.crop((left, top, right, bottom)) # defines crop points
im.save('screenshot.png') # saves new cropped image

最后輸出是…… Stackoverflow 標志！！！

在此處輸入圖片說明

當然，這對於僅抓取靜態圖像來說是多余的，但是如果您想抓取需要 Javascript 才能到達的東西，這可能是一個可行的解決方案。

Answer 2

在 python3.5 中為我工作

from selenium import webdriver


fox = webdriver.Firefox()
fox.get('http://stackoverflow.com/')
image = fox.find_element_by_id('hlogo').screenshot_as_png

ps

保存到文件

image=driver.find_element_by_id('hlogo').screenshot(output_file_path)

Answer 3

我寫了這個有用的python3函數。

from base64 import b64decode
from wand.image import Image
from selenium.webdriver.remote.webelement import WebElement
from selenium.webdriver.common.action_chains import ActionChains
import math

def get_element_screenshot(element: WebElement) -> bytes:
    driver = element._parent
    ActionChains(driver).move_to_element(element).perform()  # focus
    src_base64 = driver.get_screenshot_as_base64()
    scr_png = b64decode(src_base64)
    scr_img = Image(blob=scr_png)

    x = element.location["x"]
    y = element.location["y"]
    w = element.size["width"]
    h = element.size["height"]
    scr_img.crop(
        left=math.floor(x),
        top=math.floor(y),
        width=math.ceil(w),
        height=math.ceil(h),
    )
    return scr_img.make_blob()

它以字節形式返回顯示元素的 png 圖像。 限制：元素必須適合視口。
您必須安裝魔杖模塊才能使用它。

Answer 4

這是一個執行此操作的函數，在傳遞給裁剪函數之前，必須將大小轉換為整數：

from PIL import Image
from StringIO import StringIO
def capture_element(element,driver):
  location = element.location
  size = element.size
  img = driver.get_screenshot_as_png()
  img = Image.open(StringIO(img))
  left = location['x']
  top = location['y']
  right = location['x'] + size['width']
  bottom = location['y'] + size['height']
  img = img.crop((int(left), int(top), int(right), int(bottom)))
  img.save('screenshot.png')

Answer 5

擴展評論以響應 RandomPhobia 的非常好的答案，這里有兩個具有正確導入語句的解決方案，它們將打開全屏屏幕截圖而無需先保存到文件：

from selenium import webdriver
from PIL import Image
from StringIO import StringIO
import base64

DRIVER = 'chromedriver'
browser = webdriver.Chrome(DRIVER)

browser.get( "http:\\\\www.bbc.co.uk" )

img 1 = Image.open(StringIO(base64.decodestring(browser.get_screenshot_as_base64())))

img 2 = Image.open(StringIO(browser.get_screenshot_as_png()))

因為我確定你的下一個問題是，“這很好，但哪個最快？”，這里是如何確定它（我發現第一種方法在一定距離內是最快的）：

import timeit

setup = '''
from selenium import webdriver
from PIL import Image
from StringIO import StringIO
import base64

DRIVER = 'chromedriver'
browser = webdriver.Chrome(DRIVER)
browser.get( "http:\\\\www.bbc.co.uk" )

file_name = 'tmp.png'
'''

print timeit.Timer('Image.open(StringIO(browser.get_screenshot_as_png()))', setup=setup).repeat(2, 10)
print timeit.Timer('Image.open(StringIO(base64.decodestring(browser.get_screenshot_as_base64())))', setup=setup).repeat(2, 10)
print timeit.Timer('browser.get_screenshot_as_file(file_name); pil_img = Image.open(file_name)', setup=setup).repeat(2, 10)

Answer 6

元素截圖：

from PIL import Image
from io import BytesIO


image = self.browser.driver.find_element_by_class_name('example.bla.bla').screenshot_as_png
im = Image.open(BytesIO(image))  # uses PIL library to open image in memory
im.save('example.png')

Answer 7

這個名叫“ Cherri”的家伙為Selenium創建了一個包含此庫的庫。

import SeleniumUrllib as selenium
selenium_urllib = selenium()
selenium_urllib.element_screenshot(selectbyid('elementid'),'path.png')
selenium_urllib.driver ## Access normal webdriver

Answer 8

就這么簡單：

element = driver.find_element_by_class_name('myclass')
element.screenshot('screenshot.png')

Answer 9

我將@randomphobia 的答案轉換為一個函數。 我還使用了@bummis 的建議，即使用location_once_scrolled_into_view而不是location以便概括無論頁面大小。

from selenium import webdriver
from PIL import Image
from io import BytesIO

def take_screenshot(element, driver, filename='screenshot.png'):
  location = element.location_once_scrolled_into_view
  size = element.size
  png = driver.get_screenshot_as_png() # saves screenshot of entire page

  im = Image.open(BytesIO(png)) # uses PIL library to open image in memory

  left = location['x']
  top = location['y']
  right = location['x'] + size['width']
  bottom = location['y'] + size['height']


  im = im.crop((left, top, right, bottom)) # defines crop points
  im.save(filename) # saves new cropped image

這是一個要點： https : //gist.github.com/WittmannF/b714d3ceb7b6a5cd50002f11fb5a4929

如何在 python 中使用 Selenium WebDriver 截取部分屏幕截圖？

問題描述

8 個解決方案

解決方案1
145 已采納 2013-04-08 03:16:37

解決方案2
39 2017-12-24 10:13:10

解決方案3
7 2016-06-20 14:17:52

解決方案4
6 2017-06-21 21:38:16

解決方案5
5 2018-01-28 20:13:56

解決方案6
5 2019-11-25 08:47:07

解決方案7
1 2015-04-12 21:52:43

解決方案8
1 2021-02-27 19:16:19

解決方案9
0 2020-01-10 21:11:30

如何在 python 中使用 Selenium WebDriver 截取部分屏幕截圖？

問題描述

8 個解決方案

解決方案1 145 已采納 2013-04-08 03:16:37

解決方案2 39 2017-12-24 10:13:10

解決方案3 7 2016-06-20 14:17:52

解決方案4 6 2017-06-21 21:38:16

解決方案5 5 2018-01-28 20:13:56

解決方案6 5 2019-11-25 08:47:07

解決方案7 1 2015-04-12 21:52:43

解決方案8 1 2021-02-27 19:16:19

解決方案9 0 2020-01-10 21:11:30

解決方案1
145 已采納 2013-04-08 03:16:37

解決方案2
39 2017-12-24 10:13:10

解決方案3
7 2016-06-20 14:17:52

解決方案4
6 2017-06-21 21:38:16

解決方案5
5 2018-01-28 20:13:56

解決方案6
5 2019-11-25 08:47:07

解決方案7
1 2015-04-12 21:52:43

解決方案8
1 2021-02-27 19:16:19

解決方案9
0 2020-01-10 21:11:30