[英]Python > Selenium + CSV: How to open links from list in .csv file, loop code, append data on csv?
我目前正在构建一个刮刀,设法编写了代码,但需要一些编码帮助/以下程序的教程建议:
1) 使用 webdriver 打开的 .csv 格式链接列表
2) 为列表中的所有链接运行相同的抓取代码
3) 将输出附加到 .csv 文件中
代码的基本结构:
from selenium import webdriver
import time
import csv
from selenium.webdriver.common.keys import Keys
driver = webdriver.Chrome
#driver.get("...link from csv file..."), e.g. with open('links.csv', 'r') as file: etc...
time.sleep(5)
elements = driver.find_elements_by_class_name('data-xl')
csvfile = "output.csv";
with open(csvfile, "w", newline="") as output:
writer = csv.writer(output)
writer.writerow(["Reads", "Average Time Spent", "Impressions", "Read Time", "Likes", "Publication Shares", "Times Stacked", "Link-Outs"])
column headers
driver.quit()
问题:
1)如何使用Python Selenium打开.csv,逐行依次跳转链接(1,+1,+1...)
2) 为在步骤 (1) 中访问的所有链接循环代码,即使出现错误(例如“找不到元素”等),也可以继续处理 .csv 上的下一项
3)在.csv中创建headers(注意:上面的代码结构不准确)
4) 以追加的方式打印输出到.csv,不重叠
关于如何实现上述步骤的任何提示都会有所帮助
首先,您必须将driver = webdriver.Chrome
编辑为driver = webdriver.Chrome()
和
这是完整的代码。
from selenium import webdriver
import time
import csv
#link.csv below
# https://google.com
# https://google.com
# https://google.com
driver = webdriver.Chrome()
f = open('link.csv', 'r', encoding='utf-8')
reader = csv.reader(f)
w = open('output.csv', 'w', newline="", encoding="utf-8")
writer = csv.writer(w)
for line in reader:
driver.get(line[0])
time.sleep(5)
elements = driver.find_element_by_xpath('//img[@alt="Google"]')
writer.writerow(
["Reads", "Average Time Spent", "Impressions", "Read Time", "Likes", "Publication Shares", "Times Stacked",
"Link-Outs"])
f.close()
w.close()
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.