繁体   English   中英

使用Selenium WebDriver在python中编写文本文件时发生StaleElementException

[英]StaleElementException while writing a text file in python using selenium webdriver

我正在尝试从TripAdvisor刮取一家酒店的评论,并将其写在一个文本文件中。 到目前为止,代码运行良好,只是不时地在我正在编写文本文件的行上抛出StaleElementException。 这是我的代码:

for num in range(page_count):
    try:
        if num != 0:
            try:
                nxt = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.CSS_SELECTOR, "a.nav.next.rndBtn.ui_button.primary.taLnk")))
                #nxt = driver.find_element_by_css_selector("a.nav.next.rndBtn.ui_button.primary.taLnk")
                nxt.click()
                driver.implicitly_wait(5)
            except NoSuchElementException:
                driver.refresh()
                #driver.implicitly_wait(5)
                nxt = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.CSS_SELECTOR, "a.nav.next.rndBtn.ui_button.primary.taLnk")))
                nxt.click()
                driver.implicitly_wait(5)
        try:
            more = WebDriverWait(driver, 10).until(EC.element_to_be_clickable((By.CSS_SELECTOR, "span.taLnk.ulBlueLinks")))
            #more = driver.find_element_by_css_selector("span.taLnk.ulBlueLinks")
            more.click()
            time.sleep(1)
        except TimeoutException:
            print("There is no 'more' button on page %d" % (num+1))
        except WebDriverException:
            nxt = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.CSS_SELECTOR, "span.taLnk.ulBlueLinks")))
            nxt.click()
            driver.implicitly_wait(5)

        review_result = WebDriverWait(driver, 10).until(EC.presence_of_all_elements_located((By.CLASS_NAME, 'entry')))

        with open('New_Review.txt', 'a') as fid:
            for review in review_result:
                    fid.write(unidecode(review.text))
                    fid.write(sep)
                    fid.write(line_break)
        print ("processing done till page number %d" % (num+1))
    except StaleElementReferenceException:
        driver.refresh()
        driver.implicitly_wait(5)

        try:
            more = WebDriverWait(driver, 10).until(EC.element_to_be_clickable((By.CSS_SELECTOR, "span.taLnk.ulBlueLinks")))
            #more = driver.find_element_by_css_selector("span.taLnk.ulBlueLinks")
            more.click()
        except TimeoutException:
            print("There is no 'more' button on page %d" % (num+1))
        except WebDriverException:
            nxt = WebDriverWait(driver, 10).until(EC.presence_of_element_located((By.CSS_SELECTOR, "span.taLnk.ulBlueLinks")))
            nxt.click()
            driver.implicitly_wait(5)

        review_result = WebDriverWait(driver, 10).until(EC.presence_of_all_elements_located((By.CLASS_NAME, 'entry')))

        #print (review_result[1].text)
        with open('New_Review.csv', 'a') as fid:
            writer = csv.writer(fid, delimiter = ',', line_break = '\n')
            for review in review_result:
                    fid.write(unidecode(review.text))
                    fid.write(sep)
                    fid.write(line_break)
        print ("processing done till page number %d" % (num+1))    

这是错误:

StaleElementReferenceException:失效元素引用:元素未附加到页面文档

追溯给出了这一行:

fid.write(unidecode(review.text))

我已经尝试处理该异常,但是它对我不起作用,并且我很难确定我到底在哪里错。 任何帮助表示赞赏!

尝试创建辅助方法,例如

def get_text(locator):
   staled = True
   while staled:
     try:
        return WebDriverWait(driver, 10).until(EC.presence_of_element(locator)).text
    except StaleElementReferenceException:
       `log something or limit retry to certain times`

然后更改获取文字的方式

review_result = WebDriverWait(driver, 10).until(EC.presence_of_all_elements_located())
num_of_review - review_result.length
        with open('New_Review.txt', 'a') as fid:
            for index in range(1, num_of_review):
                    review_text = get_text((By.XPATH, "//*[@class='entry'][index]"))
                    fid.write(unidecode(review_text))
                    fid.write(sep)
                    fid.write(line_break)

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM