簡體   English   中英

selenium 或者美湯從動態表格單元格中獲取數據

[英]selenium or beautiful soup to get data from dynamic table cell

我正在嘗試獲取 web 表內的字段文本,數據字段在表單元格內及其動態。我正在使用 python 腳本來完成此任務。 我已經通過下面嘗試過

1- get attributes using x path , innerText , innerHTML, textContent, value, resulting None or Html.
2 - beautiful soup  - Returning HTML and with lxml returning none

HTML:

<td>
<div class="fieldsbox" id="xfe54" style="visibility: visible;">
<input readonly="" isoutputcontrol="true" xformstype="output" id="policy_number"
 xql="tns:CHDRNUM" databoundelement="true" __parent="tblResults" class="input output" 
absolutexpath="tns:CHDRNUM" doebivalidate="false" title="Value for Policy No." style="" 
ref="tns:CHDRNUM" _intable="true"></div>
</td>

美麗的湯

url = "https://cms.bharti-axagi.co.in/home/CMS/com/bagi/cms/Loginforms/CMS_LoginScreen.caf"
    crom_driver.get(url)
    time.sleep(5)
    content = crom_driver.page_source
    soup = bs(content, "html.parser")
    data = soup.findAll("table", {"id": "CMS_CLAIMS_DETAILSTable"})

    print(data)

通過XPATH

    previous_Claim_Policy =  crom_driver.find_element_by_xpath(xpath_string).get_attribute('value')

    previous_Claim_Policy_inner_text = crom_driver.find_element_by_xpath(xpath_string).get_attribute('innerText')

    previous_Claim_Policy_txt = crom_driver.find_element_by_xpath(xpath_string).text

    previous_Claim_Policy_txt_content = crom_driver.find_element_by_xpath(xpath_string).get_attribute('textContent')

    previous_Claim_Policy_txt_html_inner = crom_driver.find_element_by_xpath(xpath_string).get_attribute('innerHTML')

我嘗試了完整的 x 路徑,它通過 get_attibute('value') 解決了

xpath_string=''/html/body/div[2]/div[3]/div/div[2]/fieldset/div/div/div[1]/div/div[2]/div/div/table/ tbody/tr[2]/td[2]/div/input''

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM